Finetuning Llama on custom data for QA tasks

Student Name: Rahul Purswani

Defense Date: Tuesday, May 13, 2025 - 9:15 AM

Location: Eaton Hall, Room 2001B

Chair: David Johnson

Drew Davidson

Prasad Kulkarni

Abstract:

Fine-tuning large language models (LLMs) for domain-specific use cases, such as question answering, offers valuable insights into how their performance can be tailored to specialized information needs. In this project, we focused on the University of Kansas (KU) as our target domain. We began by scraping structured and unstructured content from official KU webpages, covering a wide array of student-facing topics including campus resources, academic policies, and support services. From this content, we generated a diverse set of question-answer pairs to form a high-quality training dataset. LLaMA 3.2 was then fine-tuned on this dataset to improve its ability to answer KU-specific queries with greater relevance and accuracy. Our evaluation revealed mixed results—while the fine-tuned model outperformed the base model on most domain-specific questions, the original model still had an edge in handling ambiguous or out-of-scope prompts. These findings highlight the strengths and limitations of domain-specific fine-tuning, and provide practical takeaways for customizing LLMs for real-world QA applications.

Degree: MS Project Defense (CS)

Degree Type: MS Project Defense

Degree Field: Computer Science