Integrating Small Language Models with Retrieval-Augmented Generation in Computing Education: Key Takeaways, Setup, and Practical Insights (SIGCSE TS 2025 - Papers)

Who

Zezhu Yu, Suqing Liu, Paul Denny, Andreas Bergen, Michael Liut

Track

SIGCSE TS 2025 Papers

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 28 Feb 2025 10:45 - 11:03 at Meeting Rooms 310-311 - AI/Machine Learning Chair(s): Marlon Mejias

Abstract

Leveraging a Large Language Model (LLM) for personalized learning in computing education is promising, yet cloud-based LLMs pose risks around data security and privacy. To address these concerns, we developed and deployed a locally stored Small Language Model (SLM) utilizing Retrieval-Augmented Generation (RAG) methods to support computing students’ learning. Previous work has demonstrated that SLMs can match or surpass popular LLMs (gpt-3.5-turbo and gpt-4-32k) in handling conversational data from a CS1 course. We deployed SLMs with RAG (SLM + RAG) in a large course having more than 250 students use our system, fielding nearly 2,000 student questions while evaluating data privacy, scalability, and feasibility of local deployments. This paper provides a comprehensive guide for deploying SLM + RAG systems, detailing model selection, vector database choice, embedding methods, and pipeline frameworks. We share practical insights from our deployment, including scalability concerns, accuracy versus context length trade-offs, guardrails and hallucination reduction, as well as data privacy maintenance. We address the “Impossible Triangle” in RAG systems, which states that achieving high accuracy, short context length, and low time consumption simultaneously is not feasible. Furthermore, our novel RAG framework, Intelligence Concentration (IC), categorizes information into multiple layers of abstraction within Milvus collections mitigating trade-offs and enables educational assistants to deliver more relevant and personalized responses to students quickly.

Zezhu Yu

University of Toronto

Canada

Suqing Liu

University of Toronto Mississauga

Canada

Paul Denny

The University of Auckland

New Zealand

Andreas Bergen

University of Toronto Mississauga

Canada

Michael Liut

University of Toronto Mississauga

Canada

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 28 Feb
Displayed time zone: Eastern Time (US & Canada) change

10:45 - 12:00	AI/Machine LearningPapers at Meeting Rooms 310-311 Chair(s): Marlon Mejias University of North Carolina Charlotte

10:45 18m Talk		Integrating Small Language Models with Retrieval-Augmented Generation in Computing Education: Key Takeaways, Setup, and Practical Insights Papers Zezhu Yu University of Toronto, Suqing Liu University of Toronto Mississauga, Paul Denny The University of Auckland, Andreas Bergen University of Toronto Mississauga, Michael Liut University of Toronto Mississauga
11:03 18m Talk		Leveraging Undergraduate Perspectives to Redefine AI Literacy Papers Jack Ebert University of Maryland, College Park, Kristina Kramarczuk University of Maryland, College Park
11:22 18m Talk		PhysioML: A Web-Based Tool for Machine Learning Education with Real-Time Physiological Data Papers Bryan Y. Hernández-Cuevas University of Alabama, Myles Lewis University of Alabama, Wesley Junkins University of Alabama, Chris Crawford University of Alabama, Andre Denham University of Alabama, Feiya Luo University of Alabama
11:41 18m Talk		Fostering Creativity: Student-Generative AI Teaming in an Open-Ended CS0 Assignment Papers Daniel Filcik U.S. Military Academy, Edward Sobiesk United States Military Academy, Suzanne Matthews United States Military Academy