Blogs (3) >>

This program is tentative and subject to change.

Fri 28 Feb 2025 10:45 - 11:03 at Meeting Rooms 310-311 - AI/Machine Learning

Leveraging a Large Language Model (LLM) for personalized learning in computing education is promising, yet cloud-based LLMs pose risks around data security and privacy. To address these concerns, we developed and deployed a locally stored Small Language Model (SLM) utilizing Retrieval-Augmented Generation (RAG) methods to support computing students’ learning. Previous work has demonstrated that SLMs can match or surpass popular LLMs (gpt-3.5-turbo and gpt-4-32k) in handling conversational data from a CS1 course. We deployed SLMs with RAG (SLM + RAG) in a large course having more than 250 students use our system, fielding nearly 2,000 student questions while evaluating data privacy, scalability, and feasibility of local deployments. This paper provides a comprehensive guide for deploying SLM + RAG systems, detailing model selection, vector database choice, embedding methods, and pipeline frameworks. We share practical insights from our deployment, including scalability concerns, accuracy versus context length trade-offs, guardrails and hallucination reduction, as well as data privacy maintenance. We address the “Impossible Triangle” in RAG systems, which states that achieving high accuracy, short context length, and low time consumption simultaneously is not feasible. Furthermore, our novel RAG framework, Intelligence Concentration (IC), categorizes information into multiple layers of abstraction within Milvus collections mitigating trade-offs and enables educational assistants to deliver more relevant and personalized responses to students quickly.

This program is tentative and subject to change.

Fri 28 Feb

Displayed time zone: Eastern Time (US & Canada) change

10:45 - 12:00
AI/Machine LearningPapers at Meeting Rooms 310-311
10:45
18m
Talk
Integrating Small Language Models with Retrieval-Augmented Generation in Computing Education: Key Takeaways, Setup, and Practical Insights
Papers
Zezhu Yu University of Toronto, Suqing Liu University of Toronto Mississauga, Paul Denny The University of Auckland, Andreas Bergen University of Toronto Mississauga, Michael Liut University of Toronto Mississauga
11:03
18m
Talk
Leveraging Undergraduate Perspectives to Redefine AI Literacy
Papers
Jack Ebert University of Maryland, College Park, Kristina Kramarczuk University of Maryland, College Park
11:22
18m
Talk
PhysioML: A Web-Based Tool for Machine Learning Education with Real-Time Physiological Data
Papers
Bryan Y. Hernández-Cuevas University of Alabama, Myles Lewis University of Alabama, Wesley Junkins University of Alabama, Chris Crawford University of Alabama, Andre Denham University of Alabama, Feiya Luo University of Alabama
11:41
18m
Talk
Fostering Creativity: Student-Generative AI Teaming in an Open-Ended CS0 Assignment
Papers
Daniel Filcik U.S. Military Academy, Edward Sobiesk United States Military Academy, Suzanne Matthews United States Military Academy