Investigating the Capabilities of Generative AI in Solving Data Structures, Algorithms, and Computability Problems (SIGCSE TS 2025 - Papers)

Who

Ofek Gila, Shahar Broner, Yubin Kim, Nero Li, Katrina Mizuo, Elijah Sauder, Claire To, Albert Wang, Michael Shindler

Track

SIGCSE TS 2025 Papers

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 27 Feb 2025 16:03 - 16:22 at Meeting Rooms 310-311 - Algorithms #2 Chair(s): Tyler Menezes

Abstract

There is both great hope and great concern about the future of Computer Science practice and education with respect to the recent advent of large language models (LLMs).

We present the first study to extensively evaluate the ability of such a model to solve problems in Computer Science Theory. We tested 183 exam-level problems across 16 specific topics in various Computer Science Theory related topics, ranging from preliminary data structures to algorithm design paradigms to Theory of Computation. Our results use the recent popular models (GPT-4 and GPT-4o) without image recognition abilities. This is a rapidly evolving field, with model performance continuously improving. We present our results primarily as an indication of what they can already achieve—equivalently how they can already be useful—today, fully expecting them to improve even further in the near future.

Our results show that what was very recently a state-of-the-art model (GPT-4) is able to solve a majority (78%) of free-response problems in Data Structure and algorithm with little to no guidance. The latest model, GPT-4o, was able to solve nearly half the Theory of Computation problems we posed, with predictable categories for problems it could not solve. When broken down by topic, the model was able to solve 85% of problems in four out of the nine topics in data structures and algorithms and at least half in four others. Other problems, namely more visual problems, either required more substantial coaching, or seem to still be beyond the capabilities of the language model–- for now.

Ofek Gila

University of California, Irvine

United States

Shahar Broner

University of California, Irvine

United States

Yubin Kim

UC Irvine, Computer Science Department

United States

Nero Li

UC Irvine, Computer Science Department

United States

Katrina Mizuo

UC Irvine, Computer Science Department

United States

Elijah Sauder

UC Irvine, Computer Science Department

United States

Claire To

UC Irvine, Computer Science Department

United States

Albert Wang

UC Irvine, Computer Science Department

United States

Michael Shindler

University of California, Irvine

United States

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 27 Feb
Displayed time zone: Eastern Time (US & Canada) change

15:45 - 17:00	Algorithms #2Papers at Meeting Rooms 310-311 Chair(s): Tyler Menezes CodeDay

15:45 18m Talk		Construction and Preliminary Validation of a Dynamic Programming Concept Inventory Papers Matthew Ferland University of Southern California, Varun Nagaraj Rao Princeton University, Arushi Arora University of California, Irvine, Drew van der Poel Northeastern, Michael Luu University of California, Irvine, Randy Huynh University of California Irvine, Frederick Reiber Boston University, Sandra Ossman UC Irvine, Seth Poulsen Utah State University, Michael Shindler University of California, Irvine
16:03 18m Talk		Investigating the Capabilities of Generative AI in Solving Data Structures, Algorithms, and Computability Problems Papers Ofek Gila University of California, Irvine, Shahar Broner University of California, Irvine, Yubin Kim UC Irvine, Computer Science Department, Nero Li UC Irvine, Computer Science Department, Katrina Mizuo UC Irvine, Computer Science Department, Elijah Sauder UC Irvine, Computer Science Department, Claire To UC Irvine, Computer Science Department, Albert Wang UC Irvine, Computer Science Department, Michael Shindler University of California, Irvine
16:22 18m Talk		Reflections on Teaching Algorithm Courses Papers J. Ángel Velázquez-Iturbide Universidad Rey Juan Carlos
16:41 18m Talk		Student Utilization of Metacognitive Strategies in Solving Dynamic Programming Problems Papers Jonathan Liu University of Chicago, Erica Goodwin University of Chicago, Diana Franklin University of Chicago