Investigating the Capabilities of Generative AI in Solving Data Structures, Algorithms, and Computability Problems
This program is tentative and subject to change.
There is both great hope and great concern about the future of Computer Science practice and education with respect to the recent advent of large language models (LLMs).
We present the first study to extensively evaluate the ability of such a model to solve problems in Computer Science Theory. We tested 183 exam-level problems across 16 specific topics in various Computer Science Theory related topics, ranging from preliminary data structures to algorithm design paradigms to Theory of Computation. Our results use the recent popular models (GPT-4 and GPT-4o) without image recognition abilities. This is a rapidly evolving field, with model performance continuously improving. We present our results primarily as an indication of what they can already achieve—equivalently how they can already be useful—today, fully expecting them to improve even further in the near future.
Our results show that what was very recently a state-of-the-art model (GPT-4) is able to solve a majority (78%) of free-response problems in Data Structure and algorithm with little to no guidance. The latest model, GPT-4o, was able to solve nearly half the Theory of Computation problems we posed, with predictable categories for problems it could not solve. When broken down by topic, the model was able to solve 85% of problems in four out of the nine topics in data structures and algorithms and at least half in four others. Other problems, namely more visual problems, either required more substantial coaching, or seem to still be beyond the capabilities of the language model–- for now.
This program is tentative and subject to change.
Thu 27 FebDisplayed time zone: Eastern Time (US & Canada) change
15:45 - 17:00 | |||
15:45 18mTalk | Construction and Preliminary Validation of a Dynamic Programming Concept Inventory Papers Matthew Ferland University of Southern California, Varun Nagaraj Rao Princeton University, Arushi Arora University of California, Irvine, Drew van der Poel Northeastern, Michael Luu University of California, Irvine, Randy Huynh University of California Irvine, Frederick Reiber Boston University, Sandra Ossman UC Irvine, Seth Poulsen Utah State University, Michael Shindler University of California, Irvine | ||
16:03 18mTalk | Investigating the Capabilities of Generative AI in Solving Data Structures, Algorithms, and Computability Problems Papers Ofek Gila University of California, Irvine, Shahar Broner University of California, Irvine, Yubin Kim UC Irvine, Computer Science Department, Nero Li UC Irvine, Computer Science Department, Katrina Mizuo UC Irvine, Computer Science Department, Elijah Sauder UC Irvine, Computer Science Department, Claire To UC Irvine, Computer Science Department, Albert Wang UC Irvine, Computer Science Department, Michael Shindler University of California, Irvine | ||
16:22 18mTalk | Reflections on Teaching Algorithm Courses Papers J. Ángel Velázquez-Iturbide Universidad Rey Juan Carlos | ||
16:41 18mTalk | Student Utilization of Metacognitive Strategies in Solving Dynamic Programming Problems Papers Jonathan Liu University of Chicago, Erica Goodwin University of Chicago, Diana Franklin University of Chicago |