Blogs (3) >>

This program is tentative and subject to change.

Thu 27 Feb 2025 14:22 - 14:41 at Meeting Rooms 315-316 - Data Science #1

Data scientists often need to read and understand messy and undocumented code that relies on large software libraries. What makes data science experts more effective than novices at this task? To understand expert practices, we conducted a think-aloud study where 4 novice and 5 expert data scientists reasoned about an unfamiliar data analysis script with realistic complexity that used the Python pandas library. Surprisingly, familiarity of the pandas package had relatively minor importance for experts. Instead, experts consistently performed three practices that novices did not: experts examined the data in detail rather than fixating on surface-level code features; experts consistently verified their assumptions about how the data was transformed; and experts navigated lengthy program outputs in a goal-directed way. Using these findings, we provide a practical set of guidelines for data science pedagogy and for future tools to support data science learners.

This program is tentative and subject to change.

Thu 27 Feb

Displayed time zone: Eastern Time (US & Canada) change

13:45 - 15:00
Data Science #1Papers at Meeting Rooms 315-316
13:45
18m
Talk
Approachable Machine Learning Education: A Spiral Pedagogy Approach with Experiential Learning
Papers
Meiying Qin York University
14:03
18m
Talk
A Window into DataWorks: Developing an Integrated Work-Training Curriculum for Novice Adults
Papers
Lara Schenck Georgia Institute of Technology, Dana Priest DataWorks at Georgia Tech, Gabe Dubose Emory University, Zajerria Godfrey Maynard Jackson High School, Annabel Rothschild Georgia Institute of Technology, Benjamin Shapiro Georgia State University, Betsy Disalvo Georgia Institute of Technology
14:22
18m
Talk
"I'm not sure, but...": Expert Practices that Enable Effective Code Comprehension in Data Science
Papers
Christopher Lum UC San Diego, Guoxuan Xu UC San Diego, Sam Lau University of California at San Diego
14:41
18m
Talk
Larger than Life In-Class Demonstrations for Introductory Machine Learning
Papers
Henry Chai Carnegie Mellon University, Matthew R. Gormley Carnegie Mellon University