"I'm not sure, but...": Expert Practices that Enable Effective Code Comprehension in Data Science
This program is tentative and subject to change.
Data scientists often need to read and understand messy and undocumented code that relies on large software libraries. What makes data science experts more effective than novices at this task? To understand expert practices, we conducted a think-aloud study where 4 novice and 5 expert data scientists reasoned about an unfamiliar data analysis script with realistic complexity that used the Python pandas library. Surprisingly, familiarity of the pandas package had relatively minor importance for experts. Instead, experts consistently performed three practices that novices did not: experts examined the data in detail rather than fixating on surface-level code features; experts consistently verified their assumptions about how the data was transformed; and experts navigated lengthy program outputs in a goal-directed way. Using these findings, we provide a practical set of guidelines for data science pedagogy and for future tools to support data science learners.
This program is tentative and subject to change.
Thu 27 FebDisplayed time zone: Eastern Time (US & Canada) change
13:45 - 15:00 | |||
13:45 18mTalk | Approachable Machine Learning Education: A Spiral Pedagogy Approach with Experiential Learning Papers Meiying Qin York University | ||
14:03 18mTalk | A Window into DataWorks: Developing an Integrated Work-Training Curriculum for Novice Adults Papers Lara Schenck Georgia Institute of Technology, Dana Priest DataWorks at Georgia Tech, Gabe Dubose Emory University, Zajerria Godfrey Maynard Jackson High School, Annabel Rothschild Georgia Institute of Technology, Benjamin Shapiro Georgia State University, Betsy Disalvo Georgia Institute of Technology | ||
14:22 18mTalk | "I'm not sure, but...": Expert Practices that Enable Effective Code Comprehension in Data Science Papers Christopher Lum UC San Diego, Guoxuan Xu UC San Diego, Sam Lau University of California at San Diego | ||
14:41 18mTalk | Larger than Life In-Class Demonstrations for Introductory Machine Learning Papers |