Abstract
The success of data scientists in developing machine learning models is contingent on an iterative development process for detecting patterns in data, finding and extracting useful features, and maximizing their model's performance. However, it is often the case that they struggle during model development and become stuck and unable to make significant progress. We collected qualitative and quantitative data from the workflow of data scientists that allow us to learn from and examine such moments of stuckness. We used this data to develop a model for predicting stuckness based on real-time indicators, such as code artifacts, and then used the model to develop an innovative algorithm that determines precisely when a potential stuckness intervention should occur: as close as possible to the beginning of actual stuckness. Our algorithm's performance indicates the potential efficacy of predicting data scientist stuckness algorithmically under real-world circumstances and for real-world needs.
Original language | English |
---|---|
Title of host publication | Proceedings - 2022 IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC 2022 |
Editors | Paolo Bottoni, Gennaro Costagliola, Michelle Brachman, Mark Minas |
Publisher | IEEE Computer Society |
ISBN (Electronic) | 9781665442145 |
DOIs | |
State | Published - 2022 |
Externally published | Yes |
Event | 2022 IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC 2022 - Rome, Italy Duration: 12 Sep 2022 → 16 Sep 2022 |
Publication series
Name | Proceedings of IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC |
---|---|
Volume | 2022-September |
ISSN (Print) | 1943-6092 |
ISSN (Electronic) | 1943-6106 |
Conference
Conference | 2022 IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC 2022 |
---|---|
Country/Territory | Italy |
City | Rome |
Period | 12/09/22 → 16/09/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE Computer Society. All rights reserved.
Funding
ACKNOWLEDGMENTS This research was funded by JPMorgan Chase & Co. Any views or opinions expressed herein are solely those of the authors listed, and may differ from the views and opinions expressed by JPMorgan Chase & Co. or its affiliates. This material is not a product of the Research Department of J.P. Morgan Securities LLC. This material should not be construed as an individual recommendation for any particular client and is not intended as a recommendation of particular securities, financial instruments or strategies for a particular client. This material does not constitute a solicitation or offer in any jurisdiction.
Funders | Funder number |
---|---|
JPMorgan Chase & Co |