I'm leading this project under the mentorship of the Data Science group director.
Semi-structured interviews, brand design, contextual enquiry, market research, competitive analysis, literature reviews, wireframing, usability testing and development.
Wireframe, Research Findings, Long term and Short term goals
June 2022 - May 2023
By integrating automation in the workflow of a data scientist in a controlled manner, Plato aims to assist and accelerate their knowledge discovery process, while making sure the human remains in the drivers seat.
Keeping in mind the curiousity and creativity that is necessary for this exploration, the focus of Plato is to steer away from complete automation and look to augment human capabilities through data visualization and pattern discovery.
Despite the increasing trend in automating different parts of the DS lifecyle, and increasing usage patterns of autoML tools such as VertexAI and AzureML, data exploration continues to be a painstakingly time consuming and manual process. Due to the amount of creativity required in this phase, they must be carefully designed to place the control in the hands of the human. At what points can automation be introduced to augment human creativity and curiousity?
My research goals of Round 1 were to understand how users would explore an unfamiliar tabular dataset for a classification task and their experience with existing autoML tools. To explore this, I conducted 45 minute semi-structured with 6 data scientists within a research team.
Some users relied heavily on modeling to understand the data, and were looking for tools that would give them a stronger grasp.
Users talked about feeling lost while studying large datasets, not knowing what to do with the data, and the danger of overlooking patterns.
Users talked about existing autoML tools being a “no-brainer” and “blackboxes”.
My research goals for the usability tests were to understand how quickly users were able to navigate through the dataset at different levels of detail. I also looked to understand how well the system supported their data exploration workflow.
I conducted 45 minute task-based usability tests with 5 data scientists.
2 users thought the system architecture provided a simple entrypoint and helped them where to look. All users found the generated insights and reports useful to their analysis.
4/5 users had doubts whenever any qualitative terms were used to describe the data. They found it easier trusting the graphs and numbers.
4/5 users wanted to start with visualizations of characteristics more descriptive of columns before understanding quality
2/5 users wanted to visualize relationships at the column level while another user wanted to visualize the schema in a more efficient manner.
I'm currently working on acquiring licenses to share some of my designs and test results. Please check back sometime at the end of March for a more comprehensive case study.