Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"

<p dir="ltr">20 participants installed and interacted with a thematic analysis coding assistant (TACA), an interactive machine learning desktop application designed to train a classifier on user-defined coded datasets to generate additional coding suggestions. The interviews were con...

Full description

Saved in:
Bibliographic Details
Main Author: Federico Milana (9430286) (author)
Other Authors: Enrico Costanza (6784982) (author), Mirco Musolesi (6769889) (author), Amid Ayobi (6773444) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1852023555286368256
author Federico Milana (9430286)
author2 Enrico Costanza (6784982)
Mirco Musolesi (6769889)
Amid Ayobi (6773444)
author2_role author
author
author
author_facet Federico Milana (9430286)
Enrico Costanza (6784982)
Mirco Musolesi (6769889)
Amid Ayobi (6773444)
author_role author
dc.creator.none.fl_str_mv Federico Milana (9430286)
Enrico Costanza (6784982)
Mirco Musolesi (6769889)
Amid Ayobi (6773444)
dc.date.none.fl_str_mv 2025-01-16T14:20:01Z
dc.identifier.none.fl_str_mv 10.5522/04/28182962.v1
dc.relation.none.fl_str_mv https://figshare.com/articles/dataset/Dataset_for_Understanding_Interaction_with_Machine_Learning_through_a_Thematic_Analysis_Coding_Assistant_A_User_Study_/28182962
dc.rights.none.fl_str_mv CC0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Human-computer interaction
human-AI interaction (HAII)
Interactive Machine Learning
user study
human-computer interaction tool
dc.title.none.fl_str_mv Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
dc.type.none.fl_str_mv Dataset
info:eu-repo/semantics/publishedVersion
dataset
description <p dir="ltr">20 participants installed and interacted with a thematic analysis coding assistant (TACA), an interactive machine learning desktop application designed to train a classifier on user-defined coded datasets to generate additional coding suggestions. The interviews were conducted with the participants after they interacted with the tool for 20 minutes, or until no more benefits were perceived. The questions were aimed to understand the experience of the participants with TACA and their perceptions of the ML model.<br><br></p><ul><li>The <b>coded_transcripts.docx</b> file contains the anonymised interview transcripts coded with codes appearing as comments. The document is split into Study 1 (5 participants) and Study 2 (15 participants). The participants in Study 1 imported their own dataset into TACA, while the participants in Study 2 used a set of newspaper restaurant reviews that were given to them by the researchers. Participant IDs follow the structure "S[study number]_P[participant number]", e.g. "S2_P1".<br></li><li>The <b>themes.csv</b> file shows all the codes below each corresponding theme, the result of conducting thematic analysis on the interview transcripts.<br></li><li>The <b>restaurant_reviews.docx</b> file is the collection of 21 restaurant reviews from the newspaper The Guardian (<a href="https://www.theguardian.com/food/restaurants+tone/reviews" target="_blank">Restaurants + Reviews | Food | The Guardian</a>) that was given to 15 of the 20 participants who did not have their own dataset available for the study.<br></li><li>The <b>logs</b> folder contains an anonymised interaction log file for each participant with the interface of TACA named with the corresponding participant ID. The interaction logs for participants S1_P4 and S2_P5 are missing due to an issue in data storage.</li></ul><p dir="ltr"><br></p>
eu_rights_str_mv openAccess
id Manara_a265aa8d6d2cd030d29d0b5d464cba6d
identifier_str_mv 10.5522/04/28182962.v1
network_acronym_str Manara
network_name_str ManaraRepo
oai_identifier_str oai:figshare.com:article/28182962
publishDate 2025
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC0
spelling Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"Federico Milana (9430286)Enrico Costanza (6784982)Mirco Musolesi (6769889)Amid Ayobi (6773444)Human-computer interactionhuman-AI interaction (HAII)Interactive Machine Learninguser studyhuman-computer interaction tool<p dir="ltr">20 participants installed and interacted with a thematic analysis coding assistant (TACA), an interactive machine learning desktop application designed to train a classifier on user-defined coded datasets to generate additional coding suggestions. The interviews were conducted with the participants after they interacted with the tool for 20 minutes, or until no more benefits were perceived. The questions were aimed to understand the experience of the participants with TACA and their perceptions of the ML model.<br><br></p><ul><li>The <b>coded_transcripts.docx</b> file contains the anonymised interview transcripts coded with codes appearing as comments. The document is split into Study 1 (5 participants) and Study 2 (15 participants). The participants in Study 1 imported their own dataset into TACA, while the participants in Study 2 used a set of newspaper restaurant reviews that were given to them by the researchers. Participant IDs follow the structure "S[study number]_P[participant number]", e.g. "S2_P1".<br></li><li>The <b>themes.csv</b> file shows all the codes below each corresponding theme, the result of conducting thematic analysis on the interview transcripts.<br></li><li>The <b>restaurant_reviews.docx</b> file is the collection of 21 restaurant reviews from the newspaper The Guardian (<a href="https://www.theguardian.com/food/restaurants+tone/reviews" target="_blank">Restaurants + Reviews | Food | The Guardian</a>) that was given to 15 of the 20 participants who did not have their own dataset available for the study.<br></li><li>The <b>logs</b> folder contains an anonymised interaction log file for each participant with the interface of TACA named with the corresponding participant ID. The interaction logs for participants S1_P4 and S2_P5 are missing due to an issue in data storage.</li></ul><p dir="ltr"><br></p>2025-01-16T14:20:01ZDatasetinfo:eu-repo/semantics/publishedVersiondataset10.5522/04/28182962.v1https://figshare.com/articles/dataset/Dataset_for_Understanding_Interaction_with_Machine_Learning_through_a_Thematic_Analysis_Coding_Assistant_A_User_Study_/28182962CC0info:eu-repo/semantics/openAccessoai:figshare.com:article/281829622025-01-16T14:20:01Z
spellingShingle Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
Federico Milana (9430286)
Human-computer interaction
human-AI interaction (HAII)
Interactive Machine Learning
user study
human-computer interaction tool
status_str publishedVersion
title Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
title_full Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
title_fullStr Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
title_full_unstemmed Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
title_short Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
title_sort Dataset for "Understanding Interaction with Machine Learning through a Thematic Analysis Coding Assistant: A User Study"
topic Human-computer interaction
human-AI interaction (HAII)
Interactive Machine Learning
user study
human-computer interaction tool