Exploring diversity through machine learning: a case for the use of decision trees in social science research

The literature provides multiple measures of diversity along a single demographic dimension, but when it comes to studying the interaction of multiple diversity types (e.g. age, gender, and race), the field of useable measures diminishes. We present the use of decision trees as a machine learning te...

Full description

Saved in:
Bibliographic Details
Main Author: Srour, F. Jordan (author)
Other Authors: Karkoulian, Silva (author)
Format: article
Published: 2021
Online Access:http://hdl.handle.net/10725/17291
https://doi.org/10.1080/13645579.2021.1933064
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://www.tandfonline.com/doi/full/10.1080/13645579.2021.1933064
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The literature provides multiple measures of diversity along a single demographic dimension, but when it comes to studying the interaction of multiple diversity types (e.g. age, gender, and race), the field of useable measures diminishes. We present the use of decision trees as a machine learning technique to automatically identify the interactions across diversity types to predict different levels of a dependent variable. In order to demonstrate the power of decision trees, we use five types of surface-level diversity (age, gender, education level, religion, and region of origin) measured via the standardized Blau index as independent variables and knowledge sharing as the dependent variable. The results of our decision tree approach relative to linear regression show that decision trees serve as a powerful tool to identify key demographic faultlines without a priori specification of a model structure.