Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking

Machine learning has been key to significant information technology discoveries in myriad disciplines. However, it has received mixed outlook in the social science field. This study aims to use the methods of learning from real data set on human trafficking, which is a serious social problem of toda...

Full description

Saved in:
Bibliographic Details
Main Author: Caoli, Arsenio Jr. (author)
Published: 2019
Subjects:
Online Access:https://bspace.buid.ac.ae/handle/1234/1579
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1862980614674710529
author Caoli, Arsenio Jr.
author_facet Caoli, Arsenio Jr.
author_role author
dc.creator.none.fl_str_mv Caoli, Arsenio Jr.
dc.date.none.fl_str_mv 2019-09
2020-03-30T11:44:38Z
2020-03-30T11:44:38Z
dc.format.none.fl_str_mv application/vnd.openxmlformats-officedocument.wordprocessingml.document
application/pdf
dc.identifier.none.fl_str_mv 2016128152
https://bspace.buid.ac.ae/handle/1234/1579
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv The British University in Dubai (BUiD)
dc.subject.none.fl_str_mv human trafficking
machine learning
pattern mining
Multiple Imputation using Chained Equations (MICE)
agglomerative hierarchical clustering
dc.title.none.fl_str_mv Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
dc.type.none.fl_str_mv Dissertation
description Machine learning has been key to significant information technology discoveries in myriad disciplines. However, it has received mixed outlook in the social science field. This study aims to use the methods of learning from real data set on human trafficking, which is a serious social problem of today. The Counter-Trafficking Data Collaborative (CTDC) dataset, which is an initiative of the International Organization for Migration (IOM) for human trafficking was used for the experimental study. The exploration of the dataset revealed 61% of missing data — another incentive for the applicability of machine learning via multiple imputation using chained equations (MICE) instead of single imputation or deletion. Agglomerative hierarchical clustering using Gower's Distance was used for pattern discovery of the categorical type of data in this research, with a comparison to Fuzzy k-mode clustering. Results show that MICE had a level of effectiveness in handling missing data, while agglomerative hierarchical clustering was successful in identifying distinct and describable clusters from three time periods that the imputed dataset was segmented.
id budr_8b416ef1e435c4263f74b2e64d910350
identifier_str_mv 2016128152
language_invalid_str_mv en
network_acronym_str budr
network_name_str The British University in Dubai repository
oai_identifier_str oai:bspace.buid.ac.ae:1234/1579
publishDate 2019
publisher.none.fl_str_mv The British University in Dubai (BUiD)
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Machine Learning in the Analysis of Social Problems: The Case of Global Human TraffickingCaoli, Arsenio Jr.human traffickingmachine learningpattern miningMultiple Imputation using Chained Equations (MICE)agglomerative hierarchical clusteringMachine learning has been key to significant information technology discoveries in myriad disciplines. However, it has received mixed outlook in the social science field. This study aims to use the methods of learning from real data set on human trafficking, which is a serious social problem of today. The Counter-Trafficking Data Collaborative (CTDC) dataset, which is an initiative of the International Organization for Migration (IOM) for human trafficking was used for the experimental study. The exploration of the dataset revealed 61% of missing data — another incentive for the applicability of machine learning via multiple imputation using chained equations (MICE) instead of single imputation or deletion. Agglomerative hierarchical clustering using Gower's Distance was used for pattern discovery of the categorical type of data in this research, with a comparison to Fuzzy k-mode clustering. Results show that MICE had a level of effectiveness in handling missing data, while agglomerative hierarchical clustering was successful in identifying distinct and describable clusters from three time periods that the imputed dataset was segmented.The British University in Dubai (BUiD)2020-03-30T11:44:38Z2020-03-30T11:44:38Z2019-09Dissertationapplication/vnd.openxmlformats-officedocument.wordprocessingml.documentapplication/pdf2016128152https://bspace.buid.ac.ae/handle/1234/1579enoai:bspace.buid.ac.ae:1234/15792023-08-12T05:39:02Z
spellingShingle Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
Caoli, Arsenio Jr.
human trafficking
machine learning
pattern mining
Multiple Imputation using Chained Equations (MICE)
agglomerative hierarchical clustering
title Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
title_full Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
title_fullStr Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
title_full_unstemmed Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
title_short Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
title_sort Machine Learning in the Analysis of Social Problems: The Case of Global Human Trafficking
topic human trafficking
machine learning
pattern mining
Multiple Imputation using Chained Equations (MICE)
agglomerative hierarchical clustering
url https://bspace.buid.ac.ae/handle/1234/1579