Human Rights Violations and Machine Learning - Cluster Analysis of Countries using the CIRIGHTS Dataset

Ståhle, Emilie; Haubrich, Emilie

Human Rights Violations and Machine Learning - Cluster Analysis of Countries using the CIRIGHTS Dataset

Mark

Ståhle, Emilie ^LU and Haubrich, Emilie (2023) DABN01 20231
Department of Statistics
Department of Economics

Abstract: This master's thesis explores the use of unsupervised machine learning techniques to cluster countries based on their degree of human rights violations. Accordingly, the study evaluates the performance of two clustering methods, K-Means clustering and Latent Class Analysis (LCA), using two cluster validation metrics (Silhouette Coefficient and Dunn Index), as well as an Accuracy measure using the Human Rights index. It analyses the characteristics of clusters and the assignments thereof over four decades to provide compact insights for policymakers. The results, in turn, show that both clustering methods perform equally well, however, LCA is chosen for the bulk of the analysis out of respect for the categorical nature of the data.... (More); This master's thesis explores the use of unsupervised machine learning techniques to cluster countries based on their degree of human rights violations. Accordingly, the study evaluates the performance of two clustering methods, K-Means clustering and Latent Class Analysis (LCA), using two cluster validation metrics (Silhouette Coefficient and Dunn Index), as well as an Accuracy measure using the Human Rights index. It analyses the characteristics of clusters and the assignments thereof over four decades to provide compact insights for policymakers. The results, in turn, show that both clustering methods perform equally well, however, LCA is chosen for the bulk of the analysis out of respect for the categorical nature of the data. Consequently, cluster profiling identifies three clusters with varying levels of human rights scores, although, looking at each variable and decade individually, we see that they do not all follow the same order of magnitude that the overall cluster scores suggest. Furthermore, the probability transition matrix shows that, generally, countries do not change significantly over time, in terms of their level of respect for human rights. Finally, policy advice for stable countries involves using cluster 1 as a “gold standard”, incentivizing cluster 2, and taking a proactive approach for cluster 3. In turn, for unstable countries, advice includes incentivizing further improvements for countries that have shown positive progress, understanding reasons for decline, and stabilising and monitoring closely those that have shown fluctuating tendencies. The paper concludes that unsupervised machine learning for detecting human rights violations is useful, efficient, and provides insights into patterns that are not immediately apparent. Furthermore, it is a useful instrument to summarise these patterns in a clear and interpretable way. (Less)

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/9120322

author

Ståhle, Emilie ^LU and Haubrich, Emilie

supervisor

Simon Reese

organization

course

DABN01 20231

year

2023

type

H1 - Master's Degree (One Year)

subject

Business and Economics

keywords

Human Rights Violation, Clustering, K-Means, Latent Class Analysis, Transition Matrix

language

English

id

9120322

date added to LUP

2023-11-21 12:55:08

date last changed

2023-11-21 12:55:08

@misc{9120322,
  abstract     = {{This master's thesis explores the use of unsupervised machine learning techniques to cluster countries based on their degree of human rights violations. Accordingly, the study evaluates the performance of two clustering methods, K-Means clustering and Latent Class Analysis (LCA), using two cluster validation metrics (Silhouette Coefficient and Dunn Index), as well as an Accuracy measure using the Human Rights index. It analyses the characteristics of clusters and the assignments thereof over four decades to provide compact insights for policymakers. The results, in turn, show that both clustering methods perform equally well, however, LCA is chosen for the bulk of the analysis out of respect for the categorical nature of the data. Consequently, cluster profiling identifies three clusters with varying levels of human rights scores, although, looking at each variable and decade individually, we see that they do not all follow the same order of magnitude that the overall cluster scores suggest. Furthermore, the probability transition matrix shows that, generally, countries do not change significantly over time, in terms of their level of respect for human rights. Finally, policy advice for stable countries involves using cluster 1 as a “gold standard”, incentivizing cluster 2, and taking a proactive approach for cluster 3. In turn, for unstable countries, advice includes incentivizing further improvements for countries that have shown positive progress, understanding reasons for decline, and stabilising and monitoring closely those that have shown fluctuating tendencies. The paper concludes that unsupervised machine learning for detecting human rights violations is useful, efficient, and provides insights into patterns that are not immediately apparent. Furthermore, it is a useful instrument to summarise these patterns in a clear and interpretable way.}},
  author       = {{Ståhle, Emilie and Haubrich, Emilie}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Human Rights Violations and Machine Learning - Cluster Analysis of Countries using the CIRIGHTS Dataset}},
  year         = {{2023}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Human Rights Violations and Machine Learning - Cluster Analysis of Countries using the CIRIGHTS Dataset