Attention to women's rights in NGO press releases, 1996–2018: A curated, coded dataset of organizational attention to women and violence against women

<p dir="ltr">This dataset contains 31,937 press releases from four leading human rights organizations (Human Rights Watch, Amnesty International, Freedom House, ACLU) spanning 1996-2018, with document-level indicators of attention to women's rights and violence against women.<...

Descrición completa

Gardado en:
Detalles Bibliográficos
Autor Principal: Margarita Solntseva (22563608) (author)
Outros autores: Charles Crabtree (22496435) (author)
Publicado: 2025
Subjects:
Tags: Engadir etiqueta
Sen Etiquetas, Sexa o primeiro en etiquetar este rexistro!
Descripción
Summary:<p dir="ltr">This dataset contains 31,937 press releases from four leading human rights organizations (Human Rights Watch, Amnesty International, Freedom House, ACLU) spanning 1996-2018, with document-level indicators of attention to women's rights and violence against women.</p><p><br></p><p dir="ltr">Each document is classified with:<br>• Two binary labels: (1) women/girls depicted as victims; (2) women's/gender rights as main topic</p><p dir="ltr">• Single best-fitting UDHR category (18 categories)<br>• Country ISO3 code for geographic identification</p><p dir="ltr"><br>Human validation with 200 documents demonstrates strong intercoder reliability (Cohen's κ = 0.848 for women-as-victims, κ = 0.826 for women's-rights-as-topic, κ = 0.783 for UDHR categories, κ = 1.000 for country identification).</p><p dir="ltr"><br>The dataset enables temporal and cross-national analysis of organizational attention patterns, with straightforward merges to standard country-level datasets (CIRI, PTS, V-Dem) via ISO3 codes.</p><p dir="ltr">Package Contents:</p><p dir="ltr">• Core datasets (classified_docs.csv: 31,937 documents; classified_locs.csv: geographic mapping)<br>• Validation materials (intercoder reliability metrics, validation sample, UDHR codebook)</p><p dir="ltr">• Diagnostic files (unmatched country assignments)<br>• Visualizations of data</p><p dir="ltr">• Comprehensive README with usage examples in R and Python</p><p dir="ltr">All data are derived from publicly available organizational press releases. The classification pipeline combines LLM-based coding with deterministic fallbacks and rigorous human validation, ensuring valid measurements suitable for cross-national comparative research and temporal analysis of advocacy priorities.</p>