Attention to women's rights in NGO press releases, 1996–2018: A curated, coded dataset of organizational attention to women and violence against women
<p dir="ltr">This dataset contains 31,937 press releases from four leading human rights organizations (Human Rights Watch, Amnesty International, Freedom House, ACLU) spanning 1996-2018, with document-level indicators of attention to women's rights and violence against women.<...
Gardado en:
| Autor Principal: | |
|---|---|
| Outros autores: | |
| Publicado: |
2025
|
| Subjects: | |
| Tags: |
Engadir etiqueta
Sen Etiquetas, Sexa o primeiro en etiquetar este rexistro!
|
| Summary: | <p dir="ltr">This dataset contains 31,937 press releases from four leading human rights organizations (Human Rights Watch, Amnesty International, Freedom House, ACLU) spanning 1996-2018, with document-level indicators of attention to women's rights and violence against women.</p><p><br></p><p dir="ltr">Each document is classified with:<br>• Two binary labels: (1) women/girls depicted as victims; (2) women's/gender rights as main topic</p><p dir="ltr">• Single best-fitting UDHR category (18 categories)<br>• Country ISO3 code for geographic identification</p><p dir="ltr"><br>Human validation with 200 documents demonstrates strong intercoder reliability (Cohen's κ = 0.848 for women-as-victims, κ = 0.826 for women's-rights-as-topic, κ = 0.783 for UDHR categories, κ = 1.000 for country identification).</p><p dir="ltr"><br>The dataset enables temporal and cross-national analysis of organizational attention patterns, with straightforward merges to standard country-level datasets (CIRI, PTS, V-Dem) via ISO3 codes.</p><p dir="ltr">Package Contents:</p><p dir="ltr">• Core datasets (classified_docs.csv: 31,937 documents; classified_locs.csv: geographic mapping)<br>• Validation materials (intercoder reliability metrics, validation sample, UDHR codebook)</p><p dir="ltr">• Diagnostic files (unmatched country assignments)<br>• Visualizations of data</p><p dir="ltr">• Comprehensive README with usage examples in R and Python</p><p dir="ltr">All data are derived from publicly available organizational press releases. The classification pipeline combines LLM-based coding with deterministic fallbacks and rigorous human validation, ensuring valid measurements suitable for cross-national comparative research and temporal analysis of advocacy priorities.</p> |
|---|