NCSE v2.0: A Dataset of OCR-Processed 19th Century English Newspapers

NCSE v2.0 Dataset Repository<p dir="ltr">This repository contains the NCSE v2.0 dataset and associated supporting data used in the paper "Reading the unreadable: Creating a dataset of 19th century English newspapers using image-to-text language models".</p><h2>D...

Full description

Saved in:
Bibliographic Details
Main Author: Jonno Bourne (6498233) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!