When a user approaches a newspaper collection, they usually have three principal means of retrieving the information they want: browsing individual issues, jumping to a relevant article by means of an index, or searching the full text of the collection. Considering the material and archival histories of these objects, it... [Read More]
Our blog posts expand on elements of the Atlas, including selection decisions, lessons learned from our archival partners, and some of the ontological issues we found when working with digitised newspaper metadata.
How Do You Solve A Problem Like OCR?
The complexity of Optical Character Recognition as a problem isn’t always fully understood by users. In 1999, Stephen V. Rice, George Nagy and Thomas A. Nartker offered an illustrated guide to OCR that suggested “A seven year old child can identify characters with far greater accuracy than the leading OCR... [Read More]
Issues with Editions; Editions with Issues
In the case of newspapers, the digital edition is a copy of a version that is unique among many possible editions. It may have a damaged page or may be missing some material available in another edition, such as advertisements. … Unlike print, digital remediations of historical texts are accessible... [Read More]