Inside Google Blog Search: “U.S. Copyright Office records. Records from 1978 onward are online but not downloadable in bulk. The Copyright Office hasn’t digitized their earlier records, but Carnegie Mellon scanned them as part of their Universal Library Project, and the tireless folks at Project Gutenberg and the Distributed Proofreaders painstakingly corrected the OCR. Thanks to the efforts of Google software engineer Jarkko Hietaniemi, we’ve gathered the records from both sources, massaged them a bit for easier parsing, and combined them into a single XML file available for download here.”
Sorry, comments are closed for this post.