United States Copyright Office Datasets

This dataset contains approximately 22 million copyright registration records and 15 million other records from January 1, 1978, to June 27, 2025. It includes information on authors, types of works registered, publication status, and other relevant copyright information. An overview of the data structure and variable names in the parsed .csv files are available in this ReadMe. More detailed descriptions of the fields, variables, and definitions can be found in the Library of Congress Copyright Data as Distributed in the Marc Format document. Previous versions of this dataset are archived and dated below.

Disclaimer: This data set does not replace or supersede the online public catalog or existing search practices established by the U.S. Copyright Office, and the data set should not be relied on for legal matters. For information on searching copyright records, please refer to How to Investigate the Copyright Status of a Work (Circular 22). For information regarding requests to remove personal information from Copyright Office public records, please refer to Privacy: Public Copyright Registration Records (Circular 18).