Library of Congress Web Archive
The Library has been archiving born-digital Web content through its Web Archiving program since 2000. Thousands of sites have been preserved in a variety event and thematic Web archives, selected by subject specialists. It is part of a continuing effort by the Library to evaluate, select, collect, catalog, provide access to, and preserve digital materials for researchers today and in the future. This site provides information for researchers who are interested in using the Web Archives, information for site owners who might be included in the archives, and information about the tools, infrastructure, and technical details that enable the Library to carry out this work.
Archived Web pages from 1996 on Internet Archive.
The 16TB collection includes over 311,000 datasets harvested during 2024 and 2025, a complete archive of federal public datasets linked by data.gov. It will be updated daily as new datasets are added to data.gov.