About archiving content and data on GitHub
You can archive content and data on GitHub for other people to view and reference.
GitHub intends to keep your public repositories available unless you remove them. In some cases, we may make public content unavailable, for example if:
- We receive a DMCA Takedown Notice for content in a repository.
- We determine that a repository's content violates our Community Guidelines or Terms of Service.
Academics and researchers can reference this information in data management plans.
By default, all public repositories are included in the GitHub Archive Program, a partnership between GitHub and organizations such as Software Heritage Foundation and Internet Archive to ensure the long-term preservation of the world's open source software.
The GitHub Archive Program enables third-party partners to archive public repositories using the public API. These partners archive different types of data at varying frequencies and make the data available to the public. The GitHub Archive Program also protects the data by storing multiple copies, on an ongoing basis, across various data formats and locations, including a very-long-term archive designed to last at least 1,000 years. For more information, see GitHub Archive Program.
Responsible use of archives includes respecting users' privacy. For more information, see "Public information on GitHub."
You can opt out of the GitHub Archive Program for your repository. For more information, see "Opting into or out of the GitHub Archive Program for your public repository."
Libraries and researchers may require legal protections to create archives of publicly available content. If you want third parties to consider your work on GitHub for archiving, you can add an open source license to your projects. An open source license gives contributors explicit permissions to copy and distribute the material in your repositories.