Hacking and Programming

Daily News and Weekly Tutorials


GitHub finally preserved giant open-source archive in an Arctic vault

GitHub completes sending snapshot of all public repositories taken to an archiving vault in the Arctic at Svalbard, Norway.

According to GitHub Archive Program, GitHub finally completed sending snapshots of active repositories to the Arctic Vault situated in Norway. On February 2, 2020, GitHub took a snapshot of all active public repositories on GitHub platform for archiving. Over the last several months, github partner Piql created a 21TB of repository data to 186 reels of piqlFilm, a type of digital photosensitive archival film. GitHub finally completed the process on 8 July, 2020.

GitHub on their blog said that

“Our original plan was for our team to fly to Norway and personally escort the world’s open source code to the Arctic, but as the world continues to endure a global pandemic, we had to adjust our plans. We stayed in close contact with our partners. We’re happy to report that the code was successfully deposited in the Arctic Code Vault on July 8, 2020.”

Github also created a badge named “Arctic Code Vault Contributor” for the users whose repositories have been successfully stored in vault. This badge can be found under highlights section of users profile.

As said in GitHub Archive Program website, The 2 February, 2020 snapshot archived in the  Arctic Code Vault will sweep up every active public GitHub repository, in addition to significant dormant repositories. The snapshot will include every repository with any commits b/w the announcement at GitHub Universe on November 13, 2019 and 2 February 2020, every repository with at least a star and any commits from the year before the snapshot (02/03/2019 – 02/02/2020), and every repository with at least 250 stars.

The snapshot will consist of the HEAD of the default branch of each repository, minus any binaries larger than 100KB in size—depending on available space, repositories with more stars may retain binaries. Each repository will be packaged as a single TAR file. For greater data density and integrity, most of the data will be stored QR encoded, and compressed. A human-readable index and guide about location of each repository and how to recover data