Software Heritage, first steps:
What is it and how to use it?


13 December 2022
Les entrepôts de données: un outil au service des principes FAIR

Joenio Marques da Costa

Research Software Engineer at LISIS
CorTexT platform and Risis Core Facility (RCF) project

softwareheritage.org/2022/11/22/22nd-ambassador-joenio-marques-da-costa

Universal software source code archive



180 million software projects

why?

  • 2015, Google Code and Gitorious.org shutdown
  • 2019, BitBucket announces Mercurial VCS sunset
  • 2020, BitBucket erases 250.000+ repositories
  • 2021, Inria’s old gforge.inria.fr was shut down
  • 2022, GitLab.com considers erasing all projects that are inactive for a year

Software Heritage

=

GitHub, GitLab, BitBucket, Google Code … ?

Version Control with Git

Features

Software Heritage archive

archive.softwareheritage.org

Software Heritage identifiers

Intrinsic identifiers for digital objects.

One type of identifier can’t answer all use cases, we need both intrinsic identifiers and extrinsic identifiers for software research outputs.

softwareheritage.org/2020/07/09/intrinsic-vs-extrinsic-identifiers

What can be identified with a SWHID?

softwareheritage.org/faq/#32_What_can_be_identified_with_a_SWHID

SWHID howtos:

Examples of SWHID use:

Extra references:

CorTexT Manager


docs.cortext.net

Thanks!

joenio@joenio.me


This presentation is available at:

http://joenio.me/software-heritage-olio

export presentation to pdf (require chromium browser)

(source-code: https://gitlab.com/joenio/joenio.gitlab.io)

Licença Creative Commons

Presentation history

Where and when this presentation was done