• Home
  • About
  • Schedule
  • Sessions
  • Team
  • Resources
  • Mechanics
  • FAQ
  • Code of Conduct
  • Acknowledgements

Contents

  • Overview
  • Slides
  • Abstract
    • DataLad - Decentralized Management of Digital Objects for Open Science
  • What is DataLad?
  • Announcement Email

Introduction to DataLad - Talk by Dr. Adina Wagner

Session 13

Published

January 26, 2024

Add to calendar! Slides Source DataLad DataLad Handbook What is DataLad?

Overview

  • Title: “DataLad - Decentralized Management of Digital Objects for Open Science” (see abstract)
  • Speaker: Dr. Adina Wagner
  • Date: Friday, 26th of January 2024
  • Time: 10:15 to 11:45 am (90 minutes)
  • Place: Hybrid (registration details below)
    • In person at Von-Melle-Park 5, 20146 Hamburg, William Stern Room (Room 4054)
    • Zoom (details will be send after registration)
  • Language: English
  • Context: Part of the course “Version Control of Code & Data”
  • Thanks: Supported by the the Digital & Data Literacy in Teaching Lab
  • Registration: To register, please send an email to the organizer Dr. Lennart Wittkuhn

Slides

  • HTML version of the slides
  • PDF version of the slides

Abstract

DataLad - Decentralized Management of Digital Objects for Open Science

With a general awareness of a reproducibility crisis in many scientific areas and increasing importance of research data management in science and policy making, data-driven fields require convenient and scalable data management solutions. Standing on the shoulders of Git and git-annex (git-annex.branchable.com, Joey Hess), DataLad provides a decentralized solution that enables the joint management of code, data, and complete containerized computational environments in a scalable and distributed fashion. With features such as unambiguous version control, a wide spectrum of data transport mechanisms, convenient provenance capture, and re-execution for verification or as an alternative to storage and transport, it enables and facilitates many aspects of open and reproducible science: collaboration, sharing, analytical transparency, computational reproducibility of digital research objects, and disk-space aware storage and computing workflows on infrastructure that ranges from personal laptops up to supercomputers.

What is DataLad?

Announcement Email

Dear all,

we cordially invite you to a hybrid talk by Dr. Adina Wagner on “DataLad - Decentralized Management of Digital Objects for Open Science”.

  • Date: Friday, 26th of January 2024
  • Time: 10:15 to 11:45 am (90 minutes)
  • Place: Hybrid, in person at Von-Melle-Park 5, 20146 Hamburg, William Stern Room (Room 4054) and via Zoom
  • Registration: To register, please send an email to the organizer Dr. Lennart Wittkuhn. Please indicate whether you would like to join in-person or online.
  • Details: For more information, please visit the event website

The link to the Zoom meeting will be sent to all registered participants a few days before the event.

The talk is part of the course on “Version Control of Code & Data”, generously supported by the Digital & Data Literacy in Teaching Lab program at University of Hamburg.

Abstract

With a general awareness of a reproducibility crisis in many scientific areas and increasing importance of research data management in science and policy making, data-driven fields require convenient and scalable data management solutions. Standing on the shoulders of Git and git-annex (git-annex.branchable.com, Joey Hess), DataLad provides a decentralized solution that enables the joint management of code, data, and complete containerized computational environments in a scalable and distributed fashion. With features such as unambiguous version control, a wide spectrum of data transport mechanisms, convenient provenance capture, and re-execution for verification or as an alternative to storage and transport, it enables and facilitates many aspects of open and reproducible science: collaboration, sharing, analytical transparency, computational reproducibility of digital research objects, and disk-space aware storage and computing workflows on infrastructure that ranges from personal laptops up to supercomputers.

© 2023 – 2024 Dr. Lennart Wittkuhn
  • Funded by the Digital and Data Literacy in Teaching Lab
License: CC BY-SA 4.0