Data Infrastructure

Research Data Management for Psychology and Neuroscience
Course at University of Hamburg, RTG 2753: Emotional Learning and Memory
Slides | Source
License: CC BY 4.0

13:00

Schedule

Day Date Time Title
2 2026-02-06 09:30 - 10:00 Introduction to Version Control
2 2026-02-06 10:00 - 12:00 Version Control of Data with DataLad
2 2026-02-06 12:00 - 13:00 Lunch Break
2 2026-02-06 13:00 - 14:00 Data publication
2 2026-02-06 14:00 - 16:00 Integrating with data infrastructure at UHH (and beyond)
2 2026-02-06 16:00 - 16:30 Summary & Outlook

1 This session: Data Infrastructure

Objectives

πŸ’‘ You know about the different data infrastructure services available at the University of Hamburg.
πŸ’‘ You can distinguish between local storage, cloud storage, and object storage solutions.
πŸ’‘ You know how to access and use UHH Disk, UHH Cloud, and Object Storage services.
πŸ’‘ You understand the role of version control systems like GitLab for research data management.
πŸ’‘ You are familiar with the UHH Research Data Repository and its features for long-term data preservation.

Overview of UHH Data Services

The University of Hamburg provides various data infrastructure services to support research:

  • UHH Disk: Network storage for daily research work
  • UHH Cloud: Collaborative cloud platform
  • Object Storage: Large-scale, automated data storage
  • GitLab: Version control and code collaboration
  • Research Data Repository: Long-term data preservation and publication

πŸ’‘ Key principle: Different services for different stages of the research data lifecycle

2 Data Infrastructure at UHH

UHH Disk

What is it?

  • Network-attached storage service
  • Accessible from UHH network and VPN
  • Designed for active research data

Key Features:

  • High-performance access
  • Regular backups
  • Integration with UHH IT infrastructure
  • Suitable for daily research workflows

Best for:

  • Active research projects
  • Frequent data access
  • Integration with campus IT
  • Personal research storage

Storage Capacity:

  • Staff: Maximum 500 GB per personal directory
  • Students: Maximum 100 GB per personal directory
  • Project spaces: Available on request via ServicePortal

Access Methods:

  • Within UHH network: \\uhhdisk.uni-hamburg.de\home
  • RRZ-managed devices: Drive letter K:
  • Remote access: VPN required
  • Protocols: SMB/CIFS, NFS (for servers)

πŸ”— More information

UHH Cloud

What is it?

  • Cloud-based collaboration platform (using Nextcloud)
  • Used for File sharing and synchronization
  • Web-based access from anywhere

Key Features:

  • Cross-platform synchronization
  • Collaborative workspaces
  • External sharing capabilities
  • Mobile and desktop applications

Best for:

  • Team collaboration
  • Remote access
  • File sharing
  • Cross-device synchronization

Storage Capacity:

  • Specific limits not publicly specified
  • Request additional storage via RRZ if needed

Access Methods:

  • Web interface: Available via UHH login
  • Desktop clients: Available for Windows, macOS, Linux
  • Mobile apps: Available for iOS and Android
  • Sync & Share: Cross-device synchronization

πŸ”— More information

Object Storage

What is it?

  • Large-scale storage system
  • API-based access
  • Designed for automated workflows

Key Features:

  • Virtually unlimited capacity
  • Programmatic access (S3-compatible)
  • Cost-effective for large datasets
  • Integration with computational workflows

Best for:

  • Large datasets
  • Automated processing
  • Long-term storage
  • Computational workflows

Access Methods:

  • S3 API: Compatible with Amazon S3 protocols
  • REST API: HTTP-based access
  • SDK support: Python boto3, AWS CLI, etc.
  • Integration: CMS, research data management systems

Use Cases:

  • Backend for file services and archival solutions
  • Multimedia file delivery
  • Measurement data and simulation results
  • Photo, video, and digitized book collections

πŸ”— More information

GitLab at UHH

What is it?

  • Git-based version control system
  • Code and document collaboration
  • Integrated CI/CD pipelines

Key Features:

  • Version tracking for code and documents
  • Issue tracking and project management
  • Collaborative development workflows
  • Integration with other UHH services

Best for:

  • Source code management
  • Collaborative writing
  • Project documentation
  • Reproducible research workflows

Two Instances Available:

  • Research & Teaching: https://gitlab.rrz.uni-hamburg.de
    • GitLab Ultimate features via GitLab Education
  • Other purposes: https://gitlab-ce.rrz.uni-hamburg.de
    • GitLab Community Edition (open-source)

Access Methods:

  • Web interface: Browser-based project management
  • Git clients: Command line, desktop apps
  • HTTPS: Encrypted data transfer
  • SSH: Available within UHH network

πŸ”— More information

UHH Research Data Repository

Purpose:

  • Long-term preservation of research data
  • Publication and citation of datasets
  • Compliance with funder requirements
  • Support for open science practices

Managed by Centre for Sustainable Research Data Management at University of Hamburg

πŸ”— More information

Storage Capacity:

  • Standard upload: Up to 50 GB via web interface
  • Larger datasets: Available upon request
  • File formats: Any digital format supported
  • Retention: Minimum 10 years (renewable)

Access Methods:

  • URL: www.fdr.uni-hamburg.de
  • Login: B-Kennung (UHH username and password)
  • Upload: Drag & drop or β€œChoose Files” interface
  • API: Available for automated uploads

Access Control Options:

  • Open Access: Public access with license selection
  • Embargoed Access: Delayed public access with end date
  • Restricted Access: Conditional access with approval
  • Closed Access: Metadata only, no file access

Key Features of RDR

Versioning System:

  • Concept-DOI: Cites entire entry across all versions
  • Version-DOI: Cites specific version of the data
  • Enables proper citation of evolving datasets

Data Storage:

  • Stored on RRZ long-term memory (S3 storage)
  • Daily backups with multiple copies
  • Separate geographic locations for redundancy
  • Minimum 10-year retention period
  • MD5 checksums for file integrity verification

Metadata Licensing:

  • Metadata licensed under CC0 (Creative Commons Zero)
  • Email addresses remain private
  • Supports data discovery and reuse

Copyright Requirements:

  • You must be the author or have usage rights
  • Co-authors must agree on publication decisions
  • No violation of copyright, privacy, or confidentiality laws

Rights Retention:

  • No rights transferred to University of Hamburg
  • Original copyright remains with authors
  • Usage governed by license specified during upload

3 Choosing the Right Service

Decision Matrix

Use Case Recommended Service Storage Capacity Why?
Daily research work UHH Disk Staff: 500GB, Students: 100GB High-performance, integrated access
Team collaboration UHH Cloud Quotas available Sharing, synchronization, remote access
Large datasets Object Storage Unlimited Cost-effective, S3-compatible
Code development GitLab Project-based Version control, collaboration
Data publication Research Data Repository 50GB standard, more on request Long-term preservation, citation

Data Lifecycle Integration

Collection & Processing β†’ UHH Disk / Object Storage

Analysis & Development β†’ GitLab + UHH Disk

Collaboration β†’ UHH Cloud

Publication & Archiving β†’ Research Data Repository

πŸ’‘ Best practice: Use multiple services throughout your research project lifecycle

4 Exercises

Exercise 1

Task: Compare the different storage solutions offered by the University of Hamburg.

  1. Visit the UHH Disk, UHH Cloud, Object Storage and Research Data Repository service pages

  2. Discuss which solution would be most appropriate for different research scenarios:

    • Daily research work with frequent file access
    • Long-term archival storage
    • Collaborative projects with external partners
    • Large datasets requiring automated processing

Exercise 2

Task: Familiarize yourself with the UHH Research Data Repository features.

  1. Explore the repository features mentioned in the documentation:
    • Versioning system (Concept-DOI vs. Versions-DOI)
    • Citation export options
    • Metadata licensing
  2. Discuss the importance of:
    • Long-term preservation (10+ years)
    • Multiple backup locations
    • Copyright and licensing considerations
  3. Consider a research project from your field and determine:
    • What type of data would you publish?
    • What metadata would be essential?
    • Which license would be appropriate?

Exercise 3

Task: Integrate DataLad with UHHCloud

Relevant Links:

5 Resources

University of Hamburg Data Services

Research Data Management

6 Questions & Discussion

❓ Any questions about UHH data infrastructure services?

πŸ’¬ Discussion: How might these services fit into your research workflow?