Objectives
π‘ You understand the importance of well-structured data organization for research.
π‘ You can design logical and intuitive folder structures.
π‘ You can apply file naming best practices and unique identifiers.
π‘ You understand ISO 8601 timestamps and proper sorting methods.
π‘ You can choose appropriate file formats for preservation.
π‘ You can implement effective document versioning strategies.
π‘ You understand ASCII/UTF-8 encoding advantages for text files.
π‘ You can identify and solve common file organization problems.
Exercises
Exercise 1: Project template comparison
Task: Compare the following project templates and discuss advantages and disadvantages of each approach.
Turing Way template
.
βββ LICENSE
βββ README.md
βββ CODE_OF_CONDUCT.md
βββ CONTRIBUTING.md
βββ data
β βββ processed # Final, canonical datasets
β βββ raw # Original, immutable data
βββ docs # Sphinx documentation
βββ models # Trained models, predictions, summaries
βββ notebooks # Jupyter notebooks (numbered)
βββ reports # Generated analysis (HTML, PDF, LaTeX)
β βββ figures # Generated graphics and figures
βββ project_management # Meeting notes, planning resources
βββ src # Source code
βββ data # Scripts to download/generate data
βββ models # Scripts to train models
βββ visualisation # Scripts for visualizationsRepository Structure Template by The Turing Way. Used under the LICENSE CC-BY 4.0. Reused without any modifications.
Heidi Seibold template
.
βββ README.md
βββ analysis # All things data analysis
β βββ src # Functions and source files
βββ comm
β βββ internal_comm # Internal communication, meeting notes
β βββ journal_comm # Communication with journal, peer review
βββ data
β βββ data_clean # Clean version of data
β βββ data_raw # Raw data (don't touch)
βββ dissemination
β βββ manuscripts
β βββ posters
β βββ presentations
βββ documentation # Data management plan, etc.
βββ misc # Miscellaneous filesResearch Project Template by Heidi Seibold. No license use specified. For source code, see here. Reused without modifications.
analysistemplates template
.
βββ 01_Data
β βββ 01_Raw
β βββ 02_Clean
βββ 02_Analysis
β βββ 01_Scripts
β βββ 02_Results
β βββ 03_Figures
β βββ 04_Tables
βββ 03_Manuscript
β βββ 01_Text
β βββ 02_Final_figures
βββ 04_Presentation
βββ 05_Misc
βββ 06_Analysis_for_publication # Optional
βββ README.md
βββ .gitignore # Optional
βββ renv # Optionalanalysis template packages by Jonas Hagenbeck. Used under the MIT License. Reused without any modifications.
WORCS template
| File/Folder | Description | Usage |
|---|---|---|
| _pkgdown.yml | YAML for package website | do not edit |
| DESCRIPTION | R-package DESCRIPTION | do not edit |
| LICENSE.md | Project license | do not edit |
| README.md | Read this file to get started! | do not edit |
| README.Rmd | R-markdown source for readme.md | human editable |
| docs/ | Package website | machine-written |
| paper/ | WORCS paper source files | human editable |
| R/ | R-package source code | human editable |
| vignettes/ | R-package vignettes | human editable |
WORCS project structure by Van Lissa et al. (2021). Used under the GNU General Public License. No changes were made.
Exercise 2: Design your project folder structure
- Create a new directory called
my-research-projectin your home directory. - Design and create a folder structure for your own or a fictional research project e.g., studying βEffects of Social Media on Sleep Patterns in College Studentsβ. Consider:
- Where will you store raw survey data?
- Where will you keep processed analysis results?
- Where will you organize your documentation?
- Where will you store your analysis scripts?
- Create at least 5-6 folders that reflect good organization principles from the slides.
- Navigate through your structure and verify all directories exist.
Exercise 3: File naming practice
- In your projectβs raw data folder, create the following files using good naming practices (remember: no spaces, use proper date formats, include leading zeros):
- A survey data file from January 13, 2024
- A survey data file from April 21, 2024
- A survey data file from December 3, 2025
- Sleep tracking data from participant 007
- Sleep tracking data from participant 023
- Sleep tracking data from participant 156
- List the files and verify they sort in logical order.
- Now create the bad versions of these filenames in a separate
bad-examplesfolder:- Use spaces in names
- Use inconsistent date formats
- Avoid leading zeros
- Use special characters
- Compare how the two folders look when listed.
Exercise 4: README documentation
- Create a
README.mdfile in your main project directory. - Document the following in your README:
- Brief project description
- Explanation of your folder structure
- Your file naming convention rules
- Data collection dates and methods
- Contact information
- Include at least one example of your naming convention with explanation.
Exercise 5: File format decisions
- Create a
documentationfolder in your project. - In this folder, create files representing different types of documentation using appropriate file formats:
- A data dictionary/codebook
- A research protocol document
- A list of participant information
- Analysis notes
- Use the recommended file formats from the slides (
.txt,.csv,.md, etc.). - Add a comment in each file explaining why you chose that format.
Exercise 6: Versioning practice
- Create a file called
analysis-script.Rin your scripts folder. - Add some sample R code (even if basic) to the file.
- Create 3 versions of this file using proper versioning conventions:
- An initial draft version
- A revised version with minor changes
- A major revision with significant updates
- Practice both numbering and date-based versioning approaches.
- Document your versioning system in a
versioning-notes.txtfile.
Challenges & solutions in research project organization
Reflect on your own experience organizing research projects and discuss with the group:
Common challenges:
- What problems have you encountered in your own projects?
- What made it hard to find files or understand the structure later?
- Have you ever lost work or wasted time due to poor organization?
- What mistakes did you or your team make that you would avoid now?
Potential solutions & what helps:
- What strategies or conventions have worked well for you?
- Which tools or systems do you rely on to keep projects organized?
- What would you recommend to a colleague starting a new project?
- What is one change you plan to make to your current workflow?
Slides
To export the slides to PDF, do the following:
- Toggle into Print View using the E key (or using the Navigation Menu).
- Open the in-browser print dialog (CTRL/CMD+P).
- Change the Destination setting to Save as PDF.
- Change the Layout to Landscape.
- Change the Margins to None.
- Enable the Background graphics option.
- Click Save.
Note: This feature has been confirmed to work in Google Chrome, Chromium as well as in Firefox.
These instructions were copied from the Quarto documentation (MIT License) and slightly modified.