The Repro Book

A Guide to Reproducible Research

Figure 1: This illustration is created by Scriberia with The Turing Way community. Used under a CC-BY 4.0 licence. DOI: 10.5281/zenodo.3695300 (Version 3, direct download link).

Preface

Welcome

Welcome to the world of reproducibility and “The Repro Book - A Guide to Reproducible Research”! The purpose of this book is to empower scientists, researchers, and students with the knowledge and skills needed to conduct a research project that is easy to reproduce.

Who is this book for?

This guide is meant to be a gentle introduction to reproducibility for (aspiring) scientists who want to conduct research more reproducible. Whether you are conducting experiments, writing code, collaborating with scientific peers, or managing complex data sets, this book conveys useful concepts with hands-on tips and exercises to enhance the reproducibility of your research project. While this book was developed with scientists in mind, it is open to anyone who wants to learn more about reproducibility. We try to avoid technical jargon as much as we can. When we discuss best practices about reproducible guidelines, we try to offer multiple alternatives but also give opinionated recommendations as guidance to new users.

What is the purpose of this book?

Reproducibility can take your scientific projects to the next level. By enhancing reproducibility, you will strengthen your results and evidence from your analysis (The Turing Way Community 2022). Of course, conducting research projects considering reproducibility also adds some layer of complexity to your workflow. We aim to simplify this workflow as much as possible for you. Whether you are new to reproducibility or have dabbled in it before, this book aims to add something for all levels of expertise.

How to use this book?

We believe in learning by doing and try to focus on implementation as much as possible. The concepts introduced in this book are accompanied by practical examples and hands on exercises. Feel free to follow the exercises to gain the necessary “muscle memory” to start enhancing the reproducibility of your work step-by-step. Further, we recommend to work with a current research projects of yourself to apply different steps making your research more reproducible. However, we also provide an example project, which you can download here.

This book was initially created for the full-semester course “Open Science in Practice: A Toolkit for Research” at University of Hamburg, Germany. The book is therefore structured for a course that is spread out across multiple sessions. Each chapter and the accompanying exercises should roughly fill 90 minutes of class time during a course.

This book is available in HTML format and will soon be available as PDF version. However, we recommend using the HTML version for the most up-to-date content. The HTML version in continuously updated with new material, corrections, and interactive elements, ensuring you always have access to the latest information. The PDF version will be a static snapshot and may become outdated over time, making it useful for offline reading and archival but less reliable for the most current information.

How can I contribute?

This book is constantly evolving and meant as a living resource, and your input can make it even better! If you spot typos, have suggestions for improvement, or want to contribute new content, we welcome your involvement! If you find a typo, have an idea for a new chapter or want to see a specific topic covered you are more than welcome to open an issue or submit a pull request in the GitHub repository of this book.