%%{init: {'theme':'forest'}}%% flowchart LR R(Reproducibility) R --> X(Replication) R --> Y(Preregistration) T(Trustworthy) X --> T Y --> T R --> T
A Talk on How to Do the Same Thing More Than Once
Max Planck Institute for Human Development
same thing in, same thing out
SEXY
SEXY
SEXY
SEXY
SEXY
SEXY
SEXY
SEXY
Accurate, unbiased, complete, and insightful reporting of the analytic treatment of data (be it quantitative or qualitative) must be a component of all research reports.
— Publication manual of the American Psychological Association, 7th ed.
Perhaps.
Perhaps.
But it does not hurt to want reproducibility out of selfish reasons.
Nous connaissons la vérité non seulement par la raison mais encore par le coeur […]
We know the truth, not only by the reason, but also by the heart.
Pascal, B. (1670). Pensées. (Lines 110–282).
1-3. stolen from David Marr 1982/2010 book Vision
Redoing activities try to exclude mistakes in these domains:
or
\[ \mathit{R}^2_{\text{adj.}} = \mathit{R}^2 - (1-R^2)\frac{p}{n - p - 1} \]
\[ C_p = \frac{\sum_{i=1}^n(\hat{y}_i- y_i)^2}{\sigma^2_e}-n-2p \]
\[ \text{df}(\hat{y}) = \frac{\sum^n_{i = 1}{\text{Cov}(\hat{y}_i, y_i)}}{\sigma^2_e} \]
\[ \text{df}(\hat{y}) = \frac{\sum^n_{i = 1}{\text{Cov}(\hat{y}_i, y_i)}}{\sigma^2_e} \]
This covariance requires:
This covariance requires:
\[ \mathit{R}^2_{\text{adj.}} = \mathit{R}^2 - (1 - \mathit{R}^2)\frac{n - df}{df - 1} \]
\[ C_p = \frac{\sum_{i=1}^n(\hat{y}_i - y_i)^2}{\sigma^2_e} - 3n + 2df \]
Require the Hessian around the solution for their “overfit” correction.
Computed via:
Require the Hessian around the solution for their “overfit” correction.
Computed via:
is
on subsamples.
First, computational reproducibility must ensure that the same data lead to the same results.
First, computational reproducibility must ensure that the same data lead to the same results.
Second, computational reproducibility must make the inductive process repeatable on similar data.
%%{init: {'theme':'forest'}}%% flowchart LR R(Reproducibility) R --> X(Replication) R --> Y(Preregistration) T(Trustworthy) X --> T Y --> T R --> T
For reproducibility, it really needs to be reproducible and checkable by a stranger with little time or energy to spare, because even the author will soon enough be that stranger.
Category | Type | Trans / Driven wheels | Fuel / air-con |
---|---|---|---|
E: Economy | F: SUV | A: Auto (drive unspecified) | R: Unspecified Fuel With AC |
I: Intermediate | T: Convertible | B: Auto 4WD | D: Diesel With AC |
S: Standard | C: 2/4 Door | D: Auto AWD | H: Hybrid With AC |
F: Fullsize | D: 4-5 Door | M: Manual (drive unspecified) | E: Electric With AC |
P: Premium | S: Sport | V: Petrol With AC |
Ride it like you stole it
R + RMarkdown + Docker + Make + Git
Julia + RMarkdown + Pkg.jl + GitHub Actions + Git
https://github.com/formal-methods-mpi/pkgmanuscript/blob/main/Dockerfile
Lua + Quarto + GitHub Actions + GitHub Actions + Git
Python + Quarto + Docker + GitHub Actions + Git
https://github.com/formal-methods-mpi/projects/pull/41/files
R + RMarkdown + Docker + Make + Git
%%{init: {'theme':'forest'}}%% flowchart LR R(Reproducibility) R --> C(Collaboration) R --> X(Replication) R --> Y(Preregistration) T(Trustworthy) X --> T Y --> T C --> Cu(Cumulative) T --> Cu C --> T