Solved – Complete substantive examples of reproducible research using R

rreferencesreproducible-research

The Question: Are there any good examples of reproducible research using R that are freely available online?

Ideal Example:
Specifically, ideal examples would provide:

  • The raw data (and ideally meta data explaining the data),
  • All R code including data import, processing, analyses, and output generation,
  • Sweave or some other approach for linking the final output to the final document,
  • All in a format that is easily downloadable and compilable on a reader's computer.

Ideally, the example would be a journal article or a thesis where the emphasis is on an actual applied topic as opposed to a statistical teaching example.

Reasons for interest:
I'm particularly interested in applied topics in journal articles and theses, because in these situations, several additional issues arise:

  • Issues arise related to data cleaning and processing,
  • Issues arise related to managing metadata,
  • Journals and theses often have style guide expectations regarding the appearance and formatting of tables and figures,
  • Many journals and theses often have a wide range of analyses which raise issues regarding workflow (i.e., how to sequence analyses) and processing time (e.g., issues of caching analyses, etc.).

Seeing complete working examples could provide good instructional material for researchers starting out with reproducible research.

Best Answer

Frank Harrell has been beating the drum on reproducible research and reports for many, many years. You could start at this wiki page which lists plenty of other resources, including published research and also covers Charles Geyer's page.