class: center, middle, inverse, title-slide # Reproducible research ## What are we doing and how can we improve? ### Claus Thorn Ekstrøm
UCPH Biostatistics
.small[
ekstrom@sund.ku.dk
] ### May 16th 2017
.small[Slides @
biostatistics.dk/talks/
] --- background-image: url(pics/psycho.png) background-size: 100% auto ??? First slide: Reproducibility is key to science. If science is the best method that we have of figuring out how nature works, if our hypotheses and theories are to have any basis in reality, then the observations upon which those hypotheses and theories are based must be reproducible. Reproducibility helps demonstrates a key difference between science and pseudoscience. 2011 was a year of horror for **social psychologists**. Journal of personality and social psychologists. People can look into the future. Especially extrovert women confronted with erotic pictures. Rejected others - negative findings. **Diederik Stabel** - falsified, students could not see data --- background-image: url(pics/psycho2.png) background-size: 100% auto ??? Replication results from social sciences show that Publication bias. Chance findings. --- background-image: url(pics/lancet.png) background-size: auto 100% class: middle, center --- background-image: url(pics/penkowa.png) background-size: 100% auto class: middle, center # Hansen and Penkowa ??? Why reproducibility is important --- background-image: url(pics/soccer.png) background-size: 100% auto class: middle, center --- # The research process
??? The red line shows where peer review comes in. Total summary. We need to document the steps we did througout as shown in the soccer example. --- # Statistical analysis example > **Statistical analysis** > > .large[All of the data were analyzed with data processing software and figures with Microsoft excel 2007.] > > .pull-right[-- Tayefe *et al*, Advances in Bioresearch, 2014] --- background-image: url(pics/term.png) background-size: 100% auto class: middle, center --- # Terminology **Reproducibility** Given code/data/materials, can I get *the same* (=identical) numbers that you did? **Replicability** Given scientific protocol, can I get *the same* (=in agreement) result that you did in my own study? --- background-image: url(pics/proces.png) background-size: auto 100% --- class: middle, center # Nerd quiz ## 42 --- class: middle, center background-image: url(pics/claus.png) background-size: auto 100% ??? My most annoying collaborator. He has done a lot of previous work that I would like to use. PhD, 36000 lines of code. it's working ... but He does not answer back when I email him! --- class: center, middle # Reproducibility ## Tools Already there ## Desire ? --- background-image: url(pics/public.jpg) background-size: auto 100% # The public may not understand scientists ... --- background-image: url(pics/nejm-parasites.png) background-size: auto 100% ??? NEJM - Research parasites --- background-image: url(pics/election.png) background-size: auto 100% ??? EU parliament and European patent applications --- class: center, middle # Poential problems Incentives for individual success are focused on getting it **published**, not getting it **right**. Publications favour novel results and positive results No funding "Research parasites" --- class: center, middle # Replicability ## Tools ? ## Desire Great! --- class: inverse # Workshop - part I * Groups of 2-3 people. * Spend 3 minutes each explaining who you are/what you do * Goal 1: we want to hear **good** stories about what we actually do * Goal 2: where do you see **potential problems**, where we could do better.
--- # Tools for reproducibility <img src="repro_files/figure-html/unnamed-chunk-3-1.png" height="10%" />
R - markdown gitHub Docker Jupyter Notebooks OSF Dropbox, Google Drive, Amazon web services, evernote, --- class: center # R markdown <img src="pics/logo-R.png" width="60%" /> --- class: center # R GitHub <img src="pics/logo-gh.png" width="60%" /> --- class: center # Jupyter Notebooks <img src="pics/logo-jupyter.svg" width="50%" /> --- class: center # Open Science Framework <img src="pics/logo-osf.png" width="50%" /> --- class: middle # Teaching What are we doing - through supervision and courses - for * Ph.D.s and post docs? * Bachelor and master students? --- # Undergraduate courses <img src="pics/kursusoversigt.png" width="100%" /> --- # Ph.D. <img src="pics/rcr1.png" width="100%" /> --- # Ph.D. <img src="pics/rcr2.png" width="100%" /> --- # RCR RCR – A Danish textbook for courses in Responsible Conduct of Research .center[http://ifro.ku.dk/rcr.pdf] (See Chapter 5, in particular p. 54, Box 1, p. 63) --- class: inverse # Workshop - part II * Groups of 2-3 people. * Spend 2 minutes each explaining your role (student, teacher, courses, ...) and what you are involved in * Goal: Which curricula, courses, training programs should we focus on? * Goal 2: What should/could we change? Small changes are better than no changes! --- background-image: url(pics/teeth.jpg) background-size: 100% ??? Reprodicible research is like brushing you teeth: slightly annoying, takes some time, but it's good for you and we just need to get into the habit