Knowledge and attitudes among life scientists towards reproducibility within journal articles: a research survey

Evanthia Kaimaklioti Samota; Robert P. Davey

doi:10.1101/581033

Abstract

We constructed a survey to understand how authors and scientists view the issues around reproducibility, focusing on interactive elements such as interactive figures embedded within online publications, as a solution for enabling the reproducibility of experiments. We report the views of 251 researchers, comprising authors who have published in eLIFE Sciences, and those who work at the Norwich Biosciences Institutes (NBI). The survey also outlines to what extent researchers are occupied with reproducing experiments themselves. Currently, there is an increasing range of tools that attempt to address the production of reproducible research by making code, data, and analyses available to the community for reuse. We wanted to collect information about attitudes around the consumer end of the spectrum, where life scientists interact with research outputs to interpret scientific results. Static plots and figures within articles are a central part of this interpretation, and therefore we asked respondents to consider various features for an interactive figure within a research article that would allow them to better understand and reproduce a published analysis. The majority (91%) of respondents reported that when authors describe their research methodology (methods and analyses) in detail, published research can become more reproducible. The respondents believe that having interactive figures in published papers is a beneficial element to themselves, the papers they read as well as to their readers. Whilst interactive figures are one potential solution for consuming the results of research more effectively to enable reproducibility, we also review the equally pressing technical and cultural demands on researchers that need to be addressed to achieve greater success in reproducibility in the life sciences.

Introduction

Reproducibility is a defining principle of scientific research, and broadly refers to the ability of researchers, other than the original researchers, to achieve the same findings using the same data and analysis [1]. However, irreproducible experiments are common across all disciplines of life sciences [2]. A recent study showed that 88% of drug-discovery experiments could not be reproduced even by the original authors, in some cases forcing retraction of the original work [2]. Irreproducible genetic experiments with weak or wrong evidence can have negative implications for our healthcare [3]. For example, 27% of mutations linked to childhood genetic diseases cited in literature have later been discovered to be common polymorphisms or misannotations [4]. While irreproducibility is not confined to biology and medical sciences [5], irreproducible biomedical experiments pose a strong financial burden on society; an estimated $28 billion was spent on irreproducible biomedical science in 2015 in the USA alone [6].

Reproducibility should inevitably lead to robust science, relating to the way in which conclusions rely on specific analyses or procedures undertaken on experimental systems. Unfortunately, the community has yet to reach consensus on how we traverse the space of re-use, re-analysis and re-interpretation of scientific research to try to define suitable overarching definitions for reproducibility. Thus, there are different definitions of reproducibility used in the literature [7,8], some of which contradict one another. A recent exhaustive review has also documented this problem [9], so our survey and results do need to be contextualised somewhat by this lack of consensus. The terms repeatability, replicability and reproducibility are also occasionally confused [10,11], therefore it is important to differentiate these terms from each other.

Repeatability The original researchers using the same data, running precisely the same analysis and getting the same results, on multiple runs [8].
Replicability Different teams performing different experimental setups and using independent data, achieving the same result as the original researchers, on multiple trials [11-13].
Reproducibility Different teams re-running the same analysis with the same data and getting the same result [1,11-13].

It is argued that in many science disciplines replicability is more desirable than reproducibility because a result needs to be corroborated independently before it can be generally accepted by the scientific community [12,14]. However, reproducibility can serve as a cost-effective way of verifying results prior to replicating results [14].

Computational reproducibility, or reproducible computational research, refers to the reproducibility of computational experiments, where an independent team can produce the same result utilising the data and computational methods (code and workflow) provided by the original authors [9,13,15-17]. Computational reproducibility is influenced by both technical and cultural (social) factors [13,17,18]. Technical challenges to computational reproducibility include poorly written, incorrect, or unmaintained software, changes in software libraries on which tools are dependent, or incompatibility between older software and newer operating systems [19]. Cultural factors that challenge computational reproducibility include the attitudes and behaviours of authors when performing and reporting research. Examples include authors not providing sufficient descriptions of methods and being reluctant to publish original data and code under FAIR (Findable, Accessible, Interoperable, and Reusable) principles[13,20,21]. Other cultural factors include favouring of high prestige or high impact science publications over performing rigorous and reproducible science (which tends to be improved by open access policies) [22,23]. We refer to the cultural factors affecting computational reproducibility as the culture of reproducibility [12].

Several projects have attempted to address some of the technical aspects of reproducibility by making it easier for authors to disseminate fully reproducible workflows and data, and for readers to perform computations. For example: F1000 Living Figure [24] and re-executable publications [25-28] using Plotly (plot.ly) and Code Ocean widgets (codeocean.com); Whole Tale Project [29]; ReproZip project [30]; Python-compatible tools and widgets (interactive widgets for Jupyter Notebooks with Binder); Zenodo (zenodo.org) and FigShare (figshare.com) as examples of open access repositories for scientific content (including datasets, code, figures, reports); Galaxy [31]; CyVerse (formerly iPlant Collaborative) [32]; ^myExperiment [33]; UTOPIA [34,35]; GigaDB [36]; Taverna [37-39]; workflow description efforts such as the Common Workflow Language [40]; and Docker (docker.com), Singularity (singularity.lbl.gov) [41], and other container systems. Even though these tools are widely available, and seem to address many of the issues of technical reproducibility and the culture of reproducibility, they have not yet become a core part of the life sciences experimental and publication lifecycle. There is an apparent disconnection between the development of tools addressing reproducibility and their use by the wider scientific and publishing communities who might benefit from them.

This raises the question of “how do researchers view their role in the production and consumption of scientific outputs?” A common way for researchers to quickly provide information about their data, analysis and results is through a figure or graph. Scientific figures in publications are commonly presented as static images. Access to the data (including the raw, processed and/or aggregated data), analysis, code or description of how the software was used that produced the figure are not available within the static images [42-48]. This can be especially pertinent to figures that have thousands or millions of points of data to convey [42]. In order for readers to interrogate published results in more detail, examine the transparency and reproducibility of the data and research, they would need to download a complete copy of the data, code, and any associated analysis methodology (data pre-processing, filtering, cleaning, etc) and reproduce this locally, provided all those elements are available and accessible [49]. Computational analyses often require running particular software which might require configuration and parameterisation, as well as library dependencies and operating system prerequisites. This is a time-consuming task and achieving reproducibility of computational experiments is not always possible [49,50]. Thus, solutions that automatically reproduce computational analyses and allow the investigation of the data and code presented in the figure in detail would be advantageous [12,26,42].

Many solutions now exist that allow for the reproducibility of computational analyses outside the research paper, and are typically supplied as links within the research paper or journal website redirecting to many different types of computational systems, such as Galaxy workflows, Binder interactive workspaces converted by GitHub repositories with Jupyter notebooks [51], and myExperiment links [33]. The endpoint of these analyses are often graphical figures or plots, and these may well be interactive, thus allowing modification of plot type, axes, data filtering, regression lines, etc. Whilst these figures may well be interactive in that a user can modify some part of the visualisation, this does not implicitly make the data or code that produced that figure more available, and hence more reproducible.

Technologies that can expose code, data and interactive figures are now mature. For example, Jupyter notebooks are built up of executable “cells” of code which can encapsulate a link to a data file hosted on a cloud service, code to get and analyse this data file, and then produce an interactive figure to interpret the dataset. Again, this is somewhat disconnected from the actual research publication. However, as technology has progressed in terms of available storage for data, computational power on the web through cloud services, and the ability of these services to run research code, we are now coming to the point where the production of interactive figures within publications themselves is achievable. These interactive figures which would inherently have access to the underlying data and analytical process can provide users with unique functionality that can help increase the reproducible nature of the research. This combination of code, data, analysis, visualisation and paper are examples of “executable documents” [52,53].

Interactive figures within executable documents therefore have incorporated data, code and graphics so that when the user interacts with the figure, perhaps by selecting a cluster of data points within a graph, the user could then be presented with the data that underlies those data points. Similarly, a user could make changes to the underlying parameters of the analysis, for example modifying a filter threshold, which would ultimately make changes to the visualisation of the figure or the document itself [42-48]. By means of an example, an executable document could represent an interactive figure showing a heat map of gene expression under different stress conditions. In a traditional article, the user would be tasked with finding references to the datasets and downloading them, and subsequently finding the code or methodology used to analyse the data, and retrace the original authors steps (if the code and data were available at all). Within an interactive figure in an executable document, a user could select a particular gene of interest by clicking on the heatmap, and viewing the gene expression information within a pop-up browser window. Whilst this is useful for general interpretation, to achieve reproducibility this pop-up window would provide a button that allows the user to pull the sequencing read data that was the basis for the results into a computational system in order to re-run the differential expression analysis. This raises many questions around how this infrastructure is provided, what technologies would be used to package up all elements needed for reproducibility, the subsequent costs of running the analysis, and so on.

These caveats aside, interactive figures within executable documents can benefit the reader for the consumption of the research outputs in an interactive way, with easy access to the data and removing the need for installing and configuring code and parameters for reproducing the computational experiments presented in the figure within the publication [26]. The aforementioned solutions would not only be helpful to the readers of papers [54], but benefit the peer review process [42].

There have been efforts to make the connection between production and consumption of research outputs within online publications. One of the first interactive figures to have been published in a scholarly life sciences journal is the Living Figure by Björn Brembs and Julien Colomb which allowed readers to change parameters of a statistical computation underlying a figure [25]. F1000Research has now published more papers that include Plotly graphs and Code Ocean widgets in order to provide interactivity and data and code reproducibility from within the article figures [28,52]. The first prototype of eLIFE’s computationally reproducible article aims to convert manuscripts created in a specific format (using the Stencila Desktop, stenci.la, and saved as a Document Archive file) into interactive documents allowing the reader to “play” with the article and its figures when viewed in a web browser [53]. The Manifold platform (manifoldapp.org) allows researchers to show their research objects alongside their publication in an electronic reader, whilst including some dynamic elements. The Cell journal included interactive figures in a paper using Juicebox.js for 3D visualisation of Hi-C data (http://aidenlab.org/juicebox/) [55,56].

Whilst there are few incentives to promote the culture of reproducibility [57,58], efforts in most science domains are being made to establish a culture where there is an expectation to share data for all publications according to the FAIR principles. The implementation of these principles is grounded in the assumption that better reproducibility will benefit the scientific community and the general public [59,60]. Studies have suggested that reproducibility in science is a serious issue with costly repercussions to science and the public [61,62]. Whilst there have been survey studies canvassing the attitudes of researchers around reproducibility in other disciplines to some extent [20,63,64], fewer studies have investigated the attitudes and knowledge of researchers around reproducibility in the life sciences [20]. In particular, minimal research has been conducted into the frequency of difficulties experienced with reproducibility, the perception of its importance, and preferences with respect to potential solutions among the life sciences community.

This paper presents a survey that was designed to assess researcher understanding of the concepts of reproducibility and to inform future efforts in one specific area: to help researchers be able to reproduce research outputs in publications. The development of tools, one example of which are interactive figures within journal publications, may better meet the needs of producers and consumers of life science research. Our survey is limited in that we do not assess how open access tools for the production of reproducible research outputs compare, but how the consumption of research information through interactive means is regarded. We constructed the survey in order to understand how the following are experienced by the respondents:

Technical factors affecting computational reproducibility: issues with accessing data, code and methodology parameters, and how solutions such as interactive figures could promote reproducibility from within an article.
Culture of reproducibility: attitudes towards reproducibility, the social factors hindering reproducibility, and interest in how research outputs can be consumed via interactive figures and their feature preferences.

Methods

Population and sample

The data were analysed anonymously, nonetheless, we sought ethical approval. The University of East Anglia Computing Sciences Research Ethics Committee approved this study (CMPREC/1819/R/13). Our sample populations were selected to include all life sciences communities across levels of seniority, discipline and level of experience with the issues we wished to survey. The first survey was conducted in November 2016 and sent out to 750 researchers working in the Norwich Biosciences Institutes (NBI) at a post-doctoral level or above. We chose to survey scientists of post-doctoral level or above, as these scientists are more likely to have had at least one interaction with publishing in scientific journals. The NBI is a partnership of four UK research institutions: the Earlham Institute (formerly known as The Genome Analysis Centre), the John Innes Centre, the Sainsbury Centre, and the Institute of Food Research (now Quadram Institute Bioscience). Invitations to participate were distributed via email, with a link to the survey. The second survey, similar to the first but with amendments and additions, was distributed in February 2017 to a random sample of 1651 researchers who had published papers in the eLIFE journal. Further information about the eLIFE sample is found in Supplementary section 3. Invitations to participate were sent using email by eLIFE staff. We achieved a 15% (n=112) response rate from the NBI researchers and an 8% response rate from the eLIFE survey (n=139). Table 1 shows the survey questions. Questions were designed to give qualitative and quantitative answers on technical and cultural aspects of reproducibility. Questions assessed the frequency in difficulties encountered in accessing data, the reasons for these difficulties, and how respondents currently obtain data underlying published articles. They measured understanding of what constitutes reproducibility of experiments, interactive figures, and computationally reproducible data. Finally, we evaluated the perceived benefit of interactive figures and of reproducing computational experiments, and which features of interactive figures would be most desirable.

View this table:

Table 1: Questions used to survey the knowledge of respondents about research reproducibility.

Validation of the survey design

We undertook a two-step survey: firstly NBI, then eLIFE interactions leading to additional questions. We tested the initial survey on a small cohort of researchers local to the authors to determine question suitability and flow. We designed the surveys in accordance with the Standards for Reporting Qualitative Research (SRQR) [65]. Our study was not pre-registered, but we confirm that the reported analyses are the only analyses that were conducted and that the reported variables were the only variables that were measured.

Statistical analysis

Results are typically presented as proportions of those responding, stratified by the respondent’s area of work, training received, and version of the survey as appropriate. Chi-square tests for independence were used to test for relationships between responses to specific questions, or whether responses varied between samples. The analysis was conducted using R (version 3.5.2; R Core Team, 2018) and Microsoft Excel. All supplementary figures and data are available on Figshare (see Data Availability).

We assessed if there was a significant difference in the ability and willingness to reproduce published results between the cohort of eLIFE respondents who understand the term “computationally reproducible data” and those who do not and whether training received, had an effect. Given the free-text responses within the “unsure” group as to the understanding of this term, where many understood the term, we did not include those that replied, “unsure” (see Section “Understanding of reproducibility, training and successful replication” below). The respondents who chose “yes tried reproducing results, but unsuccessfully”, “have not tried to reproduce results” and “it is not important to reproduce results” were grouped together under “unsuccessfully”.

Results

Characteristics of the sample

Figure 1 shows the distribution of areas of work of our respondents, stratified by survey sample. Genomics (proportion in the whole sample = 22%), biochemistry (17%), and computational biology (15%) were the most common subject areas endorsed in both NBI and eLIFE samples. With regard to how often respondents use bioinformatics tools, 25% replied “never”, 39% “rarely”, and 36% “often”. Many (43%) received statistical training, (31%) bioinformatic training, (20%) computer science training.

Figure 1: Data types used by NBI and eLIFE respondents.

Responses were not mutually exclusive. Data type choices were the same as the article data types available in the eLIFE article categorisation system.

Access to data and bioinformatics tools

In both samples, 90% of those who responded, reported having tried to access data underlying a published research article (Fig. 2). Of those who had tried, few had found this “easy” (14%) or “very easy” (2%) with 41% reporting that the process was “difficult” and 5% “very difficult”. Reasons for difficulty were chiefly cultural (Fig. 2), in that the data was not made available alongside the publication (found by 75% of those who had tried to access data), or authors could not be contacted or did not respond to data requests (52%). Relatively few found data unavailable for technical reasons of data size (21%), commercial sensitivity (13%) or confidentiality (12%). With respect to data sources, 57% of the total sample have used open public databases, 48% reported data was available with a link in the paper, and 47% had needed to contact authors.

Figure 2. Left panel: Difficulty encountered accessing data underlying published research.

Whether respondents have attempted to access data underlying previous publications and the level of difficulty typically encountered in doing so. Right panel: Reasons given for difficulty accessing data. The reasons given by respondents for being unable to access data (restricted to those who have attempted to access data).

Very few of the respondents either “never” (2%) or “rarely” (8%) had problems with running, installing, configuring bioinformatics software. Problems with software were encountered “often” (29%) or “very often” (15%) suggesting that nearly half of respondents regularly encountered technical barriers to computational reproducibility.

Understanding of reproducibility, training and successful replication

The majority of respondents reported that they understood the term “reproducibility of experiments” and selected the explanation for the term as defined in the introduction above, which corresponds to the most established definitions of reproducibility [11-13]. It is important to state that for this question, we allowed for respondents to choose more than one answer, as we recognise the limitation that there is no standard and accepted definition for reproducibility, as well as the familiarity of the term between scientists from different backgrounds can differ. The first three definitions are plausible definitions for reproducibility. Given the results, we can assume that some of the respondents chose both correct and wrong definitions. The majority of the answers (77%) included the definition of reproducibility as we define it in the manuscript. However, by looking into the individual responses (n=54), 11.1% (n=6) of respondents chose only option A thus appeared to understand that this matched the definition of reproducibility, as we state in the manuscript. 5.5% (n=3) chose only option D, which is incorrect. The majority of people (57%, n=23) picked any of A, B, or C and did not pick D, which seems to suggest that they understand that replicability is not reproducibility, but they are still not clear on exact definitions, which matches the general lack of consensus [7,8,10]. Just over a third (37%, n=20) picked one or all of A, B and C, and picked D, which seems to suggest that they didn’t understand the difference between reproducibility and replicability at all and considered any form of repeating a process could be classed as “reproducibility of experiments (see Supplementary table 4).

Most (52%) participants provided a different interpretation of the term “computationally reproducible data” to our interpretation, while 26% did know and 22% were unsure. We received several explanations (free text responses) of the term of which the majority were accurate (Supplementary section 2, free responses to question 13). We assign meaning to the term as data as an output (result) in a computational context, which was generated when reproducing computational experiments. Although the term “computationally reproducible data” is not officially defined, other sources and studies have referred to the concept of data that contributes to computational reproducibility [26,66-70]. From the unsure responses (n=30), we categorised those that gave free-text responses (70%, n=21, see Supplementary section 2, free responses) into whether they did actually understand the term, those that did not understand the term, and those that did not give any free text. The majority of respondents that chose “unsure” and gave a free text response (71%, n=15) did understand the term “computationally reproducible data”. The remaining 29% (n=6) did not understand the term correctly.

Some (18%) reported not attempting to reproduce published research. Very few (n = 5; 6%) of the sample endorsed the option that “it is not important to reproduce other people’s published results” (Supplementary figure 1). Even though the majority (60%) reported successfully reproducing published results, almost a quarter of the respondents found that their efforts to reproduce any results were unsuccessful (23%). Table 2 shows the ability of respondents in reproducing experiments stratified by the understanding of the term “computationally reproducible data” and the training received (bioinformatics, computer science, statistics). We found a significant difference between the ability to reproduce published experiments and knowing the meaning of the term “computationally reproducible data”. Among the 25 respondents who understood the term “computationally reproducible data”, 18 (72%) had successfully reproduced previous work, compared to only 26 (52%) of the 50 who responded that they did not understand the term (Chi-square test for independence, p = 0.048). Taking their training background into account did not show any significant difference. However, when testing with the responses “yes tried reproducing results, but unsuccessfully”, “have not tried to reproduce results” and “it is not important to reproduce results” (not grouped together under “unsuccessfully” in order to get an indication of how willingness and success together differed between the training groups), we found a significant difference (see Supplementary Table 1). The distribution of the training variable with those who received computer science training and those without was significantly different (Fisher exact test for independence, p = 0.018). It appears that respondents with computer science training are less likely to have tried to reproduce an experiment but be more likely to succeed when they did try.

View this table:

Table 2:

Success in reproducing any published results stratified by their knowledge of the term “computationally reproducible data” and training received.

There was no evidence for a difference in the ability and willingness to reproduce published results between the respondents who use bioinformatics tools often, and those who use them rarely or never (Supplementary Table 2). The majority of the respondents who use bioinformatics tools often were coming from the scientific backgrounds of Biophysics, Biochemistry, Computational Biology and Genomics. Most of the respondents who answered “reproducibility is not important” and “haven’t tried reproducing experiments” were scientists coming from disciplines using computational or bioinformatics tools “rarely” or “never” (Supplementary Table 3).

Improving Reproducibility of Published Research

The majority (91%) of respondents stated that authors describing all methodology steps in detail, including any formulae analysing the data can make published science more reproducible. Around half (53%) endorsed the view that “authors should provide the source code of any custom software used to analyse the data and that the software code is well documented”, and that authors provide a link to the raw data (49%) (Supplementary figure 2). Two respondents suggested that achieving better science reproducibility would be easier if funding was more readily available for reproducing the results of others and if there were opportunities to publish the reproduced results (Supplementary section, free responses). Within the same context, some respondents recognised the current culture in science that there are not sufficient incentives in publishing reproducible (or indeed negative findings) papers, but rather being rewarded in publishing as many papers as possible in high Impact Factor journals (Supplementary section, free responses).

Interactive Figures

Participants ranked in terms of preference potential features for an interactive figure within an article. These included choices such as “easy to manipulate” as the most preferred, and have “easy to define parameters” (Fig. 3). Generally, the answers from both the eLIFE and NBI surveys followed similar trends. Furthermore, free-text responses were collected, and most respondents stated that mechanisms to allow them to better understand the data presented in the figure would be beneficial, e.g. by zooming in on data (Supplementary section, free responses).

Figure 3. Preferred features for the interactive figure.

Responses to question 9: Respondents were asked to rank in order of preference the above features, with 1 most preferred feature, to 11 the least preferred feature. The average score for each feature was calculated in order of preference as selected by the respondents from both NBI and eLIFE surveys. The lower the average score value (x-axis), the more preferred the feature (y-axis).

The majority of the respondents perceive a benefit in having interactive figures in published papers for both readers and authors (Fig. 4). Examples of insights included: the interactive figure would allow visualising further points on the plot from data in the supplementary section, as well as be able to alter the data that is presented in the figure; having an interactive figure like a movie or to display protein 3D structures, would be beneficial to readers. The remaining responses we categorised as software related, which included suggestions of software that could be used to produce a figure that can be interactive, such as R Shiny (shiny.studio.com). We received a total of 114 free-text responses about the respondents’ opinions on what interactive figures are and a proportion of those (25%) suggested that they had never seen or interacted with such a figure before, and no indication was given that an interactive figure would help their work (see Supplementary section, free responses).

Figure 4. The level of perception of benefit to having the ability to publish papers with interactive figures.

The benefit to the author, to the readers of the author’s papers and to the papers the author reads. Answers include the responses from both NBI and eLIFE surveys for question 11.

The majority of the respondents also said that they see a benefit in automatically reproducing computational experiments, and manipulating and interacting with parameters in computational analysis workflows. Equally favourable was to be able to computationally reproduce statistical analyses (Fig. 5). Despite this perceived benefit, most respondents (61%) indicated that the ability to include an interactive figure would not affect their choice of a journal when seeking to publish their research.

Figure 5. Assessment of perceived benefit for automatically reproducing computational experiments or other analyses (including statistical tests).

Responses from both NBI and eLIFE for question 14.

Limitations

The findings were collected using the self-reporting method which can be limited in certain ways, especially with regards to the reported reproducibility success or lack of success of the respondents. We do not know categorically that someone reproduced experiments successfully because they checked the box. Despite the potential for confusing the exact meaning of reproducibility, which could affect the answers to questions 5, 6 and 7, the general consensus among respondents showed that the questions were sufficiently phrased to help us divide people into two groups of assessment (successful vs not successful) for subsequent analysis.

Part of our survey sample were researchers from the NBI, and this population might not be representative of the life sciences research community as a whole. Researchers working in academic institutions may have attitudes, incentives or infrastructure to support reproducibility that may be different from those who work in the private sector or government agencies. In addition, as the population of the NBI researchers was solely UK based, the attitudes of these researchers might differ from those in the rest of the world, despite the fact that the NBI comprises scientists who are from multiple countries and have trained and worked in global institutions. eLIFE authors work across the breadth of scientific institutions, both private and public, from the international stage, thus we believe that both eLIFE and NBI participants to be sufficiently representative for the purposes of our survey study.

Although we do not have distinct evidence that the eLIFE authors cohort had a predisposition to reproducibility, and the authors we surveyed were randomly selected, we acknowledge that as eLIFE is a journal that requires data sharing and is also heavily involved in reproducibility efforts, such as the Reproducibility Project: Cancer Biology. In the absence of data to the contrary, thus it is reasonable to assume that some factors might have influenced the eLIFE respondents’ opinions about reproducibility. We do not think that this fact undermines our conclusion, but it is a factor that future studies should be aware of when drawing comparisons that can shed further light onto this issue.

We acknowledge that questions 8 and 9 were on the same page when the participants were taking the survey, and seeing the two questions together might have introduced bias into their answers. Nonetheless, free text answers to question 8 included answers which were not presented as options for question 9. Some respondents also declared that they were not aware of, or have not previously encountered, interactive figures (see Supplementary Section free text responses to question 8).

We have found that the response rate for studies of this nature is fairly typical and indeed, other studies [71-75] have experienced comparable or lower rates. Ideally, we would want to aim for a higher response rate for future studies, which could be achieved by providing monetary incentives, as well as sending email reminders to the same or bigger cohort of invited people to participate in the study [76-78].

Discussion

This study highlights the difficulties currently experienced in reproducing experiments and conveys positive attitudes of scientists towards enabling and promoting reproducibility of published experiments through interactive elements in online publications.The NBI cohort of respondents were active life sciences researchers at the time the survey was conducted, and the eLIFE cohort were researchers that have published at least once in the eLIFE journal, therefore we believe the opinions collected are representative of researchers in life sciences who are routinely reading and publishing research.

While progress has been made in publishing standards across all life science disciplines, the opinions of the respondents reflect previously published shortcomings of the publishing procedures [79-81]: lack of data and code provision; storage standards; not including or requiring a detailed description of the methods and code structure in the published papers. However, the level of interest and incentives in reproducing published research is in its infancy, or it is not the researchers’ priority [82,83]. A key outcome of our survey is the acknowledgement of the large majority who understand that science becomes implicitly more reproducible if methods (including data, analysis, and code) are well-described and available. Respondents also perceive the benefit of having tools that enable the availability of data, methods and code and being able to automatically reproduce computational experiments described within the paper. Interactive figures within publications and executable documents can be such tools that allow the automatic reproducibility of computational experiments, or other analyses described within the paper, interact and manipulate parameters within the computational analysis workflow and give further insights and detail view of the data in the figure. Despite technologies existing to aid reproducibility and authors knowing they are beneficial, many scientific publications do not meet basic standards of reproducibility.

Our findings are in accordance with the current literature [61,84] that highlight that the lack of data access at the publication stage is one of the major reasons leading to the irreproducibility of published studies. When data is difficult to obtain, the reproducibility problem is exacerbated. A study that examined the differences between clinical and non-clinical scientists, showed that the majority of respondents did not have experience with uploading biomedical data to a repository, stemming from different social reasons not to do so: concerns and motivation around data sharing; work necessary to prepare the data [73]. Even with current policies mandating data openness [59,60], authors still fail to include their data alongside their publication, and this can not only be attributed to technical complications, but also fear of being scooped, fear of mistakes being found in data or analyses, and fear of others using their data for their own research papers [64,73]. Making FAIR data practices standard, through public data deposition and subsequent publication and citation, could encourage individual researchers and communities to share and reuse data considering their individual requirements and needs [68]. Data accessibility issues are also compounded by data becoming less retrievable with every year passing after the publication [85]. This is supported by our findings where data is either not available upon publication (57%) or authors cannot be reached/are unresponsive to data provision requests (44%). This continues to be a cultural artefact of using a paper’s methods section as a description of steps to reproduce analysis, rather than a fully reproducible solution involving easy access to public data repositories, open source code, and comprehensive documentation.

As evidenced by the respondents, the lack of data availability is a common hurdle for researchers to encounter that prevents the reproducibility of published work. Thus, the reproducibility of experiments could be improved by increasing the availability of data. Datasets are becoming larger and more complex, especially in the area of genomics. Storage solutions for large data files and citing them within the publication document, especially those in the order of terabytes, can allow for their wider, more efficient and proper data reusability [86,87]. Despite the potential advantage, these services can provide for data availability and accessibility, they do not implicitly solve the problem of data reusability. This is most apparent when data is too large to be stored locally or transferred via slow internet connections, or there is no route to attach metadata that describes the datasets sufficiently for reuse or integration with other datasets. There is also the question of data repository longevity - who funds the repositories for decades into the future? Currently, some researchers now have to pay data egress charges for downloading data from cloud providers [88-90]. This method presumably saves the data producers money in terms of storing large datasets publicly, but the cost is somewhat now presented to the consumer. This raises complex questions around large data generation projects that also need to be studied extensively for future impact, especially with respect to reproducibility within publications. Moreover, access to the raw data might not be enough, if the steps and other artefacts involved in producing the processed data that was used in the analysis are not provided [68]. In addition, corresponding authors often move on from projects and institutions or the authors themselves can no longer access the data, meaning “data available on request” ceases to be a viable option to source data or explanations of methods. Restricted access to an article can also affect reproducibility by requiring paid subscriptions to read content from a publisher. Although there is precedent for requesting single articles within cross-library loan systems or contacting the corresponding author(s) directly, this, much like requesting access to data, is not without issues. Pre-print servers such as bioRxiv have been taken up rapidly [91], especially in the genomics and bioinformatics domains, and this has the potential to remove delays in publication whilst simultaneously providing a “line in the sand” with a Digital Object Identifier (DOI) and maintaining the requirements for FAIR data. In some cases, the sensitivity of data might discourage authors from data sharing [92,93], but this reason was only reported by a small proportion of our respondents. Whilst there are efforts that attempt to apply the FAIR principles to clinical data, such as in the case of the OpenTrials database [94], they are by no means ubiquitous.

Data within public repositories with specific deposition requirements (such as the EMBL-EBI European Nucleotide Archive, ebi.ac.uk/ena), might not be associated or annotated with standardised metadata that describes it accurately [95], rather the bare minimum for deposition. Training scientists to implement data management policies effectively is likely to increase data reuse through improved metadata. In a 2016 survey of 3987 National Science Foundation Directorate of Biological Sciences principal investigators (BIO PIs), expressed their greatest unmet training needs by their institutions [83]. These were in the areas of integration of multiple data (89%), data management and metadata (78%) and scaling analysis to cloud/high-performance computing (71%). The aforementioned data and computing elements are integral to the correct knowledge “how-to” for research reproducibility. Our findings indicated that those who stated they had experience in informatics also stated they are better able to attempt and reproduce results. Practical bioinformatics and data management training, rather than in specific tools, may be an effective way of reinforcing the notion that researchers’ contributions towards reproducibility are a responsibility that requires active planning and execution. This may be especially effective when considering the training requirements of wet-lab and field scientists, who are becoming increasingly responsible for larger and more complex computational datasets. Further research needs to be undertaken to better understand how researchers’ competence in computational reproducibility may be linked to their level of informatics training.

Furthermore, there remains a perception that researchers do not get credit for reproducing the work of others or publishing negative or null results [96,97]. Whilst some journals explicitly state that they welcome negative results articles (e.g. PLOS One “Missing Pieces” collection), this is by no means the norm in life science publishing as evidenced by low, and dropping publication rates of negative findings [96-98]. In addition, the perception that mostly positive results are publication-worthy might discourage researchers from providing enough details on their research methodology, such as reporting any negative findings. For transparent and reproducible science, both negative (or null) and positive results should be reported for others to examine the evidence [96,99,100]. Ideally, the publication system would enable checking of reproducibility by reviewers and editors at the peer-review stage, with authors providing all data (including raw data), a full description of methods including statistical analysis parameters, any negative findings based on previous work and open source software code [101]. These elements can all be included within the interactive figure, such as by zooming in on over data points to reveal more information on the data, pop up windows to give details on negative results and parameters and the figure offering re-running of the computational experiment in the case of executable documents. Peer reviewers would then be better able to check for anomalies, and editors could perform the final check to ensure that the science paper to be published is presenting true, valid, and reproducible research. Some respondents have suggested that if reviewers and/or editors were monetarily compensated, spending time to reproduce the computational experiments in manuscripts would become more feasible, and would aid the irreproducibility issue. However, paying reviewers does not necessarily ensure that they would be more diligent in checking or trying to reproduce results [102] and there must be optimal ways to ensure effective pressure is placed upon the authors and publishing journals to have better publication standards [103,104]. The increasing adoption by journals of reporting standards for experimental design and results, provide a framework for harmonising the description of scientific processes to enable reproducibility. However, these standards are not universally enforced [105]. Similarly, concrete funding within research grants for implementing reproducibility itself manifested as actionable Data Management Plans (dcc.ac.uk, 2019), rather than what is currently a by-product of the publishing process, could give a level of confidence to researchers who would want to reproduce previous work and incorporate that data in their own projects.

Respondents mentioned that there are word count restrictions in papers, and journals often ask authors to shorten methods sections and perhaps move some text to supplementary information, many times placed in an unorganised fashion or having to remove it altogether. This is a legacy product of the hard-copy publishing era and readability aside, word limits are not consequential for most internet journals. Even so, if the word count limit was only applicable to the introduction, results and discussion sections, then the authors could describe methods in more detail within the paper, without having to move that valuable information in the supplementary section. When methods are citing methodology techniques as described in other papers, where those original references are hard to obtain, typically through closed access practices or by request mechanisms as noted above, then this can be an additional barrier to the reproducibility of the experiment. This suggests that there are benefits to describing the methods in detail and stating that they are similar to certain (cited) references as well as document the laboratory’s expertise in a particular method [106]. However, multi-institutional or consortium papers are becoming more common with ever-increasing numbers of authors on papers, which adds complexity to how authors should describe every previous method available that underpins their research [100]. There is no obvious solution to this issue. Highly specialised methods (e.g. electrophysiology expertise, requirements for large computational resources or knowledge of complex bioinformatics algorithms) and specific reagents (e.g. different animal strains), might not be readily available to other research groups [83]. As stated by some respondents, in certain cases the effective reproducibility of experiments is obstructed by numerical issues with very small or very large matrices or datasets, or different versions of analysis software used, perhaps to address bugs in analytical code, will cause a variation in the reproduced results.

Previous studies have provided strong evidence that there is a need for better technical systems and platforms to enable and promote the reproducibility of experiments. We provide additional evidence that that paper authors and readers perceive a benefit from having an interactive figure that would allow for the reproducibility of the experiment shown in the figure. An article that gives access to the data, code and detailed data analysis steps would allow for in situ reproduction of computational experiments by re-running code including statistical analyses “live” within the paper [26]. Whilst our study did not concentrate on how these “executable papers” may be constructed, this is an active area of development and some examples of how this may be achieved have been provided [51,108]. We provide additional evidence that paper authors and readers perceive a benefit from having publication infrastructure available that would allow for the reproducibility of an experiment. As such, the findings of this survey helped the development of two prototypes of interactive figures (see Data and Code availability) and subsequently the creation of eLIFE’s first computationally reproducible document [52].

We also asked whether presenting published experiments through interactive figures elements in online publications might be beneficial to researchers, in order to better consume research outputs. Respondents stated they could see the benefit in having interactive figures for the readers of their papers and the papers they read and being able as authors to present their experiment analysis and data as interactive figures. Respondents endorsed articles which include interactive elements, where access to the processed and raw data, metadata, code, and detailed analysis steps, in the form of an interactive figure, would help article readers better understand the paper and the experimental design and methodology. This would, in turn, improve the reproducibility of the experiment presented in the interactive figure, especially computational experiments. The notion of data visualisation tools promoting interactivity and reproducibility in online publishing has also been discussed in the literature [42]. Other efforts have been exploring the availability of interactive figures for driving reproducibility in publishing in the form of executable documents [28,51,52,55]. Moreover, technologies such as Jupyter Notebooks, Binder, myExperiment, CodeOcean enable the reproducibility of computational experiments associated with publications, provided by the authors as links from the paper. However, the benefit of having the interactivity and availability of reproducing experiments from within the article itself in the form of interactive figures, is that the reader can stay within the article itself and explore all the details of the data presented in the figure, download the data, play with the code or analysis that produced the figure, interact with parameters in the computational analysis workflows and computationally reproduce the experiments presented in the figure. This can enable the reader to better understand the research done presented in the interactive figure. Despite the self-reported perceived benefits of including interactive figures, the availability of this facility would not affect the respondents’ decisions on where to publish. This contradiction suggests that cultural factors (incentives, concerns authors have with sharing their data, attitudes towards open research) [64,73] play an underestimated role in reproducibility.

Despite the benefits, the interactive documents and figures can provide to the publishing system for improved consumption of research outputs, and that those benefits are in demand by the scientific community, work is needed in order to promote and support their use. Given the diversity of biological datasets and ever-evolving methods for data generation and analysis, it is unlikely that a single interactive infrastructure type can support all types of data and analysis. More research into how different types of data can be supported and presented in papers with interactivity needs to be undertaken. Yet problems with data availability and data sizes will persist - many studies comprise datasets that are too large to upload and render within web browsers in a reasonable timescale. Even if the data are available through well-funded repositories with fast data transfers, e.g. the INSDC databases (insdc.org), are publishers ready to bear the extra costs of supporting the infrastructure and people required to develop or maintain such interactive systems in the long run? These are questions that need to be further investigated, particularly when considering any form of industry standardisation of such interactivity in the publishing system. It is clear that publishing online journal papers with embedded interactive figures requires alterations to infrastructure, authoring tools and editorial processes [42]. In some cases, the data underpinning the figures might need to be stored and managed by third parties and this means the data, as well as the figures, may not be persistent. The same argument is relevant to software availability and reuse - publishers would need to verify that any links to data and software were available and contained original unmodified datasets. As datasets become larger and more complex, and more software and infrastructure is needed to re-analyse published datasets, this will affect how infrastructure will need to be developed to underpin reproducible research. Incentives will need to be put in place to motivate investment in these efforts.

We show that providing tools to scientists who are not computationally aware also requires a change in research culture, as many aspects of computational reproducibility require a change in publishing behaviour and competence in the informatics domain. Encouraging and incentivising scientists to conduct robust, transparent, reproducible and replicable research, such as with badges to recognise open practices should be prioritised to help solve the irreproducibility issue [109]. Implementing hiring practices with open science at the core of research roles [110] will encourage attitudes to change across faculty departments and institutions.

Another potential solution to the reproducibility crisis is to identify quantifiable metrics of research reproducibility and its scientific impact, thus giving researchers a better understanding of how their work stands on a scale of measurable reproducibility. The current assessment of the impact of research articles is a set of quantifiable metrics that do not evaluate research reproducibility, but stakeholders are starting to request that checklists and tools are provided to improve these assessments [111]. It is harder to find a better approach that is based on a thoroughly informed analysis by unbiased experts in the field that would quantify the reproducibility level of the research article [112]. That said, top-down requirements from journals and funders to release reproducible data and code may go some way to improving computational reproducibility within the life sciences, but this will also rely on the availability of technical solutions that are accessible and useful to the majority of scientists.

Opinions are mixed regarding the extent and severity of the reproducibility crisis. Our study and previous studies are highlighting the need to find effective solutions towards solving the reproducibility issue. Steps towards modernising the publishing system by incorporating interactivity with interactive figures and by automatically reproducing computational experiments described within a paper are deemed desirable. This may be a good starting point for improving research reproducibility by reproducing experiments within research articles. This, however, does not come without its caveats, as we described above. From our findings, and given the ongoing release of tools and platforms for technical reproducibility, future efforts should be spent in tackling the cultural behaviour of scientists, especially when faced with the need to publish for career progression.

Data and Code Availability

All data files are available via this Url: https://doi.org/10.6084/m9.figshare.c.4436912. Prototypes of interactives figures developed by the corresponding author are available via these GitHub repositories: https://github.com/code56/nodeServerSimpleFig and https://github.com/code56/prototype_article_interactive_figure

Acknowledgements

This project is funded by a BBSRC iCASE Studentship (project reference: BB/M017176/1). We would like to thank all the respondents of the surveys for their time. We would also like to thank George Savva from the Quadram Institute (QIB, UK) for comments and suggestions for this manuscript; eLIFE Sciences Publications Ltd, with whom the corresponding author collaborates as an iCASE student; as well as Ian Mulvany, former eLIFE Head of Development, for his help in developing the survey questionnaire.

Footnotes

The abstract was updated to add some quantitative results. The introduction was updated to clarify the definition of reproducibility versus other terms; Methods were updated to include more detailed information on the methodologies including further analysis; Supplemental files updated include the statistical analysis scripts; Data Availability updated; Discussion was updated; References were updated.
https://doi.org/10.6084/m9.figshare.c.4436912

References

1.↵
Claerbout J, Karrenbach M. Electronic Documents Give Reproducible Research a New Meaning [Internet]. Sepstanford.edu. 1992 [cited 14 May 2020]. Available from: http://sepwww.stanford.edu/doku.php?id=sep:research:reproducible:seg92
2.↵
Begley C, Ellis L. Raise standards for preclinical cancer research. Nature. 2012;483(7391):531–533.
OpenUrl CrossRef PubMed Web of Science
3.↵
Yong E. Reproducibility problems in genetics research may be jeopardizing lives. 2015 Dec 17 [cited 12 February 2015].In: Genetic Literacy Project [Internet]. Pennsylvania (USA): Genetic Literacy Project 2012 -. [about 1 screen]. Available from: https://geneticliteracyproject.org/2015/12/17/reproducibility-problems-genetics-research-may-costing-lives/
4.↵
Bell C, Dinwiddie D, Miller N, Hateley S, Ganusova E, Mudge J et al. Carrier Testing for Severe Childhood Recessive Diseases by Next-Generation Sequencing. Science Translational Medicine. 2011;3(65):65ra4.
OpenUrl Abstract/FREE Full Text
5.↵
Ioannidis J, Doucouliagos C. What’s to know about the credibility of empirical economics?. Journal of Economic Surveys. 2013;27(5):997–1004. doi: 10.1111/joes.12032.
OpenUrl CrossRef
6.↵
Freedman L, Cockburn I, Simcoe T. The Economics of Reproducibility in Preclinical Research. PLOS Biology. 2015;13(6):e1002165. doi: 10.1371/journal.pbio.1002165.
OpenUrl CrossRef PubMed
7.↵
Plesser H. Reproducibility vs. Replicability: A brief History of a Confused Terminology. Frontiers in Neuroinformatics. 2018;11. doi: 10.3389/fninf.2017.00076.
OpenUrl CrossRef
8.↵
Drummond C. Replicability is not Reproducibility: Nor is it Good Science. Proceedings of the Evaluation Methods for Machine Learning Workshop at the 26^th ICML., 2009. Available from: http://cogprints.org/7691/7/ICMLws09.pdf.
9.↵
Leipzig J, Nüst D, Hoyt CT, Soiland-Reyes S, Ram K, Greenberg J. The role of metadata in reproducible computational research. arXiv preprint arXiv:2006.08589. 2020 Jun 15.
10.↵
Liberman M. Language Log » Replicability vs. reproducibility — or is it the other way around?. 2015 October 31 [cited 5 October 2019]. In: Language Log Blog [Internet]. [place unknown]: Language Log 2003 - . [about 5 screens]. Available from: https://languagelog.ldc.upenn.edu/nll/?p=21956.f
11.↵
Peng RD, Dominici F, Zeger SL. Reproducible epidemiologic research. American journal of epidemiology. 2006 Mar 1;163(9):783–9.
OpenUrl CrossRef PubMed Web of Science
12.↵
Peng RD. Reproducible research in computational science. Science. 2011 Dec 2;334(6060):1226–7.
OpenUrl Abstract/FREE Full Text
13.↵
Stodden V, Bailey DH, Borwein J, LeVeque RJ, Rider W, Stein W. Setting the default to reproducible: Reproducibility in computational and experimental mathematics. InICERM Workshop Report 2013 Feb (p. 737).
14.↵
Nuijten MB, Bakker M, Maassen E, Wicherts JM. Verify original results through reanalysis before replicating. Behavioral and Brain Sciences. Cambridge University Press; 2018;41:e143.
OpenUrl
15.↵
Donoho DL. An invitation to reproducible computational research. Biostatistics. 2010 Jul 1;11(3):385–8.
OpenUrl CrossRef PubMed Web of Science
16.
Stodden V, Miguez S. Best practices for computational science: Software infrastructure and environments for reproducible and extensible research. Available at SSRN 2322276. 2013 Sep 6.
17.↵
Stodden V, Seiler J, Ma Z. An empirical analysis of journal policy effectiveness for computational reproducibility. Proceedings of the National Academy of Sciences. 2018 Mar 13;115(11):2584–9.
OpenUrl Abstract/FREE Full Text
18.↵
Stodden V. Reproducible research for scientific computing: Tools and strategies for changing the culture. Computing in Science & Engineering. 2012 Jul 26;14(4):13.
OpenUrl
19.↵
Cataldo M, Mockus A, Roberts J, Herbsleb J. Software Dependencies, Work Dependencies, and Their Impact on Failures. IEEE Transactions on Software Engineering. 2009;35(6):864–878. doi: 10.1109/tse.2009.42.
OpenUrl CrossRef
20.↵
Baker M. Is there a reproducibility crisis? A Nature survey lifts the lid on how researchers view the crisis rocking science and what they think will help. Nature. 2016 May 26;533(7604):452–5.
OpenUrl CrossRef PubMed
21.↵
Munafò MR, Nosek BA, Bishop DV, Button KS, Chambers CD, Du Sert NP, Simonsohn U, Wagenmakers EJ, Ware JJ, Ioannidis JP. A manifesto for reproducible science. Nature human behaviour. 2017 Jan;1(1):0021.
OpenUrl
22.↵
Hardwicke TE, Mathur MB, MacDonald K, Nilsonne G, Banks GC, Kidwell MC, Hofelich Mohr A, Clayton E, Yoon EJ, Henry Tessler M, Lenne RL. Data availability, reusability, and analytic reproducibility: Evaluating the impact of a mandatory open data policy at the journal Cognition. Royal Society open science. 2018 Aug 15;5(8):180448.
OpenUrl CrossRef PubMed
23.↵
Eisner DA. Reproducibility of science: fraud, impact factors and carelessness. Journal of molecular and cellular cardiology. 2018 Jan 1;114:364–8.
OpenUrl
24.↵
Colomb J, Brembs B. Sub-strains of Drosophila Canton-S differ markedly in their locomotor behavior. F1000Research. 2015;3:176. doi: 10.12688/f1000research.4263.2.
OpenUrl CrossRef
25.↵
Ghosh SS, Poline JB, Keator DB, Halchenko YO, Thomas AG, Kessler DA, Kennedy DN. A very simple, re-executable neuroimaging publication. F1000Research. 2017;6.
26.↵
Perkel J. TechBlog: Interactive figures address data reproducibility. 2017 October 20 [cited 6 September 2019]. In Naturejobs Blog [Internet]. [place unknown]: Nature 1869 - . [about 2 screens]. Available from: http://blogs.nature.com/naturejobs/2017/10/20/techblog-interactive-figures-address-data-reproducibility/.
27.
Researchers can now publish interactive Plotly figures in F1000. 2017 July 19 [cited 2019 September 7]. In: Medium [Internet]. San Francisco (USA): Medium 2012 - . [about 4 screens]. Available from:https://medium.com/plotly/researchers-can-now-publish-interactive-plotly-figures-in-f1000-87827a1b5d94
28.↵
Ingraham T. Reanalyse(a)s: making reproducibility easier with Code Ocean widgets - F1000Research Blogs. 2017 April 20 [cited 2019 September 6]. In: F1000Research Blogs [Internet]. London (UK): F1000Research 2015 - . [about 3 screens]. Available from: https://blog.f1000.com/2017/04/20/reanaly-seas-making-reproducibility-easier-with-code-ocean-widgets/
29.↵
Brinckman A, Chard K, Gaffney N, Hategan M, Jones M, Kowalik K et al. Computing environments for reproducibility: Capturing the “Whole Tale”. Future Generation Computer Systems. 2018;94:854–867. doi: 10.1016/j.future.2017.12.029.
OpenUrl CrossRef
30.↵
Chirigati F, Rampin R, Shasha D, Freire J. Reprozip: Computational reproducibility with ease. InProceedings of the 2016 International Conference on Management of Data 2016 Jun 26 (pp. 2085-2088). ACM.
31.↵
Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Čech M et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Research. 2018;46(W1):W537–W544. doi: 10.1093/nar/gky379.
OpenUrl CrossRef PubMed
32.↵
Goff SA, Vaughn M, Mckay S, Lyons E, Stapleton AE, Gessler D, et al. The iPlant Collaborative: Cyberinfrastructure for Plant Biology. Frontiers in Plant Science. 2011;2. doi: 10.3389/fpls.2011.00034.
OpenUrl CrossRef PubMed
33.↵
Goble C, Bhagat J, Aleksejevs S, Cruickshank D, Michaelides D, Newman D et al. myExperiment: a repository and social network for the sharing of bioinformatics workflows. Nucleic Acids Research. 2010;38(uppl_2):W677–W682. doi: 10.1093/nar/gkq429.
OpenUrl CrossRef PubMed Web of Science
34.↵
Pettifer S, Sinnott J, Attwood T. UTOPIA—User-Friendly Tools for Operating Informatics Applications. Comparative and Functional Genomics. 2004;5(1):56–60. doi: 10.1002/cfg.359.
OpenUrl CrossRef PubMed Web of Science
35.↵
Pettifer S, Thorne D, McDermott P, Marsh J, Villéger A, Kell D et al. Visualising biological data: a semantic approach to tool and database integration. BMC Bioinformatics. 2009;10(S6). doi: 10.1186/1471-2105-10-s6-s19.
OpenUrl CrossRef
36.↵
Sneddon T, Li P, Edmunds S. GigaDB: announcing the GigaScience database. GigaScience. 2012;1(1). doi: 10.1186/2047-217x-1-11.
OpenUrl CrossRef
37.↵
Wolstencroft K, Haines R, Fellows D, Williams A, Withers D, Owen S et al. The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud. Nucleic Acids Research. 2013;41(W1):W557–W561. doi: 10.1093/nar/gkt328.
OpenUrl CrossRef PubMed Web of Science
38.
Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M et al. Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics. 2004;20(17):3045–3054. doi: 10.1093/bioinformatics/bth361.
OpenUrl CrossRef PubMed Web of Science
39.↵
Hull D, Wolstencroft K, Stevens R, Goble C, Pocock M, Li P et al. Taverna: a tool for building and running workflows of services. Nucleic Acids Research. 2006;34(Web Server):W729–W732. doi: 10.1093/nar/gkl320.
OpenUrl CrossRef PubMed Web of Science
40.↵
Peter Amstutz, Michael R. Crusoe, Nebojša Tijanić (editors), Brad Chapman, John Chilton, Michael Heuer et al. Common Workflow Language, v1.0. Specification, Common Workflow Language working group. 2016. Available from: https://w3id.org/cwl/v1.0/ xdoi:10.6084/m9.figshare.3115156.v2
OpenUrl CrossRef
41.↵
Kurtzer G, Sochat V, Bauer M. Singularity: Scientific containers for mobility of compute. PLOS ONE. 2017;12(5):e0177459. doi: 10.1371/journal.pone.0177459.
OpenUrl CrossRef PubMed
42.↵
Perkel JM. Data visualization tools drive interactivity and reproducibility in online publishing. Nature. 2018 Jan 1;554(7690):133–4.
OpenUrl CrossRef
43.
Grossman T, Chevalier F, Kazi RH. Bringing research articles to life with animated figures. interactions. 2016 Jul;23(4):52–7.
OpenUrl
44.
Weissgerber TL, Garovic VD, Savic M, Winham SJ, Milic NM. From static to interactive: transforming data visualization to improve transparency.PLoS biology. 2016 Jun 22;14(6):e1002484.
OpenUrl PubMed
45.
Barnes DG, Fluke CJ. Incorporating interactive three-dimensional graphics in astronomy research papers. New Astronomy. 2008 Nov 1;13(8):599–605.
OpenUrl CrossRef
46.
Newe A. Enriching scientific publications with interactive 3D PDF: an integrated toolbox for creating ready-to-publish figures. PeerJ Computer Science. 2016 Jun 20;2:e64.
OpenUrl
47.
Barnes DG, Vidiassov M, Ruthensteiner B, Fluke CJ, Quayle MR, McHenry CR. Embedding and publishing interactive, 3-dimensional, scientific figures in Portable Document Format (PDF) files. PloS one. 2013 Sep 25;8(9):e69446.
OpenUrl CrossRef PubMed
48.↵
Rao SS, Huang SC, St Hilaire BG, Engreitz JM, Perez EM, Kieffer-Kwon KR, Sanborn AL, Johnstone SE, Bascom GD, Bochkov ID, Huang X. Cohesin loss eliminates all loop domains. Cell. 2017 Oct 5;171(2):305–20.
OpenUrl CrossRef PubMed
49.↵
Stodden V, McNutt M, Bailey DH, Deelman E, Gil Y, Hanson B, Heroux MA, Ioannidis JP, Taufer M. Enhancing reproducibility for computational methods. Science. 2016 Dec 9;354(6317):1240–1.
OpenUrl Abstract/FREE Full Text
50.↵
Kim YM, Poline JB, Dumas G. Experimenting with reproducibility in bioinformatics. BioRxiv. 2017 Jan 1:143503.
OpenUrl
51.↵
Jupyter et al., “Binder 2.0 - Reproducible, Interactive, Sharable Environments for Science at Scale.” Proceedings of the 17th Python in Science Conference. 2018. doi:10.25080/Majora-4af1f417-011.
OpenUrl CrossRef
52.↵
Ghosh SS, Poline JB, Keator DB, Halchenko YO, Thomas AG, Kessler DA, Kennedy DN. A very simple, re-executable neuroimaging publication. F1000Research. 2017;6.
53.↵
Maciocci G, Aufreiter M, Bentley N. Introducing eLife’s first computationally reproducible article.2019 Feb 20 [cited 20 February 2019]. In: eLife Labs Blog [Internet]. Cambridge (UK) 2012 – . [about 5 screens]. Available from: https://elifesciences.org/labs/ad58f08d/introducing-elife-s-first-computationally-reproducible-article
54.↵
Tang B, Li F, Li J, Zhao W, Zhang Z. Delta integrates 3D physical structure with topology and genomic data of chromosomes. bioRxiv. 2017 Jan xv1:199950.
55.↵
Rao SS, Huang SC, St Hilaire BG, Engreitz JM, Perez EM, Kieffer-Kwon KR, Sanborn AL, Johnstone SE, Bascom GD, Bochkov ID, Huang X. Cohesin loss eliminates all loop domains. Cell. 2017 Oct 5;171(2):305–20.
OpenUrl CrossRef PubMed
56.↵
Robinson JT, Turner D, Durand NC, Thorvaldsdottir H, Mesirov JP, Aiden EL. Juicebox. js provides a cloud-based visualization system for Hi-C data. Cell systems. 2018 Feb 28;6(2):256–8.
OpenUrl
57.↵
Higginson A, Munafò M. Current Incentives for Scientists Lead to Underpowered Studies with Erroneous Conclusions. PLOS Biology. 2016;14(11):e2000995. doi: 10.1371/journal.pbio.2000995.
OpenUrl CrossRef PubMed
58.↵
Pusztai L, Hatzis C, Andre F. Reproducibility of research and preclinical validation: problems and solutions.Nature Reviews Clinical Oncology. 2013 Dec;10(12):720.
OpenUrl
59.↵
National Institutes of Health Plan for increasing access to scientific publications and digital scientific data from NIH funded scientific research, [Internet].NIH. 2015 [cited 5 May 2017]. Available from: https://grants.nih.gov/grants/NIH-Public-Access-Plan.pdf.
60.↵
Wilkinson M, Dumontier M, Aalbersberg I, Appleton G, Axton M, Baak A et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data. 2016; 3:160018. doi: 10.1038/sdata.2016.18.
OpenUrl CrossRef PubMed
61.↵
Pulverer B. Reproducibility blues. The EMBO Journal. 2015;34(22):2721–2724. doi: 10.15252/embj.201570090.
OpenUrl Abstract/FREE Full Text
62.↵
Stodden V, Guo P, Ma Z. Toward Reproducible Computational Research: An Empirical Analysis of Data and Code Policy Adoption by Journals. PLOS ONE. 2013;8(6):e67111. doi: 10.1371/journal.pone.0067111.
OpenUrl CrossRef PubMed
63.↵
Feger SS, Dallmeier-Tiessen S, Wozniak PW, Schmidt A. Gamification in Science: A Study of Requirements in the Context of Reproducible Research. InProceedings of the 2019 CHI Conference on Human Factors in Computing Systems 2019 Apr 18 (p. 460). ACM.
64.↵
Stodden, V. (2010). The scientific method in practice: Reproducibility in the computational sciences (MIT Sloan Research Paper No. 4773-10). Cambridge, MA: Massachusetts Institute of Technology. doi:10.2139/ssrn.1550193
OpenUrl CrossRef
65.↵
O’Brien BC, Harris IB, Beckman TJ, Reed DA, Cook DA. Standards for reporting qualitative research: a synthesis of recommendations. Academic Medicine. 2014 Sep 1;89(9):1245–51.
OpenUrl CrossRef PubMed
66.↵
Labfolder. The significance of reproducible data. [no date] [cited 2019 September 1]. In: Labfolder Blog [Internet]. Berlin (Germany): Labfold 2012 - . [about 2 screens]. Available from: https://www.labfolder.com/blog/the-significance-of-reproducible-data/
67.
Tait A. 10 Rules for Creating Reproducible Results in Data Science. 2017 July 3 [cited 2019 September 1]. In: Dataconomy [Internet]. Berlin (Germany). [about 2 screens]. Available from: https://dataconomy.com/2017/07/10-rules-results-data-science/
68.↵
Pawlik M, Hütter T, Kocher D, Mann W, Augsten N. A Link is not Enough–Reproducibility of Data. Datenbank-Spektrum. 2019:1–9.
69.
Baranyi M, Greilhuber J. Genome size inAllium: in quest of reproducible data. Annals of Botany. 1999 Jun 1;83(6):687–95.
OpenUrl CrossRef
70.↵
Weinländer M, Lekovic V, Spadijer-Gostovic S, Milicic B, Krennmair G, Plenk Jr H. Gingivomorphometry–esthetic evaluation of the crown–mucogingival complex: a new method for collection and measurement of standardized and reproducible data in oral photography. Clinical oral implants research. 2009 May;20(5):526–30.
OpenUrl PubMed
71.↵
Koschke R. Software visualization in software maintenance, reverse engineering, and re-engineering: a research survey. Journal of Software Maintenance and Evolution: Research and Practice. 2003 Mar;15(2):87–109.
OpenUrl
72.
Snell, L. and Spencer, J. Reviewers’ perceptions of the peer review process for a medical education journal. Medical Education. 2005; 39(1), pp.90–97.
OpenUrl CrossRef PubMed
73.↵
Federer LM, Lu YL, Joubert DJ, Welsh J, Brandys B.Biomedical data sharing and reuse: Attitudes and practices of clinical and scientific research staff. PloS one. 2015 Jun 24;10(6):e0129506.
OpenUrl CrossRef PubMed
74.
Schneider, M.V., Flannery, M. and Griffin, P., Survey of Bioinformatics and Computational Needs in Australia 2016. pdf. Figshare,2016. Available from: https://figshare.com/articles/Survey_of_Bioinformatics_and_Computational_Needs_in_Australia_2016_pdf/4307768/1
75.↵
Barone L, Williams J, Micklos D. Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators. PLOS Comput Biol. 2017;13(10). doi: 10.1371/journal.pcbi.1005755.
OpenUrl CrossRef PubMed
76.↵
James, J.M. and Bolstein, R., 1990. The effect of monetary incentives and follow-up mailings on the response rate and response quality in mail surveys. Public opinion quarterly, 54(3), pp.346–361.
OpenUrl CrossRef Web of Science
77.
Shettle, C. and Mooney, G.. Monetary incentives in US government surveys. Journal of Official Statistics. 1999; 15(2), p.231.
OpenUrl
78.↵
Jobber, D., Saunders, J. and Mitchell, V.W. Prepaid monetary incentive effects on mail survey response. Journal of Business Research. 2004; 57(1), pp.21–25.
OpenUrl CrossRef Web of Science
79.↵
Müller H, Naumann F, Freytag J. Data Quality in Genome Databases.Proc Conf Inf Qual (IQ 03). 2003;269–284.
80.
Marx V. The big challenges of big data. Nature. 2013;498(7453):255–260. doi: 10.1038/498255a.
OpenUrl CrossRef PubMed Web of Science
81.↵
Stodden V. Reproducing Statistical Results. Annual Review of Statistics and Its Application. 2015;2(1):1–19. doi: 10.1146/annurev-statistics-010814-020127.
OpenUrl CrossRef
82.↵
B. A. Nosek et al., Promoting an open research culture. Science. 2015;348, 1422–1425. doi: 10.1126/science.aab2374.
OpenUrl Abstract/FREE Full Text
83.↵
Collins F, Tabak L. Policy: NIH plans to enhance reproducibility. Nature. 2014;505(7485):612–613. doi: 10.1038/505612a.
OpenUrl CrossRef PubMed Web of Science
84.↵
Berg J. Progress on reproducibility. Science. 2018;359(6371):9–9. doi: 10.1126/science.aar8654.
OpenUrl Abstract/FREE Full Text
85.↵
Vines TH, Albert AY, Andrew RL, Débarre F, Bock DG, Franklin MT, Gilbert KJ, Moore JS, Renaut S, Rennison DJ. The availability of research data declines rapidly with article age. Current biology. 2014 Jan 6;24(1):94–7.
OpenUrl CrossRef PubMed
86.↵
Poldrack RA, Gorgolewski KJ. Making big data open: Data sharing in neuroimaging. Nat Neurosci. 2014;17(11):1510–7. doi: 10.1038/nn.3818.
OpenUrl CrossRef PubMed
87.↵
Faniel IM, Zimmerman A. Beyond the Data Deluge: A Research Agenda for Large-Scale Data Sharing and Reuse. Int J Digit Curation. 2011;6(1):58–69. doi: 10.2218/ijdc.v6i1.172.
OpenUrl CrossRef
88.↵
Banditwattanawong T, Masdisornchote M, Uthayopas P. Economical and efficient big data sharing with i-Cloud. In2014 International Conference on Big Data and Smart Computing (BIGCOMP) 2014 Jan 15 (pp. 105-110). IEEE.
89.
Once You’re in the Cloud, How Expensive Is It to Get Out? [cited 2019 September 6]. In: NEF [Internet]. Newton Massachusetts (USA): NEF 2004 - .[about 2 screens]. Available from: https://www.nefiber.com/blog/cloud-egress-charges/.
90.↵
Linthicum D. Don’t get surprised by the cloud’s data-egress fees [Internet]. InfoWorld. 30 March 2018 [cited 6 September 2019]. In: InfoWorld [Internet]. Framingham Massachusetts (USA): InfoWorld 1978 - . [about 2 screens]. Available from: https://www.infoworld.com/article/3266676/dont-get-surprised-by-the-clouds-data-egress-fees.html.
91.↵
Abdill R, Blekhman R. Tracking the popularity and outcomes of all bioRxiv preprints; 2019. Preprint. Available from BioRxiv: 10.1101/515643. Cited 3 April 2019.
92.↵
Figueiredo AS. Data Sharing: Convert Challenges into Opportunities. Front Public Health.Frontiers Media SA;2017; 5:327.doi:10.3389/fpubh.2017.00327.
OpenUrl CrossRef
93.↵
Hollis KF. To Share or Not to Share: Ethical Acquisition and Use of Medical Data. AMIA Jt Summits Transl Sci proceedings AMIA Jt Summits Transl Sci [Internet]. 2016;2016:420–7.
OpenUrl
94.↵
Chen CP, Zhang CY. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information sciences. 2014 Aug 10;275:314–47.
OpenUrl
95.↵
Attwood TK, Kell DB, McDermott P, Marsh J, Pettifer SR, Thorne D. Calling International Rescue: knowledge lost in literature and data landslide! Biochem J. 2009;424:317–33. doi: 10.1042/BJ20091474.
OpenUrl Abstract/FREE Full Text
96.↵
Franco A, Malhotra N, Simonovits G. Publication bias in the social sciences: Unlocking the file drawer. Science. 2014;345(6203):1502–1505. doi: 10.1126/science.1255484.
OpenUrl Abstract/FREE Full Text
97.↵
Teixeira da Silva JA. Negative results: negative perceptions limit their potential for increasing reproducibility.Journal of negative results in biomedicine. 2015 Dec;14(1):12.
OpenUrl
98.↵
Fanelli D. Negative results are disappearing from most disciplines and countries. Scientometrics. 2011;90(3):891–904. doi: 10.1007/s11192-011-0494-7.
OpenUrl CrossRef Web of Science
99.↵
Prager EM, Chambers KE, Plotkin JL, McArthur DL, Bandrowski AE, Bansal N, Martone ME, Bergstrom HC, Bespalov A, Graf C. Improving transparency and scientific rigor in academic publishing. Journal of neuroscience research. 2019 Apr;97(4):377–90.
OpenUrl
100.↵
Miyakawa T. No raw data, no science: another possible source of the reproducibility crisis. Molecular Brain. 2020;13(1).
101.↵
Iqbal S, Wallach J, Khoury M, Schully S, Ioannidis J. Reproducible Research Practices and Transparency across the Biomedical Literature. PLOS Biology. 2016;14(1):e1002333. doi: 10.1371/journal.pbio.1002333.
OpenUrl CrossRef PubMed
102.↵
Hershey N. Compensation and Accountability: The Way to Improve Peer Review. Quality Assurance and Utilization Review. 1992;7(1):23–29. doi: 10.1177/106286069200700104.
OpenUrl CrossRef
103.↵
Announcement: Reducing our irreproducibility. Nature. 2013;496(7446):398–398. doi: 10.1038/496398a.
OpenUrl CrossRef Web of Science
104.↵
Pusztai L, Hatzis C, Andre F. Reproducibility of research and preclinical validation: problems and solutions. Nature Reviews Clinical Oncology. 2013;10(12):720–724. doi: 10.1038/nrclinonc.2013.171.
OpenUrl CrossRef PubMed
105.↵
Moher D. Reporting guidelines: doing better for readers. BMC Medicine. 2018;16(1). doi: 10.1186/s12916-018-1226-0.
OpenUrl CrossRef PubMed
106.↵
Moher D, Avey M, Antes G, Altman DG. The National Institutes of Health and guidance for reporting preclinical research. BMC medicine. 2015 Dec;13(1):34.
OpenUrl
107.
Gonsalves, A. Lessons learned on consortium-based research in climate change and development. CARIAA Working Paper no. 1. 2014 [cited 2019 Jan 1]. In: International Development Research Centre [Internet]. Ottawa, Canada and UK Aid, London, United Kingdom. Available from: www.idrc.ca/cariaa
108.↵
Somers J. The Scientific Paper Is Obsolete2018 April 5[cited 1 September 2019]. In: The Atlantic [Internet]. Boston, Massachusetts (USA) 1857 -. [about 15 screens]. Available from: https://www.theatlantic.com/science/archive/2018/04/the-scientific-paper-is-obsolete/556676/
109.↵
Kidwell MC, Lazarević LB, Baranski E, Hardwicke TE, Piechowski S, Falkenberg LS, Kennett C, Slowik A, Sonnleitner C, Hess-Holden C, Errington TM. Badges to acknowledge open practices: A simple, low-cost, effective method for increasing transparency. PLoS biology. 2016 May 12;14(5):e1002456.
OpenUrl CrossRef PubMed
110.↵
Schönbrodt F. Changing hiring practices towards research transparency: The first open science statement in a professorship advertisement. 2016 Jan 6 [cited 2016 Mar 16]. In: Felix Schönbrodt’s blog [Internet]. [place unknown]. [about 1 screen]. Available from: https://www.nicebread.de/open-science-hiring-practices/.
111.↵
Wellcome Trust. Request for Information (RFI) A software tool to assess the FAIRness of research outputs against a structured checklist of requirements [FAIRWare] [Internet]. Wellcome Trust; 2018 [cited 5 March 2019]. Available from: https://wellcome.ac.uk/sites/default/files/FAIR-checking-software-request-for-information.pdf.
112.↵
Flier JS. Irreproducibility of published bioscience research: Diagnosis, pathogenesis and therapy. Mol Metab. 2017 Nov 21;6(1):2–9. doi: 10.1016/j.molmet.2016.11.006.
OpenUrl CrossRef

View the discussion thread.

Posted December 09, 2020.

Download PDF

Data/Code

Citation Tools

Subject Area

Scientific Communication and Education

Subject Areas

All Articles

Animal Behavior and Cognition (5209)
Biochemistry (11730)
Bioengineering (8743)
Bioinformatics (29179)
Biophysics (14964)
Cancer Biology (12080)
Cell Biology (17399)
Clinical Trials (138)
Developmental Biology (9417)
Ecology (14174)
Epidemiology (2067)
Evolutionary Biology (18294)
Genetics (12233)
Genomics (16791)
Immunology (11858)
Microbiology (28051)
Molecular Biology (11575)
Neuroscience (60919)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4955)
Plant Biology (10422)
Scientific Communication and Education (1682)
Synthetic Biology (2881)
Systems Biology (7338)
Zoology (1650)

[1] 1.↵
Claerbout J, Karrenbach M. Electronic Documents Give Reproducible Research a New Meaning [Internet]. Sepstanford.edu. 1992 [cited 14 May 2020]. Available from: http://sepwww.stanford.edu/doku.php?id=sep:research:reproducible:seg92

[2] 2.↵
Begley C, Ellis L. Raise standards for preclinical cancer research. Nature. 2012;483(7391):531–533.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Yong E. Reproducibility problems in genetics research may be jeopardizing lives. 2015 Dec 17 [cited 12 February 2015].In: Genetic Literacy Project [Internet]. Pennsylvania (USA): Genetic Literacy Project 2012 -. [about 1 screen]. Available from: https://geneticliteracyproject.org/2015/12/17/reproducibility-problems-genetics-research-may-costing-lives/

[4] 4.↵
Bell C, Dinwiddie D, Miller N, Hateley S, Ganusova E, Mudge J et al. Carrier Testing for Severe Childhood Recessive Diseases by Next-Generation Sequencing. Science Translational Medicine. 2011;3(65):65ra4.
OpenUrl Abstract/FREE Full Text

[5] 5.↵
Ioannidis J, Doucouliagos C. What’s to know about the credibility of empirical economics?. Journal of Economic Surveys. 2013;27(5):997–1004. doi: 10.1111/joes.12032.
OpenUrl CrossRef

[6] 6.↵
Freedman L, Cockburn I, Simcoe T. The Economics of Reproducibility in Preclinical Research. PLOS Biology. 2015;13(6):e1002165. doi: 10.1371/journal.pbio.1002165.
OpenUrl CrossRef PubMed

[7] 7.↵
Plesser H. Reproducibility vs. Replicability: A brief History of a Confused Terminology. Frontiers in Neuroinformatics. 2018;11. doi: 10.3389/fninf.2017.00076.
OpenUrl CrossRef

[8] 8.↵
Drummond C. Replicability is not Reproducibility: Nor is it Good Science. Proceedings of the Evaluation Methods for Machine Learning Workshop at the 26^th ICML., 2009. Available from: http://cogprints.org/7691/7/ICMLws09.pdf.

[9] 9.↵
Leipzig J, Nüst D, Hoyt CT, Soiland-Reyes S, Ram K, Greenberg J. The role of metadata in reproducible computational research. arXiv preprint arXiv:2006.08589. 2020 Jun 15.

[10] 10.↵
Liberman M. Language Log » Replicability vs. reproducibility — or is it the other way around?. 2015 October 31 [cited 5 October 2019]. In: Language Log Blog [Internet]. [place unknown]: Language Log 2003 - . [about 5 screens]. Available from: https://languagelog.ldc.upenn.edu/nll/?p=21956.f

[11] 11.↵
Peng RD, Dominici F, Zeger SL. Reproducible epidemiologic research. American journal of epidemiology. 2006 Mar 1;163(9):783–9.
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Peng RD. Reproducible research in computational science. Science. 2011 Dec 2;334(6060):1226–7.
OpenUrl Abstract/FREE Full Text

[13] 13.↵
Stodden V, Bailey DH, Borwein J, LeVeque RJ, Rider W, Stein W. Setting the default to reproducible: Reproducibility in computational and experimental mathematics. InICERM Workshop Report 2013 Feb (p. 737).

[14] 14.↵
Nuijten MB, Bakker M, Maassen E, Wicherts JM. Verify original results through reanalysis before replicating. Behavioral and Brain Sciences. Cambridge University Press; 2018;41:e143.
OpenUrl

[15] 15.↵
Donoho DL. An invitation to reproducible computational research. Biostatistics. 2010 Jul 1;11(3):385–8.
OpenUrl CrossRef PubMed Web of Science

[16] 16.
Stodden V, Miguez S. Best practices for computational science: Software infrastructure and environments for reproducible and extensible research. Available at SSRN 2322276. 2013 Sep 6.

[17] 17.↵
Stodden V, Seiler J, Ma Z. An empirical analysis of journal policy effectiveness for computational reproducibility. Proceedings of the National Academy of Sciences. 2018 Mar 13;115(11):2584–9.
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Stodden V. Reproducible research for scientific computing: Tools and strategies for changing the culture. Computing in Science & Engineering. 2012 Jul 26;14(4):13.
OpenUrl

[19] 19.↵
Cataldo M, Mockus A, Roberts J, Herbsleb J. Software Dependencies, Work Dependencies, and Their Impact on Failures. IEEE Transactions on Software Engineering. 2009;35(6):864–878. doi: 10.1109/tse.2009.42.
OpenUrl CrossRef

[20] 20.↵
Baker M. Is there a reproducibility crisis? A Nature survey lifts the lid on how researchers view the crisis rocking science and what they think will help. Nature. 2016 May 26;533(7604):452–5.
OpenUrl CrossRef PubMed

[21] 21.↵
Munafò MR, Nosek BA, Bishop DV, Button KS, Chambers CD, Du Sert NP, Simonsohn U, Wagenmakers EJ, Ware JJ, Ioannidis JP. A manifesto for reproducible science. Nature human behaviour. 2017 Jan;1(1):0021.
OpenUrl

[22] 22.↵
Hardwicke TE, Mathur MB, MacDonald K, Nilsonne G, Banks GC, Kidwell MC, Hofelich Mohr A, Clayton E, Yoon EJ, Henry Tessler M, Lenne RL. Data availability, reusability, and analytic reproducibility: Evaluating the impact of a mandatory open data policy at the journal Cognition. Royal Society open science. 2018 Aug 15;5(8):180448.
OpenUrl CrossRef PubMed

[23] 23.↵
Eisner DA. Reproducibility of science: fraud, impact factors and carelessness. Journal of molecular and cellular cardiology. 2018 Jan 1;114:364–8.
OpenUrl

[24] 24.↵
Colomb J, Brembs B. Sub-strains of Drosophila Canton-S differ markedly in their locomotor behavior. F1000Research. 2015;3:176. doi: 10.12688/f1000research.4263.2.
OpenUrl CrossRef

[25] 25.↵
Ghosh SS, Poline JB, Keator DB, Halchenko YO, Thomas AG, Kessler DA, Kennedy DN. A very simple, re-executable neuroimaging publication. F1000Research. 2017;6.

[26] 26.↵
Perkel J. TechBlog: Interactive figures address data reproducibility. 2017 October 20 [cited 6 September 2019]. In Naturejobs Blog [Internet]. [place unknown]: Nature 1869 - . [about 2 screens]. Available from: http://blogs.nature.com/naturejobs/2017/10/20/techblog-interactive-figures-address-data-reproducibility/.

[27] 27.
Researchers can now publish interactive Plotly figures in F1000. 2017 July 19 [cited 2019 September 7]. In: Medium [Internet]. San Francisco (USA): Medium 2012 - . [about 4 screens]. Available from:https://medium.com/plotly/researchers-can-now-publish-interactive-plotly-figures-in-f1000-87827a1b5d94

[28] 28.↵
Ingraham T. Reanalyse(a)s: making reproducibility easier with Code Ocean widgets - F1000Research Blogs. 2017 April 20 [cited 2019 September 6]. In: F1000Research Blogs [Internet]. London (UK): F1000Research 2015 - . [about 3 screens]. Available from: https://blog.f1000.com/2017/04/20/reanaly-seas-making-reproducibility-easier-with-code-ocean-widgets/

[29] 29.↵
Brinckman A, Chard K, Gaffney N, Hategan M, Jones M, Kowalik K et al. Computing environments for reproducibility: Capturing the “Whole Tale”. Future Generation Computer Systems. 2018;94:854–867. doi: 10.1016/j.future.2017.12.029.
OpenUrl CrossRef

[30] 30.↵
Chirigati F, Rampin R, Shasha D, Freire J. Reprozip: Computational reproducibility with ease. InProceedings of the 2016 International Conference on Management of Data 2016 Jun 26 (pp. 2085-2088). ACM.

[31] 31.↵
Afgan E, Baker D, Batut B, van den Beek M, Bouvier D, Čech M et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Research. 2018;46(W1):W537–W544. doi: 10.1093/nar/gky379.
OpenUrl CrossRef PubMed

[32] 32.↵
Goff SA, Vaughn M, Mckay S, Lyons E, Stapleton AE, Gessler D, et al. The iPlant Collaborative: Cyberinfrastructure for Plant Biology. Frontiers in Plant Science. 2011;2. doi: 10.3389/fpls.2011.00034.
OpenUrl CrossRef PubMed

[33] 33.↵
Goble C, Bhagat J, Aleksejevs S, Cruickshank D, Michaelides D, Newman D et al. myExperiment: a repository and social network for the sharing of bioinformatics workflows. Nucleic Acids Research. 2010;38(uppl_2):W677–W682. doi: 10.1093/nar/gkq429.
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Pettifer S, Sinnott J, Attwood T. UTOPIA—User-Friendly Tools for Operating Informatics Applications. Comparative and Functional Genomics. 2004;5(1):56–60. doi: 10.1002/cfg.359.
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Pettifer S, Thorne D, McDermott P, Marsh J, Villéger A, Kell D et al. Visualising biological data: a semantic approach to tool and database integration. BMC Bioinformatics. 2009;10(S6). doi: 10.1186/1471-2105-10-s6-s19.
OpenUrl CrossRef

[36] 36.↵
Sneddon T, Li P, Edmunds S. GigaDB: announcing the GigaScience database. GigaScience. 2012;1(1). doi: 10.1186/2047-217x-1-11.
OpenUrl CrossRef

[37] 37.↵
Wolstencroft K, Haines R, Fellows D, Williams A, Withers D, Owen S et al. The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud. Nucleic Acids Research. 2013;41(W1):W557–W561. doi: 10.1093/nar/gkt328.
OpenUrl CrossRef PubMed Web of Science

[38] 38.
Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M et al. Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics. 2004;20(17):3045–3054. doi: 10.1093/bioinformatics/bth361.
OpenUrl CrossRef PubMed Web of Science

[39] 39.↵
Hull D, Wolstencroft K, Stevens R, Goble C, Pocock M, Li P et al. Taverna: a tool for building and running workflows of services. Nucleic Acids Research. 2006;34(Web Server):W729–W732. doi: 10.1093/nar/gkl320.
OpenUrl CrossRef PubMed Web of Science

[40] 40.↵
Peter Amstutz, Michael R. Crusoe, Nebojša Tijanić (editors), Brad Chapman, John Chilton, Michael Heuer et al. Common Workflow Language, v1.0. Specification, Common Workflow Language working group. 2016. Available from: https://w3id.org/cwl/v1.0/ xdoi:10.6084/m9.figshare.3115156.v2
OpenUrl CrossRef

[41] 41.↵
Kurtzer G, Sochat V, Bauer M. Singularity: Scientific containers for mobility of compute. PLOS ONE. 2017;12(5):e0177459. doi: 10.1371/journal.pone.0177459.
OpenUrl CrossRef PubMed

[42] 42.↵
Perkel JM. Data visualization tools drive interactivity and reproducibility in online publishing. Nature. 2018 Jan 1;554(7690):133–4.
OpenUrl CrossRef

[43] 43.
Grossman T, Chevalier F, Kazi RH. Bringing research articles to life with animated figures. interactions. 2016 Jul;23(4):52–7.
OpenUrl

[44] 44.
Weissgerber TL, Garovic VD, Savic M, Winham SJ, Milic NM. From static to interactive: transforming data visualization to improve transparency.PLoS biology. 2016 Jun 22;14(6):e1002484.
OpenUrl PubMed

[45] 45.
Barnes DG, Fluke CJ. Incorporating interactive three-dimensional graphics in astronomy research papers. New Astronomy. 2008 Nov 1;13(8):599–605.
OpenUrl CrossRef

[46] 46.
Newe A. Enriching scientific publications with interactive 3D PDF: an integrated toolbox for creating ready-to-publish figures. PeerJ Computer Science. 2016 Jun 20;2:e64.
OpenUrl

[47] 47.
Barnes DG, Vidiassov M, Ruthensteiner B, Fluke CJ, Quayle MR, McHenry CR. Embedding and publishing interactive, 3-dimensional, scientific figures in Portable Document Format (PDF) files. PloS one. 2013 Sep 25;8(9):e69446.
OpenUrl CrossRef PubMed

[48] 48.↵
Rao SS, Huang SC, St Hilaire BG, Engreitz JM, Perez EM, Kieffer-Kwon KR, Sanborn AL, Johnstone SE, Bascom GD, Bochkov ID, Huang X. Cohesin loss eliminates all loop domains. Cell. 2017 Oct 5;171(2):305–20.
OpenUrl CrossRef PubMed

[49] 49.↵
Stodden V, McNutt M, Bailey DH, Deelman E, Gil Y, Hanson B, Heroux MA, Ioannidis JP, Taufer M. Enhancing reproducibility for computational methods. Science. 2016 Dec 9;354(6317):1240–1.
OpenUrl Abstract/FREE Full Text

[50] 50.↵
Kim YM, Poline JB, Dumas G. Experimenting with reproducibility in bioinformatics. BioRxiv. 2017 Jan 1:143503.
OpenUrl

[51] 51.↵
Jupyter et al., “Binder 2.0 - Reproducible, Interactive, Sharable Environments for Science at Scale.” Proceedings of the 17th Python in Science Conference. 2018. doi:10.25080/Majora-4af1f417-011.
OpenUrl CrossRef

[52] 52.↵
Ghosh SS, Poline JB, Keator DB, Halchenko YO, Thomas AG, Kessler DA, Kennedy DN. A very simple, re-executable neuroimaging publication. F1000Research. 2017;6.

[53] 53.↵
Maciocci G, Aufreiter M, Bentley N. Introducing eLife’s first computationally reproducible article.2019 Feb 20 [cited 20 February 2019]. In: eLife Labs Blog [Internet]. Cambridge (UK) 2012 – . [about 5 screens]. Available from: https://elifesciences.org/labs/ad58f08d/introducing-elife-s-first-computationally-reproducible-article

[54] 54.↵
Tang B, Li F, Li J, Zhao W, Zhang Z. Delta integrates 3D physical structure with topology and genomic data of chromosomes. bioRxiv. 2017 Jan xv1:199950.

[55] 55.↵
Rao SS, Huang SC, St Hilaire BG, Engreitz JM, Perez EM, Kieffer-Kwon KR, Sanborn AL, Johnstone SE, Bascom GD, Bochkov ID, Huang X. Cohesin loss eliminates all loop domains. Cell. 2017 Oct 5;171(2):305–20.
OpenUrl CrossRef PubMed

[56] 56.↵
Robinson JT, Turner D, Durand NC, Thorvaldsdottir H, Mesirov JP, Aiden EL. Juicebox. js provides a cloud-based visualization system for Hi-C data. Cell systems. 2018 Feb 28;6(2):256–8.
OpenUrl

[57] 57.↵
Higginson A, Munafò M. Current Incentives for Scientists Lead to Underpowered Studies with Erroneous Conclusions. PLOS Biology. 2016;14(11):e2000995. doi: 10.1371/journal.pbio.2000995.
OpenUrl CrossRef PubMed

[58] 58.↵
Pusztai L, Hatzis C, Andre F. Reproducibility of research and preclinical validation: problems and solutions.Nature Reviews Clinical Oncology. 2013 Dec;10(12):720.
OpenUrl

[59] 59.↵
National Institutes of Health Plan for increasing access to scientific publications and digital scientific data from NIH funded scientific research, [Internet].NIH. 2015 [cited 5 May 2017]. Available from: https://grants.nih.gov/grants/NIH-Public-Access-Plan.pdf.

[60] 60.↵
Wilkinson M, Dumontier M, Aalbersberg I, Appleton G, Axton M, Baak A et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data. 2016; 3:160018. doi: 10.1038/sdata.2016.18.
OpenUrl CrossRef PubMed

[61] 61.↵
Pulverer B. Reproducibility blues. The EMBO Journal. 2015;34(22):2721–2724. doi: 10.15252/embj.201570090.
OpenUrl Abstract/FREE Full Text

[62] 62.↵
Stodden V, Guo P, Ma Z. Toward Reproducible Computational Research: An Empirical Analysis of Data and Code Policy Adoption by Journals. PLOS ONE. 2013;8(6):e67111. doi: 10.1371/journal.pone.0067111.
OpenUrl CrossRef PubMed

[63] 63.↵
Feger SS, Dallmeier-Tiessen S, Wozniak PW, Schmidt A. Gamification in Science: A Study of Requirements in the Context of Reproducible Research. InProceedings of the 2019 CHI Conference on Human Factors in Computing Systems 2019 Apr 18 (p. 460). ACM.

[64] 64.↵
Stodden, V. (2010). The scientific method in practice: Reproducibility in the computational sciences (MIT Sloan Research Paper No. 4773-10). Cambridge, MA: Massachusetts Institute of Technology. doi:10.2139/ssrn.1550193
OpenUrl CrossRef

[65] 65.↵
O’Brien BC, Harris IB, Beckman TJ, Reed DA, Cook DA. Standards for reporting qualitative research: a synthesis of recommendations. Academic Medicine. 2014 Sep 1;89(9):1245–51.
OpenUrl CrossRef PubMed

[66] 66.↵
Labfolder. The significance of reproducible data. [no date] [cited 2019 September 1]. In: Labfolder Blog [Internet]. Berlin (Germany): Labfold 2012 - . [about 2 screens]. Available from: https://www.labfolder.com/blog/the-significance-of-reproducible-data/

[67] 67.
Tait A. 10 Rules for Creating Reproducible Results in Data Science. 2017 July 3 [cited 2019 September 1]. In: Dataconomy [Internet]. Berlin (Germany). [about 2 screens]. Available from: https://dataconomy.com/2017/07/10-rules-results-data-science/

[68] 68.↵
Pawlik M, Hütter T, Kocher D, Mann W, Augsten N. A Link is not Enough–Reproducibility of Data. Datenbank-Spektrum. 2019:1–9.

[69] 69.
Baranyi M, Greilhuber J. Genome size inAllium: in quest of reproducible data. Annals of Botany. 1999 Jun 1;83(6):687–95.
OpenUrl CrossRef

[70] 70.↵
Weinländer M, Lekovic V, Spadijer-Gostovic S, Milicic B, Krennmair G, Plenk Jr H. Gingivomorphometry–esthetic evaluation of the crown–mucogingival complex: a new method for collection and measurement of standardized and reproducible data in oral photography. Clinical oral implants research. 2009 May;20(5):526–30.
OpenUrl PubMed

[71] 71.↵
Koschke R. Software visualization in software maintenance, reverse engineering, and re-engineering: a research survey. Journal of Software Maintenance and Evolution: Research and Practice. 2003 Mar;15(2):87–109.
OpenUrl

[72] 72.
Snell, L. and Spencer, J. Reviewers’ perceptions of the peer review process for a medical education journal. Medical Education. 2005; 39(1), pp.90–97.
OpenUrl CrossRef PubMed

[73] 73.↵
Federer LM, Lu YL, Joubert DJ, Welsh J, Brandys B.Biomedical data sharing and reuse: Attitudes and practices of clinical and scientific research staff. PloS one. 2015 Jun 24;10(6):e0129506.
OpenUrl CrossRef PubMed

[74] 74.
Schneider, M.V., Flannery, M. and Griffin, P., Survey of Bioinformatics and Computational Needs in Australia 2016. pdf. Figshare,2016. Available from: https://figshare.com/articles/Survey_of_Bioinformatics_and_Computational_Needs_in_Australia_2016_pdf/4307768/1

[75] 75.↵
Barone L, Williams J, Micklos D. Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators. PLOS Comput Biol. 2017;13(10). doi: 10.1371/journal.pcbi.1005755.
OpenUrl CrossRef PubMed

[76] 76.↵
James, J.M. and Bolstein, R., 1990. The effect of monetary incentives and follow-up mailings on the response rate and response quality in mail surveys. Public opinion quarterly, 54(3), pp.346–361.
OpenUrl CrossRef Web of Science

[77] 77.
Shettle, C. and Mooney, G.. Monetary incentives in US government surveys. Journal of Official Statistics. 1999; 15(2), p.231.
OpenUrl

[78] 78.↵
Jobber, D., Saunders, J. and Mitchell, V.W. Prepaid monetary incentive effects on mail survey response. Journal of Business Research. 2004; 57(1), pp.21–25.
OpenUrl CrossRef Web of Science

[79] 79.↵
Müller H, Naumann F, Freytag J. Data Quality in Genome Databases.Proc Conf Inf Qual (IQ 03). 2003;269–284.

[80] 80.
Marx V. The big challenges of big data. Nature. 2013;498(7453):255–260. doi: 10.1038/498255a.
OpenUrl CrossRef PubMed Web of Science

[81] 81.↵
Stodden V. Reproducing Statistical Results. Annual Review of Statistics and Its Application. 2015;2(1):1–19. doi: 10.1146/annurev-statistics-010814-020127.
OpenUrl CrossRef

[82] 82.↵
B. A. Nosek et al., Promoting an open research culture. Science. 2015;348, 1422–1425. doi: 10.1126/science.aab2374.
OpenUrl Abstract/FREE Full Text

[83] 83.↵
Collins F, Tabak L. Policy: NIH plans to enhance reproducibility. Nature. 2014;505(7485):612–613. doi: 10.1038/505612a.
OpenUrl CrossRef PubMed Web of Science

[84] 84.↵
Berg J. Progress on reproducibility. Science. 2018;359(6371):9–9. doi: 10.1126/science.aar8654.
OpenUrl Abstract/FREE Full Text

[85] 85.↵
Vines TH, Albert AY, Andrew RL, Débarre F, Bock DG, Franklin MT, Gilbert KJ, Moore JS, Renaut S, Rennison DJ. The availability of research data declines rapidly with article age. Current biology. 2014 Jan 6;24(1):94–7.
OpenUrl CrossRef PubMed

[86] 86.↵
Poldrack RA, Gorgolewski KJ. Making big data open: Data sharing in neuroimaging. Nat Neurosci. 2014;17(11):1510–7. doi: 10.1038/nn.3818.
OpenUrl CrossRef PubMed

[87] 87.↵
Faniel IM, Zimmerman A. Beyond the Data Deluge: A Research Agenda for Large-Scale Data Sharing and Reuse. Int J Digit Curation. 2011;6(1):58–69. doi: 10.2218/ijdc.v6i1.172.
OpenUrl CrossRef

[88] 88.↵
Banditwattanawong T, Masdisornchote M, Uthayopas P. Economical and efficient big data sharing with i-Cloud. In2014 International Conference on Big Data and Smart Computing (BIGCOMP) 2014 Jan 15 (pp. 105-110). IEEE.

[89] 89.
Once You’re in the Cloud, How Expensive Is It to Get Out? [cited 2019 September 6]. In: NEF [Internet]. Newton Massachusetts (USA): NEF 2004 - .[about 2 screens]. Available from: https://www.nefiber.com/blog/cloud-egress-charges/.

[90] 90.↵
Linthicum D. Don’t get surprised by the cloud’s data-egress fees [Internet]. InfoWorld. 30 March 2018 [cited 6 September 2019]. In: InfoWorld [Internet]. Framingham Massachusetts (USA): InfoWorld 1978 - . [about 2 screens]. Available from: https://www.infoworld.com/article/3266676/dont-get-surprised-by-the-clouds-data-egress-fees.html.

[91] 91.↵
Abdill R, Blekhman R. Tracking the popularity and outcomes of all bioRxiv preprints; 2019. Preprint. Available from BioRxiv: 10.1101/515643. Cited 3 April 2019.

[92] 92.↵
Figueiredo AS. Data Sharing: Convert Challenges into Opportunities. Front Public Health.Frontiers Media SA;2017; 5:327.doi:10.3389/fpubh.2017.00327.
OpenUrl CrossRef

[93] 93.↵
Hollis KF. To Share or Not to Share: Ethical Acquisition and Use of Medical Data. AMIA Jt Summits Transl Sci proceedings AMIA Jt Summits Transl Sci [Internet]. 2016;2016:420–7.
OpenUrl

[94] 94.↵
Chen CP, Zhang CY. Data-intensive applications, challenges, techniques and technologies: A survey on Big Data. Information sciences. 2014 Aug 10;275:314–47.
OpenUrl

[95] 95.↵
Attwood TK, Kell DB, McDermott P, Marsh J, Pettifer SR, Thorne D. Calling International Rescue: knowledge lost in literature and data landslide! Biochem J. 2009;424:317–33. doi: 10.1042/BJ20091474.
OpenUrl Abstract/FREE Full Text

[96] 96.↵
Franco A, Malhotra N, Simonovits G. Publication bias in the social sciences: Unlocking the file drawer. Science. 2014;345(6203):1502–1505. doi: 10.1126/science.1255484.
OpenUrl Abstract/FREE Full Text

[97] 97.↵
Teixeira da Silva JA. Negative results: negative perceptions limit their potential for increasing reproducibility.Journal of negative results in biomedicine. 2015 Dec;14(1):12.
OpenUrl

[98] 98.↵
Fanelli D. Negative results are disappearing from most disciplines and countries. Scientometrics. 2011;90(3):891–904. doi: 10.1007/s11192-011-0494-7.
OpenUrl CrossRef Web of Science

[99] 99.↵
Prager EM, Chambers KE, Plotkin JL, McArthur DL, Bandrowski AE, Bansal N, Martone ME, Bergstrom HC, Bespalov A, Graf C. Improving transparency and scientific rigor in academic publishing. Journal of neuroscience research. 2019 Apr;97(4):377–90.
OpenUrl

[100] 100.↵
Miyakawa T. No raw data, no science: another possible source of the reproducibility crisis. Molecular Brain. 2020;13(1).

[101] 101.↵
Iqbal S, Wallach J, Khoury M, Schully S, Ioannidis J. Reproducible Research Practices and Transparency across the Biomedical Literature. PLOS Biology. 2016;14(1):e1002333. doi: 10.1371/journal.pbio.1002333.
OpenUrl CrossRef PubMed

[102] 102.↵
Hershey N. Compensation and Accountability: The Way to Improve Peer Review. Quality Assurance and Utilization Review. 1992;7(1):23–29. doi: 10.1177/106286069200700104.
OpenUrl CrossRef

[103] 103.↵
Announcement: Reducing our irreproducibility. Nature. 2013;496(7446):398–398. doi: 10.1038/496398a.
OpenUrl CrossRef Web of Science

[104] 104.↵
Pusztai L, Hatzis C, Andre F. Reproducibility of research and preclinical validation: problems and solutions. Nature Reviews Clinical Oncology. 2013;10(12):720–724. doi: 10.1038/nrclinonc.2013.171.
OpenUrl CrossRef PubMed

[105] 105.↵
Moher D. Reporting guidelines: doing better for readers. BMC Medicine. 2018;16(1). doi: 10.1186/s12916-018-1226-0.
OpenUrl CrossRef PubMed

[106] 106.↵
Moher D, Avey M, Antes G, Altman DG. The National Institutes of Health and guidance for reporting preclinical research. BMC medicine. 2015 Dec;13(1):34.
OpenUrl

[107] 107.
Gonsalves, A. Lessons learned on consortium-based research in climate change and development. CARIAA Working Paper no. 1. 2014 [cited 2019 Jan 1]. In: International Development Research Centre [Internet]. Ottawa, Canada and UK Aid, London, United Kingdom. Available from: www.idrc.ca/cariaa

[108] 108.↵
Somers J. The Scientific Paper Is Obsolete2018 April 5[cited 1 September 2019]. In: The Atlantic [Internet]. Boston, Massachusetts (USA) 1857 -. [about 15 screens]. Available from: https://www.theatlantic.com/science/archive/2018/04/the-scientific-paper-is-obsolete/556676/

[109] 109.↵
Kidwell MC, Lazarević LB, Baranski E, Hardwicke TE, Piechowski S, Falkenberg LS, Kennett C, Slowik A, Sonnleitner C, Hess-Holden C, Errington TM. Badges to acknowledge open practices: A simple, low-cost, effective method for increasing transparency. PLoS biology. 2016 May 12;14(5):e1002456.
OpenUrl CrossRef PubMed

[110] 110.↵
Schönbrodt F. Changing hiring practices towards research transparency: The first open science statement in a professorship advertisement. 2016 Jan 6 [cited 2016 Mar 16]. In: Felix Schönbrodt’s blog [Internet]. [place unknown]. [about 1 screen]. Available from: https://www.nicebread.de/open-science-hiring-practices/.

[111] 111.↵
Wellcome Trust. Request for Information (RFI) A software tool to assess the FAIRness of research outputs against a structured checklist of requirements [FAIRWare] [Internet]. Wellcome Trust; 2018 [cited 5 March 2019]. Available from: https://wellcome.ac.uk/sites/default/files/FAIR-checking-software-request-for-information.pdf.

[112] 112.↵
Flier JS. Irreproducibility of published bioscience research: Diagnosis, pathogenesis and therapy. Mol Metab. 2017 Nov 21;6(1):2–9. doi: 10.1016/j.molmet.2016.11.006.
OpenUrl CrossRef

Knowledge and attitudes among life scientists towards reproducibility within journal articles: a research survey

Abstract

Introduction

Methods

Population and sample

Validation of the survey design

Statistical analysis

Results

Characteristics of the sample

Access to data and bioinformatics tools

Understanding of reproducibility, training and successful replication

Improving Reproducibility of Published Research

Interactive Figures

Limitations

Discussion

Data and Code Availability

Acknowledgements

Footnotes

References

Citation Manager Formats

Subject Area