Data management and sharing in neuroimaging: Practices and perceptions of MRI researchers

John A. Borghi; Ana E. Van Gulick

doi:10.1101/266627

ABSTRACT

Neuroimaging methods such as magnetic resonance imaging (MRI) involve complex data collection and analysis protocols, which necessitate the establishment of good research data management (RDM). Despite efforts within the field to address issues related to rigor and reproducibility, information about the RDM-related practices and perceptions of neuroimaging researchers remains largely anecdotal. To inform such efforts, we conducted an online survey of active MRI researchers that covered a range of RDM-related topics. Survey questions addressed the type(s) of data collected, tools used for data storage, organization, and analysis, and the degree to which practices are defined and standardized within a research group. Our results demonstrate that neuroimaging data is acquired in multifarious forms, transformed and analyzed using a wide variety of software tools, and that RDM practices and perceptions vary considerably both within and between research groups, with trainees reporting less consistency than faculty. Ratings of the maturity of RDM practices from ad-hoc to refined were relatively high during the data collection and analysis phases of a project and significantly lower during the data sharing phase. Perceptions of emerging practices including open access publishing and preregistration were largely positive, but demonstrated little adoption into current practice.

Introduction

Magnetic resonance imaging (MRI) is a popular and powerful neuroimaging technique for investigating the structure and function of the human brain. Functional MRI (fMRI), which enables researchers to assess activity in specific brain areas over time by measuring changes in blood oxygenation¹, has been particularly influential in clinical and cognitive neuroscience². Like their peers in social psychology³ and other data-intensive disciplines⁴, neuroimaging researchers have grappled with questions related to the rigor and reproducibility of their methods. As a result, there has been a substantial amount of discussion within the field about the need to foster open science practices including the regular sharing and reuse of research data⁵. However, it is unclear to what extent such practices have been adopted by the active research community.

Outside the laboratory, research data has also become an increasing focus for academic libraries. Though issues of rigor and reproducibility have been explicitly addressed in some library activities⁶, the majority of data-related library services are focused on research data management (RDM). Providing a single comprehensive definition of RDM is difficult due to the number of stakeholders involved, but the term generally encompasses topics related to how data and other research materials are documented, curated, and preserved⁷. Though the configuration of services varies considerably between institutions, library RDM initiatives generally emphasize skills training and assisting researchers in complying with data-related policies and mandates⁸. In this role, academic librarians, some with extensive research backgrounds in addition to their information science training, are able to contribute their expertise to active research projects. Major challenges for library RDM initiatives include the degree to which RDM-related practices and perceptions vary between research disciplines⁹ and change over time¹⁰. Potentially significant differences between researchers and librarians in their perceptions and priorities surrounding data have been less explored, but likely also represent substantial challenges. As a highly interdisciplinary field currently grappling with issues closely related to RDM, neuroimaging research involvin MRI represents an ideal case study for thoroughly examining how active researchers are currently managing and sharing their data.

The complexities inherent in collecting, analyzing, and disseminating MRI data underscore the necessity of establishing well-defined RDM practices in neuroimaging research. Even a relatively straightforward project involving MRI requires the management of data in a variety of forms from a variety of sources. In addition to the data collected using an MRI scanner, this may include sensitive medical information (e.g. pregnancy status, psychiatric and medical diagnoses), task-related behavioral data (e.g. response accuracy, reaction time), and questionnaire responses. Assessing, using, and replicating this work also requires access to documentation pertaining to participant characteristics, image acquisition and other scanning parameters, preprocessing and analysis procedures, as well as research materials including stimuli and custom code sets.

However, the importance of RDM to neuroimaging research extends beyond the need to save and organize multifarious sets of materials. The BOLD (Blood Oxygen Level Dependant) signal, which underlies the majority of functional MRI studies, has a complex origin¹¹, is potentially confounded by a number of physical, physiological, and behavioral variables¹², and requires careful interpretation¹³. The process of analyzing MRI data is also highly flexible, iterative, and statistically challenging, with decisions made at an early stage having significant downstream effects¹⁴. Operating system type, software versions, and even hardware architecture have also been shown to significantly influence analytical results¹⁵. Thus, extensive documentation and justification of data acquisition and analysis parameters is essential. Unfortunately, despite the publication of best practice guidelines¹¹, many articles describing the results of projects involving MRI omit essential details related to experimental design, data acquisition, and analysis procedures^17,18.

The rigor and reproducibility of neuroimaging research has been questioned due to a number of interrelated issues including reporting and publication biases in the scholarly literature^19,20, low levels of statistical power^21,22, the use of suboptimal design and analytical methods^23,24 and the recent discovery of errors in widely used software tools²⁵. Open data sharing has long been proposed as a way to address these and other issues^26,27, though early attempts such as the fMRI Data Center (fMRIDC) were met with skepticism and hampered by the demands involved in curating such large and complex datasets, an absence of requirements or incentives to make data available, and a lack of formalized standards about how neuroimaging data should be organized²⁸. However, the view of the broader research community has since shifted considerably The majority of researchers now appear to support the concept of sharing data^10,29 and other data stakeholders including scholarly publishers³⁰ and federal funding agencies³¹ have adopted a heterogeneous mix of data-related policies, mandates, and best practice recommendations. In parallel, a wide variety of tools and platforms have been developed to allow neuroimaging researchers to more easily manage and share MRI data.

Reflecting the iterative and flexible nature of how it is analyzed, MRI data is currently disseminated in a variety of forms. For example, tools like Neurosynth (http://neurosynth.org/) enable researchers to examine and compare peak activation coordinates reported in the neuroimaging literature while platforms such as Neurovault (https://neurovault.org/) and OpenfMRI (https://openfmri.org/) allow researchers to share the data in the form of unthresholded statistical maps and raw images respectively. Projects such as the Alzheimer’s Disease Neuroimaging Initiative (ADNI)³², the International Neuroimaging Data Sharing Initiative (INDI)³³, and the Autism Brain Imaging Data Exchange (ABIDE)³⁴ host large datasets related to clinical conditions and the National Institutes of Health sponsors the Connectome Coordination Facility (CCF) (https://www.humanconnectome.org/), which makes carefully collected, large-scale multimodal data available for reuse. The development of standardized organizational schemes such as the Brain Imaging Data Structure (BIDS)³⁵ and tools for constructing and distributing analytical pipelines (e.g. LONI³⁶, Nipype³⁷, BIDS Apps³⁸) enable researchers to not only share their data but also ensure that it can be navigated, assessed, and reproduced by others. However, while these developments have provided important infrastructure, addressing concerns related to rigor and reproducibility requires more than simply the adoption of new technologies. It also requires the refinement of a broad spectrum of behaviors and practices³⁹.

Rigorous and reproducible science begins in the laboratory. For data to be effectively shared, evaluated, and re-used, it must first be effectively documented, organized, saved, and prepared. Such activities are encapsulated in the FAIR (Findable, Accessible, Interoperable, and Re-usable) Data Principles, which were developed by an international community of researchers, librarians, funders, and publishers as guidelines for enhancing the reusability of research data⁴⁰. Though similar principles have been incorporated into recent neuroimaging-specific best practice recommendations⁴¹, the extent to which they translate into the day-to-day activities of active researchers remains unclear. Therefore, to inform efforts within both the neuroimaging and academic library communities to address rigor and reproducibility, we designed and distributed a survey examining RDM-related practices and perceptions among neuroimaging researchers working with MRI data.

Results

Participant Characteristics

A total of 144 participants from 11 countries and 69 institutions participated in this study. The majority of participants were from the United States (72.31%), the United Kingdom (10.00%), and Canada (6.92%). As shown in Table 1, participants were affiliated with a variety of research disciplines with the most common being cognitive neuroscience. Participants consisted of a mix of trainees (e.g. graduate students, postdoctoral fellows) and faculty (e.g. associate, assistant, and full professors).

View this table:

Table 1.

Characteristics of study participants. A total of 144 neuroimaging researchers participated in this study, though not every participant gave a response for every question. Participants were split between A. trainees and faculty and B. cognitive neuroscience and other research areas. All values listed are percentages.

The majority of participants (64.06%, N = 128) indicated that they receive funding from the National Institutes of Health (NIH). Other common sources of funding include private foundations (12.5%), the National Science Foundation (NSF) (11.72%), internal grants (including startup funds) (11.72%), the Department of Defense (DOD) (3.90%), and international funding bodies (19.53%). Though most participants (53.47%, N = 144) indicated that they generate documentation about how data is to be collected, organized, and secured over the course of a project, the heterogeneous data policies of these funding bodies makes it highly unlikely that this refers exclusively to a data management plan (DMP).

Participants indicated that they received their training about how to collect and analyze neuroimaging data from a variety of sources, including individuals within their own lab (e.g. other students, post-docs) (57.64%, N = 144), online resources and documentation (e.g. self-taught) (51.39%), individuals outside their lab (e.g. other people in their department) (35.42%), and formal courses at their institution (20.14%) or another institution (27.08%). A relatively small percentage of participants indicated that they had taken advantage of local university services related to data management (27.8%) or scholarly publishing (14.6%), though engagement with services related to technical infrastructure was considerably higher (45.1%). Instead, the majority indicated that such services were unavailable, that they were unsure about their existence, or that they were aware of their existence but had not taken advantage of them.

RDM Maturity Ratings

As shown in Figure 1, participants rated the overall maturity of their data management practices during both the data collection [t(128) = 6.349, p < 0.001] and data analysis [t(116) = 7.403, p < 0.001] phases of a project as significantly higher than those of the field as a whole. A similar trend was observed for the data sharing phase, but the comparison did not reach statistical significance [t(115) = 1.677, p < 0.096]. Average maturity ratings for the data sharing phase were significantly lower than those of the other phases for both individual practices [F(2, 226) = 70.61, p < 0.001] and the field as a whole F(2, 226) = 34.44, p < 0.001].

Figure 1.

Average RDM maturity ratings between (A) and within (B-D) three phases of an MRI research project. A. Participants rated their own RDM practices as significantly more mature than those of the field as a whole during the data collection and analysis phases. Ratings of both individual and field maturity were significantly lower during the data sharing phase than during data collection and analysis. [Data collection: N = 131 (individual), 130 (field), data analysis: N = 118 (individual/field), data sharing: 116 (individual/field)]. Ratings of individual activities within each phase reflected a similar trend. B. Practices related to the backup of raw data and securing of sensitive data were rated as highly mature during the data collection phase while the documentation of file organization schemes (such as through a lab notebook or data dictionary) received the lowest rating. [N = 132] C. Similarly, during the data analysis phase, the backup of analyzed data received the highest rating, while the documentation of decisions related to analytical pipelines and the use of computational tools received the lowest. [N =120] D. Activities described in the data sharing phase received lower ratings than those in previous phases. [N = 116]

Ratings of individual practices within each phase followed a similar pattern. Overall, ratings for individual practices during the data sharing phase were substantially lower than those during the data collection and analysis phases. Maturity ratings were highest for practices involved in ensuring the security of sensitive data and backing up data and lowest for those involved making data available to researchers outside of their research group. This focus on practical concerns was also evident when participants were asked about what motivates and limits their RDM practices. As shown in Table 2, participants reported that they were primarily motivated by a desire to prevent the loss of data, ensure everyone in their research group has access to data, and a desire to foster openness and reproducibility and limited by time and a lack of available training and best practices.

View this table:

Table 2.

Factors that limit and motivate RDM during the data collection, analysis, and sharing/publishing phases of a research project. All values listed are percentages, more than one response could be selected. In terms of limits, “Other” responses included changes in personnel, differences in expertise within a lab, differences in preferences between lab members, lack of top-down leadership, and concerns about future cost. For motivations, “other” responses included ensuring continuity following personnel changes, keeping track of analyses, error prevention, and maximizing efficiency. [Data collection: N = 125 (limits/motivations), Data analysis: N = 115 (limits), 120 (motivations), Data sharing: N = 112 (limits/motivations)].

Data Collection Practices

For the purposes of this survey, “data collection” was defined as activities starting with the collection of neuroimaging data at a scanning facility and continuing through to the organization and storage of data within the participant’s laboratory. Questions in this section of the survey dealt primarily with the types of data collected as well as procedures for moving, saving, and organizing raw data.

As expected, participants indicated that they collect and manage a wide range of research materials over the course of an MRI project. As shown in Table 3, this includes multiple types of MRI images, additional “non-MRI” data, and a variety of documentation, code, and other research materials related to data collection and analysis. When it comes to moving data from the scanning facility, the majority of participants indicated that they use a server to transfer their MRI data (82.58%, N = 132) and a hard drive for non-MRI data (55.30%). Once data is in the lab, participants indicated that it is primarily organized using standardized file structures (70.45%, N = 132) and file names (67.42%). Less common were the use of formal lab notebooks (47.73%), databases (28.79%), or the admission that procedures are generally not documented (17.42%). The majority of participants indicated that practices related to data organization were consistent within their lab or research group. However, trainees were significantly less likely to endorse this consistency than faculty members [X²(2, N = 132) = 13.49, p < 0.01]. A similar trend was evident when participants were asked about backup procedures. The majority of participants (73.5%, N = 132) indicated that their scanning facility maintains backups of MRI data and that they themselves backup data using a wide variety of means including servers operated by their lab (41.67%, N =132) or institution (37.88%), manual (30.30%) and automatic (21.97%) backups of local machines, and external hard drives (25.76%). Though the majority of participants (54.2%) indicated that backup procedures were consistent within their lab or research group, trainees were less likely to endorse this than faculty [X²(2, N = 131) = 7.28, p < 0.05].

View this table:

Table 3.

Types of data collected fell into three categories: MRI data, non-MRI data, and study information. All values listed are percentages, multiple data types could be selected. A. For MRI data, common “other” responses included spectroscopy, diffusion, blood flow, and MRS. B. For non-MRI data, common “Other responses included motion tracking, neurophysiology measures, and hormones (saliva). C. For study information, common “other” responses included scanner quality assurance data, information about the scanner itself, and consent forms.

Data Analysis Practices

For the purposes of this survey, “data analysis” was defined as activities starting with preprocessing (co-registration, motion correction, etc) of MRI data and proceeding through first and second level analyses. Questions in this section of the survey dealt primarily with the use of software tools and the documentation of analytical decisions and parameters.

Overall, participants indicated that they use a wide variety of tools to analyze their MRI and non-MRI data. While there are several commonly used tools, as shown in Table 4, there is also a long tail of tools used by a relatively small number. Only 13.33% of participants (N = 120) indicated that they process data from each subject individually using a GUI. Instead, the majority of participants indicated that their preprocessing is scripted using their own scripts (64.17%), scripts adapted from others (58.33%), or scripts written by others without adaptation (15.0%). A majority of participants indicated that everyone in their lab or research group uses the same tools to analyze MRI data (40% same version, 25% different versions). However, trainees were significantly less likely to endorse this than faculty [X²(3, N = 120) = 25.4, p < 0.001]. Analysis of non-MRI software tools yielded similar results, though only 43.3% (23.3% same version, 20% different versions) of participants indicated that the application of these tools was consistent. Again, differences between trainees and faculty were statistically significant [X²(2, N = 120) = 14.90, p < 0.01]

View this table:

Table 4.

Software used for analysis: A. MRI-specific software (top 10 most popular) and B. non-MRI-specific software. All values listed are percentages, multiple software tools could be selected. “Other” MRI-specific software included Nipype (5.00%), custom code (4.17%), ANTS (4.17%), FMRIprep (2.50%), NiPy (1.17%), ITK-SNAP (1.17%), Connectome Workbench (1.17%), MRIQC (1.17%), CIVET, C-PAC, DPARSF, GIFT, ExploreDTI, CAT, SPHARM, TBSS, fidl, PLS, SamSrF, Vistasoft, and MedIRNIA. “Other” non-MRI-specific software included Acknowledge, CIGAL, Fscan, Data Desk, Mplus, Octave, Stan, and Bash.

Participants indicated that they generally document their activities (including quality checks, pre-processing parameters, and the results of first/higher level analysis) using a word processing program (e.g. Evernote, Microsoft Word) (56.67%, N = 120), readme files (42.5%), and, to a lesser extent, version control systems (e.g. Git) (25.83%), electronic lab notebooks (e.g. Jupyter) (19.17%), active data management plans (4.17%), and lab management tools (e.g. LabGuru, Open Science Framework) (2.5%). Unfortunately, 10.83% of participants indicated that they do not document their activities in any systematic way. The majority of participants (74.6%) acknowledged that not everyone uses the same system for documenting their activities and differences between trainees and faculty were again statistically significant [X²(2, N = 118) = 17.55, p < 0.001].

When asked if another researcher could recreate their work using only its documentation, 59.2% of participants (N = 120) indicated that they would be able to recreate both their preprocessing and analysis steps, 11.5% indicated that other researchers would be able to recreate one or the other, and 19.1% were either unsure or believed that they would need to be present.

Data sharing Practices

For the purpose of this survey, “data sharing” was defined as activities involving the communication or publication of conclusions drawn from neuroimaging data in a scholarly presentation or publication as well as the sharing of the underlying data itself through a general or discipline-specific repository. Questions in this section of the survey dealt primarily with the means and motivations for making data, code, and other materials available to other researchers.

As shown in Table 5, participants generally indicated that they were motivated to share data by a desire to foster research transparency and reproducibility rather than by professional incentives or the need to fulfill mandates. Whhen asked about reasons they may not be able to share their data, the most common responses were that it may contain additional findings to be discovered or published and that it contains confidential or sensitive information. Half (50.0%, N = 116) of participants indicated that they had not been required to share data or submit a data availability statement when publishing a journal article. However, significantly more faculty (50.88%) indicated that they had encountered such a requirement than trainees (28.81%) [X²(3, N = 116) = 10.52, p < 0.05]. Similarly, while the majority of participants indicated that they have not requested data from an author of a journal article (56.03%) or received such a request themselves (55.17%), significantly more faculty reported receiving a request for their data than trainees [X²(2, N = 116) = 21.62, p < 0.001].

View this table:

Table 5.

Reasons why data can and cannot be shared. All values listed are percentages, more than one reason could be selected. Other reasons given include: Consent (5), laziness, afraid of mishandling, projects that are haphazard.

Participants indicated that a large number of their research materials should be preserved over the long term (see Table 6) and generally reporting saving materials for eight years or more (29.31% maintained so that is always accessible, 40.51% saved in formats that may become obsolete). When asked if another researcher could recreate their work using only its description in a publication or scholarly report, 64.38% (N = 115) indicated that they would be able to recreate both their preprocessing and analysis steps, 9.57% indicated that other researchers would be able to recreate one or the other, and 26.09% were either unsure or believed that they would need to be present.

View this table:

Table 6.

Important parts of data to preserve long term. All values listed are percentages, multiple data types could be selected. Overall, researchers want to preserve nearly all data long term. Other data types indicated to preserve include: code for analysis (3) and hormone information.

Emerging Research Practices

The majority of participants (56.64%, N = 113) indicated that they currently regard data as a “first class” research product, meaning a product that should be assessed, valued, and considered as part of application and promotion decisions in the same way as a journal article. As shown in Figure 2, this is broadly indicative of that fact that the MRI research community is currently at a point of transition. While only a small percentage of researchers indicated that they have adopted emerging research practices such as pre-registering their studies, conducting replications, publishing preprints, or publishing research products such as code, datasets, and grant proposals, a substantial percentage indicated that they plan to in the future.

Figure 2.

Adoption of emerging research practices among neuroimaging researchers. [N = 100]

Discussion

In order to inform efforts within the neuroimaging and academic library communities to address issues of rigor and reproducibility, we surveyed the RDM-related practices and perceptions of researchers who use magnetic resonance imaging (MRI) to study human neuroscience. Overall, our results highlight the considerable challenges involved in properly managing and sharing neuroimaging data - the data is acquired in multifarious forms, transformed and analyzed using a wide variety of tools, and documented inconsistently. Our results also demonstrate that neuroimaging researchers generally receive informal training in data-related practices, have little interaction with institutional data support services, and presently encounter few expectations from data stakeholders such as scholarly publishers and research funding bodies.

Neuroimaging is not unique in facing challenges related to rigor and reproducibility. Issues such as publication bias^42,43 and low statistical power^44,45 have been discussed in the behavioral and biomedical sciences for decades and data stakeholders including scholarly publishers and federal funding agencies have instituted a range of reproducibility-related policies stipulating how the data underlying published work should be managed and shared. For example, while mandates requiring authors to share the data underlying publications have been shown to increase the degree to which data is made available⁴⁶, only a minority of biomedical journals have such requirements and even fewer provide specific guidance as to how to make data available and reusable⁴⁷. Federal funding bodies generally exercise their RDM-related policies by requiring that a data management plan (DMP), which outlines how data is going to be collected, organized, preserved, and shared, be submitted as part of any grant proposal31. The efficacy of DMPs in affecting how researchers actually manage and share their data in practice is unclear⁴⁸ and it is notable that the NIH, which was the most prevalent funder in our sample, does not currently have a DMP requirement. However, as evidenced by the imminent pilot of an NIH Data Commons and the recent controversy related to the reclassification of behavioral and imaging studies as clinical trials⁴⁹, funder data policies will likely soon begin affecting neuroimaging researchers.

Neuroimaging is also not unique in how it has addressed challenges related to rigor and reproducibility. Communities throughout neuroscience working with specific methodologies (e.g. neurophysiology⁵⁰, cell morphology⁵¹) or data from specific model systems (e.g. C. elegans⁵²) have begun to develop standards, tools, and community norms to facilitate the management, curation, and sharing of data and other research materials. Like complementary initiatives across other disciplines⁵³, these grassroots efforts have the potential to ensure that effective RDM practices are incorporated throughout the course of a research project rather than simply deployed at discrete points in response to mandates from a funder or publisher. By assessing current RDM practices in neuroimaging, the present study adds crucial context to efforts aimed at advancing rigor and reproducibility.

By collecting and analyzing quantitative ratings of RDM maturity, which we operationalized as the degree to which data-related procedures are defined and implemented, we were able to quantify how active neuroimaging researchers perceive their own practices and the practices of the field as a whole. There are several interpretations of our observation that participants generally rated their own practices as more mature than those of the field as a whole. It is possible that this observation reflects the well known phenomenon of participants rating themselves as better than average across a wide range of personal characteristics⁵⁴. Given that this study was primarily disseminated via social media and through a number of scholarly discussion groups where there is a great deal of discussion related to research methodology, open science, and reproducibility, it is also possible that our sample was indeed biased in favor of participants who incorporate RDM into their work to a greater degree than average. Our finding that maturity ratings were significantly lower for the data sharing/publishing phase is in line with the lack of existing data sharing requirements, the propensity of researchers to share data via personal communication, and the centering of data sharing as a way to address issues of rigor and reproducibility. However, it is also at odds with the fact that our results indicate that there is ample room for improving RDM practices during the data collection and analysis phases.

By asking participants about their RDM-related activities using language and terminology familiar to them, we were able to construct a comprehensive picture of how neuroimaging researchers handle their data over the course of a project. Given the preponderance of informal methodological training in neuroimaging and results of previous studies examining methods reporting in the extant literature19,20, we expected that participants would report applying a wide variety of practices and tools to their data. Our results bore this out and also revealed that trainees and faculty members had significantly different perspectives on the degree to which backup procedures, data structures, analytical tools, and documentation practices are consistent within their lab or research group. While our methods do not allow us to speculate about the cause of these differences, their ubiquity indicates that there is not an optimal amount of communication about the importance of RDM even within individual research groups or projects.

The spread of increasingly high resolution imaging hardware, the rapid evolution of experimental approaches and analytical techniques, and the community development of user-friendly software tools have enabled neuroimaging researchers using MRI to make significant contributions across the behavioral and biomedical sciences. However, our results demonstrate there is an outstanding need for training related to research data management. This issue is also not unique to neuroimaging. In a recent survey, PI’s from across the National Science Foundation (NSF)’s biological sciences directorate listed training on data management as one of their foremost unmet needs⁵⁵. Though topics related to RDM are often included in undergraduate and graduate level coursework, many educators report that they are not covered thoroughly due to a lack of time, expertise, and guidance⁵⁶. These trends highlight the need for greater collaboration between researchers who possess expertise in collecting, analyzing, and evaluating data and support providers in academic libraries who have expertise in its management and sharing. Because our results demonstrate that RDM activities among neuroimaging researchers are, at least at present, generally motivated by immediate considerations such as ensuring data is not lost and ensuring that it is accessible to the members of a particular research group, a potentially fruitful direction for such a collaboration could be the development of training materials that provide actionable information about how data could be effectively documented, organized, and saved throughout the course of a research project and also illustrate how such activities are an important component of addressing broader concerns related to rigor and reproducibility.

Though the present study offers unique insight into the data management practices of neuroimaging researchers, these results should not be interpreted as a criticism or singling-out of the field. Follow-up research will explore RDM practices and perceptions in cognate research areas such as psychology and biomedical science and it is likely that many of the same trends including informal education in effective RDM, inconsistency even within the same research group, and slow adoption of open science tools and practices will be observed.

Methods

Our survey consisted of 74 multiple choice questions and an optional open response question. Questions focused on a range of data management-related topics, including types of data collected (including MRI data, non-MRI data, and related documentation), tools used to manage and analyze data, and the degree to which data management practices are standardized within the participant’s research group. The survey was distributed using the Qualtrics platform (http://www.qualtrics.com) between June and September, 2017. Participants were able to skip questions while proceeding through the survey. All study procedures were approved by the institutional review board of Carnegie Mellon University (Study 2017 00000129). Data were analyzed using JASP⁵⁷.

Survey Design

The process of survey design drew upon expertise from both the academic library and neuroimaging communities. The structure of the survey, which generally follows the trajectory of a typical MRI project, drew from the research data lifecycle - a model that has been widely adopted by data support providers in academic libraries to organize activities related to the management of data over the course of a research project⁵⁸. Building on similarly-structured tools, such as the data curation profiles⁵⁹, survey questions were developed in consultation with researchers actively working in the field to ensure that each question was tailored to the specific terminology, practices, and tools currently employed by the MRI research community.

For questions referencing data management maturity, we drew upon the capability maturity model framework⁶⁰ which describes activities based on their degree of definition, standardization, and optimization. For the purposes of this study, RDM maturity was defined as the extent to which data management practices are clearly defined, implemented, and (if applicable) optimized. Introductory text also clarified that the survey was not designed to judge researchers who have different styles of data management or whose practices exhibit different levels of sophistication. Though capability models specific to the management of scientific data have been developed⁶¹, to our knowledge this is the first survey to apply this framework as a means to collect quantitative maturity ratings from the research community itself.

Because we believed that participants would come to our survey with different perspectives on RDM-related topics and terms, each section of the survey was preceded by a brief description of the specific activities and practices covered in that section as well as an operational definition of data management maturity.

Distribution and Filtering Criteria

Recruitment emails were sent to the directors of neuroimaging centers and other MRI facilities affiliated with universities and other research institutions. The survey was also advertised through social media and via psychology, neuroscience, and neuroimaging discussion groups. In order to capture a broad view of data management-related practices and perceptions inclusion criteria were only that potential participants be an active researcher using MRI, that they were over the age of 18, and that they consented to participate in our study. Data from participants who did not meet these criteria or who completed less than a single section of the survey were excluded from subsequent analyses.

Data Availability

The survey instrument⁶² and resulting dataset (excluding personally identifying information)⁶³ are both available via figshare.

Author contributions statement

J.B. and A.V. jointly conceived the study, designed the survey, analyzed the results, and wrote the manuscript.

Competing financial interests

The authors declare no competing financial interests.

Acknowledgements

This work was partially funded by a Berkman Faculty Development Grant awarded to A.V. by Carnegie Mellon University. While conducting the work described in this publication, J.B. was funded as both a CLIR Software Curation Fellow (Alfred P. Sloan Foundation #G-2015-14112) and an RDA Data Share Fellow (Alfred P. Sloan Foundation #G-2014-13746, National Science Foundation NSF ACI #1349002). The authors would like to thank Yael Isler and John Pyles for their helpful comments throughout the research process and J.B. Poline and Russ Poldrack for their suggestions related to the survey instrument.

Footnotes

↵* anavangulick{at}cmu.edu

Work Cited

1.↵
Logothetis, N. K. (2008). What we can do and what we cannot do with fMRI. Nature, 453(7197), 869–878. http://doi.org/10.1038/nature06976
OpenUrl CrossRef PubMed Web of Science
2.↵
Poldrack, R. A., & Farah, M. J. (2015). Progress and challenges in probing the human brain. Nature, 526(7573), 371–379. http://doi.org/10.1038/nature15692
OpenUrl CrossRef PubMed
3.↵
Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. http://doi.org/10.1126/science.aac4716
OpenUrl Abstract/FREE Full Text
4.↵
Ioannidis, J. P. A. (2005). Why most published research findings are false. PLOS Medicine, 2(8), 0696–0701. http://doi.org/10.1371/journal.pmed.0020124
OpenUrl
5.↵
Poldrack, R. A., Baker, C. I., Durnez, J., Gorgolewski, K. J., Matthews, P. M., Munafò, M. R., … Yarkoni, T. (2017). Scanning the horizon: towards transparent and reproducible neuroimaging research. Nature Reviews Neuroscience, 18(2), 115–126. http://doi.org/10.1038/nrn.2016.167
OpenUrl CrossRef PubMed
6.↵
Sayre, F., & Riegelman, A. (2018). The reproducibility crisis and academic libraries. College & Research Libraries, 79(1), 2–9. http://doi.org/10.5860/crl.79.1.2
OpenUrl
7.↵
Flores, J. R., Brodeur, J. J., Daniels, M. G., Nicholls, N., & Turnator, E. (2015). Libraries and the research data management landscape. The process of discovery. In J. C. Maclachlan, E. A. Waraksa, & C. Williford (Eds.) The CLIR Postdoctoral Fellowship Program and the Future of the Academy, (pp.82–102). Washington, DC: Council on Library and Information Resources. Retrieved from http://www.clir.org/pubs/reports/pub167/RDM.pdf
8.↵
Tenopir, C., Sandusky, R. J., Allard, S., & Birch, B. (2014). Research data management services in academic research libraries and perceptions of librarians. Library and Information Science Research, 36(2), 84–90. http://doi.org/10.1016/j.lisr.2013.11.003
OpenUrl CrossRef
9.↵
Parham, S. W., Carlson, J., Hswe, P., Westra, B., & Whitmire, A. (2016). Using data management plans to explore variability in research data management practices across domains. International Journal of Digital Curation, 11(1), 53–67. http://doi.org/10.2218/ijdc.v11i1.423
OpenUrl
10.↵
Tenopir, C., Dalton, E. D., Allard, S., Frame, M., Pjesivac, I., Birch, B., … Dorsett, K. (2015). Changes in data sharing and data reuse practices and perceptions among scientists worldwide. PLOS ONE, 10(8), e0134826. http://doi.org/10.1371/journal.pone.0134826
OpenUrl CrossRef PubMed
11.↵
Hillman, E. M. C. (2014). Coupling mechanisms and significance of BOLD signal: A status report. Annual Review of Neuroscience, 37, 161–181. http://doi.org/10.1146/annurev-neuro-071013-014111.Coupling
OpenUrl CrossRef PubMed
12.↵
Murphy, K., Birn, R. M., & Bandettini, P. A. (2013). Resting-state fMRI confounds and cleanup. NeuroImage, 80(Supplement C), 349–359. http://doi.org/10.1016/j.neuroimage.2013.04.001
OpenUrl CrossRef PubMed Web of Science
13.↵
Poldrack, R. A. (2006). Can cognitive processes be inferred from neuroimaging data? Trends in Cognitive Sciences, 10(2), 59–63. http://doi.org/10.1016/j.tics.2005.12.004
OpenUrl CrossRef PubMed Web of Science
14.↵
Carp, J. (2012). On the plurality of (methodological) worlds: Estimating the analytic flexibility of fmri experiments. Frontiers in Neuroscience, 6, 1–13. http://doi.org/10.3389/fnins.2012.00149
OpenUrl
15.↵
Gronenschild, E. H. B. M., Habets, P., Jacobs, H. I. L., Mengelers, R., Rozendaal, N., van Os, J., & Marcelis, M. (2012). The effects of FreeSurfer version, workstation type, and Macintosh operating system version on anatomical volume and cortical thickness measurements. PLOS ONE, 7 (6). e38234. http://doi.org/10.1371/journal.pone.0038234
OpenUrl CrossRef PubMed
16.
Poldrack, R. A., Fletcher, P. C., Henson, R. N., Worsley, K. J., Brett, M., & Nichols, T. E. (2008). Guidelines for reporting an fMRI study. NeuroImage, 40(2), 409–414. http://doi.org/10.1016/j.neuroimage.2007.11.048
OpenUrl CrossRef PubMed Web of Science
17.↵
Carp, J. (2012). The secret lives of experiments: Methods reporting in the fMRI literature. NeuroImage, 63(1), 289–300. http://doi.org/10.1016/j.neuroimage.2012.07.004
OpenUrl CrossRef PubMed Web of Science
18.↵
Guo, Q., Parlar, M., Truong, W., Hall, G., Thabane, L., McKinnon, M., … Pullenayegum, E. (2014). The reporting of observational clinical functional magnetic resonance imaging studies: A systematic review. PLOS ONE, 9(4), e94412. http://doi.org/10.1371/journal.pone.0094412
OpenUrl CrossRef PubMed
19.↵
David, S. P., Ware, J. J., Chu, I. M., Loftus, P. D., Fusar-Poli, P., Radua, J., … Ioannidis, J. P. A. (2013). Potential reporting bias in fMRI studies of the brain. PLOS ONE, 8(7), e70104. http://doi.org/10.1371/journal.pone.0070104
OpenUrl CrossRef PubMed
20.↵
Jennings, R. G., & Van Horn, J. D. (2012). Publication bias in neuroimaging research: Implications for meta-analyses. Neuroinformatics, 10(1), 67–80. http://doi.org/10.1007/s12021-011-9125-y
OpenUrl CrossRef PubMed Web of Science
21.↵
Button, K. S., Ioannidis, J. P. A., Mokrysz, C., Nosek, B. A., Flint, J., Robinson, E. S. J., & Munafò, M. R. (2013). Power failure: why small sample size undermines the reliability of neuroscience. Nature Reviews Neuroscience, 14(5), 365–376. http://doi.org/10.1038/nrn3475
OpenUrl CrossRef PubMed
22.↵
Cremers, H. R., Wager, T. D., & Yarkoni, T. (2017). The relation between statistical power and inference in fMRI. PLOS ONE, 12(11), e0184923. http://doi.org/10.1371/journal.pone.0184923
OpenUrl CrossRef
23.↵
Bennett, C. M., Miller, M. B., & Wolford, G. L. (2009). Neural correlates of interspecies perspective taking in the post-mortem Atlantic Salmon: an argument for multiple comparisons correction. NeuroImage, 47(Supplement 1), S125. http://doi.org/10.1016/S1053-8119(09)71202-9
OpenUrl
24.↵
Vul, E., Harris, C., Winkielman, P., & Pashler, H. (2009). Puzzlingly high correlations in fMRI studies of emotion, personality, and social cognition. Perspectives on Psychological Science, 4(3), 274–290. http://doi.org/10.1111/j.1745-6924.2009.01125.x
OpenUrl CrossRef PubMed Web of Science
25.↵
Eklund, A., Nichols, T. E., & Knutsson, H. (2016). Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates. Proceedings of the National Academy of Sciences, 113(28), 7900–7905. http://doi.org/10.1073/pnas.1602413113
OpenUrl Abstract/FREE Full Text
26.↵
Koslow, S. H. (2000). Should the neuroscience community make a paradigm shift to sharing primary data? Nature Neuroscience, 3(9), 863–865. http://doi.org/10.1038/78760
OpenUrl CrossRef PubMed Web of Science
27.↵
Van Horn, J. D., Grafton, S. T., Rockmore, D., & Gazzaniga, M. S. (2004). Sharing neuroimaging studies of human cognition. Nature Neuroscience, 7(5), 473–481. http://doi.org/10.1038/nn1231
OpenUrl CrossRef PubMed Web of Science
28.↵
Van Horn, J. D., & Gazzaniga, M. S. (2013). Why share data? Lessons learned from the fMRIDC. NeuroImage, 82(Supplement C), 677–682. http://doi.org/10.1016/j.neuroimage.2012.11.010
OpenUrl CrossRef PubMed Web of Science
29.↵
Federer, L. M., Lu, Y.-L., Joubert, D. J., Welsh, J., & Brandys, B. (2015). Biomedical data sharing and reuse: Attitudes and practices of clinical and scientific research staff. PLOS ONE, 10(6), e0129506. http://doi.org/10.1371/journal.pone.0129506
OpenUrl CrossRef PubMed
30.↵
Piwowar, H. A., & Chapman, W. W. (2008). Identifying data sharing in biomedical literature. AMIA Annual Symposium Proceedings, 2008, 596–600. Retrieved from http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2655927
OpenUrl
31.↵
Kriesberg, A., Huller, K., Punzalan, R., & Parr, C. (2017). An analysis of federal policy on public access to scientific research data. Data Science Journal, 16, 27. http://doi.org/10.5334/dsj-2017-027
OpenUrl
32.↵
Mueller, S. G., Weiner, M. W., Thal, L. J., Petersen, R. C., Jack, C. R., Jagust, W., … Beckett, L. (2005). Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI). Alzheimer’s and Dementia, 1(1), 55–66. http://doi.org/10.1016/j.jalz.2005.06.003
OpenUrl CrossRef
33.↵
Mennes, M., Biswal, B., Castellanos, F. X., & Milham, M. P. (2013). Making data sharing work: The FCP/INDI experience. NeuroImage, 82, 683–691. http://doi.org/10.1016/j.neuroimage.2012.10.064
OpenUrl
34.↵
Di Martino, A., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., … Milham, M. P. (2014). The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6), 659–667. http://doi.org/10.1038/mp.2013.78
OpenUrl CrossRef PubMed Web of Science
35.↵
Gorgolewski, K. J., Auer, T., Calhoun, V. D., Craddock, R. C., Das, S., Duff, E. P., … Poldrack, R. A. (2016). The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Scientific Data, 3, 160044. http://doi.org/10.1038/sdata.2016.44
OpenUrl CrossRef
36.↵
Rex, D. E., Ma, J. Q., & Toga, A. W. (2003). The LONI pipeline processing environment. NeuroImage, 19(3), 1033–1048. http://doi.org/10.1016/S1053-8119(03)00185-X
OpenUrl CrossRef PubMed Web of Science
37.↵
Gorgolewski, K., Burns, C. D., Madison, C., Clark, D., Halchenko, Y. O., Waskom, M. L., & Ghosh, S. S. (2011). Nipype: A flexible, lightweight and extensible neuroimaging data processing framework in Python. Frontiers in Neuroinformatics, 5. http://doi.org/10.3389/fninf.2011.00013
38.↵
Gorgolewski, K. J., Alfaro-Almagro, F., Auer, T., Bellec, P., Capotă, M., Chakravarty, M. M., … Poldrack, R. A. (2017). BIDS apps: Improving ease of use, accessibility, and reproducibility of neuroimaging data analysis methods. PLOS Computational Biology, 13(3), e1005209. http://doi.org/10.1371/journal.pcbi.1005209
OpenUrl CrossRef
39.↵
Ioannidis, J. P. A. (2014). How to make more published research true. PLOS Medicine, 11(10). e1001747. http://doi.org/10.1371/journal.pmed.1001747
OpenUrl
40.↵
Wilkinson, M. D., Dumontier, M., Aalbersberg, Ij. J., Appleton, G., Axton, M., Baak, A., … Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3, 160018. http://doi.org/10.1038/sdata.2016.18
OpenUrl CrossRef
41.↵
Nichols, T. E., Das, S., Eickhoff, S. B., Evans, A. C., Glatard, T., Hanke, M., … Yeo, B. T. T. (2017). Best practices in data analysis and sharing in neuroimaging using MRI. Nature Neuroscience, 20(3), 299–303. http://doi.org/10.1038/nn.4500
OpenUrl CrossRef PubMed
42.↵
Dickersin, K., Chan, S., Chalmers, T. C., Sacks, H. S., & Smith, H. (1987). Publication bias and clinical trials. Controlled Clinical Trials, 8, 343–353. http://doi.org/10.1016/0197-2456(87)90155-3
OpenUrl CrossRef PubMed Web of Science
43.↵
Sterling, T. D. (1959). Publication decisions and their possible effects on inferences drawn from tests of significance—or vice versa. Journal of the American Statistical Association, 54(285), 30–34. http://doi.org/10.1080/01621459.1959.10501497
OpenUrl CrossRef Web of Science
44.↵
Cohen, J. (1962). The statistical power of abnormal-social psychological research: A review. The Journal of Abnormal and Social Psychology, 65(3), 145–153. http://doi.org/10.1037/h0045186
OpenUrl
45.↵
Freiman, J. A., Chalmers, T. C., Smith, H., & Kuebler, R. R. (1978). The importance of beta, the type II error and sample size in the design and interpretation of the randomized control trial. New England Journal of Medicine, 299(13), 690–694. http://doi.org/10.1056/NEJM197809282991304
OpenUrl CrossRef PubMed Web of Science
46.↵
Vines, T., Andrew, R., Bock, D., Franklin, M., Gilbert, K., Kane, N., … Yeaman, S. (2013). Mandated data archiving grately improves access to research data. The FASEB Journal, 27(4), 1304-1308. http://doi.org/10.5061/dryad.6bs31
OpenUrl CrossRef PubMed
47.↵
Vasilevsky, N. A., Minnier, J., Haendel, M. A., & Champieux, R. E. (2017). Reproducible and reusable research: are journal data sharing policies meeting the mark? PeerJ, 5, e3208. http://doi.org/10.7717/peerj.3208
OpenUrl CrossRef
48.↵
Van Tuyl, S., & Whitmire, A. L. (2016). Water, water, everywhere: Defining and assessing data sharing in academia. PLOS ONE, 11(2), e0147942. http://doi.org/10.1371/journal.pone.0147942
OpenUrl CrossRef
49.↵
Wolfe, J. M., & Kanwisher, N. G. (2018). Not your parent’s NIH clinical trial. Nature Human Behaviour, 2, 107-109. http://doi.org/10.1038/s41562-017-0262-7
OpenUrl
50.↵
Teeters, J. L., Godfrey, K., Young, R., Dang, C., Friedsam, C., Wark, B., … Sommer, F. T. (2015). Neurodata Without Borders: Creating a common data format for neurophysiology. Neuron, 88(4), 629–634. http://doi.org/10.1016/j.neuron.2015.10.025
OpenUrl CrossRef PubMed
51.↵
Ascoli, G. A., Donohue, D. E., & Halavi, M. (2007). NeuroMorpho.Org: A central resource for neuronal morphologies. Journal of Neuroscience, 27(35), 9247–9251. http://doi.org/10.1523/JNEUROSCI.2055-07.2007
OpenUrl FREE Full Text
52.↵
Lee, R. Y. N., Howe, K. L., Harris, T. W., Arnaboldi, V., Cain, S., Chan, J., … Sternberg, P. W. (2017). WormBase 2017: molting into a new stage. Nucleic Acids Research, 46(D1), D869–D874. http://doi.org/10.1093/nar/gkx998
OpenUrl
53.↵
Munafò, M. R., Nosek, B. A., Bishop, D. V. M., Button, K. S., Chambers, C. D., Percie du Sert, N., … Ioannidis, J. P. A. (2017). A manifesto for reproducible science. Nature Human Behaviour, 1(1), 0021. http://doi.org/10.1038/s41562-016-0021
OpenUrl
54.↵
Dunning, D., Heath, C., & Suls, J. M. (2004). Flawed self-assessment implications for health, education, and the workplace. Psychological Science in the Public Interest, 5(3), 69–106. http://doi.org/10.1111/j.1529-1006.2004.00018.x
OpenUrl CrossRef PubMed
55.↵
Barone, L., Williams, J., & Micklos, D. (2017). Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators. PLOS Computational Biology, 13(10), e1005755. http://doi.org/10.1371/journal.pcbi.1005755
OpenUrl
56.↵
Tenopir, C., Allard, S., Sinha, P., Pollock, D., Newman, J., Dalton, E., … Baird, L. (2016). Data Management Education from the Perspective of Science Educators. International Journal of Digital Curation, 11(1), 232–251. article. http://doi.org/10.2218/ijdc.v11i1.389
OpenUrl CrossRef
57.↵
JASP Team. (2017). JASP (Version 0.8.2).
58.↵
Carlson, J. (2014). The use of lifecycle models in developing and supporting data services. In J. M. Ray (Ed.), Research data management: Practical strategies for information professionals (pp. 63–86). West Lafayette, Indiana: Purdue University Press.
59.↵
Witt, M., Carlson, J., Brandt, D. S., & Cragin, M. H. (2009). Constructing data curation profiles. International Journal of Digital Curation, 4(3), 93-103 http://doi.org/10.2218/ijdc.v4i3.117
OpenUrl
60.↵
Paulk, M. C., Curtis, B., Chrissis, M. B., & Weber, C. V. (1993). Capability maturity model, version 1.1. IEEE Software, 10(4), 18–27. http://doi.org/10.1109/52.219617
OpenUrl CrossRef Web of Science
61.↵
Crowston, K., & Qin, J. (2011). A capability maturity model for scientific data management: Evidence from the literature. Proceedings of the American Society for Information Science and Technology, 48(1), 1–9. http://doi.org/10.1002/meet.2011.14504801036
OpenUrl CrossRef
62.↵
Borghi, J., & Van Gulick, A.. (2018). Survey instrument to assess the research data management practices and perceptions of MRI researchers (Version 1). figshare. https://doi.org/10.1184/R1/5845212.v1
63.↵
Borghi, J., & Van Gulick, A.. (2018). Survey data on research data management practices and perceptions of MRI researchers (Version 1). figshare. https://doi.org/10.1184/R1/5845656.v1

View the discussion thread.

Posted February 18, 2018.

Download PDF

Citation Tools

Subject Area

Neuroscience

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11739)
Bioengineering (8750)
Bioinformatics (29189)
Biophysics (14967)
Cancer Biology (12093)
Cell Biology (17409)
Clinical Trials (138)
Developmental Biology (9419)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18301)
Genetics (12238)
Genomics (16797)
Immunology (11865)
Microbiology (28068)
Molecular Biology (11583)
Neuroscience (60953)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10425)
Scientific Communication and Education (1683)
Synthetic Biology (2884)
Systems Biology (7338)
Zoology (1651)

[1] 1.↵
Logothetis, N. K. (2008). What we can do and what we cannot do with fMRI. Nature, 453(7197), 869–878. http://doi.org/10.1038/nature06976
OpenUrl CrossRef PubMed Web of Science

[2] 2.↵
Poldrack, R. A., & Farah, M. J. (2015). Progress and challenges in probing the human brain. Nature, 526(7573), 371–379. http://doi.org/10.1038/nature15692
OpenUrl CrossRef PubMed

[3] 3.↵
Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. http://doi.org/10.1126/science.aac4716
OpenUrl Abstract/FREE Full Text

[4] 4.↵
Ioannidis, J. P. A. (2005). Why most published research findings are false. PLOS Medicine, 2(8), 0696–0701. http://doi.org/10.1371/journal.pmed.0020124
OpenUrl

[5] 5.↵
Poldrack, R. A., Baker, C. I., Durnez, J., Gorgolewski, K. J., Matthews, P. M., Munafò, M. R., … Yarkoni, T. (2017). Scanning the horizon: towards transparent and reproducible neuroimaging research. Nature Reviews Neuroscience, 18(2), 115–126. http://doi.org/10.1038/nrn.2016.167
OpenUrl CrossRef PubMed

[6] 6.↵
Sayre, F., & Riegelman, A. (2018). The reproducibility crisis and academic libraries. College & Research Libraries, 79(1), 2–9. http://doi.org/10.5860/crl.79.1.2
OpenUrl

[7] 7.↵
Flores, J. R., Brodeur, J. J., Daniels, M. G., Nicholls, N., & Turnator, E. (2015). Libraries and the research data management landscape. The process of discovery. In J. C. Maclachlan, E. A. Waraksa, & C. Williford (Eds.) The CLIR Postdoctoral Fellowship Program and the Future of the Academy, (pp.82–102). Washington, DC: Council on Library and Information Resources. Retrieved from http://www.clir.org/pubs/reports/pub167/RDM.pdf

[8] 8.↵
Tenopir, C., Sandusky, R. J., Allard, S., & Birch, B. (2014). Research data management services in academic research libraries and perceptions of librarians. Library and Information Science Research, 36(2), 84–90. http://doi.org/10.1016/j.lisr.2013.11.003
OpenUrl CrossRef

[9] 9.↵
Parham, S. W., Carlson, J., Hswe, P., Westra, B., & Whitmire, A. (2016). Using data management plans to explore variability in research data management practices across domains. International Journal of Digital Curation, 11(1), 53–67. http://doi.org/10.2218/ijdc.v11i1.423
OpenUrl

[10] 10.↵
Tenopir, C., Dalton, E. D., Allard, S., Frame, M., Pjesivac, I., Birch, B., … Dorsett, K. (2015). Changes in data sharing and data reuse practices and perceptions among scientists worldwide. PLOS ONE, 10(8), e0134826. http://doi.org/10.1371/journal.pone.0134826
OpenUrl CrossRef PubMed

[11] 11.↵
Hillman, E. M. C. (2014). Coupling mechanisms and significance of BOLD signal: A status report. Annual Review of Neuroscience, 37, 161–181. http://doi.org/10.1146/annurev-neuro-071013-014111.Coupling
OpenUrl CrossRef PubMed

[12] 12.↵
Murphy, K., Birn, R. M., & Bandettini, P. A. (2013). Resting-state fMRI confounds and cleanup. NeuroImage, 80(Supplement C), 349–359. http://doi.org/10.1016/j.neuroimage.2013.04.001
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Poldrack, R. A. (2006). Can cognitive processes be inferred from neuroimaging data? Trends in Cognitive Sciences, 10(2), 59–63. http://doi.org/10.1016/j.tics.2005.12.004
OpenUrl CrossRef PubMed Web of Science

[14] 14.↵
Carp, J. (2012). On the plurality of (methodological) worlds: Estimating the analytic flexibility of fmri experiments. Frontiers in Neuroscience, 6, 1–13. http://doi.org/10.3389/fnins.2012.00149
OpenUrl

[15] 15.↵
Gronenschild, E. H. B. M., Habets, P., Jacobs, H. I. L., Mengelers, R., Rozendaal, N., van Os, J., & Marcelis, M. (2012). The effects of FreeSurfer version, workstation type, and Macintosh operating system version on anatomical volume and cortical thickness measurements. PLOS ONE, 7 (6). e38234. http://doi.org/10.1371/journal.pone.0038234
OpenUrl CrossRef PubMed

[16] 16.
Poldrack, R. A., Fletcher, P. C., Henson, R. N., Worsley, K. J., Brett, M., & Nichols, T. E. (2008). Guidelines for reporting an fMRI study. NeuroImage, 40(2), 409–414. http://doi.org/10.1016/j.neuroimage.2007.11.048
OpenUrl CrossRef PubMed Web of Science

[17] 17.↵
Carp, J. (2012). The secret lives of experiments: Methods reporting in the fMRI literature. NeuroImage, 63(1), 289–300. http://doi.org/10.1016/j.neuroimage.2012.07.004
OpenUrl CrossRef PubMed Web of Science

[18] 18.↵
Guo, Q., Parlar, M., Truong, W., Hall, G., Thabane, L., McKinnon, M., … Pullenayegum, E. (2014). The reporting of observational clinical functional magnetic resonance imaging studies: A systematic review. PLOS ONE, 9(4), e94412. http://doi.org/10.1371/journal.pone.0094412
OpenUrl CrossRef PubMed

[19] 19.↵
David, S. P., Ware, J. J., Chu, I. M., Loftus, P. D., Fusar-Poli, P., Radua, J., … Ioannidis, J. P. A. (2013). Potential reporting bias in fMRI studies of the brain. PLOS ONE, 8(7), e70104. http://doi.org/10.1371/journal.pone.0070104
OpenUrl CrossRef PubMed

[20] 20.↵
Jennings, R. G., & Van Horn, J. D. (2012). Publication bias in neuroimaging research: Implications for meta-analyses. Neuroinformatics, 10(1), 67–80. http://doi.org/10.1007/s12021-011-9125-y
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
Button, K. S., Ioannidis, J. P. A., Mokrysz, C., Nosek, B. A., Flint, J., Robinson, E. S. J., & Munafò, M. R. (2013). Power failure: why small sample size undermines the reliability of neuroscience. Nature Reviews Neuroscience, 14(5), 365–376. http://doi.org/10.1038/nrn3475
OpenUrl CrossRef PubMed

[22] 22.↵
Cremers, H. R., Wager, T. D., & Yarkoni, T. (2017). The relation between statistical power and inference in fMRI. PLOS ONE, 12(11), e0184923. http://doi.org/10.1371/journal.pone.0184923
OpenUrl CrossRef

[23] 23.↵
Bennett, C. M., Miller, M. B., & Wolford, G. L. (2009). Neural correlates of interspecies perspective taking in the post-mortem Atlantic Salmon: an argument for multiple comparisons correction. NeuroImage, 47(Supplement 1), S125. http://doi.org/10.1016/S1053-8119(09)71202-9
OpenUrl

[24] 24.↵
Vul, E., Harris, C., Winkielman, P., & Pashler, H. (2009). Puzzlingly high correlations in fMRI studies of emotion, personality, and social cognition. Perspectives on Psychological Science, 4(3), 274–290. http://doi.org/10.1111/j.1745-6924.2009.01125.x
OpenUrl CrossRef PubMed Web of Science

[25] 25.↵
Eklund, A., Nichols, T. E., & Knutsson, H. (2016). Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates. Proceedings of the National Academy of Sciences, 113(28), 7900–7905. http://doi.org/10.1073/pnas.1602413113
OpenUrl Abstract/FREE Full Text

[26] 26.↵
Koslow, S. H. (2000). Should the neuroscience community make a paradigm shift to sharing primary data? Nature Neuroscience, 3(9), 863–865. http://doi.org/10.1038/78760
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Van Horn, J. D., Grafton, S. T., Rockmore, D., & Gazzaniga, M. S. (2004). Sharing neuroimaging studies of human cognition. Nature Neuroscience, 7(5), 473–481. http://doi.org/10.1038/nn1231
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Van Horn, J. D., & Gazzaniga, M. S. (2013). Why share data? Lessons learned from the fMRIDC. NeuroImage, 82(Supplement C), 677–682. http://doi.org/10.1016/j.neuroimage.2012.11.010
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Federer, L. M., Lu, Y.-L., Joubert, D. J., Welsh, J., & Brandys, B. (2015). Biomedical data sharing and reuse: Attitudes and practices of clinical and scientific research staff. PLOS ONE, 10(6), e0129506. http://doi.org/10.1371/journal.pone.0129506
OpenUrl CrossRef PubMed

[30] 30.↵
Piwowar, H. A., & Chapman, W. W. (2008). Identifying data sharing in biomedical literature. AMIA Annual Symposium Proceedings, 2008, 596–600. Retrieved from http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2655927
OpenUrl

[31] 31.↵
Kriesberg, A., Huller, K., Punzalan, R., & Parr, C. (2017). An analysis of federal policy on public access to scientific research data. Data Science Journal, 16, 27. http://doi.org/10.5334/dsj-2017-027
OpenUrl

[32] 32.↵
Mueller, S. G., Weiner, M. W., Thal, L. J., Petersen, R. C., Jack, C. R., Jagust, W., … Beckett, L. (2005). Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI). Alzheimer’s and Dementia, 1(1), 55–66. http://doi.org/10.1016/j.jalz.2005.06.003
OpenUrl CrossRef

[33] 33.↵
Mennes, M., Biswal, B., Castellanos, F. X., & Milham, M. P. (2013). Making data sharing work: The FCP/INDI experience. NeuroImage, 82, 683–691. http://doi.org/10.1016/j.neuroimage.2012.10.064
OpenUrl

[34] 34.↵
Di Martino, A., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., … Milham, M. P. (2014). The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6), 659–667. http://doi.org/10.1038/mp.2013.78
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Gorgolewski, K. J., Auer, T., Calhoun, V. D., Craddock, R. C., Das, S., Duff, E. P., … Poldrack, R. A. (2016). The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Scientific Data, 3, 160044. http://doi.org/10.1038/sdata.2016.44
OpenUrl CrossRef

[36] 36.↵
Rex, D. E., Ma, J. Q., & Toga, A. W. (2003). The LONI pipeline processing environment. NeuroImage, 19(3), 1033–1048. http://doi.org/10.1016/S1053-8119(03)00185-X
OpenUrl CrossRef PubMed Web of Science

[37] 37.↵
Gorgolewski, K., Burns, C. D., Madison, C., Clark, D., Halchenko, Y. O., Waskom, M. L., & Ghosh, S. S. (2011). Nipype: A flexible, lightweight and extensible neuroimaging data processing framework in Python. Frontiers in Neuroinformatics, 5. http://doi.org/10.3389/fninf.2011.00013

[38] 38.↵
Gorgolewski, K. J., Alfaro-Almagro, F., Auer, T., Bellec, P., Capotă, M., Chakravarty, M. M., … Poldrack, R. A. (2017). BIDS apps: Improving ease of use, accessibility, and reproducibility of neuroimaging data analysis methods. PLOS Computational Biology, 13(3), e1005209. http://doi.org/10.1371/journal.pcbi.1005209
OpenUrl CrossRef

[39] 39.↵
Ioannidis, J. P. A. (2014). How to make more published research true. PLOS Medicine, 11(10). e1001747. http://doi.org/10.1371/journal.pmed.1001747
OpenUrl

[40] 40.↵
Wilkinson, M. D., Dumontier, M., Aalbersberg, Ij. J., Appleton, G., Axton, M., Baak, A., … Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3, 160018. http://doi.org/10.1038/sdata.2016.18
OpenUrl CrossRef

[41] 41.↵
Nichols, T. E., Das, S., Eickhoff, S. B., Evans, A. C., Glatard, T., Hanke, M., … Yeo, B. T. T. (2017). Best practices in data analysis and sharing in neuroimaging using MRI. Nature Neuroscience, 20(3), 299–303. http://doi.org/10.1038/nn.4500
OpenUrl CrossRef PubMed

[42] 42.↵
Dickersin, K., Chan, S., Chalmers, T. C., Sacks, H. S., & Smith, H. (1987). Publication bias and clinical trials. Controlled Clinical Trials, 8, 343–353. http://doi.org/10.1016/0197-2456(87)90155-3
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
Sterling, T. D. (1959). Publication decisions and their possible effects on inferences drawn from tests of significance—or vice versa. Journal of the American Statistical Association, 54(285), 30–34. http://doi.org/10.1080/01621459.1959.10501497
OpenUrl CrossRef Web of Science

[44] 44.↵
Cohen, J. (1962). The statistical power of abnormal-social psychological research: A review. The Journal of Abnormal and Social Psychology, 65(3), 145–153. http://doi.org/10.1037/h0045186
OpenUrl

[45] 45.↵
Freiman, J. A., Chalmers, T. C., Smith, H., & Kuebler, R. R. (1978). The importance of beta, the type II error and sample size in the design and interpretation of the randomized control trial. New England Journal of Medicine, 299(13), 690–694. http://doi.org/10.1056/NEJM197809282991304
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Vines, T., Andrew, R., Bock, D., Franklin, M., Gilbert, K., Kane, N., … Yeaman, S. (2013). Mandated data archiving grately improves access to research data. The FASEB Journal, 27(4), 1304-1308. http://doi.org/10.5061/dryad.6bs31
OpenUrl CrossRef PubMed

[47] 47.↵
Vasilevsky, N. A., Minnier, J., Haendel, M. A., & Champieux, R. E. (2017). Reproducible and reusable research: are journal data sharing policies meeting the mark? PeerJ, 5, e3208. http://doi.org/10.7717/peerj.3208
OpenUrl CrossRef

[48] 48.↵
Van Tuyl, S., & Whitmire, A. L. (2016). Water, water, everywhere: Defining and assessing data sharing in academia. PLOS ONE, 11(2), e0147942. http://doi.org/10.1371/journal.pone.0147942
OpenUrl CrossRef

[49] 49.↵
Wolfe, J. M., & Kanwisher, N. G. (2018). Not your parent’s NIH clinical trial. Nature Human Behaviour, 2, 107-109. http://doi.org/10.1038/s41562-017-0262-7
OpenUrl

[50] 50.↵
Teeters, J. L., Godfrey, K., Young, R., Dang, C., Friedsam, C., Wark, B., … Sommer, F. T. (2015). Neurodata Without Borders: Creating a common data format for neurophysiology. Neuron, 88(4), 629–634. http://doi.org/10.1016/j.neuron.2015.10.025
OpenUrl CrossRef PubMed

[51] 51.↵
Ascoli, G. A., Donohue, D. E., & Halavi, M. (2007). NeuroMorpho.Org: A central resource for neuronal morphologies. Journal of Neuroscience, 27(35), 9247–9251. http://doi.org/10.1523/JNEUROSCI.2055-07.2007
OpenUrl FREE Full Text

[52] 52.↵
Lee, R. Y. N., Howe, K. L., Harris, T. W., Arnaboldi, V., Cain, S., Chan, J., … Sternberg, P. W. (2017). WormBase 2017: molting into a new stage. Nucleic Acids Research, 46(D1), D869–D874. http://doi.org/10.1093/nar/gkx998
OpenUrl

[53] 53.↵
Munafò, M. R., Nosek, B. A., Bishop, D. V. M., Button, K. S., Chambers, C. D., Percie du Sert, N., … Ioannidis, J. P. A. (2017). A manifesto for reproducible science. Nature Human Behaviour, 1(1), 0021. http://doi.org/10.1038/s41562-016-0021
OpenUrl

[54] 54.↵
Dunning, D., Heath, C., & Suls, J. M. (2004). Flawed self-assessment implications for health, education, and the workplace. Psychological Science in the Public Interest, 5(3), 69–106. http://doi.org/10.1111/j.1529-1006.2004.00018.x
OpenUrl CrossRef PubMed

[55] 55.↵
Barone, L., Williams, J., & Micklos, D. (2017). Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators. PLOS Computational Biology, 13(10), e1005755. http://doi.org/10.1371/journal.pcbi.1005755
OpenUrl

[56] 56.↵
Tenopir, C., Allard, S., Sinha, P., Pollock, D., Newman, J., Dalton, E., … Baird, L. (2016). Data Management Education from the Perspective of Science Educators. International Journal of Digital Curation, 11(1), 232–251. article. http://doi.org/10.2218/ijdc.v11i1.389
OpenUrl CrossRef

[57] 57.↵
JASP Team. (2017). JASP (Version 0.8.2).

[58] 58.↵
Carlson, J. (2014). The use of lifecycle models in developing and supporting data services. In J. M. Ray (Ed.), Research data management: Practical strategies for information professionals (pp. 63–86). West Lafayette, Indiana: Purdue University Press.

[59] 59.↵
Witt, M., Carlson, J., Brandt, D. S., & Cragin, M. H. (2009). Constructing data curation profiles. International Journal of Digital Curation, 4(3), 93-103 http://doi.org/10.2218/ijdc.v4i3.117
OpenUrl

[60] 60.↵
Paulk, M. C., Curtis, B., Chrissis, M. B., & Weber, C. V. (1993). Capability maturity model, version 1.1. IEEE Software, 10(4), 18–27. http://doi.org/10.1109/52.219617
OpenUrl CrossRef Web of Science

[61] 61.↵
Crowston, K., & Qin, J. (2011). A capability maturity model for scientific data management: Evidence from the literature. Proceedings of the American Society for Information Science and Technology, 48(1), 1–9. http://doi.org/10.1002/meet.2011.14504801036
OpenUrl CrossRef

[62] 62.↵
Borghi, J., & Van Gulick, A.. (2018). Survey instrument to assess the research data management practices and perceptions of MRI researchers (Version 1). figshare. https://doi.org/10.1184/R1/5845212.v1

[63] 63.↵
Borghi, J., & Van Gulick, A.. (2018). Survey data on research data management practices and perceptions of MRI researchers (Version 1). figshare. https://doi.org/10.1184/R1/5845656.v1

Data management and sharing in neuroimaging: Practices and perceptions of MRI researchers

ABSTRACT

Introduction

Results

Participant Characteristics

RDM Maturity Ratings

Data Collection Practices

Data Analysis Practices

Data sharing Practices

Emerging Research Practices

Discussion

Methods

Survey Design

Distribution and Filtering Criteria

Data Availability

Author contributions statement

Competing financial interests

Acknowledgements

Footnotes

Work Cited

Citation Manager Formats

Subject Area