Big data from small data: data-sharing in the 'long tail' of neuroscience

feature-image

Play all audios:

Loading...

ABSTRACT The launch of the US BRAIN and European Human Brain Projects coincides with growing international efforts toward transparency and increased access to publicly funded research in the


neurosciences. The need for data-sharing standards and neuroinformatics infrastructure is more pressing than ever. However, 'big science' efforts are not the only drivers of


data-sharing needs, as neuroscientists across the full spectrum of research grapple with the overwhelming volume of data being generated daily and a scientific environment that is


increasingly focused on collaboration. In this commentary, we consider the issue of sharing of the richly diverse and heterogeneous small data sets produced by individual neuroscientists,


so-called long-tail data. We consider the utility of these data, the diversity of repositories and options available for sharing such data, and emerging best practices. We provide use cases


in which aggregating and mining diverse long-tail data convert numerous small data sources into big data for improved knowledge about neuroscience-related disorders. Access through your


institution Buy or subscribe This is a preview of subscription content, access via your institution RELEVANT ARTICLES Open Access articles citing this article. * EZBIDS: GUIDED


STANDARDIZATION OF NEUROIMAGING DATA INTEROPERABLE WITH MAJOR DATA ARCHIVES AND PLATFORMS * Daniel Levitas * , Soichi Hayashi *  … Franco Pestilli _Scientific Data_ Open Access 08 February


2024 * DIAGNOSIS OF AUTISM SPECTRUM DISORDER BASED ON FUNCTIONAL BRAIN NETWORKS AND MACHINE LEARNING * Caroline L. Alves * , Thaise G. L. de O. Toutain *  … Francisco A. Rodrigues


_Scientific Reports_ Open Access 18 May 2023 * IS NEUROSCIENCE FAIR? A CALL FOR COLLABORATIVE STANDARDISATION OF NEUROSCIENCE DATA * Jean-Baptiste Poline * , David N. Kennedy *  … Maryann E.


Martone _Neuroinformatics_ Open Access 21 January 2022 ACCESS OPTIONS Access through your institution Subscribe to this journal Receive 12 print issues and online access $209.00 per year


only $17.42 per issue Learn more Buy this article * Purchase on SpringerLink * Instant access to full article PDF Buy now Prices may be subject to local taxes which are calculated during


checkout ADDITIONAL ACCESS OPTIONS: * Log in * Learn about institutional subscriptions * Read our FAQs * Contact customer support REFERENCES * Huerta, M.F., Koslow, S.H. & Leshner, A.I.


_Trends Neurosci._ 16, 436–438 (1993). Article  CAS  PubMed  Google Scholar  * Roysam, B., Shain, W. & Ascoli, G.A. _Neuroinformatics_ 7, 1–5 (2009). Article  PubMed  PubMed Central 


Google Scholar  * National Institutes of Health. NIH Program Announcement NOT-MH-05–014, http://grants.nih.gov/grants/guide/notice-files/NOT-MH-05-014.html (2005). * Shepherd, G.M. et al.


_Trends Neurosci._ 21, 460–468 (1998). Article  CAS  PubMed  Google Scholar  * Weinberg, A.M. _Science_ 134, 161–164 (1961). Article  CAS  PubMed  Google Scholar  * Wallis, J.C., Rolando, E.


& Borgman, C.L. _PLoS ONE_ 8, e67332 (2013). Article  CAS  PubMed  PubMed Central  Google Scholar  * Chan, A.W. et al. _Lancet_ 383, 257–266 (2014). Article  PubMed  PubMed Central 


Google Scholar  * Ascoli, G.A., Donohue, D.E. & Halavi, M. _J. Neurosci._ 27, 9247–9251 (2007). Article  CAS  PubMed  PubMed Central  Google Scholar  * Gardner, D. et al.


_Neuroinformatics_ 6, 149–160 (2008). Article  PubMed  PubMed Central  Google Scholar  * Gardner, D. et al. _Neuroinformatics_ 1, 289–295 (2003). Article  PubMed  Google Scholar  * Boline,


J., Lee, E.F. & Toga, A.W. _Front. Neurosci._ 2, 100–106 (2008). Article  PubMed  PubMed Central  Google Scholar  * Van Horn, J.D. & Gazzaniga, M.S. _Neuroimage_ 82, 677–682 (2013).


Article  PubMed  Google Scholar  * Perrino, T. et al. _Perspect. Psychol. Sci._ 8, 433–444 (2013). Article  PubMed  Google Scholar  * Poline, J.B. & Poldrack, R.A. _Front. Neurosci._ 6,


96 (2012). Article  PubMed  PubMed Central  Google Scholar  * Poldrack, R.A. et al. _Front. Neuroinform._ 7, 12 (2013). Article  PubMed  PubMed Central  Google Scholar  * Steward, O.,


Popovich, P.G., Dietrich, W.D. & Kleitman, N. _Exp. Neurol._ 233, 597–605 (2012). Article  PubMed  Google Scholar  * Wicherts, J.M., Bakker, M. & Molenaar, D. _PLoS ONE_ 6, e26828


(2011). Article  CAS  PubMed  PubMed Central  Google Scholar  * Heidorn, P.B. _Libr. Trends_ 57, 280–299 (2008). Article  Google Scholar  * Mueck, L. _Nat. Nanotechnol._ 8, 693–695 (2013).


Article  CAS  PubMed  Google Scholar  * Sena, E.S., van der Worp, H.B., Bath, P.M., Howells, D.W. & Macleod, M.R. _PLoS Biol._ 8, e1000344 (2010). Article  PubMed  PubMed Central  CAS 


Google Scholar  * Fawcett, J.W. et al. _Spinal Cord_ 45, 190–205 (2007). Article  CAS  PubMed  Google Scholar  * Lemmon, V.P. et al. _J. Neurotrauma_ 31, 1354–1361 (2014). Article  PubMed 


PubMed Central  Google Scholar  * Nielson, J.L. et al. _J. Neurotrauma_ doi:10.1089/neu.2014.3399 (31 July 2014). * Fisher, M. et al. _Stroke_ 40, 2244–2250 (2009). Article  PubMed  PubMed


Central  Google Scholar  * Kwon, B.K., Hillyer, J. & Tetzlaff, W. _J. Neurotrauma_ 27, 21–33 (2010). Article  PubMed  Google Scholar  * Marmarou, A. et al. _J. Neurotrauma_ 24, 239–250


(2007). Article  PubMed  Google Scholar  * Maas, A.I. et al. _J. Neurotrauma_ 28, 177–187 (2011). Article  PubMed  PubMed Central  Google Scholar  * Manley, G.T. & Maas, A.I. _J. Am.


Med. Assoc._ 310, 473–474 (2013). Article  CAS  Google Scholar  * Yue, J.K. et al. _J. Neurotrauma_ 30, 1831–1844 (2013). Article  PubMed  PubMed Central  Google Scholar  * Steyerberg, E.W.


et al. _PLoS Med._ 5, e165 (2008). Article  PubMed  PubMed Central  Google Scholar  * Yuh, E.L. et al. _Ann. Neurol._ 73, 224–235 (2013). Article  PubMed  Google Scholar  * Ferguson, A.R. et


al. _PLoS ONE_ 8, e59712 (2013). Article  CAS  PubMed  PubMed Central  Google Scholar  * Turner, C.F. et al. _Database (Oxford)_ 2011, bar043 (2011). Article  Google Scholar  * Turner, J.A.


et al. _Front. Neuroinform._ 4, 10 (2010). PubMed  PubMed Central  Google Scholar  * Tenopir, C. et al. _PLoS ONE_ 6, e21101 (2011). Article  CAS  PubMed  PubMed Central  Google Scholar  *


Roche, D.G. et al. _PLoS Biol._ 12, e1001779 (2014). Article  PubMed  PubMed Central  Google Scholar  * Boulton, G., Rawlins, M., Vallance, P. & Walport, M. _Lancet_ 377, 1633–1635


(2011). Article  PubMed  Google Scholar  * Bohannon, J. _Science_ 344, 788–789 (2014). Article  CAS  PubMed  Google Scholar  * Agarwal, G. et al. _Science_ 344, 626–630 (2014). Article  CAS


  PubMed  PubMed Central  Google Scholar  * Cragin, M.H., Palmer, C.L., Carlson, J.R. & Witt, M. _Philos. Trans. A Math. Phys. Eng. Sci._ 368, 4023–4038 (2010). Article  CAS  PubMed 


Google Scholar  * Halavi, M., Hamilton, K.A., Parekh, R. & Ascoli, G.A. _Front. Neurosci._ 6, 49 (2012). Article  PubMed  PubMed Central  Google Scholar  * Martone, M.E. et al. _J.


Struct. Biol._ 138, 145–155 (2002). Article  CAS  PubMed  Google Scholar  * Fernandez, J.J. _BMC Bioinformatics_ 10, 178 (2009). Article  PubMed  PubMed Central  Google Scholar  * Goodman,


A. et al. _PLoS Comput. Biol._ 10, e1003542 (2014). Article  PubMed  PubMed Central  CAS  Google Scholar  * Gorgolewski, K.J., Margulies, D.S. & Milham, M.P. _Front. Neurosci._ 7, 9


(2013). Article  PubMed  PubMed Central  Google Scholar  * Gorgolewski, K.J. et al. _Gigascience_ 2, 6 (2013). Article  PubMed  PubMed Central  Google Scholar  * Klein, T. et al. _Data Sci.


J._ 12, 1–9 (2013). Article  Google Scholar  * The Future of Research Communications and e-Scholarship (FORCE11). Joint Declaration of Data Citation Principles–FINAL,


https://www.force11.org/datacitation (2013). * Research Data Alliance. Research data sharing without barriers, https://rd-alliance.org/group/data-citation-wg.html (2014). * Van Essen, D.C.


et al. _Neuroimage_ 80, 62–79 (2013). Article  PubMed  Google Scholar  * Mennes, M., Biswal, B.B., Castellanos, F.X. & Milham, M.P. _Neuroimage_ 82, 683–691 (2013). Article  PubMed 


Google Scholar  * The Royal Society. Science as an open enterprise, https://royalsociety.org/policy/projects/science-public-enterprise/Report/ (2012). * Kennedy, D.N. _Neuroinformatics_ 12,


361–363 (2014). Article  PubMed  PubMed Central  Google Scholar  * Costa L.F., Zawadzki, K., Miazaki, M., Viana, M.P. & Taraskin, S.N. _Front. Comput. Neurosci._ 4, 150 (2010). Article 


PubMed Central  Google Scholar  * Hansen, M.B., Jespersen, S.N., Leigland, L.A. & Kroenke, C.D. _Front. Integr. Neurosci._ 7, 31 (2013). Article  PubMed  PubMed Central  Google Scholar 


* Martone, M.E., Gupta, A. & Ellisman, M.H. _Nat. Neurosci._ 7, 467–472 (2004). Article  CAS  PubMed  Google Scholar  * Maas, A.I. et al. _Lancet Neurol._ 12, 1200–1210 (2013). Article 


PubMed  PubMed Central  Google Scholar  Download references ACKNOWLEDGEMENTS We thank the NIF staff, especially B. Ozyurt for his text mining expertise and tools that contributed


substantially to Supplementary Table 1. The Neuroscience Information Framework is supported by a contract from the NIH Neuroscience Blueprint HHSN271200800035C via the National Institute on


Drug Abuse. VISION-SCI is supported by NIH grants NS067092 (A.R.F.) and NS079030 (J.L.N.), and the Craig H. Neilsen foundation (A.R.F.) and Wings for Life foundation (A.R.F). This material


is based on (M.H.C.) work supported while serving at the National Science Foundation. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the


author(s) and do not reflect the views of the National Science Foundation. AUTHOR INFORMATION AUTHORS AND AFFILIATIONS * Department of Neurological Surgery, Brain and Spinal Injury Center,


University of California at San Francisco, San Francisco, California, USA Adam R Ferguson & Jessica L Nielson * Directorate for Biological Sciences, National Science Foundation,


Arlington, Virginia, USA Melissa H Cragin * Center for Research in Biological Structure, University of California at San Diego, San Diego, California, USA Anita E Bandrowski & Maryann E


Martone * Department of Neuroscience, University of California at San Diego, San Diego, California, USA Maryann E Martone Authors * Adam R Ferguson View author publications You can also


search for this author inPubMed Google Scholar * Jessica L Nielson View author publications You can also search for this author inPubMed Google Scholar * Melissa H Cragin View author


publications You can also search for this author inPubMed Google Scholar * Anita E Bandrowski View author publications You can also search for this author inPubMed Google Scholar * Maryann E


Martone View author publications You can also search for this author inPubMed Google Scholar CORRESPONDING AUTHOR Correspondence to Maryann E Martone. ETHICS DECLARATIONS COMPETING


INTERESTS M.E. Martone is the principal investigator of the Neuroscience Information Framework. A.E. Bandrowski is the NIF Project Leader. A.R. Ferguson, J.L. Nielson and M.H. Cragin are not


affiliated with NIF. SUPPLEMENTARY INFORMATION SUPPLEMENTARY TABLE A sample of Neuroscience-centered data repositories available to the community. (PDF 327 kb) RIGHTS AND PERMISSIONS


Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Ferguson, A., Nielson, J., Cragin, M. _et al._ Big data from small data: data-sharing in the 'long tail' of


neuroscience. _Nat Neurosci_ 17, 1442–1447 (2014). https://doi.org/10.1038/nn.3838 Download citation * Received: 12 May 2014 * Accepted: 17 September 2014 * Published: 28 October 2014 *


Issue Date: November 2014 * DOI: https://doi.org/10.1038/nn.3838 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a


shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative