PT - JOURNAL ARTICLE AU - Florian Ganglberger AU - Joanna Kaczanowska AU - Wulf Haubensak AU - Katja Bühler TI - A Data Structure for real-time Aggregation Queries of Big Brain Networks AID - 10.1101/346338 DP - 2018 Jan 01 TA - bioRxiv PG - 346338 4099 - http://biorxiv.org/content/early/2018/08/20/346338.short 4100 - http://biorxiv.org/content/early/2018/08/20/346338.full AB - Recent advances in neuro-imaging allowed big brain-initiatives and consortia to create vast resources of brain data that can be mined by researchers for their individual projects. Exploring the relationship between genes, brain circuitry, and behavior is one of key elements of neuroscience research. This requires fusion of spatial connectivity data at varying scales, such as whole brain correlated gene expression, structural and functional connectivity. With ever-increasing resolution, those exceed the past state-of-the art in several orders of magnitude in size and complexity. Current analytical workflows in neuroscience involve time-consuming manual aggregation of the data and only sparsely incorporate spatial context to operate continuously on multiple scales. Incorporating techniques for handling big connectivity data is therefore a necessity.We propose a data structure to explore heterogeneous neurobiological connectivity data for integrated visual analytics workflows. Aggregation Queries, i.e. the aggregated connectivity from, to or between brain areas allow experts the comparison of multimodal networks residing at different scales, or levels of hierarchically organized anatomical atlases. Executed on-demand on volumetric gene expression and connectivity data, they enable an interactive dissection of networks, with billions of edges, in real-time, and based on their spatial context. The data structure is optimized to be accessed directly from the hard disk, since connectivity of large-scale networks typically exceed the memory size of current consumer level PCs. This allows experts to embed and explore their own experimental data in the framework of public data resources without large-scale infrastructure.Our novel data structure outperforms state-of-the-art graph engines in retrieving connectivity of local brain areas experimentally. We demonstrate the application of our approach for neuroscience by analyzing fear-related functional neuroanatomy in mice. Further, we show its versatility by comparing multimodal brain networks linked to autism. Importantly, we achieve cross-species congruence in retrieving human psychiatric traits networks, which facilitates selection of neural substrates to be further studied in mouse models.