Instant Clue: A Software Suite for Interactive Data Visualization and Analysis

Nolte, Hendrik; MacVicar, Thomas D.; Tellkamp, Frederik; Krüger, Marcus

doi:10.1038/s41598-018-31154-6

Download PDF

Article
Open access
Published: 23 August 2018

Instant Clue: A Software Suite for Interactive Data Visualization and Analysis

Hendrik Nolte¹,
Thomas D. MacVicar¹,
Frederik Tellkamp ORCID: orcid.org/0000-0002-0473-7320^1,2,3 &
…
Marcus Krüger^1,3

Scientific Reports volume 8, Article number: 12648 (2018) Cite this article

11k Accesses
144 Citations
15 Altmetric
Metrics details

Subjects

Abstract

The development of modern high-throughput instrumentation and improved core facility infrastructures leads to an accumulation of large amounts of scientific data. However, for a majority of scientists the comprehensive analysis and visualization of their data goes beyond their expertise. To reduce this hurdle, we developed a software suite called Instant Clue that helps scientists to visually analyze data and to gain insights into biological processes from their high-dimensional dataset. Instant Clue combines the power of visual and statistical analytics using a straight forward drag & drop approach making the software highly intuitive. Additionally, it offers a comprehensive portfolio of statistical tools for systematic analysis such as dimensional reduction, (un)-supervised learning, clustering, multi-block (omics) integration and curve fitting. Charts can be combined with high flexibility into a main figure template for direct usage in scientific publications. Even though Instant Clue was developed with the omics-sciences in mind, users can analyze any kind of data from low to high dimensional data sets. The open-source software is available for Windows and Mac OS (http://www.instantclue.uni-koeln.de) and is accompanied by a detailed video tutorial series.

Dynamic visualization of high-dimensional data

Article 30 December 2022

Eric D. Sun, Rong Ma & James Zou

multiSLIDE is a web server for exploring connected elements of biological pathways in multi-omics data

Article Open access 16 April 2021

Soumita Ghosh, Abhik Datta & Hyungwon Choi

MassIVE.quant: a community resource of quantitative mass spectrometry–based proteomics datasets

Article 14 September 2020

Meena Choi, Jeremy Carver, … Olga Vitek

Introduction

The development of fast and robust instruments for unbiased high-throughput experiments allows scientists to routinely generate large-scale datasets. Numerous techniques to generate omics-data have reached a level that makes them broadly applicable and powerful technologies for medical and biological researchers. For example, with the latest generation of mass spectrometry instruments, it is possible to quantify a global proteome of more than 12,000 proteins within a day. In addition, the enrichment of post-translational modifications, including phosphorylations and ubiquitylations results in more than 40,000 modification sites, that can be achieved with moderate laboratory effort^1,2,3,4. Such experiments demand cooperative work between biologists with specialized knowledge in their field and computational researchers to unravel meaningful findings. In many cases the standard statistical analysis such as the identification of differentially expressed genes, metabolites or proteins is performed by a dedicated bioinformatics core unit or via easy-to-use desktop^5,6,7 or web interface tools⁸ that have been developed to partly overcome this problem. However, the visualization, exploration and statistical analysis of these datasets can still be challenging for a majority of biological or medical researchers.

Here, we address this gap by developing a software suite that enables scientists independent of their computational background to analyze their own complex data. We aimed to develop a tool fulfilling the following challenges: (i) Applicability to a broad range of data inputs from various experiments like densitometric data from immunoblots, quantitative PCR, high-dimensional molecular data sets like proteomic or transcriptomic data, as well as complex time series data. (ii) Highly intuitive data analysis and visualization by drag & drop that allows new users to rapidly draw conclusions from their data. The underlying concept and Graphical User Interface (GUI) design has been inspired by the software Tableau/Polaris⁹. (iii) Offering a repertoire of statistical tests like ANOVA, regressions, curve fitting and principal component analysis as well as facilitating calculation of metrics like area under the curve (AUC) in an interactive way. Importantly, the application of a statistical test requires the visual inspection of the data by the user. This workflow was implemented to help the user to interpret the test results. (iv) Availability of functions to filter, annotate and select data from complex datasets. (v) Facilitate the collection of charts to generate figures with little post-processing prior to publication.

To address these aims, we have created a software suite called Instant Clue which is available for Mac OS and Windows at http://www.instantclue.uni-koeln.de including detailed tutorials and working examples. In addition, the python source code is also freely available making Instant Clue editable and available on any platform with a working Python version (>=3.4).

Methods

Software design

We took advantage of the constantly growing number of scientific packages in Python and wrote Instant Clue in pure Python (>3.4). Moreover, Python is known for its design philosophy that emphasizes code readability and user specific adjustments can be achieved by a broad range of researchers. The presented software is open-source and relies mainly on the following packages: tkinter for the graphical user interface, pandas and pandastable for data management^10,11, matplotlib¹² and seaborn¹³ for generating charts, numpy¹⁴ for calculations as well as statsmodels¹⁵, scikit-learn¹⁶, and Cython¹⁷ for statistical tests. The Graphical User Interface (GUI) (Fig. S1) is built with python’s standard tool kit tkinter/tkk. The GUI represents a scaffold to use the implement library modules which are grouped by their function. For example, there is a module to provide easy data management and manipulations (data.py) separated from the plotter module which handles plotting events (plotter.py). The general software architecture is illustrated in Fig. S2.

Download and Maintenance

The software can be downloaded from http://www.instantclue.uni-koeln.de for Windows and Mac OS as a zip file. Mac OS users need to install Active Tcl Version 8.5.18 before usage (https://www.activestate.com/activetcl/downloads). Instant Clue will be updated continuously and users are alerted upon new version releases. Support for users is provided by our detailed tutorial series at http://www.instantclue.uni-koeln.de/tutorials.html in written and commented video format.

Bug Report and Feature Requests

Users that detect unexpected behavior or missing features are highly encouraged to report this to us via GitHub (https://github.com/hnolCol/instantclue/issues) or direct mail contact.

Example Data

To facilitate a comfortable start for users, we have included several example data sets that can be found in the compressed file (folder: examples). In light of the versatile application of Instant Clue, we have included fully documented step-by-step data analysis procedure in the tutorial (http://www.instantclue.uni-koeln.de/tutorials.html) of various different type of data sets: (i) Body weight measurements of people of different health condition and age. (ii) mass spectrometry based immunoprecipitation data published recently to identify interaction partners of a protease dead mutant of Presenilins-associated rhomboid-like protein (PARL)^18,19. (iii) optical recording of Pro-opiomelanocortin (POMC) neuron activity (time series data)²⁰. (iv) iris data set²¹. (v) wine quality data set for supervised learning²². As the tutorial will be extended continuously, we will also add more example data.

Results and Discussion

Throughout the software, the general concept is that the analysis of data is driven by visual inspection. Thus, we employed drag & drop events as the central mean of action to plot charts as well as to apply statistical tests and techniques. This enforces the user to inspect data visually, which will help to interpret, verify and judge results. In the following, we will explain general aspects and give an overview of the presented software. In addition, we have uploaded several video tutorials that will support users to become familiar with Instant Clue (http://www.instantclue.uni-koeln.de/videos.html). In Instant Clue all activities are initiated via the Graphical User Interface (GUI) that is explained in Figs S1 and S2.

Data organization and plotting

Data can be uploaded from various file types including Excel, tab delimited text (.txt) and csv-files (.csv), Extensible Markup Language (.xml) files as well as compressed files (.gz, .zip). Once uploaded the data columns are automatically separated by their datatype. The four available datatypes are Numeric Floats (example: 1.345), Integers (1, 1922), Categories (Time, Genotype, Gene names) and Boolean (True, False) (Fig. 1a). Because several functions require certain data types, the type of a column can be changed retroactively. Users might also upload several files that can be merged.

Charts are generated instantaneously by drag & drop of column headers to one of the two receiver boxes (Figs 1a and S1). The categorical box is used to split data according to the present categories. For example, numeric data represent measurements such as body weight or gene expression values. Categorical columns contain categorical values such as the state of a disease, genotype or experimental setup (treatment, no treatment). As an example, Fig. 1b depicts the raw on-the-fly output of Instant Clue after loading the accompanied “Tutorial_Data_01.xlsx” and adding the Body weight column to the numerical data receiver box and the Condition column to the categorical receiver box by drag & drop. The chart type can be chosen from numerous available options, each of which is specialized for a certain type of data and way of visualization (Fig. 1c). Advantages of each chart type are summarized in the online tutorial. Users can easily modify chart margins, font size and axis limits in an interactive way and export charts to numerous file types.

Computational activities and data filtering

Instant Clue offers a diverse portfolio of computational activities to assist the visual exploration of multivariate data. Activities are applied on columns in the dataset using the context menu and cover basic steps such as sorting, string splitting and replacement, normalization and transformation, imputation of missing values, smoothing and rolling window calculations. Additionally, the data format can be changed between long and wide formats and numerous calculations such as Z-Score, mean and standard deviation row-wise or kernel density estimations column-wise are implemented. A detailed description of each activity is presented in the pdf tutorial at http://www.instantclue.uni-koeln.de/tutorials.html.

To systematically evaluate differences between biological samples, researchers aim to subset their data by certain criteria such as cellular localization or signaling pathway based on annotation terms derived from several sources such as the gene ontology²³, GSEA²⁴, MitoCarta²⁵ or PFAM²⁶ database. Therefore, we have implemented numerous categorical filters that allow for quick but complex filtering. There are three different categorical filters: (i) ‘Find Category & Annotate’, (ii) ‘Find String(s) & Annotate’ and (iii) ‘Custom Categorical Filter’. A summary of all filtering steps, advantages and example results are displayed in Fig. S3. To visualize applied filters, the “Slice and Marks Frame” option (Fig. S1) allows for color/size encoding via Drag & Drop that might be used to highlight significantly different expressed proteins or genes (Fig. 2 top-right). The tooltip and label activities (Fig. 2 bottom) facilitate fast and efficient screening through the dataset. For instance, these activities can be used to enable the annotation and identification of interesting candidates in a scatter plot or hierarchical clustering.

Instant Clue comprises a Statistical Toolbox for multivariate data analysis

Instant Clue promotes the visual analysis of data, but also offers several statistical tests that are applied in an interactive way. In line with the idea that researchers should inspect data visually as a first step, statistical tests are enabled by a drag & drop event from the Analysis toolbox onto generated charts. Several tests are automatically performed and do not require further action by the user. Nevertheless, for comparing two groups via t-test or U-test, the statistical assessment is only enabled after a drag & drop action. By clicking on the desired groups that should be compared, the test is automatically calculated, and the p-value is indicated in the chart above lines between tested groups (Fig. 3 – top right). In addition, performed tests are stored and can be exported at any time. Noteworthy, if an activity (each test is an activity) cannot handle missing values, the data are automatically filtered before submission to the specified activity, without changing the source data. The toolbox covers numerous techniques, including supervised learning, clustering, dimensional reduction, time series as well as curve fitting. In the following we describe and present results of supervised learning, time series and curve fitting to illustrate the functionality and ease of the presented software.

(Un-)Supervised Learning for data classification

The ability to generate high dimensional data with moderate effort and depth as well as the massively increasing knowledge in science facilitates the application of supervised learning techniques. In general, these methods are utilized to predict class memberships based on a learning process. In this step a training dataset is used to build inferred function that is used to classify new unseen samples. The training dataset consist of n samples and m features as well as the class labels. For example, a training dataset could encompass several thousand human subjects (samples) that were screened in a hospital measuring several parameters (features) such as blood pressure, weight, or the number/location of single nucleotide polymorphisms (SNPs) found in the genome, describing the subject’s health condition (class labels – healthy, cancerous). These data can be used to train an estimator which in turn is able to predict a health condition based on the used features for uncharacterized subjects. Such classification tasks were successfully used to predict new kinase-substrate relationships³ and many other applications in biological and medical science^27,28. Instant Clue offers several functions to establish an estimator for prediction, based on the scikit-learn library¹⁶. Users can optimize pre-processing of data, feature selection/reduction and estimator parameters using exhaustive grid searching over given parameters. Fig. S4 shows the dialog window to interactively construct a prediction pipeline. A pre-processing step might be used to scale/normalize the input data. To increase the generalization ability, accuracy and prediction speed of an estimator it is often useful to select the most important features or to apply a dimensional reduction technique (feature selection) before training an estimator. These steps can be defined using an interactive drag & drop dialog window (Fig. S4). Established pipelines can be saved and subsequently used to predict class memberships of unseen data. Thus, Instant Clue provides a convenient way to accomplish classification tasks.

Moreover, the software offers several functions to analyze data in an unsupervised fashion such as Principal Component Analysis (PCA), k-means or Density-based spatial clustering of applications with noise (DBSCAN) clustering allowing users to identify underlying patterns. Fig. S5 illustrates the raw output of a PCA and k-means clustering analysis. Clustering algorithms can also be utilized to predict cluster membership of unseen data.

Time series analysis

Instant Clue offers the possibility to explore time series data. The software is currently limited to continuous time data such as increasing number of seconds/minutes. The activities to analyze time series data aim to smooth data such as an intensity along a time axis. The Example Data 03 (see Methods) are optical recordings of Pro-opiomelanocortin (POMC) neuron activity. Signal measurements over time can be baseline corrected and the area under curve (AUC) can be determined in an interactive way. (Fig. 3 top-left and Fig. S6). Noteworthy, even though these activities are limited to the time series chart type (Fig. 1c) the x axis can be any continuous data array such as m/z or scan number.

Curve fitting and correlation analysis

Curve fitting and correlation is an efficient way to connect phenotype characterizing data such as blood glucose levels, body mass index, blood pressure or fitness to expression data of proteomic or genomic experiments. This fundamental principle was first discovered by Linus Pauling in 1940, when he observed that a single amino acid change in Hemoglobin caused a structural change of the protein, which eventually results in the development of sickle cell anemia²⁹. Today, scientists are able to create causal networks on a more comprehensive scale, mostly driven by correlation analysis. It has recently been demonstrated how three distinct omics levels provide in-depth insights into the molecular mechanisms and how they correlate to the characterized phenotype³⁰. Therefore, we have added a toolbox to perform curve fitting and correlation analysis in an intuitive way. Several functions are implemented such as polynomial or linear fits, enzymatic reaction models (Michaelis-Menten), and periodic functions to identify genes/proteins that are following circadian rhythm (Fig. S7).

The Main Figure Template promotes structured collection of charts

Scientists often seek to integrate several charts into figures containing multiple subplots. Even though plots can be easily exported as vector graphics directly from the main window which can be further processed in suitable vector graphic software tools. In addition, we also provide the possibility to combine several charts and images in so-called main figure templates (Fig. 4). To this end, we have generated activities to: (i) create multiple main figure templates (ii) add labeled subplots to these figure templates (iii) incorporate charts from Instant Clue’s main window or for adding image files from the user’s documents (Fig. 4a). Users can delete, move and modify elements of a chart, define subplot labels and add text or formulas resulting in a publication-ready figure without further software tools (Fig. 4b). In practice, the main figure template ensures the same format between subplots, helps to generate an uniform figure presentations, and clearly reduces the processing time. Main figures can be exported to numerous file types including pdf, svg or png.

Comparison with other tools

To highlight Instant Clue’s advantages and the contribution to the field, we have compared the presented tool with other published and free of charge software suites (Table 1). Each software tool has its own strengths and weaknesses since they were developed to address different needs of users. Instant Clue’s functionality covers a unique variety of scientific tools from time series analysis over curve fitting to multi-omics data analysis. In addition, Instant Clue aims to combine a rich statistical toolbox with visualization that are suitable for usage in scientific publications with little post-processing. While sophisticated tools such as KNIME³¹, Orange³², Voyager 2³³ and GProX³⁴ offer functionality for data analysis and visualization with overlap to the presented tool, Instant Clue has a unique and wide-ranging combination of features within one software suite: (i) interactive live filtering and masking of data (ii) broadly applicable analysis and visualization activities used in various fields of life sciences and therefore broadly applicable (iii) comprehensive categorical filtering to subset and annotate data without a single line of code (iv) highly adjustable charts such as annotated text labels and (v) intuitive main figure templates to collect multiple charts for publication with minimal post-processing. However, software such as Orange and KNIME are based on constructing pipelines for data analysis with great overview and applicability to other data sets which is currently not implemented in Instant Clue. Tools that are specialized on visual analytics such as Voyager 2 include a sophisticated algorithm that infers graphical representation, which might help the user to gain more insights into their data. Though not as advanced, we have implemented algorithms in Instant Clue that infer a graphical representation that are commonly used in life science data plotting. Overall, we are confident that the unique combination of a comprehensive statistical toolbox, interactive and dynamic live filtering, and flexibility in chart generation offered by Instant Clue will be a helpful and complementary approach for scientists from interdisciplinary areas to analyze complex data sets.

Table 1 Features and Requirements of the current plethora of tools with overlap to Instant Clue’s functionality.

Full size table

Conclusion

The routine generation of high dimensional datasets demands the cooperative work between bioinformaticist and biological researcher. To equip scientists that are faced by the challenge of visualizing and analyzing multifactorial data with a straightforward tool, we have developed Instant Clue. Due to its simplicity, attractive design and intuitive drag & drop interface, the software can assist in the fast and comprehensive analysis of various datasets. The wide-ranging functionality of Instant Clue covers numerous charts, however we are aiming to extend the panel of statistical tests and will add further activities that will be beneficial for systematic data interpretation.

Moreover, advanced users are not limited to the portfolio of activities and can modify the source code to adjust the software to their needs. We encourage computer experts to contribute to the development of Instant Clue, sharing their adjustments with us, and thereby accelerate the continuous improvement process. We are confident that this software will facilitate the communication between interdisciplinary scientists.

References

Akimov, V. et al. StUbEx PLUS-A Modified Stable Tagged Ubiquitin Exchange System for Peptide Level Purification and In-Depth Mapping of Ubiquitination Sites. Journal of proteome research 17, 296–304, https://doi.org/10.1021/acs.jproteome.7b00566 (2018).
Article PubMed CAS Google Scholar
Bekker-Jensen, D. B. et al. An Optimized Shotgun Strategy for the Rapid Generation of Comprehensive Human Proteomes. Cell Syst 4, 587–599 e584, https://doi.org/10.1016/j.cels.2017.05.009 (2017).
Article PubMed PubMed Central CAS Google Scholar
Krishnan, R. K. et al. Quantitative analysis of the TNF-alpha-induced phosphoproteome reveals AEG-1/MTDH/LYRIC as an IKKbeta substrate. Nature communications 6, 6658, https://doi.org/10.1038/ncomms7658 (2015).
Article PubMed PubMed Central CAS Google Scholar
Sharma, K. et al. Ultradeep human phosphoproteome reveals a distinct regulatory nature of Tyr and Ser/Thr-based signaling. Cell reports 8, 1583–1594, https://doi.org/10.1016/j.celrep.2014.07.036 (2014).
Article PubMed CAS Google Scholar
Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nature methods 13, 731–740, https://doi.org/10.1038/nmeth.3901 (2016).
Article PubMed CAS Google Scholar
Colaert, N., Helsens, K., Impens, F., Vandekerckhove, J. & Gevaert, K. Rover: a tool to visualize and validate quantitative proteomics data from different sources. Proteomics 10, 1226–1229, https://doi.org/10.1002/pmic.200900379 (2010).
Article PubMed CAS Google Scholar
Polpitiya, A. D. et al. DAnTE: a statistical tool for quantitative analysis of -omics data. Bioinformatics 24, 1556–1558, https://doi.org/10.1093/bioinformatics/btn217 (2008).
Article PubMed PubMed Central CAS Google Scholar
Efstathiou, G. et al. ProteoSign: an end-user online differential proteomics statistical analysis platform. Nucleic acids research https://doi.org/10.1093/nar/gkx444 (2017).
Stolte, C., Tang, D. & Hanrahan, P. Polaris: a system for query, analysis, and visualization of multidimensional relational databases. IEEE Transactions on Visualization and Computer Graphics 8, 52–65, https://doi.org/10.1109/2945.981851 (2002).
Article Google Scholar
McKinney, W. Data Structures for Statistical Computing in Python. Proceedings of the 9th Python in Science Conference 51–56 (2010).
Farrell, D. DataExplore: An Application for General Data Analysis in Research and Education. Journal of Open Research Software 4, 9, https://doi.org/10.5334/jors.94 (2016).
Article Google Scholar
Hunter, J. D. Matplotlib: A 2D Graphics Environment. Computing in Science & Engineering 9, 90–95, https://doi.org/10.1109/mcse.2007.55 (2007).
Article ADS Google Scholar
Waskom, M. et al mwaskom/seaborn: v0.8.1 (September 2017). https://doi.org/10.5281/zenodo.883859 (2017).
Walt, S. V. D., Colbert, S. C. & Varoquaux, G. The NumPy Array: A Structure for Efficient Numerical Computation. Computing in Science & Engineering 13, 22–30, https://doi.org/10.1109/MCSE.2011.37 (2011).
Article Google Scholar
Seabold, S., Perktold, J. Statsmodels: Econometric and Statistical Modeling with Python Proceedings of the 9th Python in Science Conference 57–61 (2010).
Pedregosa, F. et al. Scikit-learn: Machine Learning in {P}ython. Journal of Machine Learning Research 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Behnel, S. et al. Cython: The Best of Both Worlds. Computing in Science & Engineering 13, 31–39, https://doi.org/10.1109/MCSE.2010.118 (2011).
Article Google Scholar
Wai, T. et al. The membrane scaffold SLP2 anchors a proteolytic hub in mitochondria containing PARL and the i-AAA protease YME1L. EMBO reports 17, 1844–1856, https://doi.org/10.15252/embr.201642698 (2016).
Article PubMed PubMed Central CAS Google Scholar
Saita, S. et al. PARL mediates Smac proteolytic maturation in mitochondria to promote apoptosis. Nature cell biology 19, 318–328, https://doi.org/10.1038/ncb3488 (2017).
Article PubMed CAS Google Scholar
Chen, Y., Lin, Y. C., Kuo, T. W. & Knight, Z. A. Sensory detection of food rapidly modulates arcuate feeding circuits. Cell 160, 829–841, https://doi.org/10.1016/j.cell.2015.01.033 (2015).
Article PubMed PubMed Central CAS Google Scholar
Fisher, R. A. The Use of Multiple Measurements in Taxonomic Problems. Annals of Eugenics 7, 179–188, https://doi.org/10.1111/j.1469-1809.1936.tb02137.x (1936).
Article Google Scholar
Cortez, P., Cerdeira, A., Almeida, F., Matos, T. & Reis, J. Modeling wine preferences by data mining from physicochemical properties. Decision Support Systems 47, 547–553, https://doi.org/10.1016/j.dss.2009.05.016 (2009).
Article Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature genetics 25, 25–29, https://doi.org/10.1038/75556 (2000).
Article PubMed PubMed Central CAS Google Scholar
Mootha, V. K. et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nature genetics 34, 267–273, https://doi.org/10.1038/ng1180 (2003).
Article ADS PubMed CAS Google Scholar
Calvo, S. E., Clauser, K. R. & Mootha, V. K. MitoCarta2.0: an updated inventory of mammalian mitochondrial proteins. Nucleic acids research 44, D1251–1257, https://doi.org/10.1093/nar/gkv1003 (2016).
Article PubMed CAS Google Scholar
Finn, R. D. et al. The Pfam protein families database: towards a more sustainable future. Nucleic acids research 44, D279–285, https://doi.org/10.1093/nar/gkv1344 (2016).
Article PubMed CAS Google Scholar
Sommer, C. & Gerlich, D. W. Machine learning in cell biology - teaching computers to recognize phenotypes. Journal of cell science 126, 5529–5539, https://doi.org/10.1242/jcs.123604 (2013).
Article PubMed CAS Google Scholar
Liu, C., Che, D., Liu, X. & Song, Y. Applications of machine learning in genomics and systems biology. Computational and mathematical methods in medicine 2013, 587492, https://doi.org/10.1155/2013/587492 (2013).
Article MathSciNet PubMed PubMed Central Google Scholar
Pauling, L. et al. Sickle cell anemia, a molecular disease. Science 109, 443 (1949).
PubMed CAS Google Scholar
Williams, E. G. et al. Systems proteomics of liver mitochondria function. Science 352, aad0189, https://doi.org/10.1126/science.aad0189 (2016).
Article PubMed CAS Google Scholar
Berthold, M. R. et al. 319–326 (Springer Berlin Heidelberg).
Demšar, J. & Zupan, B. Orange: Data Mining Fruitful and Fun - A Historical Perspective. Informatica 37, (55–60 (2013).
Google Scholar
Wongsuphasawat, K. et al. Voyager 2: Augmenting Visual Analysis with Partial View Specifications. ACM Human Factors in Computing Systems (CHI) (2017).
Rigbolt, K. T., Vanselow, J. T. & Blagoev, B. GProX, a user-friendly platform for bioinformatics analysis and visualization of quantitative proteomics data. Molecular & cellular proteomics: MCP 10, O110 007450, https://doi.org/10.1074/mcp.O110.007450 (2011).
Article CAS Google Scholar
Tenenhaus, A. et al. Variable selection for generalized canonical correlation analysis. Biostatistics 15, 569–583, https://doi.org/10.1093/biostatistics/kxu001 (2014).
Article PubMed MATH Google Scholar

Download references

Acknowledgements

We would like to thank all researchers at University of Cologne, Max Planck Institute for Metabolism and Max Planck Institute for Ageing that helped to develop and improve Instant Clue. We acknowledge the members of the CECAD proteomics facility for helpful comments during the development of Instant Clue. This work was supported by the Cologne Cluster of Excellence in Cellular Stress Responses in Aging-associated Diseases (CECAD, EXC 229/2), the Collaborative Research Center - Molecular Mechanisms Regulating Skin Homeostasis (CRC 829; DFG SFB 829 A1, and the DFG NIE-1234/6-1.

Author information

Authors and Affiliations

Institute for Genetics and Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, Joseph-Stelzmann-Strasse 26, 50931, Cologne, Germany
Hendrik Nolte, Thomas D. MacVicar, Frederik Tellkamp & Marcus Krüger
Department of Dermatology, Center for Molecular Medicine Cologne, University of Cologne, 50931, Cologne, Germany
Frederik Tellkamp
Center for Molecular Medicine (CMMC), University of Cologne, 50931, Cologne, Germany
Frederik Tellkamp & Marcus Krüger

Authors

Hendrik Nolte
View author publications
You can also search for this author in PubMed Google Scholar
Thomas D. MacVicar
View author publications
You can also search for this author in PubMed Google Scholar
Frederik Tellkamp
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Krüger
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.N. engineered the software. F.T. tested the software extensively. H.N. and T.D.M. recorded and wrote tutorials. M.K. supervised the project. The manuscript was written through contributions of all authors. All authors have given approval to the final version of the manuscript.

Corresponding authors

Correspondence to Hendrik Nolte or Marcus Krüger.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supporting Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nolte, H., MacVicar, T.D., Tellkamp, F. et al. Instant Clue: A Software Suite for Interactive Data Visualization and Analysis. Sci Rep 8, 12648 (2018). https://doi.org/10.1038/s41598-018-31154-6

Download citation

Received: 15 May 2018
Accepted: 13 August 2018
Published: 23 August 2018
DOI: https://doi.org/10.1038/s41598-018-31154-6

This article is cited by

A knock down strategy for rapid, generic, and versatile modelling of muscular dystrophies in 3D-tissue-engineered-skeletal muscle
- Stijn L. M. in ‘t Groen
- Marnix Franken
- W. W. M. Pim Pijnappel
Skeletal Muscle (2024)
Post-ischemic ubiquitination at the postsynaptic density reversibly influences the activity of ischemia-relevant kinases
- Luvna Dhawka
- Victoria Palfini
- Karin Hochrainer
Communications Biology (2024)
Mitochondrial translocation of TFEB regulates complex I and inflammation
- Chiara Calabrese
- Hendrik Nolte
- Nirmal Robinson
EMBO Reports (2024)
Mitochondrial dysfunction abrogates dietary lipid processing in enterocytes
- Chrysanthi Moschandrea
- Vangelis Kondylis
- Manolis Pasparakis
Nature (2024)
A virtual reality data visualization tool for dimensionality reduction methods
- Juan C. Morales-Vega
- Laura Raya
- Alberto Sanchez
Virtual Reality (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.