PT  - JOURNAL ARTICLE
AU  - Mohieddin Jafari
AU  - Cheng Chen
AU  - Mehdi Mirzaie
AU  - Jing Tang
TI  - NIMAA: an R/CRAN package to accomplish NomInal data Mining AnAlysis
AID  - 10.1101/2022.01.13.475835
DP  - 2022 Jan 01
TA  - bioRxiv
PG  - 2022.01.13.475835
4099  - http://biorxiv.org/content/early/2022/01/18/2022.01.13.475835.short
4100  - http://biorxiv.org/content/early/2022/01/18/2022.01.13.475835.full
AB  - Summary Nominal data is data that has been “labeled” and can be designated into a number of non-overlapping unordered groups. The analysis of this type of data is often superficial or trivial because it is not feasible to conduct extensive numerical methods on this type of data. Graphs or networks, on the other hand, are comprised of sets of nodes and edges that can also be considered as nominal variables. By integrating graph theory and data mining approaches, we offer the R package NIMAA to define a nominal data-mining pipeline to explore more information. Using nominal variables in a dataset, NIMAA provides functions for constructing weighted and unweighted bipartite graphs, analysing the similarity of labels in nominal variables, clustering labels or categories to super-labels, validating clustering results, predicting bipartite edges by missing weight imputation, and providing a variety of visualization tools. Here, we also indicated the application of nominal data mining in a biological dataset with well-riched nominal variables.Availability NIMAA’s official release and the beta update are available on CRAN and Github, respectively. URLs: https://CRAN.R-project.org/package=NIMAA and https://github.com/jafarilab/NIMAAContact mohieddin.jafari{at}helsinki.fi; jing.tang{at}helisnki.fiContributions MJ conceived the study and developed the models, MJ and CC adopted and implemented the methods, MM improved the methods, JT provided the funding, MJ, CC, MM and JT wrote the paper.Competing Interest StatementThe authors have declared no competing interest.