PT - JOURNAL ARTICLE AU - Katherine Redfield Chang AU - Xinghua Lou AU - Theofanis Karaletsos AU - Christopher Crosbie AU - Stuart Gardos AU - David Artz AU - Gunnar Rätsch TI - An Empirical Analysis of Topic Modeling for Mining Cancer Clinical Notes AID - 10.1101/062307 DP - 2016 Jan 01 TA - bioRxiv PG - 062307 4099 - http://biorxiv.org/content/early/2016/07/06/062307.short 4100 - http://biorxiv.org/content/early/2016/07/06/062307.full AB - Using a variety of techniques including Topic Modeling, Principal Component Analysis and Bi-clustering, we explore electronic patient records in the form of unstructured clinical notes and genetic mutation test results. Our ultimate goal is to gain insight into a unique body of clinical data, specifically regarding the topics discussed within the note content and relationships between patient clinical notes and their underlying genetics.