%0 Journal Article %A Agnieszka Danek %A Sebastian Deorowicz %T GTC: an attempt to maintenance of huge genome collections compressed %D 2017 %R 10.1101/131649 %J bioRxiv %P 131649 %X We present GTC, a new compressed data structure for representation of huge collections of genetic variation data. GTC significantly outperforms existing solutions in terms of compression ratio and time of answering various types of queries. We show that the largest of publicly available database of about 60 thousand haplotypes at about 40 million SNPs can be stored in less than 4 Gbytes, while the queries related to variants are answered in a fraction of a second. GTC can be downloaded from http://sun.aei.polsl.pl/REFRESH/gtc. %U https://www.biorxiv.org/content/biorxiv/early/2017/04/28/131649.full.pdf