User profiles for Jouni Siren
Jouni SirénUC Santa Cruz Genomics Institute Verified email at ucsc.edu Cited by 2813 |
Pangenomics enables genotyping of known structural variants in 5202 diverse genomes
INTRODUCTION Modern genomics depends on inexpensive short-read sequencing.
Sequenced reads up to a few hundred base pairs in length are computationally mapped to …
Sequenced reads up to a few hundred base pairs in length are computationally mapped to …
Pangenome graphs
JM Eizenga, AM Novak, JA Sibbesen… - Annual review of …, 2020 - annualreviews.org
Low-cost whole-genome assembly has enabled the collection of haplotype-resolved
pangenomes for numerous organisms. In turn, this technological change is encouraging the …
pangenomes for numerous organisms. In turn, this technological change is encouraging the …
Variation graph toolkit improves read mapping by representing genetic variation in the reference
Reference genomes guide our interpretation of DNA sequence data. However, conventional
linear references represent only one version of each locus, ignoring variation in the …
linear references represent only one version of each locus, ignoring variation in the …
[HTML][HTML] Wheeler graphs: A framework for BWT-based data structures
The famous Burrows–Wheeler Transform (BWT) was originally defined for a single string but
variations have been developed for sets of strings, labeled trees, de Bruijn graphs, etc. In …
variations have been developed for sets of strings, labeled trees, de Bruijn graphs, etc. In …
Indexing variation graphs
J Sirén - 2017 Proceedings of the ninteenth workshop on …, 2017 - SIAM
Variation graphs, which represent genetic variation within a population, are replacing
sequences as reference genomes. Path indexes are one of the most important tools for working …
sequences as reference genomes. Path indexes are one of the most important tools for working …
Haplotype-aware graph indexes
Motivation The variation graph toolkit (VG) represents genetic variation as a graph. Although
each path in the graph is a potential haplotype, most paths are non-biological, unlikely …
each path in the graph is a potential haplotype, most paths are non-biological, unlikely …
[HTML][HTML] A draft human pangenome reference
Here the Human Pangenome Reference Consortium presents a first draft of the human
pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of …
pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of …
[HTML][HTML] Computational graph pangenomics: a tutorial on data structures and their applications
Computational pangenomics is an emerging research field that is changing the way
computer scientists are facing challenges in biological sequence analysis. In past decades, …
computer scientists are facing challenges in biological sequence analysis. In past decades, …
Indexing graphs for path queries with applications in genome research
We propose a generic approach to replace the canonical sequence representation of
genomes with graph representations, and study several applications of such extensions. We …
genomes with graph representations, and study several applications of such extensions. We …
Storage and retrieval of highly repetitive sequence collections
A repetitive sequence collection is a set of sequences which are small variations of each other.
A prominent example are genome sequences of individuals of the same or close species, …
A prominent example are genome sequences of individuals of the same or close species, …