RT Journal Article SR Electronic T1 Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold JF bioRxiv FD Cold Spring Harbor Laboratory SP 386110 DO 10.1101/386110 A1 Steinegger, Martin A1 Mirdita, Milot A1 Söding, Johannes YR 2018 UL http://biorxiv.org/content/early/2018/08/07/386110.abstract AB The open-source de-novo Protein-Level ASSembler Plass (https://plass.mmseqs.org) assembles six-frame-translated sequencing reads into protein sequences. It recovers 2 to 10 times more protein sequences from complex metagenomes and can assemble huge datasets. We assembled two redundancy-filtered reference protein catalogs, 2 billion sequences from 640 soil samples (SRC) and 292 million sequences from 775 marine eukaryotic metatranscriptomes (MERC), the largest free collections of protein sequences.