Google Scholar

User profiles for Myle Ott

Myle Ott

Character AI

Verified email at character.ai

Cited by 46149

[PDF] arxiv.org

Roberta: A robustly optimized bert pretraining approach

Y Liu, M Ott, N Goyal, J Du, M Joshi, D Chen… - arXiv preprint arXiv …, 2019 - arxiv.org

Language model pretraining has led to significant performance gains but careful comparison
between different approaches is challenging. Training is computationally expensive, often …

Save Cite Cited by 12595 Related articles All 9 versions View as HTML

[PDF] arxiv.org

fairseq: A fast, extensible toolkit for sequence modeling

M Ott, S Edunov, A Baevski, A Fan, S Gross… - arXiv preprint arXiv …, 2019 - arxiv.org

fairseq is an open-source sequence modeling toolkit that allows researchers and developers
to train custom models for translation, summarization, language modeling, and other text …

Save Cite Cited by 2890 Related articles All 10 versions View as HTML

[PDF] arxiv.org

Finding deceptive opinion spam by any stretch of the imagination

M Ott, Y Choi, C Cardie, JT Hancock - arXiv preprint arXiv:1107.4557, 2011 - arxiv.org

Consumers increasingly rate, review and research products online. Consequently, websites
containing consumer reviews are becoming targets of opinion spam. While recent work has …

Save Cite Cited by 1836 Related articles All 37 versions View as HTML

[PDF] arxiv.org

Recipes for building an open-domain chatbot

…, N Goyal, D Ju, M Williamson, Y Liu, J Xu, M Ott… - arXiv preprint arXiv …, 2020 - arxiv.org

Building open-domain chatbots is a challenging area for machine learning research. While
prior work has shown that scaling neural models in the number of parameters and the size of …

Save Cite Cited by 953 Related articles All 7 versions View as HTML

Related searches

[PDF] arxiv.org

Unsupervised cross-lingual representation learning at scale

…, G Wenzek, F Guzmán, E Grave, M Ott… - arXiv preprint arXiv …, 2019 - arxiv.org

This paper shows that pretraining multilingual language models at scale leads to significant
performance gains for a wide range of cross-lingual transfer tasks. We train a Transformer-…

Save Cite Cited by 5162 Related articles All 10 versions View as HTML

[PDF] arxiv.org

Opt: Open pre-trained transformer language models

…, M Diab, X Li, XV Lin, T Mihaylov, M Ott… - arXiv preprint arXiv …, 2022 - arxiv.org

Large language models, which are often trained for hundreds of thousands of compute days,
have shown remarkable capabilities for zero- and few-shot learning. Given their …

Save Cite Cited by 1629 Related articles All 2 versions View as HTML

[PDF] pnas.org Full View

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

…, S Goyal, Z Lin, J Liu, D Guo, M Ott… - Proceedings of the …, 2021 - National Acad Sciences

In the field of artificial intelligence, a combination of scale in data and model capacity
enabled by unsupervised learning has led to major advances in representation learning and …

Save Cite Cited by 1533 Related articles All 15 versions

[PDF] arxiv.org

Understanding back-translation at scale

S Edunov, M Ott, M Auli, D Grangier - arXiv preprint arXiv:1808.09381, 2018 - arxiv.org

An effective method to improve neural machine translation with monolingual data is to
augment the parallel training corpus with back-translations of target language sentences. This …

Save Cite Cited by 1216 Related articles All 9 versions View as HTML

[PDF] mlsys.org

Sustainable ai: Environmental implications, challenges and opportunities

…, C Bai, M Gschwind, A Gupta, M Ott… - Proceedings of …, 2022 - proceedings.mlsys.org

This paper explores the environmental impact of the super-linear growth trends for AI from a
holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the …

Save Cite Cited by 252 Related articles All 7 versions View as HTML

[PDF] arxiv.org

Phrase-based & neural unsupervised machine translation

G Lample, M Ott, A Conneau, L Denoyer… - arXiv preprint arXiv …, 2018 - arxiv.org

Machine translation systems achieve near human-level performance on some languages, yet
their effectiveness strongly relies on the availability of large amounts of parallel sentences, …

Save Cite Cited by 749 Related articles All 6 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Myle Ott

Myle Ott

Roberta: A robustly optimized bert pretraining approach

fairseq: A fast, extensible toolkit for sequence modeling

Finding deceptive opinion spam by any stretch of the imagination

Recipes for building an open-domain chatbot

Related searches

Unsupervised cross-lingual representation learning at scale

Opt: Open pre-trained transformer language models

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

Understanding back-translation at scale

Sustainable ai: Environmental implications, challenges and opportunities

Phrase-based & neural unsupervised machine translation