Google Scholar

User profiles for Scott Novotney

Scott Novotney

Senior Applied Scientist, Amazon

Verified email at amazon.com

Cited by 441

[PDF] aclanthology.org

[PDF][PDF] Cheap, fast and good enough: Automatic speech recognition with non-expert transcription

S Novotney, C Callison-Burch - … of the North American Chapter of …, 2010 - aclanthology.org

Deploying an automatic speech recognition system with reasonable performance requires
expensive and time-consuming in-domain transcription. Previous work demonstrated that non-…

Save Cite Cited by 211 Related articles All 13 versions View as HTML

[PDF] psu.edu

Unsupervised acoustic and language model training with small amounts of labelled data

S Novotney, R Schwartz, J Ma - 2009 IEEE International …, 2009 - ieeexplore.ieee.org

We measure the effects of a weak language model, estimated from as little as 100k words of
text, on unsupervised acoustic model training and then explore the best method of using …

Save Cite Cited by 85 Related articles All 7 versions

[HTML] plos.org

[HTML][HTML] Large-scale design and refinement of stable proteins using sequence-only models

JM Singer, S Novotney, D Strickland, HK Haddox… - PloS one, 2022 - journals.plos.org

Engineered proteins generally must possess a stable structure in order to achieve their
designed function. Stable designs, however, are astronomically rare within the space of all …

Save Cite Cited by 24 Related articles All 14 versions Cached

[PDF] isca-archive.org

[PDF][PDF] Analysis of low-resource acoustic model self-training

S Novotney, R Schwartz - Tenth annual conference of the …, 2009 - isca-archive.org

Previous work on self-training of acoustic models using unlabeled data reported significant
reductions in WER assuming a large phonetic dictionary was available. We now assume only …

Save Cite Cited by 31 Related articles All 4 versions View as HTML

[PDF] aclanthology.org

[PDF][PDF] Crowdsourced accessibility: Elicitation of Wikipedia articles

S Novotney, C Callison-Burch - Proceedings of the NAACL HLT …, 2010 - aclanthology.org

Mechanical Turk is useful for generating complex speech resources like conversational
speech transcription. In this work, we explore the next step of eliciting narrations of Wikipedia …

Save Cite Cited by 22 Related articles All 14 versions View as HTML

[PDF] arxiv.org

Cue vectors: Modular training of language models conditioned on diverse contextual signals

S Novotney, S Mukherjee, Z Ahmed… - arXiv preprint arXiv …, 2022 - arxiv.org

We propose a framework to modularize the training of neural language models that use diverse
forms of sentence-external context (including metadata) by eliminating the need to jointly …

Save Cite Cited by 4 Related articles All 7 versions View as HTML

[PDF] arxiv.org

Improving accuracy of rare words for rnn-transducer through unigram shallow fusion

…, A Rastrow, L Liu, D Filimonov, S Novotney… - arXiv preprint arXiv …, 2020 - arxiv.org

End-to-end automatic speech recognition (ASR) systems, such as recurrent neural network
transducer (RNN-T), have become popular, but rare word remains a challenge. In this paper, …

Save Cite Cited by 7 Related articles All 3 versions View as HTML

[PDF] arxiv.org

Attention-based contextual language model adaptation for speech recognition

RD Martinez, S Novotney, I Bulyko, A Rastrow… - arXiv preprint arXiv …, 2021 - arxiv.org

Language modeling (LM) for automatic speech recognition (ASR) does not usually incorporate
utterance level contextual information. For some domains like voice assistants, however, …

Save Cite Cited by 8 Related articles All 4 versions View as HTML

Robust keystroke transcription from the acoustic side-channel

D Slater, S Novotney, J Moore, S Morgan… - Proceedings of the 35th …, 2019 - dl.acm.org

The acoustic emanations from keyboards provide a side-channel attack from which an
attacker can recover sensitive user information, such as passwords and personally identifiable …

Save Cite Cited by 9 Related articles All 3 versions

[PDF] researchgate.net

[PDF][PDF] Unsupervised Arabic Dialect Adaptation with Self-Training.

S Novotney, RM Schwartz, S Khudanpur - InterSpeech, 2011 - researchgate.net

Useful training data for automatic speech recognition systems of colloquial speech is usually
limited to expensive in-domain transcription. Broadcast news is an appealing source of …

Save Cite Cited by 13 Related articles All 6 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Scott Novotney

Scott Novotney

[PDF][PDF] Cheap, fast and good enough: Automatic speech recognition with non-expert transcription

Unsupervised acoustic and language model training with small amounts of labelled data

[HTML][HTML] Large-scale design and refinement of stable proteins using sequence-only models

[PDF][PDF] Analysis of low-resource acoustic model self-training

[PDF][PDF] Crowdsourced accessibility: Elicitation of Wikipedia articles

Cue vectors: Modular training of language models conditioned on diverse contextual signals

Improving accuracy of rare words for rnn-transducer through unigram shallow fusion

Attention-based contextual language model adaptation for speech recognition

Robust keystroke transcription from the acoustic side-channel

[PDF][PDF] Unsupervised Arabic Dialect Adaptation with Self-Training.