Publications and Tutorials
Books
-
Brian Roark, Richard Sproat and Suyoun Yoon. The Tools of the
Scribe: How writing systems, technology, and human factors interact to
affect the act of writing. Cham,
SpringerNature, Forthcoming, 2025.
-
Richard Sproat. Symbols: An Evolutionary History from the Stone
Age to the Future. SpringerNature, Cham, Switzerland, 2023.
-
Kyle Gorman and Richard Sproat. Finite-State Text Processing. Williston,
VT, Morgan Claypool, 2021.
- Fabian Brackhane, Richard Sproat and Jürgen Trouvain
(editors). Wolfgang
von Kempelens
Mechanismus der menschlichen
Sprache. New edition with English translation, and commentary. Technische
Universität Dresden Press, 2017.
-
Richard Sproat.
Language, Technology and Society. Oxford,
Oxford University Press, 2010.
-
Brian Roark, Richard Sproat.
Computational Approaches to Morphology
and Syntax. Oxford, Oxford University Press, 2007.
-
Martin Neef, Anneke Neijt and Richard Sproat (editors)
The Relation of Writing to Spoken Language. Tübingen:
Niemeyer (= Linguistische Arbeiten 460), 2002.
-
Richard Sproat,
A Computational Theory of Writing Systems, (ACL Studies
in Natural Language Processing Series), Cambridge, Cambridge
University Press, 2000.
List of known errata.
A prepublication version can be downloaded from here. This does not correct the errata listed
above nor is the pagination the same as the printed version.
-
Richard Sproat (editor),
Multilingual Text-to-Speech Synthesis: The Bell Labs
Approach, Boston, MA, Kluwer Academic Publishers, 1997.
-
Jan van Santen, Richard Sproat, Joseph Olive and Julia Hirschberg
(editors),
Progress in Speech Synthesis, New York, Springer,
1997.
-
Richard Sproat,
Morphology and Computation, Cambridge, MA,
MIT Press, 1992.
Videos
- Richard Sproat.
Introduction to Writing Systems: How Writing Encodes Language, SpringerNature,
2024.
Refereed Journals
-
Llion Jones, Richard Sproat, Haruko Ishikawa and Alexander
Gutkin. "Helpful Neighbors: Leveraging Neighbors in Geographic Feature
Pronunciation." Transactions of the Association for Computational
Linguistics, To appear.
-
Richard Sproat. "Last Words: Boring Problems are Sometimes the Most
Interesting." Computational Linguistics, 48(2): 483–490.
-
Richard Sproat and Alexander Gutkin. "The Taxonomy of Writing
Systems: How to Measure how Logographic a System is."
Computational Linguistics, 47(3), 477–528, 2021.
-
Richard Sproat. "Review of Zev Handel. 2019. Sinography: The Borrowing and Adaptation of the Chinese Script."
Asiatische Studien/Études Asiatiques, 74(4): 1103–1115, 2021.
-
Juliette Blevins, Richard Sproat. "Statistical Evidence for the
Proto-Indo-European-Euskarian Hypothesis: A word-list approach integrating
phonotactics". Diachronica, 2021.
-
Hao Zhang, Richard Sproat, Axel Ng, Felix Stahlberg, Xiaochang Peng, Kyle
Gorman, Brian Roark. "Neural models of text normalization for speech
applications." Computational Linguistics, 45(2), 2019: 293-337.
- Richard Sproat. "Review of Daniels An Exploration of Writing",
Written Language and Literacy, 21(2), 2018: 269–278.
- Richard Sproat. "A Computational Model of the Discovery of Writing",
Written Language and Literacy, 20(2), 2017: 194-226.
-
Richard Sproat, "Language typology in speech and language technology",
Linguistic Typology, 20(3), 2016.
-
Kyle Gorman and Richard Sproat. "Minimally supervised models for number
normalization." Transactions of the Association for
Computational Linguistics. 2016.
-
Richard Sproat. "On misunderstandings and misrepresentations: A reply to Rao et
al." Language, 91(4), 2015.
-
Peter Ebden and Richard Sproat. "The Kestrel TTS Text Normalization System."
Journal of Natural Language Engineering, 2014.
-
Richard Sproat, Mahsa Yarmohammadi, Izhak Shafran and Brian Roark. "Applications
of Lexicographic Semirings to
Problems in Speech and Language Processing," Computational Linguistics,
2014.
-
Richard Sproat, "A statistical comparison of written language and
non-linguistic symbol systems," Language,
2014.
-
Richard Sproat, "A note on Unger’s 'What linguistic units do Chinese
characters represent?'", Written Language & Literacy, 16(1), 107–111,
2013.
-
Jan van Santen, Richard Sproat and Alison Presmanes Hill, "Quantifying
Repetitive Speech in Autism Spectrum Disorders and Language Impairment,"
Autism Research, 2013.
-
Padmapriya Kandhadai and Richard Sproat,
"Impact of spatial ordering of graphemes in alphasyllabic scripts on
phonemic awareness in Indic languages,"
Writing Systems Research, 2(2), 105-116, 2010.
-
Richard Sproat. "Reply to Rao et al. and Lee et al." Computational
Linguistics, 36:4, 2010.
- Richard Sproat. "Last Words: Ancient
symbols, computational linguistics, and the reviewing practices of the
general science journals." Computational Linguistics,
36:3, 2010.
-
Aamir Wali, Richard Sproat, Prakash Padakannaya and Bhuvaneshwari,
B. "Model for phonemic awareness in readers of Indian
script". Written Language and Literacy, 12:2, 161-169, 2009.
-
Su-Youn Yoon, Lisa Pierce, Amanda Huensch, Eric Juul, Samantha
Perkins, Richard Sproat, and Mark Hasegawa-Johnson, "Construction of a
rated speech corpus of L2 learners' speech," CALICO Journal, 2009
-
Richard Sproat. "Brahmi-Derived Scripts, Script Layout, and
Phonological Awareness." Written Language and Literacy,
9(1), 45--66, 2006.
- Michiel Bacchiani, Brian Roark, Michael Riley and Richard Sproat
"MAP Adaptation of Stochastic Grammars".
Computer Speech and Language, 20(1):41-68, 2006.
-
Steve Farmer, Richard Sproat and Michael Witzel.
"The Collapse of the
Indus-Script Thesis: The Myth of a Literate Harappan
Civilization". Electronic Journal of Vedic Studies, 11(2), 2004
-
Narasimhan, Bhuvana, Sproat, Richard, and Kiraz, George.
"Schwa-deletion in Hindi Text-to-Speech Synthesis," International
Journal of Speech Technology, 7(4), 319-333, 2004.
-
Richard Sproat, Alan Black, Stanley Chen, Shankar Kumar, Mari
Ostendorf, and Christopher Richards. "Normalization of non-standard
words." Computer Speech and Language, 15(3), 287-333, 2001.
-
Richard Sproat. "Multilingual Text Analysis for Text-to-Speech
Synthesis", Natural Language Engineering, 2(4), 369-380,
1996.
-
Chilin Shih and Richard Sproat, "Issues in
Text-to-Speech Conversion for Mandarin," Computational
Linguistics and Chinese Language Processing, 1996.
-
Richard Sproat, Chilin Shih, William Gale and Nancy Chang, "A
Stochastic Finite-State Word-Segmentation Algorithm for Chinese,"
Computational Linguistics, 22(3), 1996.
-
Harald Baayen and Richard Sproat,
"Estimating Lexical Priors for
Low-Frequency Morphologically Ambiguous Forms," Computational
Linguistics, 22(2), 1996.
-
Richard Sproat and Chilin Shih. "A Corpus-Based Analysis of Mandarin
Nominal Root Compounds." Journal of East Asian Linguistics,
5, 49-71, 1996.
-
Richard Sproat and Joseph Olive, "Text to Speech Synthesis,"
AT&T Technical Journal, 74(2), 35-44, 1995.
-
Richard Sproat, "English Noun-Phrase Accent Prediction for
Text-to-Speech." Computer Speech and Language, 8, 79-94,
1994. The list of about 7900 nominals with hand-assigned stress
used in this paper can be found here.
-
Richard Sproat and Chilin Shih, "Why Mandarin Morphology is not
Stratum-Ordered." Yearbook of Morphology, 185-217, 1993.
-
Richard Sproat and Osamu Fujimura, "Allophonic variation in English
/l/ and its implications for phonetic implementation," Journal of
Phonetics, 21, 291-311, 1993.
-
Gail McKoon, Gregory Ward, Roger Ratcliff and Richard Sproat,
"Morphosyntactic and pragmatic factors affecting the accessibility of
discourse entities," Journal of Memory and Language, 32,
1-20, 1993.
-
David Roe, Pedro Moreno, Richard Sproat, Fernando Pereira, Michael
Riley and Alejandro Macarrón. "A Spoken Language Translator for
Restricted-Domain Context-Free Languages," Speech
Communication, 11, 1-9, 1992.
-
Richard Sproat, "Unhappier is not a Bracketing Paradox,"
Linguistic Inquiry, 23, 347-352, 1992.
-
Gregory Ward, Richard Sproat and Gail McKoon, "A Pragmatic Analysis of
So-Called Anaphoric Islands," Language, 67, 439-474, 1991.
-
Richard Sproat and Chilin Shih, "A Statistical
Method for Finding Word Boundaries in Chinese Text," Computer
Processing of Chinese and Oriental Languages, 4, 336-351, 1990.
-
Richard Sproat, "Welsh Syntax and VSO Structure," Natural Language
and Linguistic Theory, 3, 173-216, 1985.
-
Richard Sproat, "Southern California Reflexives: An Example of
Translation Borrowing?" Journal of California and Great Basin
Anthropology: Papers in Linguistics, 3, 77-94, 1981.
Refereed Collections
- Christo Kirov and Richard Sproat. "Computational
Morphology". Probabilistic Linguistics, CSLI, to appear.
-
Richard Sproat. "Writing Systems". Oxford Handbook of the History of
Phonology, to appear.
-
Gregory Ward and Richard Sproat. "Anaphoric Islands: A History of a Misleading
Geographical Metaphor". The Wiley-Blackwell Companion to Morphology.
Oxford: Wiley-Blackwell, to appear.
-
Sproat, Richard. "English among the writing systems of the world", in Vivian
Cook and Des Ryan, Handbook of the English Writing System,
Routledge.
-
Sproat, Richard and Prakash Padakannaya. "Script Indices". in
N. Srinivasan, A.K. Gupta and J. Pandey, Advances in Cognitive
Science, Volume 1. New Delhi, Sage: 2008.
-
Sproat, Richard. "The Consistency of the Orthographically Relevant
Level in Dutch." in Martin Neef, Anneke Neijt and Richard Sproat (editors). The
Relation of Writing to Spoken Language. Tübingen: Niemeyer (=
Linguistische Arbeiten 460), 2002.
-
Jianying Hu, Richard Sproat and Hao Chen. "Electronic Mail", in
McGraw-Hill Yearbook of Science and Technology, 2001,
144-146, McGraw-Hill, New York, 2000.
-
R. Harald Baayen, Robert Schreuder and Richard Sproat. "Morphology in
the Mental Lexicon: A Computational Model for Visual Word
Recognition", in F. Van Eynde and D. Gibbon (eds.) Lexicon
Development for Speech and Language Processing, 267-293,
Dordrecht, Kluwer Academic Publishers, 2000.
-
Richard Sproat, Christer Samuelsson, Jennifer Chu-Carroll and Bob
Carpenter. "Computational Linguistics", in Mark Aronoff and Janie
Rees-Miller (eds.) The Blackwell Handbook of Linguistics,
Oxford, Blackwell, 2000.
-
Richard Sproat, "Lexical Analysis", in Robert Dale, Hermann Moisl, and
Harold Somers (eds.) Handbook of Natural Language
Processing, 37-58, New York, Dekker, 2000.
-
Richard Sproat and Joseph Olive, "Text-to-Speech Synthesis," in
L. Rabiner and R. Cox (eds.), IEEE/CRC Press Handbook of Signal
Processing, Cleveland, CRC Press, 1997.
-
Richard Sproat, "Text
Interpretation for TtS Synthesis", in Ron Cole (ed.) Survey of
the State of the Art in Human Language Technology,
http://www.cse.ogi.edu/CSLU/HLTsurvey/HLTsurvey.html, 1996.
-
Richard Sproat and Joseph Olive, "A Modular Architecture for
Multilingual Text-to-Speech," in J. van Santen, R. Sproat, J. Olive
and J. Hirschberg (eds.), Progress in Speech Synthesis, New
York, Springer, 1997.
-
Richard Sproat and Joseph Olive, "An Approach to Text-to-Speech
Synthesis," in W. Kleijn and K. Paliwal (eds.), Speech Coding and
Synthesis, 611-633, Amsterdam, Elsevier, 1995.
-
Julia Hirschberg and Richard Sproat, "Pitch Accent Prediction from
Text Analysis," in J. Cole, G. Green and J. Morgan (eds.),
Linguistics and Computation, 281-296, CSLI Publications,
Chicago, University of Chicago Press, 1995.
-
Richard Sproat, "Looking into Words," in S. Hargus and E. Kaisse
(eds.), Studies in Lexical Phonology, 173-195, San Diego,
Academic Press, 1993.
-
Mark Liberman and Richard Sproat, "The Stress and Structure of
Modified Noun Phrases in English," in I. Sag (ed.), Lexical
Matters, 131-181, CSLI Publications, Chicago, University of
Chicago Press, 1992.
-
Richard Sproat and Chilin Shih, "The Cross-Linguistic Distribution of
Adjective Ordering Restrictions," in C. Georgopoulos and R. Ishihara
(eds.), Interdisciplinary Approaches to Language: Essays in Honor
of S.-Y. Kuroda, 565-593, Dordrecht, Kluwer Academic
Publishers, 1990.
-
Richard Sproat, "Bracketing Paradoxes, Cliticization and Other Topics:
The Mapping Between Syntactic and Phonological Structure," in
M. Everaert, A. Evers, R. Huybregts and M. Trommelen (eds.),
Morphology and Modularity: In Honour of Henk Schultink, 339-360,
Dordrecht, Foris, 1988.
-
Richard Sproat, "On Anaphoric Islandhood," in M. Hammond and M.
Noonan (eds.), Theoretical Morphology, 291-301, San Diego,
Academic Press, 1988.
Conference Proceedings, Presentations
-
Shigeki Karita, Richard Sproat and Haruko Ishikawa.
"Lenient Evaluation of Japanese Speech Recognition: Modeling
Naturally Occurring Spelling Inconsistency." First ACL Workshop on
Computation and Written Language, Toronto, July 14, 2023.
-
Abhirut Gupta, Ananya B. Sai, Richard Sproat, Yuri Vasilevski, James
S. Ren, Ambarish Jash, Sukhdeep S. Sodhi and Aravindan Raghuveer.
"Bi-Phone: Modeling Inter Language Phonetic Influences in Text."
Association for Computational Linguistics, Toronto, July 9-14, 2023.
-
Richard Sproat. "A Computational Model of the Invention of Writing."
Anchoring the Invention of Writing. Leiden, December 15-17, 2022.
-
Alexander Gutkin, Cibu Johny, Raiomond Doctor, Brian Roark and Richard
Sproat. "Beyond Arabic: Software for Perso-Arabic Script
Manipulation," WANLP, 2022.
-
Christo Kirov, Richard Sproat and Alexander Gutkin. "Mockingbird at the SIGTYP
2022 Shared Task: Two Types of Models for the Prediction of Cognate
Reflexes," ACL SIGTYP, July, 2022.
-
Raiomond Doctor, Alexander Gutkin, Cibu Johny, Brian Roark and Richard Sproat
"Graphemic Normalization of the Perso-Arabic Script," Grapholinguistics
in the 21st Century, June 2022.
-
Kyle Gorman and Richard Sproat. "On the Persistent Conflation of Writing and
Language," Grapholinguistics in the 21st Century, June 2022.
-
Richard Sproat. "Computational Methods in the Analysis of Graphical Symbol
Systems," Keynote Presentation, Grapholinguistics in the 21st Century,
June 2022.
-
Kyle Gorman, Christo Kirov, Brian Roark and Richard Sproat. "Structured
abbreviation expansion in context", Findings of EMNLP, 2021.
-
Jae Ro, Hao Zhang and Richard Sproat.
"Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on
Knowledge Graph Entities". COLING, 2020.
-
Alexander Gutkin and Richard Sproat, "NEMO: Frequentist Inference Approach to
Constrained Linguistic Typology Feature Prediction in SIGTYP 2020 Shared
Task". SIGTYP Workshop, 2020.
-
Alena Butryna, Shan-Hui Cathy Chu, Isin Demirsahin, Alexander Gutkin, Linne Ha,
Fei He, Martin Jansche, Cibu Johny, Anna Katanova, Oddur Kjartansson, Chenfang
Li, Tatiana Merkulova, Yin May Oo, Knot Pipatsrisawat, Clara Rivera,
Supheakmungkol Sarin, Pasindu de Silva, Keshan Sodimana, Richard Sproat,
Theeraphol Wattanavekin and Jaka Aris Eko Wibawa. "Google Crowdsourced Speech
Corpora and Related Open-Source Resources for Low-Resource Languages and
Dialects: An Overview". 2019 UNESCO International Conference Language
Technologies for All (LT4All): Enabling Linguistic Diversity and
Multilingualism Worldwide, 4-6 December, Paris, France, 2019.
https://arxiv.org/abs/2010.06778
-
Ajda Gokcen, Hao Zhang and Richard Sproat. "Dual Encoder Classifier Models as
Constraints in Neural Text Normalization."
Interspeech, 2019.
-
Sandy Ritchie, Richard Sproat, Kyle Gorman, Daan van Esch, Christian Schallhart, Nikos
Bampounis, Benoît Brard, Jonas Fromseier Mortensen, Millie Holt, and Eoin Mahon.
"Unified Verbalization for Speech Recognition \& Synthesis Across Languages."
Interspeech, 2019.
-
Keshan Sodimana, Pasindu De Silva, Richard Sproat, Theeraphol
Wattanavekin, Alexander Gutkin and Knot Pipatsrisawat. "Text Normalization
for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech
Systems." SLTU. 2018.
-
Hao Zhang, Ke Wu and Richard Sproat "Fast and Accurate Reordering with ITG
Transition RNN." COLING. 2018.
-
Ke Wu, Kyle Gorman and Richard Sproat. "Minimally supervised
written-to-spoken text normalization." ASRU. 2017.
- Fabian Brackhane, Richard Sproat and Jürgen Trouvain. "Editing
Kempelen’s Mechanismus der menschlichen Sprache: Experiences and Findings,"
History of Speech Communication Research, 2017.
-
Richard Sproat and Navdeep Jaitly "An RNN Model of Text Normalization,"
Proceedings of Interspeech, 2017.
-
Alexander Gutkin and Richard Sproat. "Areal and Phylogenetic Features for
Multilingual Speech Synthesis," Proceedings of Interspeech, 2017.
-
Daan van Esch and Richard Sproat. "An Expanded Taxonomy of Semiotic Classes for
Text Normalization," Proceedings of Interspeech, 2017.
-
Alexander Gutkin, Linne Ha, Martin Jansche, Oddur Kjartansson, Knot
Pipatsrisawat, Richard Sproat, "Building statistical parametric multi-speaker
synthesis for Bangladeshi Bangla", IEEE Workshop on Spoken Language
Technology (SLT), 2016.
-
Alexander Gutkin, Linne Ha, Martin Jansche, Knot Pipatsrisawat and Richard
Sproat, "TTS for Low Resource Languages: A Bangla Synthesizer." LREC
2016.
-
Masoud Rouhizadeh, Emily Prud’hommeaux, Jan van Santen, Richard Sproat.
"Measuring idiosyncratic interests in children with autism spectrum
disorder". Association for Computational Linguistics, 2015, Beijing.
-
Masoud Rouhizadeh, Jan van Santen, Richard Sproat. "Similarity Measures for
Quantifying Restrictive and Repetitive Behavior in Conversations of Autistic
Children". CLPsych, 2015, Denver.
-
Emily Prud'hommeaux, Eric Morley, Masoud Rouhizadeh, Laura Silverman, Jan van
Santen, Brian Roark, Richard Sproat, Sarah Kauper, Rachel DeLaHunta.
"Computational analysis of trajectories of
linguistic development in autism". IEEE Workshop on Spoken Language
Technology, 2014, South Lake Tahoe.
-
Richard Sproat and Keith Hall. "Applications of Maximum Entropy Rankers to Problems in
Spoken Language Processing." Interspeech 2014.
-
Brian Roark and Richard Sproat. "Hippocratic abbreviation expansion." ACL 2014.
-
Masoud Rouhizadeh, Emily Prud'hommeaux, Jan van Santen, Richard Sproat
"Detecting linguistic restricted interests in autism
using distributional semantic models," CLPsych, 2014, Baltimore, MD
-
Richard Sproat, Bruno Cartoni, HyunJeong Choi, Linne Ha, David Huynh, Ravindran
Rajakumar and Evelyn Wenzel-Grondie. "A Database for Measuring Linguistic
Information Content." LREC 2014.
-
Keith Hall and Richard Sproat. "Russian Stress Prediction using Maximum Entropy
Ranking", EMNLP 2013
-
Bob Coyne, Alex Klapheke, Masoud Rouhizadeh, Richard Sproat and Daniel
Bauer. 2012. "Annotation Tools and Knowledge Representation for a
Text-To-Scene System." COLING, Mumbai, India, December, 2012.
- Steven Bedrick, Russell Beckley, Brian Roark and Richard Sproat. "Robust
kaomoji detection in Twitter". Language and Social Media,
Montreal, Canada, June, 2012.
- Margaret Mitchell and Richard Sproat. "Discourse-Based Modeling for AAC".
Third Workshop on Speech and Language Processing for Assistive
Technologies, Montreal, Canada, June, 2012.
-
Brian Roark, Michael Riley, Cyril Allauzen, Terry Tai and Richard Sproat.
"The OpenGrm open-source finite-state grammar software libraries".
ACL 2012, Jeju Island, Korea, July.
-
Katherine Wu, Jennifer Solman, Ruth Linehan and Richard Sproat. "Corpora of
Non-Linguistic Symbol Systems." Linguistic Society of America,
Portland, OR, January 2012. PDF.
-
Terry Tai, Wojciech Skut and Richard Sproat. "Thrax: An Open Source
Grammar Compiler Built on OpenFst". ASRU 2011, Waikoloa Resort,
Hawaii, December.
-
Masoud Rouhizadeh, Bob Coyne, and Richard Sproat, "Collecting Semantic
Information for Locations in the Scenario-Based Lexical Knowledge Resource of a
Text-to-Scene Conversion System", 15th International Conference on
Knowledge-Based and Intelligent Information & Engineering Systems (KES
2011), Kaiserslautern, Germany, September, 2011.
- Izhak Shafran, Richard Sproat, Mahsa Yarmohammadi and Brian
Roark. "Efficient Determinization of Tagged Word Lattices using Categorial
and Lexicographic Semirings". ASRU 2011, Waikoloa Resort, Hawaii,
December.
-
Brian Roark, Andrew Fowler, Richard Sproat, Christopher Gibbons and
Melanie Fried-Oken, "Towards technology-assisted co-construction with
communication partners." SLPAT 2011, Edinburgh. 2011.
-
Masoud Rouhizadeh, Daniel Bauer, Bob Coyne, Owen Rambow and Richard Sproat. 2011.
"Collecting Spatial Information for Locations in a Text-to-Scene Conversion
System." CoSLI 2011 Workshop, Boston, MA. 2011.
-
Brian Roark, Richard Sproat and Izhak Shafran,
"Lexicographic Semirings for Exact Automata Encoding of
Sequence Models". ACL-HLT 2011, Portland, OR. 2011. Best short paper award
-
Alexander Kotov, ChengXiang Zhai and Richard Sproat,
"Mining Named Entities with Temporally Correlated Bursts from Multilingual Web
News Streams", WSDM 2011, 2011.
-
Masoud Rouhizadeh, Margit Bowler, Richard Sproat and Bob
Coyne. "Collecting Semantic Data from Amazon's Mechanical Turk for a
Lexical Knowledge Resource in a Text to Picture Generating System."
International Conference on Computational Semantics (IWCS
2011), Oxford, January, 2011.
- Aaron M. Cohen, Kyle Ambert, Jianji Yang, Robert Felder, Richard Sproat,
Brian Roark, Kristy Hollingshead and Kari Baker. "OHSU/Portland VAMC Team
Participation in the 2010 i2b2/VA Challenge Tasks," Fourth i2b2/VA NLP
Workshop, Washington, December 2010.
-
Masoud Rouhizadeh, Margit Bowler, Richard Sproat and Bob Coyne. "Data
Collection and Normalization for Building the Scenario-Based Lexical
Knowledge Resource of a Text-to-Scene Conversion System," SMAP 2010:
5th International Workshop on Semantic Media Adaptation and
Personalization. Limassol, Cyprus, 2010.
-
Richard Sproat, "Lightly Supervised Learning of Text Normalization:
Russian Number Names," IEEE Workshop on Spoken Language
Technology, Berkeley, CA, 2010.
-
Su-Youn Yoon, Mark Hasegawa-Johnson, and Richard Sproat,
"Landmark-based Automated Pronunciation Error
Detection," Proceedings of Interspeech 2010, Makuhari, Japan,
2010.
-
Suma Bhat, Mark Hasegawa-Johnson, Richard Sproat. "Automatic Fluency
Assessment by Signal-Level Measurement of Spontaneous Speech".
2010 INTERSPEECH Satellite Workshop on Second Language Studies:
Acquisition, Learning, Education and Technology., Makuhari,
Japan, 2010.
-
Bob Coyne, Richard Sproat, Julia Hirschberg. "Spatial Relations in
Text-to-Scene Conversion". Computational Models of Spatial Language
Interpretation, Workshop at Spatial Cognition 2010, Mt. Hood, Oregon.
-
Bob Coyne, Owen Rambow, Julia Hirschberg, and Richard Sproat,
"Frame Semantics in Text-to-Scene Generation".
14th International Conference on Knowledge-Based and Intelligent
Information & Engineering Systems, Cardiff, Wales, 2010.
-
Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim,
and Richard Sproat. "A Python Toolkit for Universal Transliteration".
LREC, Malta, 2010.
-
Suma Bhat, Richard Sproat, Mark Hasegawa-Johnson and Fred Davidson,
"Automatic fluency assessment using thin-slices of spontaneous speech," LTRC,
2010.
-
Su-youn Yoon, Mark Hasegawa-Johnson and Richard Sproat. "Automated
Pronunciation Scoring using Confidence Scoring and Landmark-based
SVM", InterSpeech, 2009.
-
Martin Jansche, Richard Sproat, "Named Entity Transcription with Pair
n-Gram Models", Named Entities Workshop: Shared Task on Transliteration
(NEWS 2009), ACL--IJCNLP 2009, pp. 32--35.
- Suma Bhat and Richard Sproat. "Knowing the Unseen: Estimating
Vocabulary Size over Unseen Samples",ACL, 2009.
-
Xuanhui Wang, ChengXiang Zhai, Xiao Hu, and Richard Sproat, "Mining
Correlated Bursty Topic Patterns from Coordinated Text Streams",
Proceedings of the 2007 ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining (KDD'07 ), pages 784-793.
-
Su-Youn Yoon, Kyoung-young Kim and Richard Sproat. "Multilingual
Transliteration Using Feature based Phonetic Method". ACL,
June, 2007, Prague.
-
Alla Rozovskaya, Richard Sproat, "Multilingual Word Sense
Discrimination: A Comparative Cross-Linguistic Study",
Balto-Slavonic Natural Language Processing. June, 2007,
Prague.
-
Boon Pang Lim, Richard Sproat: Using Latent Semantics for NE
Translation. ICCPOL 2006: 457-465
-
Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat and ChengXiang
Zhai. "Unsupervised Named Entity Transliteration Using Temporal and
Phonetic Correlation." EMNLP,
July 22-23, 2006, Sydney, Australia.
-
Richard Sproat, Tao Tao and ChengXiang Zhai.
"Named Entity Transliteration with Comparable Corpora".
ACL 2006, July 17-21,
2006, Sydney, Australia.
-
Alla Rozovskaya, Richard Sproat and Elabbas Benmamoun. "Language
Modeling of Arabic Dialects". Colloquium on Arabic Language
Processing, June 5-7, Rabat, Morocco.
-
Cecilia Ovesdotter Alm, Dan Roth and Richard Sproat "Emotions from
text: machine learning for text-based emotion prediction." HLT/EMNLP
2005. October 6-8, 2005, Vancouver.
-
Cecilia Alm and Richard Sproat. "Emotional sequencing and
development in fairy tales." First
International Conference on Affective Computing and Intelligent
Interaction, Beijing, China, Oct. 22-24, 2005.
-
Cecilia Alm and Richard Sproat. "Perceptions of emotions in expressive
storytelling." InterSpeech 2005, Lisbon, Portugal, Sep. 4-8,
2005.
-
Yanli Zheng, Richard Sproat, Liang Gu, Izhak Shafran,
Haolang Zhou, Yi Su, Dan Jurafsky, Rebecca Starr,
Su-Youn Yoon. "Accent Detection and Speech Recognition for
Shanghai-Accented Mandarin." InterSpeech 2005, Lisbon, Portugal,
Sep. 4-8, 2005.
-
Richard Sproat, Prakash Padakannaya. "Script
Indices". International Conference
on Cognitive Science, Allahabad, India, December 2004.
-
Sameer Maskey, Michiel Bacchiani, Brian Roark, and Richard
Sproat. "Improved Name Recognition with Meta-Data Dependent Name
Networks". ICASSP 2004, Montréal, May 2004.
-
Murat Saraclar, Richard Sproat. "Lattice-Based Search for Spoken
Utterance Retrieval." HLT-NAACL 04, Boston, May 2004.
-
Michael Riley, Brian Roark and Richard Sproat. "Good-Turing Estimation
from Word Lattices for Unsupervised Language Model Adaptation." IEEE
Automatic Speech Recognition and Understanding Workshop. Fisherman's
Reef, U.S. Virgin Islands. December 2003.
-
Richard Sproat,
"A Formal Computational Analysis of Indic Scripts", International Symposium
on Indic Scripts: Past and Future, Tokyo, December 2003.
-
Jason Zhang, Alan Black and Richard Sproat, "Identifying Speakers in
Children's Stories for Speech Synthesis," Proceedings of
EUROSPEECH 2003, Geneva, September 2003.
-
Richard Sproat and Thomas Emerson, "The First
International Chinese Word Segmentation Bakeoff", The Second
SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan,
July 2003.
-
Srinivas Bangalore, Owen Rambow, Tahir Butt, Alexis Nasr and Richard
Sproat, "Creating a Finite-State Parser with Application Semantics",
COLING, Taipei, Taiwan, August 2002.
-
Richard Sproat,
"Inferring the Environment in a Text-to-Scene Conversion System",
First International Conference on Knowledge Capture (K-CAP
'01), Victoria, BC, Canada, 2001.
-
Richard Sproat,
"Pmtools: A Pronunciation Modeling Toolkit",
Proceedings of the Fourth ISCA Tutorial and Research Workshop on
Speech Synthesis, Blair Atholl, Scotland, 2001.
-
Bob Coyne and Richard Sproat,
"WordsEye: An Automatic Text-to-Scene Conversion system",
SIGGRAPH 2001, Los Angeles, CA, 2001.
-
Richard Sproat, "Corpus-Based Methods and Hand-Built Methods",
International Conference on Spoken Language Processing, 2000.
-
Bhuvana Narasimhan, Richard Sproat and George Kiraz. "Schwa deletion
in Hindi Text-to-Speech Synthesis."
Workshop on Computational Linguistics in South Asian Languages, 21st
SALA, October 2001, Konstanz.
-
Jan van Santen and Richard Sproat. "High-accuracy automatic segmentation".
Proceedings of EUROSPEECH 99, Budapest, Hungary, 1999.
-
Richard Sproat, Jianying Hu, Hao Chen,
"EMU: An E-mail Preprocessor for Text-to-Speech," IEEE Signal
Processing Society 1998 Workshop on Multimedia Signal Processing, Los
Angeles, CA.
-
Hao Chen, Jianying Hu and Richard Sproat. "E-mail Signature Block
Analysis", ICPR'98, Brisbane, Australia.
-
Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan Black,
Kevin Lenzo, Mike Edgington, "SABLE: A
Standard for TTS Markup", International Conference on Spoken
Language Processing, 1998. Also presented at the ESCA/COCOSDA
Speech Synthesis Workshop. Jenolan Caves, Australia; and at the W3C
Meeting on Voice Browsers, Cambridge, MA, October 13, 1998.
-
Richard Sproat, Jan van Santen, "Automatic Ambiguity Detection",
International Conference on Spoken Language Processing, 1998.
-
Richard Sproat, Paul Taylor, Michael Tanenblatt and Amy Isard, "A
Markup Language for Speech Synthesis," Proceedings of
EUROSPEECH, 1997.
-
Bernd Möbius, Richard Sproat, Jan van Santen and Joseph Olive, "A
German Text-to-Speech System," Proceedings of EUROSPEECH,
1997.
-
Elena Pavlova, Chilin Shih and Richard Sproat, "A Text-to-Speech
System for Russian," Proceedings of EUROSPEECH, 1997.
-
Chilin Shih and Richard Sproat, "Mandarin Text-to-Speech Synthesis,"
Proceedings of the Eighth North American Conference on Chinese
Linguistics, 1997.
-
Richard Sproat. "Multilingual Text Analysis for Text-to-Speech
Synthesis", International Conference on Spoken Language
Processing, 1996.
-
Richard Sproat. "Multilingual Text
Analysis for Text-to-Speech Synthesis", ECAI '96 Workshop on
Extended Finite State Models of Language, Budapest, Hungary,
75-81, 1996.
-
Richard Sproat and Michael Riley. "Compilation of Weighted
Finite-State Transducers from Decision Trees", 34th Annual
Meeting of the Association for Computational Linguistics: Proceedings
of the Conference, 1996.
-
Mehryar Mohri and Richard Sproat. "An Efficient Compiler
for Weighted Rewrite Rules", 34th Annual Meeting of the
Association for Computational Linguistics: Proceedings of the
Conference, 1996.
-
Richard Sproat. "A Finite-State Architecture for Tokenization and
Grapheme-to-Phoneme Conversion for Multilingual Text Analysis,"
Proceedings of the EACL SIGDAT Workshop, 65-72, Dublin,
Ireland. Association for Computational Linguistics, 1995.
-
Richard Sproat and Chilin Shih, "A Corpus-Based Analysis of Mandarin
Nominal Root Compounds," Proceedings of the Sixth North American
Conference on Chinese Linguistics, 1995.
-
Benjamin Ao, Chilin Shih and Richard Sproat, "A Corpus-Based Mandarin
Text-to-Speech Synthesizer," International Conference on Spoken
Language Processing, 1771-1774, 1994.
-
Richard Sproat and Joseph Olive, "A Modular Architecture for
Multilingual Text-to-Speech," Proceedings of The Second ESCA/IEEE
Workshop on Speech Synthesis, 187-190, 1994.
-
Richard Sproat, Chilin Shih, William Gale and Nancy Chang, "A
Stochastic Finite-State Word-Segmentation Algorithm for Chinese,"
32nd Annual Meeting of the Association for Computational
Linguistics: Proceedings of the Conference, 66-73, 1994.
-
Fernando Pereira, Michael Riley and Richard Sproat, "Weighted Rational
Transductions and their Application to Human Language Processing,"
ARPA Workshop on Human Language Technology, 249-254,
1994.
-
Richard Sproat, Julia Hirschberg and David Yarowsky, "A corpus-based
synthesizer," International Conference on Spoken Language
Processing, 563-566, 1992.
-
Richard Sproat and Chilin Shih, "On the sources of some constraints on
Mandarin morphology," Proceedings of the Third International
Symposium on Chinese Languages and Linguistics, 20-37, 1992.
-
David Roe, Fernando Pereira, Richard Sproat, Michael Riley, Pedro
Moreno and Alejandro Macarrón, "Efficient Grammar Processing for a
Spoken Language Translation System," ICASSP-92(1), 213-216,
1992.
-
David Roe, Fernando Pereira, Richard Sproat, Michael Riley, Pedro
Moreno and Alejandro Macarrón, "Toward a Spoken Language Translator
for Restricted-Domain Context-Free Languages," EUROSPEECH
91(3), 1063-1066, 1991.
-
Richard Sproat, "An Application of Statistical Optimization with
Dynamic Programming to Phonemic-Input-to-Character Conversion for
Chinese," Proceedings of R.O.C. Computational Linguistics
Conference III, 379-390, 1990.
-
Richard Sproat, "Stress Assignment in Complex Nominals for English
Text-to-Speech," Proceedings of ESCA Workshop on Speech
Synthesis, 129-132.
-
Richard Sproat and Chilin Shih, "Prenominal Adjectival Ordering in
English and Mandarin," Proceedings of the 18th Annual Meeting of
the North East Linguistics Society, 465-489, 1988.
-
Richard Sproat and Barbara Brunson, "Constituent-Based Morphological
Parsing: A New Approach to the Problem of Word-Recognition," 25th
Annual Meeting of the Association for Computational Linguistics:
Proceedings of the Conference, 65-72, 1987.
-
Richard Sproat and Mark Liberman, "Toward Treating English Nominals
Correctly," 25th Annual Meeting of the Association for
Computational Linguistics: Proceedings of the Conference,
140-146, 1987.
-
Richard Sproat and Gregory Ward, "Pragmatic Considerations in
Anaphoric Island Phenomena," Papers from the 23rd Annual Meeting
of the Chicago Linguistic Society, 321-335, 1987.
-
Richard Sproat, "Malayalam Compounding: A Non-Stratum-Ordered
Account," Proceedings of the Fifth West Coast Conference on Formal
Linguistics, 268-288, 1986.
-
Richard Sproat, "The Projection Principle and the Syntax of Synthetic
Compounds," Proceedings of the 16th Annual meeting of the North
East Linguistics Society, 462-475, 1986.
-
Richard Sproat, "INFL and the Configurationality of VSO Languages,"
Proceedings of the 14th Annual meeting of the North East
Linguistics Society, 418-431, 1984.
Invited Review Articles and Commentaries
-
Amalia Gnanadesikan and Richard Sproat. "Writing Systems".
Oxford
Bibliographies, 2018.
-
Richard Sproat, "Phonemic diversity and the out-of-Africa theory". Linguistic
Typology, 15, 199-206, 2011.
-
Richard Sproat, "Linguistic Processing for Speech Synthesis",
in Jacob Benesty, Yiteng (Arden) Huang, M. Mohan Sondhi,
Springer Handbook of Speech Processing and Speech
Communication, Springer, 2006.
-
Richard Sproat, "Morphology as Component or Module," invited review
article in A. Spencer and A. Zwicky (eds.), Handbook of
Morphology, Oxford, Basil Blackwell Ltd., 2000.
-
Richard Sproat, "Text Interpretation for Text-to-Speech Synthesis," in
R. Cole, J. Mariani, H. Uszkoreit, A. Zaenen and V. Zue (eds.),
Survey of the State of the Art in Human Language Technology,
Dordrecht, Kluwer, 1996.
-
Richard Sproat, "Computational Interpretations of Neurolinguistic
Observations," in M. Gazzaniga (ed.), The Cognitive
Neurosciences, Cambridge, MIT Press, 1994, 931-942.
-
Richard Sproat, "Commentary on Bird and Klein." Computational
Linguistics, 493, 1994.
-
Richard Sproat, "The Lexicon in Generative Grammar," invited review
article in W. Bright (ed.), Oxford International Encyclopedia of
Linguistics, New York, Oxford University Press, 1991.
-
Richard Sproat, "Competence, Performance and Agrammatism: A Reply to
Grodzinsky," Brain and Language, 27, 160-167, 1986.
Invited Book Reviews
-
Richard Sproat, "Review of Shouhui Zhao \& Richard B. Baldauf, Jr., Planning Chinese
Characters: Reaction, Evolution or Revolution. (Language Policy, 9.)", in
Written Language and Literacy, 11(2), 2008.
-
Chilin Shih, Richard Sproat, "Review of Horne (ed.) Prosody:
Theory and Experiment. Studies Presented to Gösta Bruce,
Computational Linguistics", 27(3), 2001, to appear.
-
Richard Sproat, "Review of Packard (ed.) New Approaches to Chinese
Word Formation." Yearbook of Morphology, 1999.
-
Richard Sproat, "Review of Daniels and Bright, The World's Writing
Systems," Written Language and Literacy, 1(1), 129-137,
1998.
-
Richard Sproat, "Review of Beard, Lexeme-Morpheme Base
Morphology," Journal of Linguistics, 1997.
-
Richard Sproat, "Review of Miller Complex Verb Formation,"
Language, 70, 361-365, 1994.
-
Richard Sproat, "Review of Ritchie et al. Computational
Morphology," Language, 69, 152-155, 1993.
-
Richard Sproat, "Review of Carstairs: Current Morphology,"
Phonology Yearbook, 9, 353-357, 1993.
-
Richard Sproat, "Morphological Non-Separation Revisited: a Review of
R. Lieber's Deconstructing Morphology." Yearbook of
Morphology, 235-258, 1992.
-
Richard Sproat, "Review of The Syntax of the Modern Celtic
Languages." Lingua, 347-370, 1992.
-
Richard Sproat, "Review of PC-KIMMO," Computational
Linguistics, 17, 229-231, 1991.
Other Book Reviews
-
Richard Sproat, "Review of Packard: The Morphology of Chinese: A
Linguistic and Cognitive Approach", LINGUIST List: Vol-12-11.
-
Richard Sproat, "Review of Bright and Daniels: The World's Writing
Systems", LINGUIST List: Vol-7-400.
Other Publications
- Kyle Gorman and Richard Sproat.
How
to get superior text processing in Python with Pynini. O'Reilly. 2016.
- M. Rouhizadeh, J. van Santen, R. Sproat. "Computational Semantic Analysis of
Restrictive and Repetitive Behavior in Language Samples of Children with
Autism" IMFAR, 2015, Salt Lake City.
- M. Rouhizadeh, J. van Santen, R. Sproat, K. Gorman, P. Heeman,
A. P. Hill, S. Bedrick, E. T. Prud'hommeaux and G. Kiss. "Children's Differing
Patterns of Discourse Marker Use in ASD and Typical Development" IMFAR,
2014, Atlanta.
-
Richard Sproat, Lois Black, Emily Tucker-Prud'hommeaux, Jan van
Santen, Brian Roark. "Automated Analysis of Natural Language Samples:
Comparison of Children with Autism Spectrum Disorders, Developmental
Language Disorders, and Typical Development." IMFAR, 2011,
San Diego.
-
Richard Sproat, Dan Roth, ChengXiang Zhai, Elabbas Benmamoun, Andrew
Fister, Nadia Karlinsky, Alex Klementiev, Chongwon Park, Vasin
Punyakanok, Tao Tao, Su-youn Yoon. "Named
Entity Recognition and Transliteration for 50 Languages.".
Keynote Address at the Second
Midwest Computational Linguistics Colloquium, May 14-15, 2005, The
Ohio State University.
-
Bernd Möbius and Richard Sproat, "Linguistische Analyse in einem
multilingualen Sprachsynthesesystem". Spektrum der Wissenschaften,
1996, (German Edition of Scientific American)
-
Richard Sproat, On Deriving the Lexicon. MIT Working Papers
in Linguistics, Cambridge, MA. 1985. (Published version of
Ph.D. dissertation)
-
Richard Sproat, "On Deriving the Lexicon," The Linguistic
Review (Dissertation Abstracts), 5, 185-196, 1985.
-
Richard Sproat, "A Note on Rebracketing in Morphology," in D.
Archangeli, A. Barss and R. Sproat (eds.), MIT Working Papers in
Linguistics, 6, 199-205, 1985.
-
Richard Sproat, "Review of Schank's `Identification of
Conceptualizations Underlying Natural Language'," In B. Levin (ed.),
Lexical Semantics in Review, Lexicon Project Working Papers,
number 1, Cambridge, MA, Center for Cognitive Science, Massachusetts
Institute of Technology, 1985.
-
Richard Sproat, "A Scientific Curriculum in Linguistics for Use In
Secondary Schools." In Doing Science: Constructing Scientific
Theories as an Introduction to Scientific Method, Technical
Report, Cambridge, MA, Educational Technology Center, Harvard Graduate
School of Education, 1985.
-
Richard Sproat, "On Bracketing Paradoxes," in M. Speas and R. Sproat
(eds.), MIT Working Papers in Linguistics, 7, 110-130, 1984.
-
Richard Sproat, "VSO Languages and Welsh Configurationality," in I.
Haik and D. Massam (eds.), MIT Working Papers in Linguistics,
5, 243-276, 1983.
-
Richard Sproat, letter to the editor, December issue of
Scientific American: a critique of Derek Bickerton's paper
entitled "Creole Languages," Scientific American, December,
1983.
-
John Newman, with the assistance of Richard Sproat, Old High
German Reader, Advocate Publishing Group, Reynoldsburg, OH, 1981.
Tutorials and Courses
-
"Corpus-Based Methods in Chinese Morphology", COLING 2002, Taipei,
Taiwan.
Here are the exercises
associated with this tutorial.
For those of you who are inclined to doubt my example mei3
hua4 `Americanize', see Li, Charles and Sandra Thompson. 1981.
Mandarin Chinese: A Functional Reference Grammar. University
of California Press, Berkeley, CA, page 42, example (34).
-
"Corpus-Based Methods in Chinese Morphology and Phonology", with
Chilin Shih, 2001 LSA Institute, Santa Barbara, CA.
Slides for first three lectures:
Lecture 1,
Lecture 2,
Lecture 3.
Lecture
notes.
-
"Finite-state Methods in Morphology, Text Analysis and the Analysis of
Writing Systems", ROCLING X, Taipei, Taiwan.
-
"Algorithms for Speech
Recognition and Language Processing", with Mehryar Mohri and
Michael Riley. COLING 1996, Copenhagen.
-
"Computational Morphology", Australian Linguistic Institute, 1994, Melbourne.
-
"Text Analysis Tools in
Spoken Language Processing", with Michael Riley, ACL 1994, Las
Cruces.
-
"Morphology and Computational Morphology", ACL 1989, Vancouver.