Publications and Tutorials

Books Videos Refereed Journals Refereed Collections Conference Proceedings
Invited Review Articles Invited Book Reviews Other Book Reviews Other Publications
Tutorials

Books

  1. Brian Roark, Richard Sproat and Suyoun Yoon. The Tools of the Scribe: How writing systems, technology, and human factors interact to affect the act of writing. Cham, SpringerNature, Forthcoming, 2025.
  2. Richard Sproat. Symbols: An Evolutionary History from the Stone Age to the Future. SpringerNature, Cham, Switzerland, 2023.
  3. Kyle Gorman and Richard Sproat. Finite-State Text Processing. Williston, VT, Morgan Claypool, 2021.
  4. Fabian Brackhane, Richard Sproat and Jürgen Trouvain (editors). Wolfgang von Kempelens Mechanismus der menschlichen Sprache. New edition with English translation, and commentary. Technische Universität Dresden Press, 2017.
  5. Richard Sproat. Language, Technology and Society. Oxford, Oxford University Press, 2010.
  6. Brian Roark, Richard Sproat. Computational Approaches to Morphology and Syntax. Oxford, Oxford University Press, 2007.
  7. Martin Neef, Anneke Neijt and Richard Sproat (editors) The Relation of Writing to Spoken Language. Tübingen: Niemeyer (= Linguistische Arbeiten 460), 2002.
  8. Richard Sproat, A Computational Theory of Writing Systems, (ACL Studies in Natural Language Processing Series), Cambridge, Cambridge University Press, 2000.
    List of known errata.
    A prepublication version can be downloaded from here. This does not correct the errata listed above nor is the pagination the same as the printed version.
  9. Richard Sproat (editor), Multilingual Text-to-Speech Synthesis: The Bell Labs Approach, Boston, MA, Kluwer Academic Publishers, 1997.
  10. Jan van Santen, Richard Sproat, Joseph Olive and Julia Hirschberg (editors), Progress in Speech Synthesis, New York, Springer, 1997.
  11. Richard Sproat, Morphology and Computation, Cambridge, MA, MIT Press, 1992.

Videos

  1. Richard Sproat. Introduction to Writing Systems: How Writing Encodes Language, SpringerNature, 2024.

Refereed Journals

  1. Llion Jones, Richard Sproat, Haruko Ishikawa and Alexander Gutkin. "Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation." Transactions of the Association for Computational Linguistics, To appear.
  2. Richard Sproat. "Last Words: Boring Problems are Sometimes the Most Interesting." Computational Linguistics, 48(2): 483–490.
  3. Richard Sproat and Alexander Gutkin. "The Taxonomy of Writing Systems: How to Measure how Logographic a System is." Computational Linguistics, 47(3), 477–528, 2021.
  4. Richard Sproat. "Review of Zev Handel. 2019. Sinography: The Borrowing and Adaptation of the Chinese Script." Asiatische Studien/Études Asiatiques, 74(4): 1103–1115, 2021.
  5. Juliette Blevins, Richard Sproat. "Statistical Evidence for the Proto-Indo-European-Euskarian Hypothesis: A word-list approach integrating phonotactics". Diachronica, 2021.
  6. Hao Zhang, Richard Sproat, Axel Ng, Felix Stahlberg, Xiaochang Peng, Kyle Gorman, Brian Roark. "Neural models of text normalization for speech applications." Computational Linguistics, 45(2), 2019: 293-337.
  7. Richard Sproat. "Review of Daniels An Exploration of Writing", Written Language and Literacy, 21(2), 2018: 269–278.
  8. Richard Sproat. "A Computational Model of the Discovery of Writing", Written Language and Literacy, 20(2), 2017: 194-226.
  9. Richard Sproat, "Language typology in speech and language technology", Linguistic Typology, 20(3), 2016.
  10. Kyle Gorman and Richard Sproat. "Minimally supervised models for number normalization." Transactions of the Association for Computational Linguistics. 2016.
  11. Richard Sproat. "On misunderstandings and misrepresentations: A reply to Rao et al." Language, 91(4), 2015.
  12. Peter Ebden and Richard Sproat. "The Kestrel TTS Text Normalization System." Journal of Natural Language Engineering, 2014.
  13. Richard Sproat, Mahsa Yarmohammadi, Izhak Shafran and Brian Roark. "Applications of Lexicographic Semirings to Problems in Speech and Language Processing," Computational Linguistics, 2014.
  14. Richard Sproat, "A statistical comparison of written language and non-linguistic symbol systems," Language, 2014.
  15. Richard Sproat, "A note on Unger’s 'What linguistic units do Chinese characters represent?'", Written Language & Literacy, 16(1), 107–111, 2013.
  16. Jan van Santen, Richard Sproat and Alison Presmanes Hill, "Quantifying Repetitive Speech in Autism Spectrum Disorders and Language Impairment," Autism Research, 2013.
  17. Padmapriya Kandhadai and Richard Sproat, "Impact of spatial ordering of graphemes in alphasyllabic scripts on phonemic awareness in Indic languages," Writing Systems Research, 2(2), 105-116, 2010.
  18. Richard Sproat. "Reply to Rao et al. and Lee et al." Computational Linguistics, 36:4, 2010.
  19. Richard Sproat. "Last Words: Ancient symbols, computational linguistics, and the reviewing practices of the general science journals." Computational Linguistics, 36:3, 2010.
  20. Aamir Wali, Richard Sproat, Prakash Padakannaya and Bhuvaneshwari, B. "Model for phonemic awareness in readers of Indian script". Written Language and Literacy, 12:2, 161-169, 2009.
  21. Su-Youn Yoon, Lisa Pierce, Amanda Huensch, Eric Juul, Samantha Perkins, Richard Sproat, and Mark Hasegawa-Johnson, "Construction of a rated speech corpus of L2 learners' speech," CALICO Journal, 2009
  22. Richard Sproat. "Brahmi-Derived Scripts, Script Layout, and Phonological Awareness." Written Language and Literacy, 9(1), 45--66, 2006.
  23. Michiel Bacchiani, Brian Roark, Michael Riley and Richard Sproat "MAP Adaptation of Stochastic Grammars". Computer Speech and Language, 20(1):41-68, 2006.
  24. Steve Farmer, Richard Sproat and Michael Witzel. "The Collapse of the Indus-Script Thesis: The Myth of a Literate Harappan Civilization". Electronic Journal of Vedic Studies, 11(2), 2004
  25. Narasimhan, Bhuvana, Sproat, Richard, and Kiraz, George. "Schwa-deletion in Hindi Text-to-Speech Synthesis," International Journal of Speech Technology, 7(4), 319-333, 2004.
  26. Richard Sproat, Alan Black, Stanley Chen, Shankar Kumar, Mari Ostendorf, and Christopher Richards. "Normalization of non-standard words." Computer Speech and Language, 15(3), 287-333, 2001.
  27. Richard Sproat. "Multilingual Text Analysis for Text-to-Speech Synthesis", Natural Language Engineering, 2(4), 369-380, 1996.
  28. Chilin Shih and Richard Sproat, "Issues in Text-to-Speech Conversion for Mandarin," Computational Linguistics and Chinese Language Processing, 1996.
  29. Richard Sproat, Chilin Shih, William Gale and Nancy Chang, "A Stochastic Finite-State Word-Segmentation Algorithm for Chinese," Computational Linguistics, 22(3), 1996.
  30. Harald Baayen and Richard Sproat, "Estimating Lexical Priors for Low-Frequency Morphologically Ambiguous Forms," Computational Linguistics, 22(2), 1996.
  31. Richard Sproat and Chilin Shih. "A Corpus-Based Analysis of Mandarin Nominal Root Compounds." Journal of East Asian Linguistics, 5, 49-71, 1996.
  32. Richard Sproat and Joseph Olive, "Text to Speech Synthesis," AT&T Technical Journal, 74(2), 35-44, 1995.
  33. Richard Sproat, "English Noun-Phrase Accent Prediction for Text-to-Speech." Computer Speech and Language, 8, 79-94, 1994. The list of about 7900 nominals with hand-assigned stress used in this paper can be found here.
  34. Richard Sproat and Chilin Shih, "Why Mandarin Morphology is not Stratum-Ordered." Yearbook of Morphology, 185-217, 1993.
  35. Richard Sproat and Osamu Fujimura, "Allophonic variation in English /l/ and its implications for phonetic implementation," Journal of Phonetics, 21, 291-311, 1993.
  36. Gail McKoon, Gregory Ward, Roger Ratcliff and Richard Sproat, "Morphosyntactic and pragmatic factors affecting the accessibility of discourse entities," Journal of Memory and Language, 32, 1-20, 1993.
  37. David Roe, Pedro Moreno, Richard Sproat, Fernando Pereira, Michael Riley and Alejandro Macarrón. "A Spoken Language Translator for Restricted-Domain Context-Free Languages," Speech Communication, 11, 1-9, 1992.
  38. Richard Sproat, "Unhappier is not a Bracketing Paradox," Linguistic Inquiry, 23, 347-352, 1992.
  39. Gregory Ward, Richard Sproat and Gail McKoon, "A Pragmatic Analysis of So-Called Anaphoric Islands," Language, 67, 439-474, 1991.
  40. Richard Sproat and Chilin Shih, "A Statistical Method for Finding Word Boundaries in Chinese Text," Computer Processing of Chinese and Oriental Languages, 4, 336-351, 1990.
  41. Richard Sproat, "Welsh Syntax and VSO Structure," Natural Language and Linguistic Theory, 3, 173-216, 1985.
  42. Richard Sproat, "Southern California Reflexives: An Example of Translation Borrowing?" Journal of California and Great Basin Anthropology: Papers in Linguistics, 3, 77-94, 1981.

Refereed Collections

  1. Christo Kirov and Richard Sproat. "Computational Morphology". Probabilistic Linguistics, CSLI, to appear.
  2. Richard Sproat. "Writing Systems". Oxford Handbook of the History of Phonology, to appear.
  3. Gregory Ward and Richard Sproat. "Anaphoric Islands: A History of a Misleading Geographical Metaphor". The Wiley-Blackwell Companion to Morphology. Oxford: Wiley-Blackwell, to appear.
  4. Sproat, Richard. "English among the writing systems of the world", in Vivian Cook and Des Ryan, Handbook of the English Writing System, Routledge.
  5. Sproat, Richard and Prakash Padakannaya. "Script Indices". in N. Srinivasan, A.K. Gupta and J. Pandey, Advances in Cognitive Science, Volume 1. New Delhi, Sage: 2008.
  6. Sproat, Richard. "The Consistency of the Orthographically Relevant Level in Dutch." in Martin Neef, Anneke Neijt and Richard Sproat (editors). The Relation of Writing to Spoken Language. Tübingen: Niemeyer (= Linguistische Arbeiten 460), 2002.
  7. Jianying Hu, Richard Sproat and Hao Chen. "Electronic Mail", in McGraw-Hill Yearbook of Science and Technology, 2001, 144-146, McGraw-Hill, New York, 2000.
  8. R. Harald Baayen, Robert Schreuder and Richard Sproat. "Morphology in the Mental Lexicon: A Computational Model for Visual Word Recognition", in F. Van Eynde and D. Gibbon (eds.) Lexicon Development for Speech and Language Processing, 267-293, Dordrecht, Kluwer Academic Publishers, 2000.
  9. Richard Sproat, Christer Samuelsson, Jennifer Chu-Carroll and Bob Carpenter. "Computational Linguistics", in Mark Aronoff and Janie Rees-Miller (eds.) The Blackwell Handbook of Linguistics, Oxford, Blackwell, 2000.
  10. Richard Sproat, "Lexical Analysis", in Robert Dale, Hermann Moisl, and Harold Somers (eds.) Handbook of Natural Language Processing, 37-58, New York, Dekker, 2000.
  11. Richard Sproat and Joseph Olive, "Text-to-Speech Synthesis," in L. Rabiner and R. Cox (eds.), IEEE/CRC Press Handbook of Signal Processing, Cleveland, CRC Press, 1997.
  12. Richard Sproat, "Text Interpretation for TtS Synthesis", in Ron Cole (ed.) Survey of the State of the Art in Human Language Technology, http://www.cse.ogi.edu/CSLU/HLTsurvey/HLTsurvey.html, 1996.
  13. Richard Sproat and Joseph Olive, "A Modular Architecture for Multilingual Text-to-Speech," in J. van Santen, R. Sproat, J. Olive and J. Hirschberg (eds.), Progress in Speech Synthesis, New York, Springer, 1997.
  14. Richard Sproat and Joseph Olive, "An Approach to Text-to-Speech Synthesis," in W. Kleijn and K. Paliwal (eds.), Speech Coding and Synthesis, 611-633, Amsterdam, Elsevier, 1995.
  15. Julia Hirschberg and Richard Sproat, "Pitch Accent Prediction from Text Analysis," in J. Cole, G. Green and J. Morgan (eds.), Linguistics and Computation, 281-296, CSLI Publications, Chicago, University of Chicago Press, 1995.
  16. Richard Sproat, "Looking into Words," in S. Hargus and E. Kaisse (eds.), Studies in Lexical Phonology, 173-195, San Diego, Academic Press, 1993.
  17. Mark Liberman and Richard Sproat, "The Stress and Structure of Modified Noun Phrases in English," in I. Sag (ed.), Lexical Matters, 131-181, CSLI Publications, Chicago, University of Chicago Press, 1992.
  18. Richard Sproat and Chilin Shih, "The Cross-Linguistic Distribution of Adjective Ordering Restrictions," in C. Georgopoulos and R. Ishihara (eds.), Interdisciplinary Approaches to Language: Essays in Honor of S.-Y. Kuroda, 565-593, Dordrecht, Kluwer Academic Publishers, 1990.
  19. Richard Sproat, "Bracketing Paradoxes, Cliticization and Other Topics: The Mapping Between Syntactic and Phonological Structure," in M. Everaert, A. Evers, R. Huybregts and M. Trommelen (eds.), Morphology and Modularity: In Honour of Henk Schultink, 339-360, Dordrecht, Foris, 1988.
  20. Richard Sproat, "On Anaphoric Islandhood," in M. Hammond and M. Noonan (eds.), Theoretical Morphology, 291-301, San Diego, Academic Press, 1988.

Conference Proceedings, Presentations

  1. Shigeki Karita, Richard Sproat and Haruko Ishikawa. "Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency." First ACL Workshop on Computation and Written Language, Toronto, July 14, 2023.
  2. Abhirut Gupta, Ananya B. Sai, Richard Sproat, Yuri Vasilevski, James S. Ren, Ambarish Jash, Sukhdeep S. Sodhi and Aravindan Raghuveer. "Bi-Phone: Modeling Inter Language Phonetic Influences in Text." Association for Computational Linguistics, Toronto, July 9-14, 2023.
  3. Richard Sproat. "A Computational Model of the Invention of Writing." Anchoring the Invention of Writing. Leiden, December 15-17, 2022.
  4. Alexander Gutkin, Cibu Johny, Raiomond Doctor, Brian Roark and Richard Sproat. "Beyond Arabic: Software for Perso-Arabic Script Manipulation," WANLP, 2022.
  5. Christo Kirov, Richard Sproat and Alexander Gutkin. "Mockingbird at the SIGTYP 2022 Shared Task: Two Types of Models for the Prediction of Cognate Reflexes," ACL SIGTYP, July, 2022.
  6. Raiomond Doctor, Alexander Gutkin, Cibu Johny, Brian Roark and Richard Sproat "Graphemic Normalization of the Perso-Arabic Script," Grapholinguistics in the 21st Century, June 2022.
  7. Kyle Gorman and Richard Sproat. "On the Persistent Conflation of Writing and Language," Grapholinguistics in the 21st Century, June 2022.
  8. Richard Sproat. "Computational Methods in the Analysis of Graphical Symbol Systems," Keynote Presentation, Grapholinguistics in the 21st Century, June 2022.
  9. Kyle Gorman, Christo Kirov, Brian Roark and Richard Sproat. "Structured abbreviation expansion in context", Findings of EMNLP, 2021.
  10. Jae Ro, Hao Zhang and Richard Sproat. "Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph Entities". COLING, 2020.
  11. Alexander Gutkin and Richard Sproat, "NEMO: Frequentist Inference Approach to Constrained Linguistic Typology Feature Prediction in SIGTYP 2020 Shared Task". SIGTYP Workshop, 2020.
  12. Alena Butryna, Shan-Hui Cathy Chu, Isin Demirsahin, Alexander Gutkin, Linne Ha, Fei He, Martin Jansche, Cibu Johny, Anna Katanova, Oddur Kjartansson, Chenfang Li, Tatiana Merkulova, Yin May Oo, Knot Pipatsrisawat, Clara Rivera, Supheakmungkol Sarin, Pasindu de Silva, Keshan Sodimana, Richard Sproat, Theeraphol Wattanavekin and Jaka Aris Eko Wibawa. "Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview". 2019 UNESCO International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide, 4-6 December, Paris, France, 2019. https://arxiv.org/abs/2010.06778
  13. Ajda Gokcen, Hao Zhang and Richard Sproat. "Dual Encoder Classifier Models as Constraints in Neural Text Normalization." Interspeech, 2019.
  14. Sandy Ritchie, Richard Sproat, Kyle Gorman, Daan van Esch, Christian Schallhart, Nikos Bampounis, Benoît Brard, Jonas Fromseier Mortensen, Millie Holt, and Eoin Mahon. "Unified Verbalization for Speech Recognition \& Synthesis Across Languages." Interspeech, 2019.
  15. Keshan Sodimana, Pasindu De Silva, Richard Sproat, Theeraphol Wattanavekin, Alexander Gutkin and Knot Pipatsrisawat. "Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems." SLTU. 2018.
  16. Hao Zhang, Ke Wu and Richard Sproat "Fast and Accurate Reordering with ITG Transition RNN." COLING. 2018.
  17. Ke Wu, Kyle Gorman and Richard Sproat. "Minimally supervised written-to-spoken text normalization." ASRU. 2017.
  18. Fabian Brackhane, Richard Sproat and Jürgen Trouvain. "Editing Kempelen’s Mechanismus der menschlichen Sprache: Experiences and Findings," History of Speech Communication Research, 2017.
  19. Richard Sproat and Navdeep Jaitly "An RNN Model of Text Normalization," Proceedings of Interspeech, 2017.
  20. Alexander Gutkin and Richard Sproat. "Areal and Phylogenetic Features for Multilingual Speech Synthesis," Proceedings of Interspeech, 2017.
  21. Daan van Esch and Richard Sproat. "An Expanded Taxonomy of Semiotic Classes for Text Normalization," Proceedings of Interspeech, 2017.
  22. Alexander Gutkin, Linne Ha, Martin Jansche, Oddur Kjartansson, Knot Pipatsrisawat, Richard Sproat, "Building statistical parametric multi-speaker synthesis for Bangladeshi Bangla", IEEE Workshop on Spoken Language Technology (SLT), 2016.
  23. Alexander Gutkin, Linne Ha, Martin Jansche, Knot Pipatsrisawat and Richard Sproat, "TTS for Low Resource Languages: A Bangla Synthesizer." LREC 2016.
  24. Masoud Rouhizadeh, Emily Prud’hommeaux, Jan van Santen, Richard Sproat. "Measuring idiosyncratic interests in children with autism spectrum disorder". Association for Computational Linguistics, 2015, Beijing.
  25. Masoud Rouhizadeh, Jan van Santen, Richard Sproat. "Similarity Measures for Quantifying Restrictive and Repetitive Behavior in Conversations of Autistic Children". CLPsych, 2015, Denver.
  26. Emily Prud'hommeaux, Eric Morley, Masoud Rouhizadeh, Laura Silverman, Jan van Santen, Brian Roark, Richard Sproat, Sarah Kauper, Rachel DeLaHunta. "Computational analysis of trajectories of linguistic development in autism". IEEE Workshop on Spoken Language Technology, 2014, South Lake Tahoe.
  27. Richard Sproat and Keith Hall. "Applications of Maximum Entropy Rankers to Problems in Spoken Language Processing." Interspeech 2014.
  28. Brian Roark and Richard Sproat. "Hippocratic abbreviation expansion." ACL 2014.
  29. Masoud Rouhizadeh, Emily Prud'hommeaux, Jan van Santen, Richard Sproat "Detecting linguistic restricted interests in autism using distributional semantic models," CLPsych, 2014, Baltimore, MD
  30. Richard Sproat, Bruno Cartoni, HyunJeong Choi, Linne Ha, David Huynh, Ravindran Rajakumar and Evelyn Wenzel-Grondie. "A Database for Measuring Linguistic Information Content." LREC 2014.
  31. Keith Hall and Richard Sproat. "Russian Stress Prediction using Maximum Entropy Ranking", EMNLP 2013
  32. Bob Coyne, Alex Klapheke, Masoud Rouhizadeh, Richard Sproat and Daniel Bauer. 2012. "Annotation Tools and Knowledge Representation for a Text-To-Scene System." COLING, Mumbai, India, December, 2012.
  33. Steven Bedrick, Russell Beckley, Brian Roark and Richard Sproat. "Robust kaomoji detection in Twitter". Language and Social Media, Montreal, Canada, June, 2012.
  34. Margaret Mitchell and Richard Sproat. "Discourse-Based Modeling for AAC". Third Workshop on Speech and Language Processing for Assistive Technologies, Montreal, Canada, June, 2012.
  35. Brian Roark, Michael Riley, Cyril Allauzen, Terry Tai and Richard Sproat. "The OpenGrm open-source finite-state grammar software libraries". ACL 2012, Jeju Island, Korea, July.
  36. Katherine Wu, Jennifer Solman, Ruth Linehan and Richard Sproat. "Corpora of Non-Linguistic Symbol Systems." Linguistic Society of America, Portland, OR, January 2012. PDF.
  37. Terry Tai, Wojciech Skut and Richard Sproat. "Thrax: An Open Source Grammar Compiler Built on OpenFst". ASRU 2011, Waikoloa Resort, Hawaii, December.
  38. Masoud Rouhizadeh, Bob Coyne, and Richard Sproat, "Collecting Semantic Information for Locations in the Scenario-Based Lexical Knowledge Resource of a Text-to-Scene Conversion System", 15th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2011), Kaiserslautern, Germany, September, 2011.
  39. Izhak Shafran, Richard Sproat, Mahsa Yarmohammadi and Brian Roark. "Efficient Determinization of Tagged Word Lattices using Categorial and Lexicographic Semirings". ASRU 2011, Waikoloa Resort, Hawaii, December.
  40. Brian Roark, Andrew Fowler, Richard Sproat, Christopher Gibbons and Melanie Fried-Oken, "Towards technology-assisted co-construction with communication partners." SLPAT 2011, Edinburgh. 2011.
  41. Masoud Rouhizadeh, Daniel Bauer, Bob Coyne, Owen Rambow and Richard Sproat. 2011. "Collecting Spatial Information for Locations in a Text-to-Scene Conversion System." CoSLI 2011 Workshop, Boston, MA. 2011.
  42. Brian Roark, Richard Sproat and Izhak Shafran, "Lexicographic Semirings for Exact Automata Encoding of Sequence Models". ACL-HLT 2011, Portland, OR. 2011. Best short paper award
  43. Alexander Kotov, ChengXiang Zhai and Richard Sproat, "Mining Named Entities with Temporally Correlated Bursts from Multilingual Web News Streams", WSDM 2011, 2011.
  44. Masoud Rouhizadeh, Margit Bowler, Richard Sproat and Bob Coyne. "Collecting Semantic Data from Amazon's Mechanical Turk for a Lexical Knowledge Resource in a Text to Picture Generating System." International Conference on Computational Semantics (IWCS 2011), Oxford, January, 2011.
  45. Aaron M. Cohen, Kyle Ambert, Jianji Yang, Robert Felder, Richard Sproat, Brian Roark, Kristy Hollingshead and Kari Baker. "OHSU/Portland VAMC Team Participation in the 2010 i2b2/VA Challenge Tasks," Fourth i2b2/VA NLP Workshop, Washington, December 2010.
  46. Masoud Rouhizadeh, Margit Bowler, Richard Sproat and Bob Coyne. "Data Collection and Normalization for Building the Scenario-Based Lexical Knowledge Resource of a Text-to-Scene Conversion System," SMAP 2010: 5th International Workshop on Semantic Media Adaptation and Personalization. Limassol, Cyprus, 2010.
  47. Richard Sproat, "Lightly Supervised Learning of Text Normalization: Russian Number Names," IEEE Workshop on Spoken Language Technology, Berkeley, CA, 2010.
  48. Su-Youn Yoon, Mark Hasegawa-Johnson, and Richard Sproat, "Landmark-based Automated Pronunciation Error Detection," Proceedings of Interspeech 2010, Makuhari, Japan, 2010.
  49. Suma Bhat, Mark Hasegawa-Johnson, Richard Sproat. "Automatic Fluency Assessment by Signal-Level Measurement of Spontaneous Speech". 2010 INTERSPEECH Satellite Workshop on Second Language Studies: Acquisition, Learning, Education and Technology., Makuhari, Japan, 2010.
  50. Bob Coyne, Richard Sproat, Julia Hirschberg. "Spatial Relations in Text-to-Scene Conversion". Computational Models of Spatial Language Interpretation, Workshop at Spatial Cognition 2010, Mt. Hood, Oregon.
  51. Bob Coyne, Owen Rambow, Julia Hirschberg, and Richard Sproat, "Frame Semantics in Text-to-Scene Generation". 14th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, Cardiff, Wales, 2010.
  52. Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim, and Richard Sproat. "A Python Toolkit for Universal Transliteration". LREC, Malta, 2010.
  53. Suma Bhat, Richard Sproat, Mark Hasegawa-Johnson and Fred Davidson, "Automatic fluency assessment using thin-slices of spontaneous speech," LTRC, 2010.
  54. Su-youn Yoon, Mark Hasegawa-Johnson and Richard Sproat. "Automated Pronunciation Scoring using Confidence Scoring and Landmark-based SVM", InterSpeech, 2009.
  55. Martin Jansche, Richard Sproat, "Named Entity Transcription with Pair n-Gram Models", Named Entities Workshop: Shared Task on Transliteration (NEWS 2009), ACL--IJCNLP 2009, pp. 32--35.
  56. Suma Bhat and Richard Sproat. "Knowing the Unseen: Estimating Vocabulary Size over Unseen Samples",ACL, 2009.
  57. Xuanhui Wang, ChengXiang Zhai, Xiao Hu, and Richard Sproat, "Mining Correlated Bursty Topic Patterns from Coordinated Text Streams", Proceedings of the 2007 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07 ), pages 784-793.
  58. Su-Youn Yoon, Kyoung-young Kim and Richard Sproat. "Multilingual Transliteration Using Feature based Phonetic Method". ACL, June, 2007, Prague.
  59. Alla Rozovskaya, Richard Sproat, "Multilingual Word Sense Discrimination: A Comparative Cross-Linguistic Study", Balto-Slavonic Natural Language Processing. June, 2007, Prague.
  60. Boon Pang Lim, Richard Sproat: Using Latent Semantics for NE Translation. ICCPOL 2006: 457-465
  61. Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat and ChengXiang Zhai. "Unsupervised Named Entity Transliteration Using Temporal and Phonetic Correlation." EMNLP, July 22-23, 2006, Sydney, Australia.
  62. Richard Sproat, Tao Tao and ChengXiang Zhai. "Named Entity Transliteration with Comparable Corpora". ACL 2006, July 17-21, 2006, Sydney, Australia.
  63. Alla Rozovskaya, Richard Sproat and Elabbas Benmamoun. "Language Modeling of Arabic Dialects". Colloquium on Arabic Language Processing, June 5-7, Rabat, Morocco.
  64. Cecilia Ovesdotter Alm, Dan Roth and Richard Sproat "Emotions from text: machine learning for text-based emotion prediction." HLT/EMNLP 2005. October 6-8, 2005, Vancouver.
  65. Cecilia Alm and Richard Sproat. "Emotional sequencing and development in fairy tales." First International Conference on Affective Computing and Intelligent Interaction, Beijing, China, Oct. 22-24, 2005.
  66. Cecilia Alm and Richard Sproat. "Perceptions of emotions in expressive storytelling." InterSpeech 2005, Lisbon, Portugal, Sep. 4-8, 2005.
  67. Yanli Zheng, Richard Sproat, Liang Gu, Izhak Shafran, Haolang Zhou, Yi Su, Dan Jurafsky, Rebecca Starr, Su-Youn Yoon. "Accent Detection and Speech Recognition for Shanghai-Accented Mandarin." InterSpeech 2005, Lisbon, Portugal, Sep. 4-8, 2005.
  68. Richard Sproat, Prakash Padakannaya. "Script Indices". International Conference on Cognitive Science, Allahabad, India, December 2004.
  69. Sameer Maskey, Michiel Bacchiani, Brian Roark, and Richard Sproat. "Improved Name Recognition with Meta-Data Dependent Name Networks". ICASSP 2004, Montréal, May 2004.
  70. Murat Saraclar, Richard Sproat. "Lattice-Based Search for Spoken Utterance Retrieval." HLT-NAACL 04, Boston, May 2004.
  71. Michael Riley, Brian Roark and Richard Sproat. "Good-Turing Estimation from Word Lattices for Unsupervised Language Model Adaptation." IEEE Automatic Speech Recognition and Understanding Workshop. Fisherman's Reef, U.S. Virgin Islands. December 2003.
  72. Richard Sproat, "A Formal Computational Analysis of Indic Scripts", International Symposium on Indic Scripts: Past and Future, Tokyo, December 2003.
  73. Jason Zhang, Alan Black and Richard Sproat, "Identifying Speakers in Children's Stories for Speech Synthesis," Proceedings of EUROSPEECH 2003, Geneva, September 2003.
  74. Richard Sproat and Thomas Emerson, "The First International Chinese Word Segmentation Bakeoff", The Second SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan, July 2003.
  75. Srinivas Bangalore, Owen Rambow, Tahir Butt, Alexis Nasr and Richard Sproat, "Creating a Finite-State Parser with Application Semantics", COLING, Taipei, Taiwan, August 2002.
  76. Richard Sproat, "Inferring the Environment in a Text-to-Scene Conversion System", First International Conference on Knowledge Capture (K-CAP '01), Victoria, BC, Canada, 2001.
  77. Richard Sproat, "Pmtools: A Pronunciation Modeling Toolkit", Proceedings of the Fourth ISCA Tutorial and Research Workshop on Speech Synthesis, Blair Atholl, Scotland, 2001.
  78. Bob Coyne and Richard Sproat, "WordsEye: An Automatic Text-to-Scene Conversion system", SIGGRAPH 2001, Los Angeles, CA, 2001.
  79. Richard Sproat, "Corpus-Based Methods and Hand-Built Methods", International Conference on Spoken Language Processing, 2000.
  80. Bhuvana Narasimhan, Richard Sproat and George Kiraz. "Schwa deletion in Hindi Text-to-Speech Synthesis." Workshop on Computational Linguistics in South Asian Languages, 21st SALA, October 2001, Konstanz.
  81. Jan van Santen and Richard Sproat. "High-accuracy automatic segmentation". Proceedings of EUROSPEECH 99, Budapest, Hungary, 1999.
  82. Richard Sproat, Jianying Hu, Hao Chen, "EMU: An E-mail Preprocessor for Text-to-Speech," IEEE Signal Processing Society 1998 Workshop on Multimedia Signal Processing, Los Angeles, CA.
  83. Hao Chen, Jianying Hu and Richard Sproat. "E-mail Signature Block Analysis", ICPR'98, Brisbane, Australia.
  84. Richard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan Black, Kevin Lenzo, Mike Edgington, "SABLE: A Standard for TTS Markup", International Conference on Spoken Language Processing, 1998. Also presented at the ESCA/COCOSDA Speech Synthesis Workshop. Jenolan Caves, Australia; and at the W3C Meeting on Voice Browsers, Cambridge, MA, October 13, 1998.
  85. Richard Sproat, Jan van Santen, "Automatic Ambiguity Detection", International Conference on Spoken Language Processing, 1998.
  86. Richard Sproat, Paul Taylor, Michael Tanenblatt and Amy Isard, "A Markup Language for Speech Synthesis," Proceedings of EUROSPEECH, 1997.
  87. Bernd Möbius, Richard Sproat, Jan van Santen and Joseph Olive, "A German Text-to-Speech System," Proceedings of EUROSPEECH, 1997.
  88. Elena Pavlova, Chilin Shih and Richard Sproat, "A Text-to-Speech System for Russian," Proceedings of EUROSPEECH, 1997.
  89. Chilin Shih and Richard Sproat, "Mandarin Text-to-Speech Synthesis," Proceedings of the Eighth North American Conference on Chinese Linguistics, 1997.
  90. Richard Sproat. "Multilingual Text Analysis for Text-to-Speech Synthesis", International Conference on Spoken Language Processing, 1996.
  91. Richard Sproat. "Multilingual Text Analysis for Text-to-Speech Synthesis", ECAI '96 Workshop on Extended Finite State Models of Language, Budapest, Hungary, 75-81, 1996.
  92. Richard Sproat and Michael Riley. "Compilation of Weighted Finite-State Transducers from Decision Trees", 34th Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, 1996.
  93. Mehryar Mohri and Richard Sproat. "An Efficient Compiler for Weighted Rewrite Rules", 34th Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, 1996.
  94. Richard Sproat. "A Finite-State Architecture for Tokenization and Grapheme-to-Phoneme Conversion for Multilingual Text Analysis," Proceedings of the EACL SIGDAT Workshop, 65-72, Dublin, Ireland. Association for Computational Linguistics, 1995.
  95. Richard Sproat and Chilin Shih, "A Corpus-Based Analysis of Mandarin Nominal Root Compounds," Proceedings of the Sixth North American Conference on Chinese Linguistics, 1995.
  96. Benjamin Ao, Chilin Shih and Richard Sproat, "A Corpus-Based Mandarin Text-to-Speech Synthesizer," International Conference on Spoken Language Processing, 1771-1774, 1994.
  97. Richard Sproat and Joseph Olive, "A Modular Architecture for Multilingual Text-to-Speech," Proceedings of The Second ESCA/IEEE Workshop on Speech Synthesis, 187-190, 1994.
  98. Richard Sproat, Chilin Shih, William Gale and Nancy Chang, "A Stochastic Finite-State Word-Segmentation Algorithm for Chinese," 32nd Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, 66-73, 1994.
  99. Fernando Pereira, Michael Riley and Richard Sproat, "Weighted Rational Transductions and their Application to Human Language Processing," ARPA Workshop on Human Language Technology, 249-254, 1994.
  100. Richard Sproat, Julia Hirschberg and David Yarowsky, "A corpus-based synthesizer," International Conference on Spoken Language Processing, 563-566, 1992.
  101. Richard Sproat and Chilin Shih, "On the sources of some constraints on Mandarin morphology," Proceedings of the Third International Symposium on Chinese Languages and Linguistics, 20-37, 1992.
  102. David Roe, Fernando Pereira, Richard Sproat, Michael Riley, Pedro Moreno and Alejandro Macarrón, "Efficient Grammar Processing for a Spoken Language Translation System," ICASSP-92(1), 213-216, 1992.
  103. David Roe, Fernando Pereira, Richard Sproat, Michael Riley, Pedro Moreno and Alejandro Macarrón, "Toward a Spoken Language Translator for Restricted-Domain Context-Free Languages," EUROSPEECH 91(3), 1063-1066, 1991.
  104. Richard Sproat, "An Application of Statistical Optimization with Dynamic Programming to Phonemic-Input-to-Character Conversion for Chinese," Proceedings of R.O.C. Computational Linguistics Conference III, 379-390, 1990.
  105. Richard Sproat, "Stress Assignment in Complex Nominals for English Text-to-Speech," Proceedings of ESCA Workshop on Speech Synthesis, 129-132.
  106. Richard Sproat and Chilin Shih, "Prenominal Adjectival Ordering in English and Mandarin," Proceedings of the 18th Annual Meeting of the North East Linguistics Society, 465-489, 1988.
  107. Richard Sproat and Barbara Brunson, "Constituent-Based Morphological Parsing: A New Approach to the Problem of Word-Recognition," 25th Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, 65-72, 1987.
  108. Richard Sproat and Mark Liberman, "Toward Treating English Nominals Correctly," 25th Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, 140-146, 1987.
  109. Richard Sproat and Gregory Ward, "Pragmatic Considerations in Anaphoric Island Phenomena," Papers from the 23rd Annual Meeting of the Chicago Linguistic Society, 321-335, 1987.
  110. Richard Sproat, "Malayalam Compounding: A Non-Stratum-Ordered Account," Proceedings of the Fifth West Coast Conference on Formal Linguistics, 268-288, 1986.
  111. Richard Sproat, "The Projection Principle and the Syntax of Synthetic Compounds," Proceedings of the 16th Annual meeting of the North East Linguistics Society, 462-475, 1986.
  112. Richard Sproat, "INFL and the Configurationality of VSO Languages," Proceedings of the 14th Annual meeting of the North East Linguistics Society, 418-431, 1984.

Invited Review Articles and Commentaries

  1. Amalia Gnanadesikan and Richard Sproat. "Writing Systems". Oxford Bibliographies, 2018.
  2. Richard Sproat, "Phonemic diversity and the out-of-Africa theory". Linguistic Typology, 15, 199-206, 2011.
  3. Richard Sproat, "Linguistic Processing for Speech Synthesis", in Jacob Benesty, Yiteng (Arden) Huang, M. Mohan Sondhi, Springer Handbook of Speech Processing and Speech Communication, Springer, 2006.
  4. Richard Sproat, "Morphology as Component or Module," invited review article in A. Spencer and A. Zwicky (eds.), Handbook of Morphology, Oxford, Basil Blackwell Ltd., 2000.
  5. Richard Sproat, "Text Interpretation for Text-to-Speech Synthesis," in R. Cole, J. Mariani, H. Uszkoreit, A. Zaenen and V. Zue (eds.), Survey of the State of the Art in Human Language Technology, Dordrecht, Kluwer, 1996.
  6. Richard Sproat, "Computational Interpretations of Neurolinguistic Observations," in M. Gazzaniga (ed.), The Cognitive Neurosciences, Cambridge, MIT Press, 1994, 931-942.
  7. Richard Sproat, "Commentary on Bird and Klein." Computational Linguistics, 493, 1994.
  8. Richard Sproat, "The Lexicon in Generative Grammar," invited review article in W. Bright (ed.), Oxford International Encyclopedia of Linguistics, New York, Oxford University Press, 1991.
  9. Richard Sproat, "Competence, Performance and Agrammatism: A Reply to Grodzinsky," Brain and Language, 27, 160-167, 1986.

Invited Book Reviews

  1. Richard Sproat, "Review of Shouhui Zhao \& Richard B. Baldauf, Jr., Planning Chinese Characters: Reaction, Evolution or Revolution. (Language Policy, 9.)", in Written Language and Literacy, 11(2), 2008.
  2. Chilin Shih, Richard Sproat, "Review of Horne (ed.) Prosody: Theory and Experiment. Studies Presented to Gösta Bruce, Computational Linguistics", 27(3), 2001, to appear.
  3. Richard Sproat, "Review of Packard (ed.) New Approaches to Chinese Word Formation." Yearbook of Morphology, 1999.
  4. Richard Sproat, "Review of Daniels and Bright, The World's Writing Systems," Written Language and Literacy, 1(1), 129-137, 1998.
  5. Richard Sproat, "Review of Beard, Lexeme-Morpheme Base Morphology," Journal of Linguistics, 1997.
  6. Richard Sproat, "Review of Miller Complex Verb Formation," Language, 70, 361-365, 1994.
  7. Richard Sproat, "Review of Ritchie et al. Computational Morphology," Language, 69, 152-155, 1993.
  8. Richard Sproat, "Review of Carstairs: Current Morphology," Phonology Yearbook, 9, 353-357, 1993.
  9. Richard Sproat, "Morphological Non-Separation Revisited: a Review of R. Lieber's Deconstructing Morphology." Yearbook of Morphology, 235-258, 1992.
  10. Richard Sproat, "Review of The Syntax of the Modern Celtic Languages." Lingua, 347-370, 1992.
  11. Richard Sproat, "Review of PC-KIMMO," Computational Linguistics, 17, 229-231, 1991.

Other Book Reviews

  1. Richard Sproat, "Review of Packard: The Morphology of Chinese: A Linguistic and Cognitive Approach", LINGUIST List: Vol-12-11.
  2. Richard Sproat, "Review of Bright and Daniels: The World's Writing Systems", LINGUIST List: Vol-7-400.

Other Publications

  1. Kyle Gorman and Richard Sproat. How to get superior text processing in Python with Pynini. O'Reilly. 2016.
  2. M. Rouhizadeh, J. van Santen, R. Sproat. "Computational Semantic Analysis of Restrictive and Repetitive Behavior in Language Samples of Children with Autism" IMFAR, 2015, Salt Lake City.
  3. M. Rouhizadeh, J. van Santen, R. Sproat, K. Gorman, P. Heeman, A. P. Hill, S. Bedrick, E. T. Prud'hommeaux and G. Kiss. "Children's Differing Patterns of Discourse Marker Use in ASD and Typical Development" IMFAR, 2014, Atlanta.
  4. Richard Sproat, Lois Black, Emily Tucker-Prud'hommeaux, Jan van Santen, Brian Roark. "Automated Analysis of Natural Language Samples: Comparison of Children with Autism Spectrum Disorders, Developmental Language Disorders, and Typical Development." IMFAR, 2011, San Diego.
  5. Richard Sproat, Dan Roth, ChengXiang Zhai, Elabbas Benmamoun, Andrew Fister, Nadia Karlinsky, Alex Klementiev, Chongwon Park, Vasin Punyakanok, Tao Tao, Su-youn Yoon. "Named Entity Recognition and Transliteration for 50 Languages.". Keynote Address at the Second Midwest Computational Linguistics Colloquium, May 14-15, 2005, The Ohio State University.
  6. Bernd Möbius and Richard Sproat, "Linguistische Analyse in einem multilingualen Sprachsynthesesystem". Spektrum der Wissenschaften, 1996, (German Edition of Scientific American)
  7. Richard Sproat, On Deriving the Lexicon. MIT Working Papers in Linguistics, Cambridge, MA. 1985. (Published version of Ph.D. dissertation)
  8. Richard Sproat, "On Deriving the Lexicon," The Linguistic Review (Dissertation Abstracts), 5, 185-196, 1985.
  9. Richard Sproat, "A Note on Rebracketing in Morphology," in D. Archangeli, A. Barss and R. Sproat (eds.), MIT Working Papers in Linguistics, 6, 199-205, 1985.
  10. Richard Sproat, "Review of Schank's `Identification of Conceptualizations Underlying Natural Language'," In B. Levin (ed.), Lexical Semantics in Review, Lexicon Project Working Papers, number 1, Cambridge, MA, Center for Cognitive Science, Massachusetts Institute of Technology, 1985.
  11. Richard Sproat, "A Scientific Curriculum in Linguistics for Use In Secondary Schools." In Doing Science: Constructing Scientific Theories as an Introduction to Scientific Method, Technical Report, Cambridge, MA, Educational Technology Center, Harvard Graduate School of Education, 1985.
  12. Richard Sproat, "On Bracketing Paradoxes," in M. Speas and R. Sproat (eds.), MIT Working Papers in Linguistics, 7, 110-130, 1984.
  13. Richard Sproat, "VSO Languages and Welsh Configurationality," in I. Haik and D. Massam (eds.), MIT Working Papers in Linguistics, 5, 243-276, 1983.
  14. Richard Sproat, letter to the editor, December issue of Scientific American: a critique of Derek Bickerton's paper entitled "Creole Languages," Scientific American, December, 1983.
  15. John Newman, with the assistance of Richard Sproat, Old High German Reader, Advocate Publishing Group, Reynoldsburg, OH, 1981.

Tutorials and Courses

  1. "Corpus-Based Methods in Chinese Morphology", COLING 2002, Taipei, Taiwan.
    Here are the exercises associated with this tutorial.
    For those of you who are inclined to doubt my example mei3 hua4 `Americanize', see Li, Charles and Sandra Thompson. 1981. Mandarin Chinese: A Functional Reference Grammar. University of California Press, Berkeley, CA, page 42, example (34).
  2. "Corpus-Based Methods in Chinese Morphology and Phonology", with Chilin Shih, 2001 LSA Institute, Santa Barbara, CA. Slides for first three lectures: Lecture 1, Lecture 2, Lecture 3.
    Lecture notes.
  3. "Finite-state Methods in Morphology, Text Analysis and the Analysis of Writing Systems", ROCLING X, Taipei, Taiwan.
  4. "Algorithms for Speech Recognition and Language Processing", with Mehryar Mohri and Michael Riley. COLING 1996, Copenhagen.
  5. "Computational Morphology", Australian Linguistic Institute, 1994, Melbourne.
  6. "Text Analysis Tools in Spoken Language Processing", with Michael Riley, ACL 1994, Las Cruces.
  7. "Morphology and Computational Morphology", ACL 1989, Vancouver.

Main Page