Computational Linguist // Research Scientist
〒105-6415、東京都港区虎ノ門1丁目17‐1、虎ノ門ヒルズビジネスタワー15階
Toranomon Hills Business Tower 15F, 1-17-1 Toranomon, Minato-ku, Tokyo 105-6415, JAPAN
I am a computational linguist (which means that I have some things in common with grapefruit).
I am a Research Scientist at Sakana.ai in Tokyo. My work at Sakana focuses on natural language processing, image processing, and AI.
Prior to that I was a Research Scientist at Google, formerly in New York, then in Tokyo.
At Sakana.ai, I work on various projects relating to Natural Language Processing and Artificial Intelligence. Some recent and/or ongoing projects:
At Google I was mostly working on text normalization, where my former group had been developing various machine learning approaches to the problem of normalizing non-standard words in text and I have been particularly interested in the promise (and limitations) of approaches using recurrent neural nets. In September 2019, I moved to Google Tokyo, and worked on end-to-end speech understanding.
I continued to maintain some "side-bar interests" including computational models of the early evolution of writing, the statistical properties of non-linguistic symbol systems, and collaborating on a translation of Wolfgang von Kempelen's Mechanismus der menschlichen Sprache, which was published in 2017.
Prior to joining Google I was involved in a number of projects:
Once again, I have long been interested in writing systems; see, for example, some work I was doing on approximate string matching in the Easter Island rongorongo script. I also ran (with Jerry Packard) a reading group centered around Hannas' controversial thesis relating Asia's supposed technological creativity gap, with the Chinese writing system.
Before joining the faculty at UIUC I worked in the Information Systems and Analysis Research Department headed by Ken Church at AT&T Labs—Research, where I worked on Speech and Text Data Mining: extracting potentially useful information from large speech or text databases using a combination of speech/NLP technology and data mining techniques.
Before joining Ken's department I worked in the Human/Computer Interaction Research Department headed by Candy Kamm. My most recent project in that department was WordsEye, an automatic text-to-scene conversion system. The WordsEye technology is now being developed at WordsEye.com. WordsEye was particularly good for creating surrealistic images that I can easily conceive of but are well beyond my artistic ability to execute. And we were doing this 20 years before the "AI revolution"! All of the following images were generated from text descriptions of the scene:
|
|
|
|
Prior to joining AT&T Labs in 1999 I worked on Text-to-Speech Synthesis at Bell Labs, Lucent Technologies. Among other things, I was responsible for the multilingual text processing module of the Bell Labs Multilingual TTS System.