Motorola NN Speech Synthesizer Article

Orhan Karaali karaali at ukraine.corp.mot.com
Tue Feb 11 12:34:16 EST 1997


FTP-host: archive.cis.ohio-state.edu
FTP-filename: /pub/neuroprose/karaali.synthesis_wcnn96.ps.Z



Motorola Neural Network Speech Synthesizer Article 

A new neural network based speech synthesizer has been developed here at
Motorola Chicago Corporate Research Laboratories by the Speech Synthesis and
Machine Learning Group.  We believe that the quality of the synthesized
speech it produces surpasses the current state of the art, particularly in
naturalness.

An invited paper describing this neural network speech synthesizer
was presented in the Speech Session of the World Congress on Neural
Networks 96 in San Diego.  This paper is now available in the NEUROPROSE
archive as karaali.synthesis_wcnn96.ps.Z.

If you have a problem getting the paper from NEUROPROSE, I can email
it to you.

Orhan Karaali

email: karaali at mot.com


---------------------------------------------------------------------

Speech Synthesis with Neural Networks
Orhan Karaali, Gerald Corrigan, and Ira Gerson
Motorola, Inc., 1301 E. Algonquin Road, Schaumburg, IL 60196
karaali at mot.com, corrigan at mot.com, gerson at mot.com

ABSTRACT

Text-to-speech conversion has traditionally been performed either by
concatenating short samples of speech or by using rule-based systems to convert
a phonetic representation of speech into an acoustic representation, which is
then converted into speech. This paper describes a system that uses a
time-delay neural network (TDNN) to perform this phonetic-to-acoustic mapping,
with another neural network to control the timing of the generated speech.
The neural network system requires less memory than a concatenation system,
and performed well in tests comparing it to commercial systems using other
technologies.



----- End Included Message -----



More information about the Connectionists mailing list