TIP: Click on subject to list as thread! ANSI
echo: electronics
to: MIKE ROSS
from: ROBERT SAYRE
date: 2002-12-14 20:48:00
subject: TALKING BOOKS

JE> I wonder why, if it is so easy. is it so hard for voice
JE> recognition programs to function reasonably accurately???

MR> My guess why speech recognition is so difficult is basically that there
MR> is a lot of information in the audio signal. There may be any number of
MR> quirks in the voice which can make it unrecognizable for the program
MR> comparing between two individuals. The program has to filter for pitch,
MR> tonal quality, speed, inflection, fricatives, and that's just a start.

MR> One can't simply match up a waveform with another but rather there is a
MR> lot of computation done on-the-fly in analyzing it to extract the
MR> information sought from a forest of waveform information. Text to speech
MR> in contrast is almost child's play since the information is already
MR> known and data is simply output from pre-existing lookup tables.

 I'm sorry. What did you say?  

 Robert

 * SLMR 2.1a * Danger!  Attention Span Exceeded!
--- Platinum Xpress/Win/WINServer v3.0pr5
* Origin: Try Our Web Based QWK: DOCSPLACE.ORG (1:123/140)
SEEN-BY: 633/267 270
@PATH: 123/140 500 106/1 379/1 633/267

SOURCE: echomail via fidonet.ozzmosis.com

Email questions or comments to sysop@ipingthereforeiam.com
All parts of this website painstakingly hand-crafted in the U.S.A.!
IPTIA BBS/MUD/Terminal/Game Server List, © 2025 IPTIA Consulting™.