Time delay neural network: Difference between revisions

Content deleted Content added
top: | Add: pages, issue, volume, journal, date, title, authors 1-5. | Use this tool. Report bugs. | #UCB_Gadget
Citation bot (talk | contribs)
Altered pages. Formatted dashes. | Use this bot. Report bugs. | Suggested by Headbomb | #UCB_toolbar
Line 37:
 
=== Large vocabulary speech recognition ===
Large vocabulary speech recognition requires recognizing sequences of phonemes that make up words subject to the constraints of a large pronunciation vocabulary. Integration of TDNNs into large vocabulary speech recognizers is possible by introducing state transitions and search between phonemes that make up a word. The resulting Multi-State Time-Delay Neural Network (MS-TDNN) can be trained discriminative from the word level, thereby optimizing the entire arrangement toward word recognition instead of phoneme classification.<ref name=":6" /><ref name=":7">{{cite book | doi=10.1109/ICASSP.1993.319179 | chapter=Improving connected letter recognition by lipreading | title=IEEE International Conference on Acoustics Speech and Signal Processing | date=1993 | last1=Bregler | first1=C. | last2=Hild | first2=H. | last3=Manke | first3=S. | last4=Waibel | first4=A. | pages=557-560557–560 |volume=1 | isbn=0-7803-0946-4 }}</ref><ref name=":2" />
 
=== Speaker independence ===