Time delay neural network: Difference between revisions

Content deleted Content added
free
Large vocabulary speech recognition: | Altered template type. Add: isbn, pages, date, title, chapter, doi, authors 1-4. Changed bare reference to CS1/2. Removed parameters. | Use this tool. Report bugs. | #UCB_Gadget
Line 37:
 
=== Large vocabulary speech recognition ===
Large vocabulary speech recognition requires recognizing sequences of phonemes that make up words subject to the constraints of a large pronunciation vocabulary. Integration of TDNNs into large vocabulary speech recognizers is possible by introducing state transitions and search between phonemes that make up a word. The resulting Multi-State Time-Delay Neural Network (MS-TDNN) can be trained discriminative from the word level, thereby optimizing the entire arrangement toward word recognition instead of phoneme classification.<ref name=":6" /><ref name=":7">[https://ieeexplore.ieee.org/document/319179{{cite C.book Bregler,| Hdoi=10. Hild, S1109/ICASSP. Manke and A1993.319179 Waibel,| "chapter=Improving connected letter recognition by lipreading," 1993| title=IEEE International Conference on Acoustics, Speech, and Signal Processing, Minneapolis,| MN,date=1993 USA,| 1993,last1=Bregler pp| first1=C. | last2=Hild | first2=H. | last3=Manke | first3=S. | last4=Waibel | first4=A. | pages=557-560 vol.|volume=1, doi:| 10.1109/ICASSP.1993.319179.]isbn=0-7803-0946-4 }}</ref><ref name=":2" />
 
=== Speaker independence ===