Revision as of 14:00, 10 May 2025 edit Headbomb (talk \| contribs) Edit filter managers, Autopatrolled, Extended confirmed users, Page movers, File movers, New page reviewers, Pending changes reviewers, Rollbackers, Template editors 473,387 edits →top: \| Add: pages, issue, volume, journal, date, title, authors 1-5. \| Use this tool. Report bugs. \| #UCB_Gadget ← Previous edit		Revision as of 14:07, 10 May 2025 edit undo Citation bot (talk \| contribs) Bots 5,868,227 edits Altered pages. Formatted dashes. \| Use this bot. Report bugs. \| Suggested by Headbomb \| #UCB_toolbar Next edit →
Line 37: === Large vocabulary speech recognition === Large vocabulary speech recognition requires recognizing sequences of phonemes that make up words subject to the constraints of a large pronunciation vocabulary. Integration of TDNNs into large vocabulary speech recognizers is possible by introducing state transitions and search between phonemes that make up a word. The resulting Multi-State Time-Delay Neural Network (MS-TDNN) can be trained discriminative from the word level, thereby optimizing the entire arrangement toward word recognition instead of phoneme classification.<ref name=":6" /><ref name=":7">{{cite book \| doi=10.1109/ICASSP.1993.319179 \| chapter=Improving connected letter recognition by lipreading \| title=IEEE International Conference on Acoustics Speech and Signal Processing \| date=1993 \| last1=Bregler \| first1=C. \| last2=Hild \| first2=H. \| last3=Manke \| first3=S. \| last4=Waibel \| first4=A. \| pages=~~557-560~~557–560 \|volume=1 \| isbn=0-7803-0946-4 }}</ref><ref name=":2" /> === Speaker independence ===

Time delay neural network: Difference between revisions