Windows Speech Recognition: Difference between revisions

Content deleted Content added
No edit summary
Edits for clarification, prose, readability in light of the Good Article review.
Line 5:
| caption = The tutorial for Windows Speech Recognition in [[Windows Vista]] depicting the selection of text in [[WordPad]] for deletion.
| developer = [[Microsoft]]
| operating system = [[Windows Vista]] and later<br/>[[Windows Server 2008]] and later
| genre = [[Speech recognition]]
}}
'''Windows Speech Recognition''' ('''WSR''') is [[speech recognition]] developed by [[Microsoft]] for [[Windows Vista]] that enables [[hands-free computing|voice commands]] to control the [[desktop metaphor|desktop]] [[user interface]]; [[transcription (linguistics)|dictate]] text in [[electronic document]]s and [[email]]; navigate [[website]]s; perform [[keyboard shortcut]]s; and to operate the [[cursor (computing)|mouse cursor]]. It supports custom [[macro (computer science)|macro]]s to perform additional or supplementary tasks.
 
WSR is a locally processed speech recognition platform; it does not rely on [[cloud computing]] for accuracy, dictation, or recognition, but adapts based on contexts, grammars, speech samples, training sessions, and vocabularies. It provides a personal dictionary that allows users to include or exclude words or expressions from dictation and to record pronunciations to increase recognition accuracy. With [[Windows Search]],<ref name="ThurrottAllchin">{{cite web |url=http://www.itprotoday.com/jim-allchin-talks-windows-vista |title=Jim Allchin Talks Windows Vista |last=Thurrott |first=Paul |authorlink=Paul Thurrott |date=October 6, 2010 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=March 29, 2018}}</ref> it can analyze and collect text in documents, email, as well as [[handwritten]] [[tablet PC]] input to contextualize and disambiguate terms.<ref name="Privacy">{{cite web |url=http://download.microsoft.com/download/7/9/4/7945a146-fc32-48c2-8c14-83b1b36696e5/Windows%20Vista%20Privacy%20Statement.rtf |title=Windows Vista Privacy Statement |author=[[Microsoft]] |date=2006 |format=RTF |accessdate=July 1, 2015}}</ref> Custom language models that adapt the recognizer to the specific contexts, phonetics, and terminologies of users in particular occupational fields such as legal or medical are also supported.<ref name="CustomizedVocabularies">{{cite web |url=https://blogs.msdn.microsoft.com/robch/2005/09/20/customized-speech-vocabularies-in-windows-vista/ |title=Customized speech vocabularies in Windows Vista |last=Chambers |first=Rob |date=September 20, 2005 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=March 29, 2018}}</ref>
 
With Windows Vista, WSR was developed to be integrated into Windows, as speech recognition was exclusive to applications such as [[Windows Media Player]]. It is present in [[Windows 7]], [[Windows 8]], [[Windows 8.1]], [[Windows RT]], and [[Windows 10]].
WSR was developed to be integrated into Windows Vista, as Windows previously only supported speech recognition exclusive to applications such as [[Windows Media Player]]. [[Microsoft Office XP]] introduced speech recognition for [[Internet Explorer]] and [[Microsoft Office|Office]]. With the release of Windows Vista, [[Microsoft Office 2007|Office 2007]] and later versions of Office rely on WSR, replacing the separate Office speech recognition.<ref name="Office2007SR">{{cite web |url=https://support.office.com/en-us/article/What-happened-to-speech-recognition-c6541b32-82df-4c18-bfa5-c411f45337d3 |title=What happened to speech recognition? |publisher=[[Microsoft]] |work=Office Support |accessdate=November 9, 2016}}</ref> The majority of integrated applications in Windows Vista can be controlled through speech.<ref name="Guide">{{cite web |url=https://msdn.microsoft.com/en-us/library/bb530325.aspx |title=Windows Vista Speech Recognition Step-by-Step Guide |last=Phillips |first=Todd |date=2007 |publisher=[[Microsoft]] |work=[[MSDN]] |accessdate=June 30, 2015}}</ref> WSR is present in [[Windows 7]],<ref name="SpeechRecognitionWindows7">{{cite web |url=http://windows.microsoft.com/en-us/windows/what-can-do-speech-recognition#1TC=windows-7 |title=What can I do with Speech Recognition? |author=[[Microsoft]] |work=Windows How-to |accessdate=June 26, 2015}}</ref> [[Windows 8]],<ref name="Windows8SR">{{cite web |url=http://windows.microsoft.com//en-US//windows-8//using-speech-recognition |title=How to use Speech Recognition |publisher=[[Microsoft]] |work=Support |archiveurl=https://web.archive.org/web/20121025193813/http://windows.microsoft.com//en-US//windows-8//using-speech-recognition |archivedate=October 25, 2012 |accessdate=December 24, 2018}}</ref> [[Windows 8.1]],<ref name="UpdatedGuidelines">{{cite web |url=https://support.microsoft.com/en-us/help/14213/windows-how-to-use-speech-recognition |title=How to use Speech Recognition in Windows |date=August 31, 2016 |publisher=[[Microsoft]] |work=Support |accessdate=December 24, 2018}}</ref> [[Windows RT]],<ref name="UpdatedGuidelines"/> and [[Windows 10]].<ref name="Windows10">{{cite web |url=http://windows.microsoft.com/en-us/windows-10/use-voice-recognition-in-windows-10 |title=Use Voice Recognition in Windows 10 |author=[[Microsoft]] |work=Support |accessdate=August 24, 2015}}</ref>
 
==History==
Microsoft was involved in speech recognition and [[speech synthesis]] research for many years before WSR. In 1993, Microsoft hired [[Xuedong Huang]] from [[Carnegie Mellon University]] to lead its speech development efforts; the company's research led to the development of the [[Speech Application Programming Interface|Speech API]] introduced in 1994.<ref name="TalkingWindowsVista">{{cite web |url=http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx |title=Exploring New Speech Recognition And Synthesis APIs In Windows Vista |last=Brown |first=Robert |publisher=[[Microsoft]] |work=MSDN Magazine |archiveurl=https://web.archive.org/web/20080307054756/http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx |archivedate=March 7, 2008 |accessdate=June 26, 2015}}</ref> Speech recognition had also been used in previous Microsoft products. [[Office XP]] and [[Microsoft Office 2003|Office 2003]] provided speech recognition capabilities among [[Internet Explorer]] and [[Microsoft Office]] applications;<ref name="SpeechXP">{{cite web |url=https://support.microsoft.com/en-us/kb/306901 |title=How To Use Speech Recognition in Windows XP |author=[[Microsoft]] |work=Support |accessdate=June 26, 2015}}</ref> it also enabled limited speech functionality in [[Windows 98]], [[Windows ME]], [[Windows NT 4.0]], and [[Windows 2000]].<ref name="Description">{{cite web |url=https://support.microsoft.com/en-us/kb/278927 |title=Description of the speech recognition and handwriting recognition methods in Word 2002 |author=[[Microsoft]] |work=Support |archiveurl=https://web.archive.org/web/20150703125056/https://support.microsoft.com/en-us/kb/278927 |archivedate=July 3, 2015 |accessdate=March 26, 2018}}</ref> [[Windows XP]] [[Windows XP editions#Tablet PC Edition|Tablet PC Edition]] 2002 included speech recognition capabilities with the Tablet PC Input Panel,<ref name="WindowsXPTabletPCEdition">{{cite web |url=http://winsupersite.com/article/windows-xp2/windows-xp-tablet-pc-edition-reviewed-127413 |title=Windows XP Tablet PC Edition Review |last=Thurrott |first=Paul |authorlink=Paul Thurrott |date=June 25, 2002 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 26, 2015}}</ref><ref name="Natural">{{cite web |url=http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt |title=Natural Input On Mobile PC Systems |last=Dresevic |first=Bodin |date=2005 |publisher=[[Microsoft]] |format=PPT |accessdate=June 26, 2015}}</ref> and the [[Microsoft Plus!#Microsoft Plus! for Windows XP|Microsoft Plus! for Windows XP]] expansion package enabled voice commands to be used in [[Windows Media Player]].<ref name="VoiceCommand">{{cite web |url=http://winsupersite.com/article/product-review/plus-for-windows-xp-review |title=Plus! for Windows XP Review |last=Thurrott |first=Paul |authorlink=Paul Thurrott |date=October 6, 2010 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 30, 2015}}</ref> However, thisthese all required installation of speech recognition as an additional component; before Windows Vista, Windows did not include extensiveintegrated or integratedextensive speech recognition capabilities.<ref name="Natural"/> [[Microsoft Office 2007|Office 2007]] and later versions rely on WSR for speech recognition services.<ref name="Office2007SR">{{cite web |url=https://support.office.com/en-us/article/What-happened-to-speech-recognition-c6541b32-82df-4c18-bfa5-c411f45337d3 |title=What happened to speech recognition? |publisher=[[Microsoft]] |work=Office Support |accessdate=November 9, 2016}}</ref>
 
===PrecursorsWindows Vista===
[[File:WindowsVistaPreliminaryWSR.PNG|160px|thumb|right|A prototype speech recognition [[Windows Aero#Aero Wizards|Aero Wizard]] in [[Windows Vista]] (then known as "Longhorn") [[Development of Windows Vista#Milestone 7|build 4093]].]]
Microsoft was involved in speech recognition and [[speech synthesis]] research for many years before WSR. In 1993, Microsoft hired [[Xuedong Huang]] from [[Carnegie Mellon University]] to lead its speech development efforts; the company's research led to the development of the [[Speech Application Programming Interface|Speech API]] introduced in 1994.<ref name="TalkingWindowsVista">{{cite web |url=http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx |title=Exploring New Speech Recognition And Synthesis APIs In Windows Vista |last=Brown |first=Robert |publisher=[[Microsoft]] |work=MSDN Magazine |archiveurl=https://web.archive.org/web/20080307054756/http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx |archivedate=March 7, 2008 |accessdate=June 26, 2015}}</ref> Speech recognition had also been used in previous Microsoft products. Office XP and [[Microsoft Office 2003|Office 2003]] provided speech recognition capabilities among Internet Explorer and Office applications;<ref name="SpeechXP">{{cite web |url=https://support.microsoft.com/en-us/kb/306901 |title=How To Use Speech Recognition in Windows XP |author=[[Microsoft]] |work=Support |accessdate=June 26, 2015}}</ref> it also enabled limited speech functionality in [[Windows 98]], [[Windows ME]], [[Windows NT 4.0]], and [[Windows 2000]].<ref name="Description">{{cite web |url=https://support.microsoft.com/en-us/kb/278927 |title=Description of the speech recognition and handwriting recognition methods in Word 2002 |author=[[Microsoft]] |work=Support |archiveurl=https://web.archive.org/web/20150703125056/https://support.microsoft.com/en-us/kb/278927 |archivedate=July 3, 2015 |accessdate=March 26, 2018}}</ref> [[Windows XP]] [[Windows XP editions#Tablet PC Edition|Tablet PC Edition]] 2002 included speech recognition capabilities with the Tablet PC Input Panel,<ref name="WindowsXPTabletPCEdition">{{cite web |url=http://winsupersite.com/article/windows-xp2/windows-xp-tablet-pc-edition-reviewed-127413 |title=Windows XP Tablet PC Edition Review |last=Thurrott |first=Paul |authorlink=Paul Thurrott |date=June 25, 2002 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 26, 2015}}</ref><ref name="Natural">{{cite web |url=http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt |title=Natural Input On Mobile PC Systems |last=Dresevic |first=Bodin |date=2005 |publisher=[[Microsoft]] |format=PPT |accessdate=June 26, 2015}}</ref> and the [[Microsoft Plus!#Microsoft Plus! for Windows XP|Microsoft Plus! for Windows XP]] expansion package enabled voice commands to be used in [[Windows Media Player]].<ref name="VoiceCommand">{{cite web |url=http://winsupersite.com/article/product-review/plus-for-windows-xp-review |title=Plus! for Windows XP Review |last=Thurrott |first=Paul |authorlink=Paul Thurrott |date=October 6, 2010 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 30, 2015}}</ref> However, this required installation of speech recognition as an additional component; before Windows Vista, Windows did not include extensive or integrated speech recognition capabilities.<ref name="Natural"/>
At [[Windows Hardware Engineering Conference|WinHEC 2002]] Microsoft announced that Windows Vista (codenamed "Longhorn") would include advances in speech recognition and in features such as [[microphone array]] support<ref name="WinHEC2002">{{cite web |url=https://www.pcmag.com/article2/0,2817,1183143,00.asp |title=WinHEC: The Pregame Show |last=Stam |first=Nick |date=April 16, 2002 |publisher=[[Ziff Davis Media]] |work=[[PC Magazine]] |accessdate=June 26, 2015}}</ref> as part of an effort to "provide a consistent quality audio infrastructure for natural (continuous) speech recognition and (discrete) command and control."<ref name="AudioConsiderations">{{cite web |url=http://download.microsoft.com/download/whistler/WHP/1.0/WXP/EN-US/WH02_AV01.exe |title=Audio Considerations for Voice-Enabled Applications |last=Flandern Van |first=Mike |date=2002 |publisher=[[Microsoft]] |work=[[Windows Hardware Engineering Conference]] |format=EXE |archiveurl=https://web.archive.org/web/20020506020208/http://download.microsoft.com/download/whistler/WHP/1.0/WXP/EN-US/WH02_AV01.exe |archivedate=May 6, 2002 |accessdate=March 30, 2018}}</ref> [[Bill Gates]] stated during [[Professional Developers Conference|PDC 2003]] that Microsoft would "build speech capabilities into the system -- a big advance for that in 'Longhorn,' in both recognition and synthesis, real-time";<ref name="SpeechCapabilities">{{cite web |url=http://www.microsoft.com/billgates/speeches/2003/10-27PDC2003.asp |title=Bill Gates' Web Site - Speech Transcript, Microsoft Professional Developers Conference 2003 |author=[[Microsoft]] |date=October 27, 2003 |archiveurl=https://web.archive.org/web/20040203152133/http://www.microsoft.com/billgates/speeches/2003/10-27PDC2003.asp |archivedate=February 3, 2004 |accessdate=June 26, 2015}}</ref><ref name="SpeechPDC2003">{{cite web |url=http://windowsitpro.com/windows-server-2008/live-pdc-2003-day-1-monday |title=Live from PDC 2003: Day 1, Monday |last2=Furman |first2=Keith |last=Thurrott |first=Paul |date=October 26, 2003 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 26, 2015}}</ref> and pre-release builds during [[development of Windows Vista]] included a speech engine with training features.<ref name="Windows2006">{{cite web |url=http://www.techhive.com/article/113631/article.html |title=Your Next OS: Windows 2006? |last=Spanbauer |first=Scott |date=December 4, 2003 |publisher=[[International Data Group|IDG]] |work=TechHive |accessdate=June 25, 2015}}</ref> A PDC 2003 developer presentation stated Windows Vista would also include a user interface for microphone feedback and control, and user configuration and training features.<ref name="UserInputPDC2003">{{cite web |url=http://download.microsoft.com/download/6/6/9/669C56E3-12AF-48C5-AB2A-E7705F1BE37F/CLI351.ppt |title=Keyboard, Speech, and Pen Input in Your Controls |last2=Chambers |first2=Rob |last1=Gjerstad |first=Kevin |date=2003 |publisher=[[Microsoft]] |work=[[Professional Developers Conference]] |format=PPT |archiveurl=https://web.archive.org/web/20121219161523/http://download.microsoft.com/download/6/6/9/669C56E3-12AF-48C5-AB2A-E7705F1BE37F/CLI351.ppt |archivedate=December 19, 2012 |accessdate=March 30, 2018}}</ref> Microsoft clarified the extent to which speech recognition would be integrated when it stated in a pre-release [[software development kit]] that "the common speech scenarios, like speech-enabling menus and buttons, will be enabled system-wide."<ref name="SpeechRecognitionLonghorn">{{cite web |url=http://longhorn.msdn.microsoft.com/lhsdk/speech/speechconcepts.aspx |title=Interacting with the Computer using Speech Input and Speech Output |author=[[Microsoft]] |date=2003 |work=[[MSDN]] |archiveurl=https://web.archive.org/web/20040104193115/http://longhorn.msdn.microsoft.com/lhsdk/speech/speechconcepts.aspx |archivedate=January 4, 2004 |accessdate=June 28, 2015}}</ref>
 
===Development===
[[File:WindowsVistaPreliminaryWSR.PNG|160px|thumb|right|A prototype speech recognition [[Windows Aero#Aero Wizards|Aero Wizard]] in [[Windows Vista]] (then known as "Longhorn") [[Development of Windows Vista#Milestone 7|build 4093]].]]
 
====Windows Vista====
At [[Windows Hardware Engineering Conference|WinHEC 2002]] Microsoft announced that Windows Vista (codenamed "Longhorn") would include advances in speech recognition and in features such as [[microphone array]] support<ref name="WinHEC2002">{{cite web |url=https://www.pcmag.com/article2/0,2817,1183143,00.asp |title=WinHEC: The Pregame Show |last=Stam |first=Nick |date=April 16, 2002 |publisher=[[Ziff Davis Media]] |work=[[PC Magazine]] |accessdate=June 26, 2015}}</ref> as part of an effort to "provide a consistent quality audio infrastructure for natural (continuous) speech recognition and (discrete) command and control."<ref name="AudioConsiderations">{{cite web |url=http://download.microsoft.com/download/whistler/WHP/1.0/WXP/EN-US/WH02_AV01.exe |title=Audio Considerations for Voice-Enabled Applications |last=Flandern Van |first=Mike |date=2002 |publisher=[[Microsoft]] |work=[[Windows Hardware Engineering Conference]] |format=EXE |archiveurl=https://web.archive.org/web/20020506020208/http://download.microsoft.com/download/whistler/WHP/1.0/WXP/EN-US/WH02_AV01.exe |archivedate=May 6, 2002 |accessdate=March 30, 2018}}</ref> [[Bill Gates]] stated during [[Professional Developers Conference|PDC 2003]] that Microsoft would "build speech capabilities into the system -- a big advance for that in 'Longhorn,' in both recognition and synthesis, real-time";<ref name="SpeechCapabilities">{{cite web |url=http://www.microsoft.com/billgates/speeches/2003/10-27PDC2003.asp |title=Bill Gates' Web Site - Speech Transcript, Microsoft Professional Developers Conference 2003 |author=[[Microsoft]] |date=October 27, 2003 |archiveurl=https://web.archive.org/web/20040203152133/http://www.microsoft.com/billgates/speeches/2003/10-27PDC2003.asp |archivedate=February 3, 2004 |accessdate=June 26, 2015}}</ref><ref name="SpeechPDC2003">{{cite web |url=http://windowsitpro.com/windows-server-2008/live-pdc-2003-day-1-monday |title=Live from PDC 2003: Day 1, Monday |last2=Furman |first2=Keith |last=Thurrott |first=Paul |date=October 26, 2003 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 26, 2015}}</ref> and pre-release builds during [[development of Windows Vista]] included a speech engine with training features.<ref name="Windows2006">{{cite web |url=http://www.techhive.com/article/113631/article.html |title=Your Next OS: Windows 2006? |last=Spanbauer |first=Scott |date=December 4, 2003 |publisher=[[International Data Group|IDG]] |work=TechHive |accessdate=June 25, 2015}}</ref> A PDC 2003 developer presentation stated Windows Vista would also include a user interface for microphone feedback and control, and user configuration and training features.<ref name="UserInputPDC2003">{{cite web |url=http://download.microsoft.com/download/6/6/9/669C56E3-12AF-48C5-AB2A-E7705F1BE37F/CLI351.ppt |title=Keyboard, Speech, and Pen Input in Your Controls |last2=Chambers |first2=Rob |last1=Gjerstad |first=Kevin |date=2003 |publisher=[[Microsoft]] |work=[[Professional Developers Conference]] |format=PPT |archiveurl=https://web.archive.org/web/20121219161523/http://download.microsoft.com/download/6/6/9/669C56E3-12AF-48C5-AB2A-E7705F1BE37F/CLI351.ppt |archivedate=December 19, 2012 |accessdate=March 30, 2018}}</ref> Microsoft clarified the extent to which speech recognition would be integrated when it stated in a pre-release [[software development kit]] that "the common speech scenarios, like speech-enabling menus and buttons, will be enabled system-wide."<ref name="SpeechRecognitionLonghorn">{{cite web |url=http://longhorn.msdn.microsoft.com/lhsdk/speech/speechconcepts.aspx |title=Interacting with the Computer using Speech Input and Speech Output |author=[[Microsoft]] |date=2003 |work=[[MSDN]] |archiveurl=https://web.archive.org/web/20040104193115/http://longhorn.msdn.microsoft.com/lhsdk/speech/speechconcepts.aspx |archivedate=January 4, 2004 |accessdate=June 28, 2015}}</ref>
 
During WinHEC 2004 Microsoft included WSR as part of a strategy to improve productivity on mobile PCs.<ref name="MobilePCs">{{cite web |url=http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04023_WINHEC2004.ppt |title=Windows For Mobile PCs And Tablet PCs - CY05 And Beyond |last=Suokko |first=Matti |date=2004 |publisher=[[Microsoft]] |archiveurl=https://web.archive.org/web/20051214170817/http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04023_WINHEC2004.ppt |archivedate=December 14, 2005 |format=PPT |accessdate=July 15, 2015}}</ref><ref name="MobilePCs04">{{cite web |url=http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04022_WINHEC2004.ppt |title=Windows For Mobile PCs and Tablet PCs - CY04 |last=Fish |first=Darrin |date=2004 |publisher=[[Microsoft]] |archiveurl=https://web.archive.org/web/20051214170759/http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04022_WINHEC2004.ppt |archivedate=December 14, 2005 |format=PPT |accessdate=July 15, 2015}}</ref> Microsoft later emphasized [[accessibility]], new mobility scenarios, support for additional languages, and improvements to the speech user experience at WinHEC 2005. Unlike the speech support included in Windows XP, which was integrated with the Tablet PC Input Panel and required switching between separate Commanding and Dictation modes, Windows Vista would introduce a dedicated interface for speech input on the desktop and would unify the separate speech modes;<ref name="NaturalInput">{{cite web |url=http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt |title=Natural Input on Mobile PC Systems |last=Dresevic |first=Bodin |date=2005 |publisher=[[Microsoft]] |format=PPT |archiveurl=https://web.archive.org/web/20051214132222/http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt |archivedate=December 14, 2005 |accessdate=March 29, 2018}}</ref> users previously could not speak a command after dictating or vice versa without first switching between these two modes.<ref name="CommandingandDictation">{{cite web |url=http://blogs.msdn.com/b/robch/archive/2005/08/01/446131.aspx |title=Commanding and Dictation - One mode or two in Windows Vista? |last=Chambers |first=Rob |date=August 1, 2005 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=June 30, 2015}}</ref> Windows Vista Beta 1 included integrated speech recognition.<ref name="WindowsVistaBeta1">{{cite web |url=http://winsupersite.com/product-review/windows-vista-beta-1-review-part-3 |title=Windows Vista Beta 1 Review (Part 3) |last=Thurrott |first=Paul |authorlink=Paul Thurrott |date=October 6, 2010 |publisher=[[Penton (company)|Penton]] |work=[[Windows IT Pro]] |accessdate=June 26, 2015}}</ref> To incentivize company employees to analyze WSR for software [[software bug|glitch]]es and to provide feedback, Microsoft offered an opportunity for its testers to win a Premium model of the [[Xbox 360]].<ref name="MicrosoftWSRPoster">{{cite web |url=http://www.brian.levy3.net/proj_msft_poster1.html |title=Microsoft Speech Recognition poster |last=Levy |first=Brian |date=2006 |archiveurl=https://web.archive.org/web/20061011080004/http://brian.levy3.net/proj_msft_poster1.html |archivedate=October 11, 2006 |accessdate=March 17, 2016}}</ref>
Line 29 ⟶ 25:
During a demonstration by Microsoft on July 27, 2006—before Windows Vista's [[release to manufacturing]] (RTM)—a notable incident involving WSR occurred that resulted in an unintended output of "Dear aunt, let's set so double the killer delete select all" when several attempts to dictate led to consecutive output errors;<ref name="GoodDemos">{{cite web |url=http://blogs.reuters.com/blog/archives/1991 |title=UPDATED-When good demos go (very, very) bad |last=Auchard |first=Eric |date=July 28, 2006 |publisher=[[Thomson Reuters]] |archiveurl=https://web.archive.org/web/20110521230956/http://blogs.reuters.com/blog/archives/1991 |archivedate=May 21, 2011 |accessdate=March 29, 2018}}</ref><ref name="MSNBC">{{cite web|url=http://www.nbcnews.com/id/14158843 |title=Software glitch foils Microsoft demo |author=[[NBC News]] |date=August 2, 2006 |publisher=[[Associated Press]] |accessdate=June 30, 2015 }}</ref> the incident was a subject of significant derision among analysts and journalists in the audience.<ref name="NeedsWork">{{cite web |url=http://www.infoworld.com/article/06/07/31/HNvoicevista_1.html |title=Vista voice-recognition feature needs work |last=Montalbano |first=Elizabeth |date=July 31, 2006 |publisher=[[International Data Group|IDG]] |work=[[InfoWorld]] |archiveurl=https://web.archive.org/web/20060805091528/http://www.infoworld.com/article/06/07/31/HNvoicevista_1.html |archivedate=August 5, 2006 |accessdate=June 26, 2015}}</ref><ref name="Stammers">{{cite web |url=http://www.techhive.com/article/126613/article.html |title=Vista's Voice Recognition Stammers |last=Montalbano |first=Elizabeth |date=July 31, 2006 |publisher=[[International Data Group|IDG]] |work=TechHive |accessdate=July 1, 2015}}</ref> Microsoft later revealed that these issues were due to an audio [[Gain (electronics)|gain]] glitch that caused the speech recognizer to distort the dictated words;<ref name="FAM">{{cite web |url=http://blogs.msdn.com/b/robch/archive/2006/07/29/682479.aspx |title=FAM: Vista SR Demo failure -- And now you know the rest of the story ... |last=Chambers |first=Rob |date=July 29, 2006 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=June 26, 2015}}</ref> the glitch was fixed before Windows Vista's release.<ref name="FAM"/>
 
Reports surfaced infrom early 2007 indicated that WSR might beis vulnerable to anattackers attackusing thatspeech couldrecognition allowfor attackersmalicious tooperations playby playing certain audio commands through a computertarget's speakers, thereby using speech recognition to perform undesired user operations on a target computer;<ref name="SpeechRecognitionHole">{{cite web |url=http://news.bbc.co.uk/2/hi/technology/6320865.stm |title=Vista has speech recognition hole |date=February 1, 2007 |publisher=[[British Broadcasting Corporation|BBC]] |work=[[BBC News]] |accessdate=March 29, 2018}}</ref><ref name="RemoteExploit">{{cite web |url=https://www.engadget.com/2007/02/01/remote-exploit-of-vista-speech-reveals-fatal-flaw/ |title=Remote 'exploit' of Vista Speech reveals fatal flaw |last=Miller |first=Paul |date=February 1, 2007 |publisher=[[AOL]] |work=[[Engadget]] |accessdate=June 28, 2015}}</ref> it was the first vulnerability discovered after Windows Vista's [[Software release life cycle#General availability|general availability]].<ref name="PCWorld">{{cite web |url=http://www.pcworld.com/article/id,128737-c,vistalonghorn/article.html |title=Honeymoon's Over: First Windows Vista Flaw |last=Roberts |first=Paul |date=February 1, 2007 |publisher=[[International Data Group|IDG]] |work=[[PCWorld]] |archiveurl=https://web.archive.org/web/20070204030144/http://www.pcworld.com/article/id,128737-c,vistalonghorn/article.html |archivedate=February 4, 2007 |accessdate=June 28, 2015}}</ref> While Microsoft stated that although such an attack is theoretically possible, ita wouldnumber haveof tomitigating meetfactors aand numberprerequisites ofwould prerequisiteslimit toits beeffectiveness successfulor prevent it altogether: thea target system would haveneed the recognizer to havebe theactive speechand recognitionconfigured featureto properly configuredinterpret andsuch activatedcommands; speakersmicrophones and microphone(s)speakers connectedwould toboth theneed targetedto systembe wouldenabled needand toat besufficient turnedvolume onlevels; and thean exploitattack would require the softwarecomputer to interpret commands without a user noticing—an unlikely scenario as the affected system would perform visible interface operations and and produce audible feedback. Mitigatingwithout factorsusers include dictation clarity and microphone feedback and placementnoticing. Because of [[User Account Control]], an exploit of this naturewould also wouldprohibit notthe beoccurrence able to performof privileged operations for users or protected administrators without explicit consent.<ref name="SpeechIssue">{{cite web |url=https://blogs.technet.microsoft.com/msrc/2007/01/31/issue-regarding-windows-vista-speech-recognition/ |title=Issue regarding Windows Vista Speech Recognition |date=January 31, 2007 |publisher=[[Microsoft]] |work=[[Microsoft TechNet|TechNet]] |archive-url=https://web.archive.org/web/20160520045703/https://blogs.technet.microsoft.com/msrc/2007/01/31/issue-regarding-windows-vista-speech-recognition/ |url-status=dead |archivedate=May 20, 2016 |accessdate=March 31, 2018}}</ref>
=====Security report=====
Reports surfaced in early 2007 that WSR might be vulnerable to an attack that could allow attackers to play audio through a computer's speakers, thereby using speech recognition to perform undesired user operations on a target computer;<ref name="SpeechRecognitionHole">{{cite web |url=http://news.bbc.co.uk/2/hi/technology/6320865.stm |title=Vista has speech recognition hole |date=February 1, 2007 |publisher=[[British Broadcasting Corporation|BBC]] |work=[[BBC News]] |accessdate=March 29, 2018}}</ref><ref name="RemoteExploit">{{cite web |url=https://www.engadget.com/2007/02/01/remote-exploit-of-vista-speech-reveals-fatal-flaw/ |title=Remote 'exploit' of Vista Speech reveals fatal flaw |last=Miller |first=Paul |date=February 1, 2007 |publisher=[[AOL]] |work=[[Engadget]] |accessdate=June 28, 2015}}</ref> it was the first vulnerability discovered after Windows Vista's [[Software release life cycle#General availability|general availability]].<ref name="PCWorld">{{cite web |url=http://www.pcworld.com/article/id,128737-c,vistalonghorn/article.html |title=Honeymoon's Over: First Windows Vista Flaw |last=Roberts |first=Paul |date=February 1, 2007 |publisher=[[International Data Group|IDG]] |work=[[PCWorld]] |archiveurl=https://web.archive.org/web/20070204030144/http://www.pcworld.com/article/id,128737-c,vistalonghorn/article.html |archivedate=February 4, 2007 |accessdate=June 28, 2015}}</ref> While Microsoft stated that such an attack is theoretically possible, it would have to meet a number of prerequisites to be successful: the target system would have to have the speech recognition feature properly configured and activated; speakers and microphone(s) connected to the targeted system would need to be turned on; and the exploit would require the software to interpret commands without a user noticing—an unlikely scenario as the affected system would perform visible interface operations and produce audible feedback. Mitigating factors include dictation clarity and microphone feedback and placement. Because of [[User Account Control]], an exploit of this nature also would not be able to perform privileged operations for users or protected administrators without explicit consent.<ref name="SpeechIssue">{{cite web |url=https://blogs.technet.microsoft.com/msrc/2007/01/31/issue-regarding-windows-vista-speech-recognition/ |title=Issue regarding Windows Vista Speech Recognition |date=January 31, 2007 |publisher=[[Microsoft]] |work=[[Microsoft TechNet|TechNet]] |archive-url=https://web.archive.org/web/20160520045703/https://blogs.technet.microsoft.com/msrc/2007/01/31/issue-regarding-windows-vista-speech-recognition/ |url-status=dead |archivedate=May 20, 2016 |accessdate=March 31, 2018}}</ref>
 
====Windows 7====
[[File:DictationScratchpad.png|thumb|200px|The dictation scratchpad in Windows 7 replaces the "enable dictation everywhere" option of Windows Vista.]]
With Windows 7, the speechThe recognizer was updated to use [[Microsoft UI Automation]]—substantially enhancing its performance—and the recognitionits engine now uses the [[Technical features new to Windows Vista#Audio stack architecture|WASAPI]] audio stack, which enablesenabling support for [[echo suppression and cancellation|echo cancellation]]. The document harvester, which can analyze and collect text in email and documents to contextualize user terms has improved performance, and now runs periodically in the background instead of only after recognizer startup. Sleep mode has also seen performance improvements and, to address security issues, the recognizer is turned off by default after users speak "stop listening" instead of being suspended. Windows 7 also introduces an option to submit speech training data to Microsoft to improve future recognizer versions.<ref name="SRWindows7">{{cite web |url=http://blogs.msdn.com/b/tsfaware/archive/2009/01/29/what-s-new-in-windows-speech-recognition.aspx |title=What's new in Windows Speech Recognition? |last=Brown |first=Eric |date=January 29, 2009 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=March 28, 2018}}</ref>
 
WindowsA 7 introduced an optionalnew dictation scratchpad interface that functions as a temporary document into which users can dictate or type text for insertion into applications that are not compatible with the [[Text Services Framework]].<ref name="SRWindows7"/> WSRWindows Vista previously provided an "enable dictation everywhere option" infor Windowssuch Vistaapplications.<ref name="DictationWSR">{{cite web |url=https://blogs.msdn.microsoft.com/speech/2007/10/24/where-does-dictation-work-in-windows-speech-recognition/ |title=Where does dictation work in Windows Speech Recognition? |last=Brown |first=Eric |date=October 24, 2007 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=March 28, 2018}}</ref>
 
====Windows 8.x and Windows RT====
WSR can be used to control the [[Metro (design language)|Metro]] user interface in Windows 8, Windows 8.1, and Windows RT with commands to open the [[Windows shell#Charms|Charms bar]] ("Press Windows C"); to dictate or display commands in [[Universal Windows Platform apps#In Windows 8.x|Metro-style apps]] ("Press Windows Z"); to perform tasks in apps (e.g., "Change to Celsius" in [[MSN#Weather|MSN Weather]]); and to display all installed apps listed by the [[Start menu#Third version|Start screen]] ("Apps").<ref name="Windows8SR">{{cite web |url=http://windows.microsoft.com//en-US//windows-8//using-speech-recognition |title=How to use Speech Recognition |publisher=[[Microsoft]] |work=Support |archiveurl=https://web.archive.org/web/20121025193813/http://windows.microsoft.com//en-US//windows-8//using-speech-recognition |archivedate=October 25, 2012 |accessdate=December 24, 2018}}</ref><ref name="UpdatedGuidelines">{{cite web |url=https://support.microsoft.com/en-us/help/14213/windows-how-to-use-speech-recognition |title=How to use Speech Recognition in Windows |date=August 31, 2016 |publisher=[[Microsoft]] |work=Support |accessdate=December 24, 2018}}</ref>
 
====Windows 10====
WSR is featured in the [[Settings (Windows)|Settings]] application starting with the Windows 10 April 2018 Update ([[Windows 10 version history|Version 1803]]); the change first appeared in [[Windows Insider|Insider]] Preview Build 17083.<ref name="WSRInsider">{{cite web |url=https://blogs.windows.com/windowsexperience/2018/01/24/announcing-windows-10-insider-preview-build-17083-for-pc/ |title=Announcing Windows 10 Insider Preview Build 17083 for PC |last=Sarkar |first=Dona |date=January 24, 2018 |publisher=[[Microsoft]] |work=Windows Blogs |accessdate=January 8, 2019}}</ref> The April 2018 Update also introduces a new {{keypress|Win}}+{{keypress|Ctrl}}+{{keypress|S}} keyboard shortcut to activate WSR.<ref name="KeyboardShortcutsAccessibility">{{cite web |url=https://support.microsoft.com/en-us/help/13810/windows-keyboard-shortcuts-accessibility |title=Windows keyboard shortcuts for accessibility |publisher=[[Microsoft]] |work=Support |archiveurl=https://web.archive.org/web/20181012161947/https://support.microsoft.com/en-us/help/13810/windows-keyboard-shortcuts-accessibility |archivedate=October 12, 2018 |accessdate=January 8, 2019}}</ref>
 
==Overview and features==
WSR allows a user to control a computer, including the operating systemWindows [[desktop metaphor|desktop]] [[user interface]], through voice commands. Applications, includingApplications—including most of those bundled with Windows, can alsoWindows—can be controlled through voice commands.<ref name="Guide"/> By using speech recognition, usersUsers can dictate text within documents, email, and forms; control the operating system user interface; perform [[keyboard shortcut]]s; and move the [[cursor (computing)|mouse cursor]].<ref name="CommonCommands">{{cite web |url=http://windows.microsoft.com/en-us/windows/common-speech-recognition-commands#1TC=windows-vista |title=Common commands in Speech Recognition |author=[[Microsoft]] |work=Windows How-to |accessdate=June 30, 2015}}</ref> WSR relies on the Speech API developed by Microsoft,<ref name="TalkingWindowsVista"/> and third-party applications must support the Text Services Framework; the majority of integrated applications in Windows Vista can be controlled through speech.<ref name="Guide">{{cite web |url=https://msdn.microsoft.com/en-us/library/bb530325.aspx |title=Windows Vista Speech Recognition Step-by-Step Guide |last=Phillips |first=Todd |date=2007 |publisher=[[Microsoft]] |work=[[MSDN]] |accessdate=June 30, 2015}}</ref> The following languages are supported: [[traditional Chinese characters|Chinese (Traditional)]], [[simplified Chinese characters|Chinese (Simplified)]], [[American English|English (U.S.)]], [[British English|English (U.K.)]], [[French language|French]], [[German language|German]], [[Japanese language|Japanese]], and [[Spanish language|Spanish]].<ref name="SpeechRecognition"/>
 
When WSR usesis astarted localfor speechthe first time, it presents an optional interactive step-by-step tutorial that users can profilecommence to storelearn informationbasic aboutcommands awhile user'sadapting the recognizer to their specific voice. characteristics;<ref name="PrivacyGuide"/> Accuracy of speech recognition increases through use, which helpsadapts the feature adaptrecognizer to acontexts, user's grammargrammars, speech patterns, vocabulary, and word usagevocabularies.<ref name="SpeechRecognition">{{cite web |url=https://www.microsoft.com/enable/products/windowsvista/speech.aspx |title=Windows Speech Recognition |author=[[Microsoft]] |work=Microsoft Accessibility |accessdate=June 26, 2015}}</ref><ref name="Privacy"/> SpeechCustom recognitionlanguage alsomodels includesthat aadapt tutorialthe recognizer to improvethe specific accuracycontexts,<ref name="SpeechRecognition"/>phonetics, and canterminologies optionallyof reviewusers ain user'sparticular personaloccupational documents—includingfields email—tosuch improveas itslegal commandor andmedical dictationare accuracyalso supported.<ref name="SpeechOptionsCustomizedVocabularies">{{cite web |url=httphttps://windowsblogs.msdn.microsoft.com/en-usrobch/windows2005/setting09/20/customized-speech-options#1TC=vocabularies-in-windows-vista/ |title=SettingCustomized speech optionsvocabularies in Windows Vista |authorlast=Chambers |first=Rob |date=September 20, 2005 |publisher=[[Microsoft]] |work=Windows[[Microsoft How-toDeveloper Network|MSDN]] |accessdate=JulyMarch 129, 20152018}}</ref> Individual speech profiles can be created on a per-user basis,<ref name="Privacy"/> and backups of profiles can be performed viaWith [[Windows Easy TransferSearch]].,<ref name="TransferThurrottAllchin">{{cite web |url=http://blogswww.msdnitprotoday.com/b/robch/archive/2007/02/15/transferringjim-windowsallchin-speechtalks-recognitionwindows-profiles-from-one-machine-to-another.aspxvista |title=TransferringJim WindowsAllchin SpeechTalks Recognition profiles from one machine toWindows anotherVista |last=ChambersThurrott |first=RobPaul |authorlink=Paul Thurrott |date=FebruaryOctober 156, 20072010 |publisher=[[MicrosoftPenton (company)|Penton]] |work=[[MicrosoftWindows DeveloperIT Network|MSDNPro]] |accessdate=JuneMarch 2829, 20152018}}</ref> WSR supports the followingrecognizer languages:also [[traditionalcan Chineseoptionally characters|Chineseanalyze (Traditional)]],and [[simplifiedcollect Chinesetext characters|Chinesein (Simplified)]]documents, [[Americanemail, English|Englishas (U.S.)]],well [[Britishas English|English (U.K.)]],handwritten [[Frenchtablet language|FrenchPC]], [[Germaninput language|German]],to [[Japanese language|Japanese]],contextualize and [[Spanishdisambiguate language|Spanish]]terms to improve accuracy.<ref name="SpeechRecognitionPrivacy"/>{{cite WSR relies on theweb Speech API developed by Microsoft,<ref name|url="TalkingWindowsVista"http:/> and third/download.microsoft.com/download/7/9/4/7945a146-partyfc32-48c2-8c14-83b1b36696e5/Windows%20Vista%20Privacy%20Statement.rtf applications|title=Windows mustVista supportPrivacy theStatement |author=[[Text Services FrameworkMicrosoft]].<ref name|date="Guide"2006 |format=RTF |accessdate=July 1, 2015}}</ref>
 
WSR is a locally processed speech recognition platform; it does not rely on cloud computing for accuracy, dictation, or recognition.<ref name="MicrosoftPrivacyStatement">{{cite web |url=https://privacy.microsoft.com/en-us/privacystatement |title=Microsoft Privacy Statement |publisher=[[Microsoft]] |accessdate=May 12, 2020}}</ref> Speech profiles that store information about users are retained locally.<ref name="Privacy"/> Backups and transfers of profiles can be performed via [[Windows Easy Transfer]].<ref name="Transfer">{{cite web |url=http://blogs.msdn.com/b/robch/archive/2007/02/15/transferring-windows-speech-recognition-profiles-from-one-machine-to-another.aspx |title=Transferring Windows Speech Recognition profiles from one machine to another |last=Chambers |first=Rob |date=February 15, 2007 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=June 28, 2015}}</ref>
 
===Interface===
Line 57 ⟶ 54:
Colors of the recognizer listening mode button denote its various modes of operation: blue when listening; blue-gray when sleeping; gray when turned off; and yellow when the user switches context (e.g., from the desktop to the taskbar) or when a voice command is misinterpreted. The status area can also display custom user information as part of [[Windows Speech Recognition#Macros|Windows Speech Recognition Macros]].<ref name="WSRMacrosPreview">{{cite web |url=http://kurtsh.com/2008/04/29/beta-windows-speech-recognition-macros-technology-preview/ |title=BETA: 'Windows Speech Recognition Macros' Technology Preview |last=Shintaku |first=Kurt |date=April 29, 2008 |accessdate=March 17, 2016}}</ref><ref name="ControlPCLifehacker">{{cite web |url=http://lifehacker.com/391884/control-your-pc-with-your-voice |title=Control Your PC with Your Voice |last=Pash |first=Adam |date=May 20, 2008 |publisher=[[Gawker Media]] |work=[[Lifehacker]] |accessdate=March 17, 2016}}</ref>
 
====Alternates panel====
[[File:WSR-AlternatesPanel.png|thumb|200px|right|The alternates panel displaying suggestions for a phrase.]]
====Alternates panel====
An ''alternates panel'' disambiguation interface displays a list of items interpreted as being relevant to a user's spoken word(s); if the word or phrase that a user desired to insert into an application is listed among results, a user can speak the corresponding number of the word or phrase in the results and confirm this choice by speaking "OK" to insert it within the application.<ref name="Modes">{{cite web |url=http://blogs.msdn.com/b/robch/archive/2007/11/19/speech-macros-typing-mode-and-spelling-mode-in-windows-speech-recognition.aspx |title=Speech Macros, Typing Mode and Spelling Mode in Windows Speech Recognition |last=Chambers |first=Rob |date=November 19, 2007 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=August 25, 2015}}</ref> The alternates panel will also appear when launching applications or speaking commands that refer to more than one item (e.g., speaking "Start Internet Explorer" may list the web browser and a version of it with browser add-ons disabled). However, an ''ExactMatchOverPartialMatch'' [[Windows Registry]] entry can limit commands to items with exact names if there is more than one instance included in results.<ref name="Clarification">{{cite web |url=http://blogs.msdn.com/b/robch/archive/2007/05/07/windows-speech-recognition-exactmatchoverpartialmatch.aspx |title=Windows Speech Recognition - ExactMatchOverPartialMatch |last=Chambers |first=Rob |date=May 7, 2007 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=August 24, 2015}}</ref>
 
===Common commands===
Line 71 ⟶ 68:
:: '''Window management commands:''' "Close (alternatively maximize, minimize, or restore) window"; "Close ''that''"; "Close ''name of open application''"; "Switch applications"; "Switch to ''name of open application''"; "Scroll ''direction''"; "Scroll ''direction'' in ''number of pages''"; "Show desktop"; "[[Windows Speech Recognition#Show numbers|Show numbers]]."<ref name="CommonCommands"/>
: '''Speech recognition commands:''' "Start listening"; "Stop listening"; "Show speech options"; "Open speech dictionary"; "Move speech recognition"; "Minimize speech recognition."<ref name="CommonCommands"/> In the English language, applicable commands can be shown by speaking "What can I say?"<ref name="SpeechRecognition"/> Users can also query the recognizer about tasks in Windows by speaking "How can I ''task name''," which opens related help documentation.<ref name="General Commands">{{cite web |url=https://blogs.msdn.microsoft.com/robch/2007/03/12/windows-speech-recognition-general-commands/ |title=Windows Speech Recognition: General commands |last=Chambers |first=Rob |date=March 12, 2007 |publisher=[[Microsoft]] |work=[[Microsoft Developer Network|MSDN]] |accessdate=May 1, 2017}}</ref>
[[File:Mousegrid.png|thumb|160px|right|The mousegridMouseGrid command displaying a grid of numbers on the Windows Vista desktop.]]
 
====''MouseGrid''====
[[File:Mousegrid.png|thumb|160px|right|The mousegrid on the Windows Vista desktop.]]
''MouseGrid'' enables users to control the mouse cursor by overlaying numbers across nine regions on the screen; these regions gradually narrow as a user speaks the number(s) of the region on which to focus until the desired interface element is reached. Users can then issue commands including "Click ''number of region''," which moves the mouse cursor to the desired region and then clicks it; and "Mark ''number of region''", which allows an item (such as a [[icon (computing)|computer icon]]) in a region to be selected, which can then be clicked with the previous ''click'' command. Users also can interact with multiple regions of the mousegrid at once.<ref name="CommonCommands"/>
 
====''Show Numbers''====
Applications and interface elements that do not present identifiable commands can still be controlled by asking the system to overlay numbers on top of them through a ''show numbers'' command. Once active, speaking the overlaid number selects that item so a user can open it or perform other operations.<ref name="CommonCommands"/> ''Show numbers'' was designed so that users could interact with items that are not readily identifiable.<ref name="US7742923">{{Cite patent|US|7742923| title=Graphic user interface schemes for supporting speech recognition input systems |status=patent |assign1=Microsoft Corporation |invent5=Scholz, Oliver |invent4=Chambers, Robert |invent3=Mowatt, David |invent2=Murillo, Oscar |invent1=Bickel, Ryan}}</ref>
[[File:Show numbers.png|thumb|160px|left|The show numbers command overlaying numbers in the [[Games for Windows#Games Explorer|Games Explorer]].]]
Applications and interface elements that do not present identifiable commands can still be controlled by asking the system to overlay numbers on top of them through a ''show numbers'' command. Once active, speaking the overlaid number selects that item so a user can open it or perform other operations.<ref name="CommonCommands"/> ''Show numbers'' was designed so that users could interact with items that are not readily identifiable.<ref name="US7742923">{{Cite patent|US|7742923| title=Graphic user interface schemes for supporting speech recognition input systems |status=patent |assign1=Microsoft Corporation |invent5=Scholz, Oliver |invent4=Chambers, Robert |invent3=Mowatt, David |invent2=Murillo, Oscar |invent1=Bickel, Ryan}}</ref>
 
===Dictation===
WSR enables dictation of text in the operating system and applications. If a dictation mistake occurs it can be corrected by speaking "Correct ''word''" or "Correct that" and the alternates panel will appear and provide suggestions for correction; these suggestions can be selected by speaking the number corresponding to the number of the suggestion in the list and by speaking "OK." If the desired item is not listed among suggestions, a user can speak it so that it might appear. Alternatively, users can speak "Spell it" or "I'll spell it myself" to speak the desired item on a per-letter basis; users can use their personal alphabet or the [[NATO phonetic alphabet]] when spelling. Multiple words in a sentence can be corrected simultaneously (for example, if a user speaks "dictating" but the recognizer interprets this word as "the thing," a user can state "correct the thing" to correct both words). In the English language over 100,000 words are recognized by default.<ref name="CustomizedVocabularies"/>
 
====Speech dictionary====
A personal dictionary allows users to include or exclude certain words or expressions from dictation.<ref name="CustomizedVocabularies"/> When a user adds a word beginning with a capital letter to the dictionary, a user can specify whether it should always be capitalized or if capitalization depends on the context in which the word is spoken. Users can also record pronunciations for words added to the dictionary to increase recognition accuracy; words written via a [[stylus]] on a [[tablet PC]] for the Windows [[handwriting recognition]] feature are also stored. Information stored within a dictionary is included as part of a user's speech profile.<ref name="Privacy"/> Users can open the speech dictionary by speaking the "show speech dictionary" command.
 
===Macros===
Line 94 ⟶ 89:
 
==Performance==
{{As of|2017}} WSR uses Microsoft Speech Recognizer 8.0, whichthe hasversion notintroduced been changed sincein Windows Vista. For dictation it was found to be 93.6% accurate without training by Mark Hachman, a Senior Editor of ''[[PC World]]''—a rate that is not as accurate as competing software. According to Microsoft, the rate of accuracy when trained is 99%. Hachman opined that Microsoft does not publicly discuss the feature because of the 2006 incident during development of Windows Vista, with the result being that few users knew that documents could be dictated within Windows before the introduction of [[Cortana]].<ref name="MSR8">{{cite web |url=http://www.pcworld.com/article/3124761/windows/the-windows-weakness-no-one-mentions-speech-recognition.html |title=The Windows weakness no one mentions: Speech recognition |last=Hachman |first=Mark |date=May 10, 2017 |publisher=[[International Data Group|IDG]] |work=[[PC World]] |accessdate=March 28, 2018}}</ref>
 
==See also==
Line 102 ⟶ 97:
* [[Microsoft Voice Command]]
* [[Technical features new to Windows Vista]]
* [[Features new to Windows Vista#Windows HotStart|Windows HotStart]]
* [[Windows Mobility Center]]
* [[Windows SideShow]]
 
==References==
{{Reflist|23}}
 
==External links==
Line 115 ⟶ 107:
 
[[Category:2006 software]]
[[Category:Speech processing]]
[[Category:Speech processing software]]
[[Category:Speech recognition]]
[[Category:Speech recognition software]]
[[Category:Speech synthesis]]
[[Category:Windows components]]
[[Category:Windows Vista]]