Windows Speech Recognition: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 08:31, 15 May 2020 edit The Rambling Man (talk \| contribs) Autopatrolled, Extended confirmed users, Page movers, IP block exemptions, New page reviewers, Pending changes reviewers, Rollbackers, Template editors 286,464 edits tw ← Previous edit		Latest revision as of 04:23, 14 September 2024 edit undo JWiki2151 (talk \| contribs) Extended confirmed users 7,690 edits Removed a paragraph from the lead Tags: Mobile edit Mobile web edit Advanced mobile edit
(32 intermediate revisions by 23 users not shown)
Line 1: {{Short description\|Speech recognition software}} {{good article}} {{Infobox software \| logo = \| name = Windows Speech Recognition▼ \| logo size = 64px ~~\| screenshot = Windows Vista Speech Recognition Tutorial.PNG~~ ▲\| name = Windows Speech Recognition ~~\| screenshot_size = 300px~~ \| ~~caption~~screenshot = Windows Vista Speech Recognition Tutorial.PNG \| screenshot_size = 300px \| caption = The tutorial for Windows Speech Recognition in [[Windows Vista]] depicting the selection of text in [[WordPad]] for deletion. \| developer = [[Microsoft]] \| released = {{Start date and age\|~~2007~~2006\|0111\|30}} \| operating system = [[Windows Vista]] and later \| genre = [[Speech recognition]] }} '''Windows Speech Recognition''' ('''WSR''') is [[speech recognition]] developed by [[Microsoft]] for [[Windows Vista]] that enables [[hands-free computing\|voice commands]] to control the [[desktop metaphor\|desktop]] [[user interface]];, [[transcription (linguistics)\|dictate]] text in [[electronic document]]s and [[email]];, navigate [[website]]s;, perform [[keyboard shortcut]]s;, and to operate the [[cursor (computing)\|mouse cursor]]. It supports custom [[macro (computer science)\|macro]]s to perform additional or supplementary tasks. WSR is a locally processed speech recognition platform; it does not rely on [[cloud computing]] for accuracy, dictation, or recognition, but adapts based on contexts, grammars, speech samples, training sessions, and vocabularies. It provides a personal dictionary that allows users to include or exclude words or expressions from dictation and to record pronunciations to increase recognition accuracy. Custom language models are also supported. With Windows Vista, WSR was developed to be part of Windows, as speech recognition was previously exclusive to applications such as [[Windows Media Player]]. It is present in [[Windows 7]], [[Windows 8]], [[Windows 8.1]], [[Windows RT]], [[Windows 10]], and [[Windows 1011]]. ==History== Microsoft was involved in speech recognition and [[speech synthesis]] research for many years before WSR. In 1993, Microsoft hired [[Xuedong Huang]] from [[Carnegie Mellon University]] to lead its speech development efforts; the company's research led to the development of the [[Speech Application Programming Interface\|Speech API]] (SAPI) introduced in 1994.<ref name="TalkingWindowsVista">{{cite web \|url=http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx \|title=Exploring New Speech Recognition And Synthesis APIs In Windows Vista \|last=Brown \|first=Robert \|publisher=[[Microsoft]] \|work=MSDN Magazine \|~~archiveurl~~archive-url=https://web.archive.org/web/20080307054756/http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx \|~~archivedate~~archive-date=March 7, 2008 \|~~accessdate~~access-date=June 26, 2015}}</ref> Speech recognition had also been used in previous Microsoft products. [[Office XP]] and [[Microsoft Office 2003\|Office 2003]] provided speech recognition capabilities among [[Internet Explorer]] and [[Microsoft Office]] applications;<ref name="SpeechXP">{{cite web \|url=https://support.microsoft.com/en-us/kb/306901 \|title=How To Use Speech Recognition in Windows XP \|~~author~~publisher=[[Microsoft]] \|work=Windows Support \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20150314222444/https://support.microsoft.com/en-us/kb/306901 26\|archive-date=March 14, 2015 \|access-date=May 15, 2020}}</ref> it also enabled limited speech functionality in [[Windows 98]], [[Windows MEMe]], [[Windows NT 4.0]], and [[Windows 2000]].<ref name="Description">{{cite web \|url=https://support.microsoft.com/en-us/kb/278927 \|title=Description of the speech recognition and handwriting recognition methods in Word 2002 \|~~author~~publisher=[[Microsoft]] \|work=Windows Support \|~~archiveurl~~archive-url=https://web.archive.org/web/20150703125056/https://support.microsoft.com/en-us/kb/278927 \|~~archivedate~~archive-date=July 3, 2015 \|~~accessdate~~access-date=March 26, 2018}}</ref> [[Windows XP]] [[Windows XP editions#Tablet PC Edition\|Tablet PC Edition]] 2002 included speech recognition capabilities with the Tablet PC Input Panel,<ref name="WindowsXPTabletPCEdition">{{cite web \|url=http://winsupersite.com/article/windows-xp2/windows-xp-tablet-pc-edition-reviewed-127413 \|title=Windows XP Tablet PC Edition Review \|last=Thurrott \|first=Paul ~~\|authorlink=Paul Thurrott~~ \|date=June 25, 2002 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20110719201607/http://winsupersite.com/article/windows-xp2/windows-xp-tablet-pc-edition-reviewed-127413 26\|archive-date=July 19, ~~2015~~2011 \|access-date=May 15, 2020}}</ref><ref name="Natural">{{cite web \|url=http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt \|title=Natural Input On Mobile PC Systems \|last=Dresevic \|first=Bodin \|date=2005 \|publisher=[[Microsoft]] \|format=PPT \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20051214132222/http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt 26\|archive-date=December 14, ~~2015~~2005 \|access-date=May 15, 2020}}</ref> and [[Microsoft Plus!#Microsoft Plus! for Windows XP\|Microsoft Plus! for Windows XP]] enabled voice commands for Windows Media Player.<ref name="VoiceCommand">{{cite web \|url=http://winsupersite.com/article/product-review/plus-for-windows-xp-review \|title=Plus! for Windows XP Review \|last=Thurrott \|first=Paul ~~\|authorlink=Paul Thurrott~~ \|date=October 6, 2010 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20110705102849/http://winsupersite.com/article/product-review/plus-for-windows-xp-review 30\|archive-date=July 5, ~~2015~~2011 \|access-date=May 15, 2020}}</ref> However, these all required installation of speech recognition as a separate component; before Windows Vista, Windows did not include integrated or extensive speech recognition.<ref name="Natural"/> [[Microsoft Office 2007\|Office 2007]] and later versions rely on WSR for speech recognition services.<ref name="Office2007SR">{{cite web \|url=https://support.office.com/en-us/article/What-happened-to-speech-recognition-c6541b32-82df-4c18-bfa5-c411f45337d3 \|title=What happened to speech recognition? \|publisher=[[Microsoft]] \|work=Office Support \|~~accessdate~~archive-url=https://web.archive.org/web/20161110044211/https://support.office.com/en-us/article/What-happened-to-speech-recognition-c6541b32-82df-4c18-bfa5-c411f45337d3 \|archive-date=November 910, 2016 \|access-date=May 15, 2020}}</ref> ===Windows Vista=== [[File:WindowsVistaPreliminaryWSR.PNG\|\|thumb\|right\|A prototype speech recognition [[Windows Aero#Aero Wizards\|Aero Wizard]] in [[Windows Vista]] (then known as "Longhorn") [[Development of Windows Vista#Milestone 7\|build 4093]].]] At [[Windows Hardware Engineering Conference\|WinHEC 2002]] Microsoft announced that Windows Vista (codenamed "Longhorn") would include advances in speech recognition and in features such as [[microphone array]] support<ref name="WinHEC2002">{{cite web \|url=https://www.pcmag.com/article2/0,2817,1183143,00.asp \|title=WinHEC: The Pregame Show \|last=Stam \|first=Nick \|date=April 16, 2002 \|publisher=[[Ziff Davis Media]] \|work=[[PC Magazine]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20150703193044/https://www.pcmag.com/article2/0,2817,1183143,00.asp 26\|archive-date=July 3, 2015 \|access-date=May 15, 2020}}</ref> as part of an effort to "provide a consistent quality audio infrastructure for natural (continuous) speech recognition and (discrete) command and control."<ref name="AudioConsiderations">{{cite web \|url=http://download.microsoft.com/download/whistler/WHP/1.0/WXP/EN-US/WH02_AV01.exe \|title=Audio Considerations for Voice-Enabled Applications \|last=Flandern Van \|first=Mike \|date=2002 \|publisher=[[Microsoft]] \|work=[[Windows Hardware Engineering Conference]] \|format=EXE \|~~archiveurl~~archive-url=https://web.archive.org/web/20020506020208/http://download.microsoft.com/download/whistler/WHP/1.0/WXP/EN-US/WH02_AV01.exe \|~~archivedate~~archive-date=May 6, 2002 \|~~accessdate~~access-date=March 30, 2018}}</ref> [[Bill Gates]] stated during [[Professional Developers Conference\|PDC 2003]] that Microsoft would "build speech capabilities into the system — a big advance for that in 'Longhorn,' in both recognition and synthesis, real-time";<ref name="SpeechCapabilities">{{cite web \|url=http://www.microsoft.com/billgates/speeches/2003/10-27PDC2003.asp \|title=Bill Gates' Web Site — Speech Transcript, Microsoft Professional Developers Conference 2003 \|~~author~~publisher=[[Microsoft]] \|date=October 27, 2003 \|~~archiveurl~~archive-url=https://web.archive.org/web/20040203152133/http://www.microsoft.com/billgates/speeches/2003/10-27PDC2003.asp \|~~archivedate~~archive-date=February 3, 2004 \|~~accessdate~~access-date=~~June~~May 2615, ~~2015~~2020}}</ref><ref name="SpeechPDC2003">{{cite web \|url=http://windowsitpro.com/windows-server-2008/live-pdc-2003-day-1-monday \|title=Live from PDC 2003: Day 1, Monday \|last2=Furman \|first2=Keith \|~~last~~last1=Thurrott \|~~first~~first1=Paul \|date=October 26, 2003 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20130911021829/http://windowsitpro.com/windows-server-2008/live-pdc-2003-day-1-monday 26\|archive-date=September 11, ~~2015~~2013 \|access-date=May 15, 2020}}</ref> and pre-release builds during the [[development of Windows Vista]] included a speech engine with training features.<ref name="Windows2006">{{cite web \|url=http://www.techhive.com/article/113631/article.html \|title=Your Next OS: Windows 2006? \|last=Spanbauer \|first=Scott \|date=December 4, 2003 \|publisher=[[International Data Group\|IDG]] \|work=TechHive \|~~accessdate~~access-date=June 25, 2015}}</ref> A PDC 2003 developer presentation stated Windows Vista would also include a user interface for microphone feedback and control, and user configuration and training features.<ref name="UserInputPDC2003">{{cite web \|url=http://download.microsoft.com/download/6/6/9/669C56E3-12AF-48C5-AB2A-E7705F1BE37F/CLI351.ppt \|title=Keyboard, Speech, and Pen Input in Your Controls \|last2=Chambers \|first2=Rob \|last1=Gjerstad \|~~first~~first1=Kevin \|date=2003 \|publisher=[[Microsoft]] \|work=[[Professional Developers Conference]] \|format=PPT \|~~archiveurl~~archive-url=https://web.archive.org/web/20121219161523/http://download.microsoft.com/download/6/6/9/669C56E3-12AF-48C5-AB2A-E7705F1BE37F/CLI351.ppt \|~~archivedate~~archive-date=December 19, 2012 \|~~accessdate~~access-date=March 30, 2018}}</ref> Microsoft clarified the extent to which speech recognition would be integrated when it stated in a pre-release [[software development kit]] that "the common speech scenarios, like speech-enabling menus and buttons, will be enabled system-wide."<ref name="SpeechRecognitionLonghorn">{{cite web \|url=http://longhorn.msdn.microsoft.com/lhsdk/speech/speechconcepts.aspx \|title=Interacting with the Computer using Speech Input and Speech Output \|~~author~~date=2003 \|publisher=[[Microsoft]] ~~\|date=2003~~ \|work=[[MSDN]] \|~~archiveurl~~archive-url=https://web.archive.org/web/20040104193115/http://longhorn.msdn.microsoft.com/lhsdk/speech/speechconcepts.aspx \|~~archivedate~~archive-date=January 4, 2004 \|~~accessdate~~access-date=June 28, 2015}}</ref> During WinHEC 2004 Microsoft included WSR as part of a strategy to improve productivity on mobile PCs.<ref name="MobilePCs">{{cite web \|url=http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04023_WINHEC2004.ppt \|title=Windows For Mobile PCs And Tablet PCs — CY05 And Beyond \|last=Suokko \|first=Matti \|date=2004 \|publisher=[[Microsoft]] \|~~archiveurl~~format=PPT \|archive-url=https://web.archive.org/web/20051214170817/http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04023_WINHEC2004.ppt \|~~archivedate~~archive-date=December 14, 2005 \|~~format~~access-date=~~PPT \|accessdate=July~~May 15, ~~2015~~2020}}</ref><ref name="MobilePCs04">{{cite web \|url=http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04022_WINHEC2004.ppt \|title=Windows For Mobile PCs and Tablet PCs -— CY04 \|last=Fish \|first=Darrin \|date=2004 \|publisher=[[Microsoft]] \|~~archiveurl~~format=PPT \|archive-url=https://web.archive.org/web/20051214170759/http://download.microsoft.com/download/1/8/f/18f8cee2-0b64-41f2-893d-a6f2295b40c8/SW04022_WINHEC2004.ppt \|~~archivedate~~archive-date=December 14, 2005 \|~~format~~access-date=~~PPT \|accessdate=July~~May 15, ~~2015~~2020}}</ref> Microsoft later emphasized [[accessibility]], new mobility scenarios, support for additional languages, and improvements to the speech user experience at WinHEC 2005. Unlike the speech support included in Windows XP, which was integrated with the Tablet PC Input Panel and required switching between separate Commanding and Dictation modes, Windows Vista would introduce a dedicated interface for speech input on the desktop and would unify the separate speech modes;<ref name="NaturalInput">{{cite web \|url=http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt \|title=Natural Input on Mobile PC Systems \|last=Dresevic \|first=Bodin \|date=2005 \|publisher=[[Microsoft]] \|format=PPT \|~~archiveurl~~archive-url=https://web.archive.org/web/20051214132222/http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt \|~~archivedate~~archive-date=December 14, 2005 \|~~accessdate~~access-date=~~March~~May 2915, ~~2018~~2020}}</ref> users previously could not speak a command after dictating or vice versa without first switching between these two modes.<ref name="CommandingandDictation">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2005/08/01/446131.aspx \|title=Commanding and Dictation — One mode or two in Windows Vista? \|last=Chambers \|first=Rob \|date=August 1, 2005 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 30, 2015}}</ref> Windows Vista Beta 1 included integrated speech recognition.<ref name="WindowsVistaBeta1">{{cite web \|url=http://winsupersite.com/product-review/windows-vista-beta-1-review-part-3 \|title=Windows Vista Beta 1 Review (Part 3) \|last=Thurrott \|first=Paul ~~\|authorlink=Paul Thurrott~~ \|date=October 6, 2010 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20140823104310/http://winsupersite.com/product-review/windows-vista-beta-1-review-part-3 26\|archive-date=August 23, ~~2015~~2014 \|access-date=May 15, 2020}}</ref> To incentivize company employees to analyze WSR for software [[software bug\|glitch]]es and to provide feedback, Microsoft offered an opportunity for its testers to win a Premium model of the [[Xbox 360]].<ref name="MicrosoftWSRPoster">{{cite web \|url=http://www.brian.levy3.net/proj_msft_poster1.html \|title=Microsoft Speech Recognition poster \|last=Levy \|first=Brian \|date=2006 \|~~archiveurl~~archive-url=https://web.archive.org/web/20061011080004/http://brian.levy3.net/proj_msft_poster1.html \|~~archivedate~~archive-date=October 11, 2006 \|~~accessdate~~access-date=~~March~~May 1715, ~~2016~~2020}}</ref> During a demonstration by Microsoft on July 27, 2006—before Windows Vista's [[release to manufacturing]] (RTM)—a notable incident involving WSR occurred that resulted in an unintended output of "Dear aunt, let's set so double the killer delete select all" when several attempts to dictate led to consecutive output errors;<ref name="GoodDemos">{{cite web \|url=http://blogs.reuters.com/blog/archives/1991 \|title=Updated – When good demos go (very, very) bad \|last=Auchard \|first=Eric \|date=July 28, 2006 \|publisher=[[Thomson Reuters]] \|~~archiveurl~~archive-url=https://web.archive.org/web/20110521230956/http://blogs.reuters.com/blog/archives/1991 \|~~archivedate~~archive-date=May 21, 2011 \|~~accessdate~~url-status=dead \|access-date=March 29, 2018}}</ref><ref name="MSNBC">{{cite web\|url=~~http~~https://www.nbcnews.com/id/~~14158843~~wbna14158843 \|title=Software glitch foils Microsoft demo ~~\|author=[[NBC News]]~~ \|date=August 2, 2006 \|publisher=[[~~Associated~~NBC ~~Press~~News]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20180328233150/http://www.nbcnews.com/id/14158843/ 30\|archive-date=March 28, ~~2015~~2018 \|url-status=live \|access-date=May 15, 2020}}</ref> the incident was a subject of significant derision among analysts and journalists in the audience.,<ref name="NeedsWork">{{cite web \|url=http://www.infoworld.com/article/06/07/31/HNvoicevista_1.html \|title=Vista voice-recognition feature needs work \|last=Montalbano \|first=Elizabeth \|date=July 31, 2006 \|publisher=[[International Data Group\|IDG]] \|work=[[InfoWorld]] \|~~archiveurl~~archive-url=https://web.archive.org/web/20060805091528/http://www.infoworld.com/article/06/07/31/HNvoicevista_1.html \|~~archivedate~~archive-date=August 5, 2006 \|~~accessdate~~access-date=June 26, 2015}}</ref><ref name="Stammers">{{cite web \|url=http://www.techhive.com/article/126613/article.html \|title=Vista's Voice Recognition Stammers \|last=Montalbano \|first=Elizabeth \|date=July 31, 2006 \|publisher=[[International Data Group\|IDG]] \|work=TechHive \|~~accessdate~~archive-url=https://web.archive.org/web/20150703154114/http://www.techhive.com/article/126613/article.html \|archive-date=July 13, 2015 \|access-date=May 15, 2020}}</ref> ~~Microsoft~~despite ~~later~~another ~~revealed~~demonstration ~~that~~for application management and navigation being successful.<ref name="GoodDemos"/> Microsoft revealed these issues were due to an audio [[Gain (electronics)\|gain]] glitch that caused the ~~speech~~ recognizer to distort ~~the~~commands ~~dictated~~and ~~words~~dictations; the glitch was fixed before Windows Vista's release.<ref name="FAM">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2006/07/29/682479.aspx \|title=FAM: Vista SR Demo failure — And now you know the rest of the story ... \|last=Chambers \|first=Rob \|date=July 29, 2006 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~archive-url=~~June~~https://web.archive.org/web/20110522071447/http://blogs.msdn.com/b/robch/archive/2006/07/29/682479.aspx 26\|archive-date=May 22, ~~2015~~2011 \|access-date=May 15, 2020}}</ref> Reports from early 2007 indicated that WSR is vulnerable to attackers using speech recognition for malicious operations by playing certain audio commands through a target's speakers;<ref name="SpeechRecognitionHole">{{cite web \|url=http://news.bbc.co.uk/2/hi/technology/6320865.stm \|title=Vista has speech recognition hole \|date=February 1, 2007 \|publisher=[[British Broadcasting Corporation\|BBC]] \|work=[[BBC News]] \|~~accessdate~~archive-url=~~March~~https://web.archive.org/web/20070203051551/http://news.bbc.co.uk/2/hi/technology/6320865.stm \|archive-date=February 3, 2007 \|access-date=May 2915, ~~2018~~2020}}</ref><ref name="RemoteExploit">{{cite web \|url=https://www.engadget.com/2007/02/01/remote-exploit-of-vista-speech-reveals-fatal-flaw/ \|title=Remote 'exploit' of Vista Speech reveals fatal flaw \|last=Miller \|first=Paul \|date=February 1, 2007 \|publisher=[[AOL]] \|work=[[Engadget]] \|~~accessdate~~access-date=June 28, 2015}}</ref> it was the first vulnerability discovered after Windows Vista's [[Software release life cycle#General availability\|general availability]].<ref name="PCWorld">{{cite web \|url=http://www.pcworld.com/article/id,128737-c,vistalonghorn/article.html \|title=Honeymoon's Over: First Windows Vista Flaw \|last=Roberts \|first=Paul \|date=February 1, 2007 \|publisher=[[International Data Group\|IDG]] \|work=[[PCWorld]] \|~~archiveurl~~archive-url=https://web.archive.org/web/20070204030144/http://www.pcworld.com/article/id,128737-c,vistalonghorn/article.html \|~~archivedate~~archive-date=February 4, 2007 \|~~accessdate~~access-date=June 28, 2015}}</ref> Microsoft stated that although such an attack is theoretically possible, a number of mitigating factors and prerequisites would limit its effectiveness or prevent it altogether: a target would need the recognizer to be active and configured to properly interpret such commands; microphones and speakers would both need to be enabled and at sufficient volume levels; and an attack would require the computer to perform visible operations and produce audible feedback without users noticing. [[User Account Control]] would also prohibit the occurrence of privileged operations.<ref name="SpeechIssue">{{cite web \|url=https://blogs.technet.microsoft.com/msrc/2007/01/31/issue-regarding-windows-vista-speech-recognition/ \|title=Issue regarding Windows Vista Speech Recognition \|date=January 31, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft TechNet\|TechNet]] \|archive-url=https://web.archive.org/web/20160520045703/https://blogs.technet.microsoft.com/msrc/2007/01/31/issue-regarding-windows-vista-speech-recognition/ \|url-status=dead \|~~archivedate~~archive-date=May 20, 2016 \|~~accessdate~~access-date=March 31, 2018}}</ref> ===Windows 7=== [[File:DictationScratchpad.png\|thumb\|200px\|The dictation scratchpad in Windows 7 replaces the "enable dictation everywhere" option of Windows Vista.]] WSR was updated to use [[Microsoft UI Automation]] and its engine now uses the [[Technical features new to Windows Vista#Audio stack architecture\|WASAPI]] audio stack, substantially enhancing its performance and enabling support for [[echo suppression and cancellation\|echo cancellation]], respectively. The document harvester, which can analyze and collect text in email and documents to contextualize user terms has improved performance, and now runs periodically in the background instead of only after recognizer startup. Sleep mode has also seen performance improvements and, to address security issues, the recognizer is turned off by default after users speak "stop listening" instead of being suspended. Windows 7 also introduces an option to submit speech training data to Microsoft to improve future recognizer versions.<ref name="SRWindows7">{{cite web \|url=http://blogs.msdn.com/b/tsfaware/archive/2009/01/29/what-s-new-in-windows-speech-recognition.aspx \|title=What's new in Windows Speech Recognition? \|last=Brown \|first=Eric \|date=January 29, 2009 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~archive-url=~~March~~https://web.archive.org/web/20110128222812/http://blogs.msdn.com/b/tsfaware/archive/2009/01/29/what-s-new-in-windows-speech-recognition.aspx \|archive-date=January 28, ~~2018~~2011 \|access-date=May 15, 2020}}</ref> A new dictation scratchpad interface functions as a temporary document into which users can dictate or type text for insertion into applications that are not compatible with the [[Text Services Framework]].<ref name="SRWindows7"/> Windows Vista previously provided an "enable dictation everywhere option" for such applications.<ref name="DictationWSR">{{cite web \|url=https://blogs.msdn.microsoft.com/speech/2007/10/24/where-does-dictation-work-in-windows-speech-recognition/ \|title=Where does dictation work in Windows Speech Recognition? \|last=Brown \|first=Eric \|date=October 24, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=March 28, 2018}}</ref> ===Windows 8.x and Windows RT=== WSR can be used to control the [[Metro (design language)\|Metro]] user interface in Windows 8, Windows 8.1, and Windows RT with commands to open the [[Windows shell#Charms\|Charms bar]] ("Press Windows C"); to dictate or display commands in [[Universal Windows Platform apps#In Windows 8.x\|Metro-style apps]] ("Press Windows Z"); to perform tasks in apps (e.g., "Change to Celsius" in [[MSN#Weather\|MSN Weather]]); and to display all installed apps listed by the [[Start menu#Third version\|Start screen]] ("Apps").<ref name="Windows8SR">{{cite web \|url=http://windows.microsoft.com//en-US//windows-8//using-speech-recognition \|title=How to use Speech Recognition \|publisher=[[Microsoft]] \|work=Windows Support \|~~archiveurl~~archive-url=https://web.archive.org/web/20121025193813/http://windows.microsoft.com//en-US//windows-8//using-speech-recognition \|~~archivedate~~archive-date=October 25, 2012 \|~~accessdate~~access-date=December 24, 2018}}</ref><ref name="UpdatedGuidelines">{{cite web \|url=https://support.microsoft.com/en-us/help/14213/windows-how-to-use-speech-recognition \|title=How to use Speech Recognition in Windows \|date=August 31, 2016 \|publisher=[[Microsoft]] \|work=Windows Support \|~~accessdate~~access-date=December 24, 2018}}</ref> ===Windows 10=== WSR is featured in the [[Settings (Windows)\|Settings]] application starting with the Windows 10 April 2018 Update ([[Windows 10 version history\|Version 1803]]); the change first appeared in [[Windows Insider\|Insider]] Preview Build 17083.<ref name="WSRInsider">{{cite web \|url=https://blogs.windows.com/windowsexperience/2018/01/24/announcing-windows-10-insider-preview-build-17083-for-pc/ \|title=Announcing Windows 10 Insider Preview Build 17083 for PC \|last=Sarkar \|first=Dona \|date=January 24, 2018 \|publisher=[[Microsoft]] \|work=Windows Blogs \|~~accessdate~~archive-url=https://web.archive.org/web/20180124224723/https://blogs.windows.com/windowsexperience/2018/01/24/announcing-windows-10-insider-preview-build-17083-for-pc/ \|archive-date=January 824, 2018 \|access-date=May 15, ~~2019~~2020}}</ref> The April 2018 Update also introduces a new {{keypress\|Win}}+{{keypress\|Ctrl}}+{{keypress\|S}} keyboard shortcut to activate WSR.<ref name="KeyboardShortcutsAccessibility">{{cite web \|url=https://support.microsoft.com/en-us/help/13810/windows-keyboard-shortcuts-accessibility \|title=Windows keyboard shortcuts for accessibility \|publisher=[[Microsoft]] \|work=Windows Support \|~~archiveurl~~archive-url=https://web.archive.org/web/20181012161947/https://support.microsoft.com/en-us/help/13810/windows-keyboard-shortcuts-accessibility \|~~archivedate~~archive-date=October 12, 2018 \|~~accessdate~~access-date=January 8, 2019}}</ref> ===Windows 11=== In Windows 11 version 22H2, a second Microsoft app, Voice Access, was added in addition to WSR.<ref>{{Cite web \|title=Set up voice access - Microsoft Support \|url=https://support.microsoft.com/en-us/topic/set-up-voice-access-9fc44e29-12bf-4d86-bc4e-e9bb69df9a0e \|access-date=2022-12-10 \|website=support.microsoft.com}}</ref><ref>{{Cite web \|last=Hachman \|first=Mark \|title=New Windows 11 build tests Voice Access, Spotlight backgrounds \|url=https://www.pcworld.com/article/558293/new-windows-11-build-tests-voice-access-spotlight-backgrounds-feature.html \|access-date=2022-12-10 \|website=PCWorld \|language=en}}</ref> In December 2023 Microsoft announced that WSR is deprecated in favor of Voice Access and may be removed in a future build or release of Windows.<ref name="DeprecatedFeatures">{{cite web \|url=https://learn.microsoft.com/en-us/windows/whats-new/deprecated-features \|title=Deprecated features in the Windows client - What's new in Windows \|author=[[Microsoft]] \|access-date=December 7, 2023}}</ref> ==Overview and features== WSR allows a user to control applications and the Windows [[desktop metaphor\|desktop]] [[user interface]] ~~through voice commands. Applications—including most of those bundled with Windows—can be controlled~~ through voice commands.<ref name="Guide"/> Users can dictate text within documents, email, and forms; control the operating system user interface; perform [[keyboard shortcut]]s; and move the [[cursor (computing)\|mouse cursor]].<ref name="CommonCommands">{{cite web \|url=http://windows.microsoft.com/en-us/windows/common-speech-recognition-commands#1TC=windows-vista \|title=~~Common commands in~~Windows Speech Recognition commands \|~~author~~publisher=[[Microsoft]] \|work=Windows ~~How-to~~Support \|~~accessdate~~access-date=~~June~~May 3015, ~~2015~~2020}}</ref> ~~WSR relies on the Speech API developed by Microsoft,<ref name="TalkingWindowsVista"/> and third-party applications must support the Text Services Framework for dictation; the~~The majority of integrated applications in Windows Vista can be controlled ~~through speech.~~;<ref name="Guide">{{cite web \|url=https://msdn.microsoft.com/en-us/library/bb530325.aspx \|title=Windows Vista Speech Recognition Step-by-Step Guide \|last=Phillips \|first=Todd \|date=2007 \|publisher=[[Microsoft]] \|work=[[MSDN]] \|~~accessdate~~access-date=June 30, 2015}}</ref> ~~The~~third-party ~~following~~applications ~~languages~~must ~~are~~support ~~supported:~~the ~~[[traditional~~Text ~~Chinese~~Services ~~characters\|Chinese~~Framework ~~(Traditional)]],~~for ~~[[simplified~~dictation.<ref ~~Chinese characters\|Chinese (Simplified)]],~~name="TalkingWindowsVista"/> [[American English\|English (U.S.)]], [[British English\|English (U.K.)]], [[French language\|French]], [[German language\|German]], [[Japanese language\|Japanese]], [[Mandarin Chinese]], and [[Spanish language\|Spanish]] are supported languages.<ref name="SpeechRecognition">{{cite web \|url=https://www.microsoft.com/enable/products/windowsvista/speech.aspx \|title=Windows Speech Recognition \|publisher=[[Microsoft]] \|work=Microsoft Accessibility \|archive-url=https://web.archive.org/web/20070204044614/https://www.microsoft.com/enable/products/windowsvista/speech.aspx \|archive-date=February 4, 2007 \|access-date=May 15, 2020}}</ref> When ~~WSR is~~ started for the first time, itWSR presents a microphone setup wizard and an optional interactive step-by-step tutorial that users can commence to learn basic commands while adapting the recognizer to their specific voice characteristics;<ref name="Guide"/> the tutorial is estimated to require approximately 10 minutes to complete.<ref name="MSR8">{{cite web \|url=http://www.pcworld.com/article/3124761/windows/the-windows-weakness-no-one-mentions-speech-recognition.html \|title=The Windows weakness no one mentions: Speech recognition \|last=Hachman \|first=Mark \|date=May 10, 2017 \|publisher=[[International Data Group\|IDG]] \|work=[[PC World]] \|~~accessdate~~access-date=March 28, 2018}}</ref> The accuracy of the recognizer increases ~~regular~~ through regular use, which adapts it to contexts, grammars, patterns, and vocabularies.<ref name="SpeechRecognition"~~>{{cite web \|url=https:~~/~~/www.microsoft.com/enable/products/windowsvista/speech.aspx \|title=Windows Speech Recognition \|author=[[Microsoft]] \|work=Microsoft Accessibility \|accessdate=June 26, 2015}}</ref~~><ref name="Privacy"/> Custom language models ~~that adapt the recognizer to~~for the specific contexts, phonetics, and terminologies of users in particular occupational fields such as legal or medical are also supported.<ref name="CustomizedVocabularies">{{cite web \|url=https://blogs.msdn.microsoft.com/robch/2005/09/20/customized-speech-vocabularies-in-windows-vista/ \|title=Customized speech vocabularies in Windows Vista \|last=Chambers \|first=Rob \|date=September 20, 2005 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=March 29, 2018}}</ref> With [[Windows Search]],<ref name="ThurrottAllchin">{{cite web \|url=http://www.itprotoday.com/jim-allchin-talks-windows-vista \|title=Jim Allchin Talks Windows Vista \|last=Thurrott \|first=Paul ~~\|authorlink=Paul Thurrott~~ \|date=October 6, 2010 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|~~accessdate~~archive-url=https://web.archive.org/web/20180328102911/http://www.itprotoday.com/jim-allchin-talks-windows-vista \|archive-date=March 2928, 2018 \|access-date=May 15, 2020}}</ref> the recognizer also can optionally harvest text in documents, email, as well as handwritten [[tablet PC]] input to contextualize and disambiguate terms to improve accuracy; no information is sent to Microsoft.<ref name="Privacy">{{cite web \|url=http://download.microsoft.com/download/7/9/4/7945a146-fc32-48c2-8c14-83b1b36696e5/Windows%20Vista%20Privacy%20Statement.rtf \|title=Windows Vista Privacy Statement \|~~author~~date=2006 \|format=RTF \|publisher=[[Microsoft]] \|~~date~~archive-url=~~2006~~https://web.archive.org/web/20080830041216/http://download.microsoft.com/download/7/9/4/7945a146-fc32-48c2-8c14-83b1b36696e5/Windows%20Vista%20Privacy%20Statement.rtf \|~~format~~archive-date=~~RTF~~August 30, 2008 \|~~accessdate~~access-date=~~July~~May 115, ~~2015~~2020}}</ref> WSR is a locally processed speech recognition platform; it does not rely on cloud computing for accuracy, dictation, or recognition.<ref name="MicrosoftPrivacyStatement">{{cite web \|url=https://privacy.microsoft.com/en-us/privacystatement \|title=Microsoft Privacy Statement \|publisher=[[Microsoft]] \|~~accessdate~~access-date=May 12, 2020}}</ref> Speech profiles that store information about users are retained locally.<ref name="Privacy"/> Backups and transfers of profiles can be performed via [[Windows Easy Transfer]].<ref name="Transfer">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2007/02/15/transferring-windows-speech-recognition-profiles-from-one-machine-to-another.aspx \|title=Transferring Windows Speech Recognition profiles from one machine to another \|last=Chambers \|first=Rob \|date=February 15, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 28, 2015}}</ref> ===Interface=== Line 53 ⟶ 60: * '''Sleeping''': The recognizer will not listen for or respond to commands other than "Start listening" * '''Off''': The recognizer will not listen or respond to any commands; this mode can be enabled by speaking "Stop listening" Colors of the recognizer listening mode button denote its various modes of operation: blue when listening; blue-gray when sleeping; gray when turned off; and yellow when the user switches context (e.g., from the desktop to the taskbar) or when a voice command is misinterpreted. The status area can also display custom user information as part of [[Windows Speech Recognition#Macros\|Windows Speech Recognition Macros]].<ref name="WSRMacrosPreview">{{cite web \|url=http://kurtsh.com/2008/04/29/beta-windows-speech-recognition-macros-technology-preview/ \|title=BETA: 'Windows Speech Recognition Macros' Technology Preview \|last=Shintaku \|first=Kurt \|date=April 29, 2008 \|~~accessdate~~access-date=March 17, 2016}}</ref><ref name="ControlPCLifehacker">{{cite web \|url=http://lifehacker.com/391884/control-your-pc-with-your-voice \|title=Control Your PC with Your Voice \|last=Pash \|first=Adam \|date=May 20, 2008 \|publisher=[[Gawker Media]] \|work=[[Lifehacker]] \|~~accessdate~~access-date=March 17, 2016}}</ref> [[File:WSR-AlternatesPanel.png\|thumb\|200px\|right\|The alternates panel displaying suggestions for a phrase.]] ====Alternates panel==== An alternates panel disambiguation interface lists items interpreted as being relevant to a user's spoken word(s); if the word or phrase that a user desired to insert into an application is listed among results, a user can speak the corresponding number of the word or phrase in the results and confirm this choice by speaking "OK" to insert it within the application.<ref name="Modes">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2007/11/19/speech-macros-typing-mode-and-spelling-mode-in-windows-speech-recognition.aspx \|title=Speech Macros, Typing Mode and Spelling Mode in Windows Speech Recognition \|last=Chambers \|first=Rob \|date=November 19, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=August 25, 2015}}</ref> The alternates panel also appear when launching applications or speaking commands that refer to more than one item (e.g., speaking "Start Internet Explorer" may list both the web browser and a separate version with add-ons disabled). An ''ExactMatchOverPartialMatch'' entry in the [[Windows Registry]] can limit commands to items with exact names if there is more than one instance included in results.<ref name="Clarification">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2007/05/07/windows-speech-recognition-exactmatchoverpartialmatch.aspx \|title=Windows Speech Recognition — ExactMatchOverPartialMatch \|last=Chambers \|first=Rob \|date=May 7, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=August 24, 2015}}</ref> ===Common commands=== Line 66 ⟶ 73: :: '''Keyboard shortcuts:''' "Press ''keyboard key''"; "Press ''{{Keypress\|SHIFT}}'' plus ''{{Keypress\|a}}''"; "Press capital ''{{Keypress\|b}}''." :: Keys that can be pressed without first giving the press command include: {{Keypress\|Backspace}}, {{Keypress\|Delete}}, {{Keypress\|End}}, {{Keypress\|Enter}}, {{Keypress\|Home}}, {{Keypress\|Page Down}}, {{Keypress\|Page Up}}, and {{Keypress\|Tab}}.<ref name="CommonCommands"/> :: '''Mouse commands:''' "Click"; "Click ''that''"; "Double-click"; "Double-click ''that''"; "Mark"; "Mark ''that''"; "Right-click"; "Right-click ''that''"; "[[Windows Speech Recognition#Mousegrid\|~~Mousegrid~~MouseGrid]]".<ref name="CommonCommands"/> :: '''Window management commands:''' "Close (alternatively maximize, minimize, or restore) window"; "Close ''that''"; "Close ''name of open application''"; "Switch applications"; "Switch to ''name of open application''"; "Scroll ''direction''"; "Scroll ''direction'' in ''number of pages''"; "Show desktop"; "[[Windows Speech Recognition#Show numbers\|Show ~~numbers~~Numbers]]."<ref name="CommonCommands"/> : '''Speech recognition commands:''' "Start listening"; "Stop listening"; "Show speech options"; "Open speech dictionary"; "Move speech recognition"; "Minimize speech recognition"; "Restore speech recognition".<ref name="CommonCommands"/> In the English language, applicable commands can be shown by speaking "What can I say?"<ref name="SpeechRecognition"/> Users can also query the recognizer about tasks in Windows by speaking "How do I ''task name''" (e.g., "How do I install a printer?") which opens related help documentation.<ref name="General Commands">{{cite web \|url=https://blogs.msdn.microsoft.com/robch/2007/03/12/windows-speech-recognition-general-commands/ \|title=Windows Speech Recognition: General commands \|last=Chambers \|first=Rob \|date=March 12, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=May 1, 2017}}</ref> [[File:Mousegrid.png\|thumb\|160px\|right\|The MouseGrid command displaying a grid of numbers on the Windows Vista desktop.]] ====''MouseGrid''==== ''MouseGrid'' enables users to control the mouse cursor by overlaying numbers across nine regions on the screen; these regions gradually narrow as a user speaks the number(s) of the region on which to focus until the desired interface element is reached. Users can then issue commands including "Click ''number of region''," which moves the mouse cursor to the desired region and then clicks it; and "Mark ''number of region''", which allows an item (such as a [[icon (computing)\|computer icon]]) in a region to be selected, which can then be clicked with the previous ''click'' command. Users also can interact with multiple regions ~~of the mousegrid~~ at once.<ref name="CommonCommands"/> ====''Show Numbers''==== Applications and interface elements that do not present identifiable commands can still be controlled by asking the system to overlay numbers on top of them through a ''~~show~~Show ~~numbers~~Numbers'' command. Once active, speaking the overlaid number selects that item so a user can open it or perform other operations.<ref name="CommonCommands"/> ''Show ~~numbers~~Numbers'' was designed so that users could interact with items that are not readily identifiable.<ref ~~name="US7742923"~~>{{Cite patent\|country=US\|number=7742923\| title=Graphic user interface schemes for supporting speech recognition input systems ~~\|status=patent~~ \|assign1=[[Microsoft Corporation ]]\|~~invent5~~inventor4=~~Scholz,~~Robert Chambers;Oliver Scholz\|~~invent4~~inventor3-last=~~Chambers, Robert~~ Mowatt\|~~invent3~~inventor3-first=~~Mowatt,~~ David \|~~invent2~~inventor2-last=Murillo, \|inventor2-first=Oscar \|~~invent1~~inventor1-last=Bickel, \|inventor1-first=Ryan}}</ref> [[File:Show numbers.png\|thumb\|160px\|left\|The ~~show~~Show ~~numbers~~Numbers command overlaying numbers in the [[Games for Windows#Games Explorer\|Games Explorer]].]] ===Dictation=== WSR enables dictation of text in ~~the operating system~~applications and ~~applications~~Windows. If a dictation mistake occurs it can be corrected by speaking "Correct ''word''" or "Correct that" and the alternates panel will appear and provide suggestions for correction; these suggestions can be selected by speaking the number corresponding to the number of the suggestion ~~in the list~~ and by speaking "OK." If the desired item is not listed among suggestions, a user can speak it so that it might appear. Alternatively, users can speak "Spell it" or "I'll spell it myself" to speak the desired ~~item~~word on ~~a per~~letter-by-letter basis; users can use their personal alphabet or the [[NATO phonetic alphabet]] (e.g., "N as in November") when spelling.<ref name="CustomizedVocabularies"/> Multiple words in a sentence can be corrected simultaneously (for example, if a user speaks "dictating" but the recognizer interprets this word as "the thing," a user can state "correct the thing" to correct both words at once). In the English language over 100,000 words are recognized by default.<ref name="CustomizedVocabularies"/> ====Speech dictionary==== Line 84 ⟶ 93: ===Macros=== [[File:WSRMacroOptions.png\|thumb\|160px\|left\|An Aero Wizard interface displaying options to create speech recognition macros.]] WSR supports custom macros through a supplementary application by Microsoft that enables additional [[natural language processing\|natural language]] commands.<ref name="WSRM">{{cite web \|url=http://www.microsoft.com/en-us/download/details.aspx?id=13045 \|title=Windows Speech Recognition Macros \|~~author~~publisher=[[Microsoft]] \|work=Download Center \|~~accessdate~~access-date=June 29, 2015}}</ref><ref name="Ars">{{cite web \|url=https://arstechnica.com/information-technology/2008/04/wsr-macros-extend-windows-vistas-speech-recognition-feature/ \|title=WSR Macros extend Windows Vista's speech recognition feature \|last=Protalinski \|first=Emil \|date=April 30, 2008 \|publisher=[[Condé Nast]] \|work=[[ArsTechnica]] \|~~accessdate~~access-date=June 29, 2015}}</ref> As an example of this functionality, an email macro released by Microsoft enables a natural language command where a user can speak "send email to ''contact'' about ''subject''," which opens [[Microsoft Outlook]] to compose a new message with the designated contact and subject automatically inserted.<ref name="MicrosoftOutlook">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/06/09/macro-of-the-day-send-email-to-outlookcontact.aspx \|title=Macro of the Day: Send Email to [OutlookContact] \|last=Chambers \|first=Rob \|date=June 9, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 26, 2015}}</ref> Microsoft has also released sample macros for the speech dictionary,<ref name="SpeechDictionaryMacro">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/08/02/speech-macro-of-the-day-speech-dictionary.aspx \|title=Speech Macro of the Day: Speech Dictionary \|last=Chambers \|first=Rob \|date=August 2, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=September 3, 2015}}</ref> for Windows Media Player,<ref name="MediaPlayer">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/07/01/macro-of-the-day-windows-media-player.aspx \|title=Macro of the Day: Windows Media Player \|last=Chambers \|first=Rob \|date=July 1, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 26, 2015}}</ref> for [[Microsoft PowerPoint]],<ref name="NextSlide">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/06/03/macro-of-the-day-next-slide.aspx \|title=Macro of the day: Next Slide \|last=Chambers \|first=Rob \|date=June 3, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=September 3, 2015}}</ref> for [[speech synthesis]],<ref name="ReadThat">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/05/28/macro-of-the-day-read-that.aspx \|title=Macro of the Day: Read that \|last=Chambers \|first=Rob \|date=May 28, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 26, 2015}}</ref> to switch between multiple microphones,<ref name="Microphone">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/11/07/macro-of-the-day-microphone-control.aspx \|title=Macro of the Day: Microphone Control \|last=Chambers \|first=Rob \|date=November 7, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 30, 2015}}</ref> to customize various aspects of audio device configuration such as volume levels,<ref name="SpeakersMacro">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/08/18/macro-of-the-day-mute-the-speakers.aspx \|title=Macro of the Day: Mute the speakers! \|last=Chambers \|first=Rob \|date=August 18, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=September 3, 2015}}</ref> and for general natural language queries such as "What is the weather forecast?"<ref name="WeatherForecast">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/06/02/macro-of-the-day-tell-me-the-weather-forecast-for-redmond.aspx \|title=Macro of the Day: Tell me the weather forecast for Redmond \|last=Chambers \|first=Rob \|date=June 2, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=June 26, 2015}}</ref> "What time is it?"<ref name="ReadThat"/> and "What's the date?"<ref name="ReadThat"/> ~~Answers~~Responses to these ~~queries~~user inquiries are spoken ~~via~~back ato the user in the active [[Microsoft text-to-speech voices\|Microsoft text-to-speech ~~synthesizer~~voice]] installed on the machine. {\| class="wikitable mw-collapsible" style="margin-left: auto; margin-right: auto; border: none; font-size:80%; text-align: center;" \|- ! scope="col" \| Application or item ! scope="col" colspan="8" \| Sample macro phrases (''italics'' indicate substitutable words) \|- \| '''Microsoft Outlook''' \|\| Send email \| Send email to \| Send email to ''Makoto'' \| Send email to ''Makoto Yamagishi'' \| Send email to ''Makoto Yamagishi about'' \| Send email to ''Makoto Yamagishi about This week's meeting'' \| Refresh Outlook email contacts \|- \| '''Microsoft PowerPoint''' \|\| Next slide \| Previous slide \| Next \| Previous \| Go forward ''5'' slides \| Go back ''3'' slides \| Go to slide ''8'' \|- \| '''Windows Media Player''' \|\| Next track \| Previous song \| Play ''Beethoven'' \| Play something by ''Mozart'' \| Play the CD that has ''In the Hall of the Mountain King'' \| Play something written in ''1930'' \| Pause music \|- \| '''Microphones in Windows''' \|\| Microphone \| Switch microphone \| ''Microphone Array'' microphone \| Switch to ''Line'' \| Switch to ''Microphone Array'' \| Switch to ''Line'' microphone \| Switch to ''Microphone Array'' microphone \|- \| '''Volume levels in Windows''' \|\| Mute the speakers \| Unmute the speakers \| Turn off the audio \| Increase the volume \| Increase the volume by 2 times \| Decrease the volume by ''50'' \| Set the volume to ''66'' \|- \| ''' WSR Speech Dictionary''' \|\| Export the speech dictionary \| Add a pronunciation \| Add that [''selected text''] to the speech dictionary \| Block that [''selected text''] from the speech dictionary \| Remove that [''selected text''] \| [''Selected text''] sounds like... \| What does that [''selected text''] sound like? \|- \| '''Speech Synthesis''' \|\| Read that [''selected text''] \| Read the next 3 paragraphs \| Read the previous sentence \| Please stop reading \| What time is it? \| What's today's date? \| Tell me the weather forecast for ''Redmond'' \|- \|} Users and developers can create their own macros based on text transcription and substitution; application execution (with support for [[command-line interface#arguments\|command-line arguments]]); keyboard shortcuts; emulation of existing voice commands; or a combination of these items. [[~~extensible markup language\|~~XML]], [[JScript]] and [[VBScript]] are supported.<ref name="Modes"/> Macros can be limited to ~~individual~~specific applications ~~if desired~~<ref name="Application">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2008/06/30/making-a-speech-macro-application-specific.aspx \|title=Making a Speech macro Application Specific \|last=Chambers \|first=Rob \|date=June 30, 2008 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|~~accessdate~~access-date=September 3, 2015}}</ref> and rules for macros can be defined programmatically.<ref name="MicrosoftOutlook"/> For a macro to load, it must be stored in a ''Speech Macros'' folder within the active user's ''[[My Documents\|Documents]]'' directory. All macros are [[digital signature\|digitally signed]] by default if a [[public key certificate\|user certificate]] is available to ensure that stored commands are not altered or loaded by third-parties; if a certificate is not available, an administrator can create one.<ref name="WSRMacros">{{cite web \|url=http://download.microsoft.com/download/F/6/B/F6B71555-D73F-4273-9217-7D872D59BE31/Windows%20Speech%20Recognition%20Macros%20Release%20Notes.docx \|title=Windows Speech Recognition Macros Release Notes \|~~author~~date=2009 \|format=DOCX \|publisher=[[Microsoft]] \|~~date~~archive-url=~~2009~~https://web.archive.org/web/20110930121421/http://download.microsoft.com/download/F/6/B/F6B71555-D73F-4273-9217-7D872D59BE31/Windows%20Speech%20Recognition%20Macros%20Release%20Notes.docx \|~~format~~archive-date=~~DOCX~~September 30, 2011 \|~~accessdate~~access-date=~~June~~May 2815, ~~2015~~2020}}</ref> Configurable security levels can prohibit unsigned macros from being loaded; to prompt users to sign macros after creation; and to load unsigned macros.<ref name="Application"/> ==Performance== {{As of\|2017}} WSR uses Microsoft Speech Recognizer 8.0, the version introduced in Windows Vista. For dictation it was found to be 93.6% accurate without training by Mark Hachman, a Senior Editor of ''[[PC World]]''—a rate that is not as accurate as competing software. According to Microsoft, the rate of accuracy when trained is 99%. Hachman opined that Microsoft does not publicly discuss the feature because of the 2006 incident during the development of Windows Vista, with the result being that few users knew that documents could be dictated within Windows before the introduction of [[Cortana (virtual assistant)\|Cortana]].<ref name="MSR8"/> ==See also== * [[Braina]] * [[List of speech recognition software]] * [[Microsoft Cordless Phone System]]