Revision as of 19:20, 15 May 2020 edit Ian Wolfman (talk \| contribs) Extended confirmed users 915 edits →Overview and features: Minor edits for readability. ← Previous edit		Revision as of 19:28, 15 May 2020 edit undo Ian Wolfman (talk \| contribs) Extended confirmed users 915 edits Edits for consistency and readability, as well as a change to a citation as per the GA review. Next edit →
Line 16: ==History== Microsoft was involved in speech recognition and [[speech synthesis]] research for many years before WSR. In 1993, Microsoft hired [[Xuedong Huang]] from [[Carnegie Mellon University]] to lead its speech development efforts; the company's research led to the development of the [[Speech Application Programming Interface\|Speech API]] (SAPI) introduced in 1994.<ref name="TalkingWindowsVista">{{cite web \|url=http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx \|title=Exploring New Speech Recognition And Synthesis APIs In Windows Vista \|last=Brown \|first=Robert \|publisher=[[Microsoft]] \|work=MSDN Magazine \|archiveurl=https://web.archive.org/web/20080307054756/http://msdn2.microsoft.com/en-us/magazine/cc163663.aspx \|archivedate=March 7, 2008 \|accessdate=June 26, 2015}}</ref> Speech recognition had also been used in previous Microsoft products. [[Office XP]] and [[Microsoft Office 2003\|Office 2003]] provided speech recognition capabilities among [[Internet Explorer]] and [[Microsoft Office]] applications;<ref name="SpeechXP">{{cite web \|url=https://support.microsoft.com/en-us/kb/306901 \|title=How To Use Speech Recognition in Windows XP \|publisher=[[Microsoft]] \|work=Windows Support \|archiveurl=https://web.archive.org/web/20150314222444/https://support.microsoft.com/en-us/kb/306901 \|archivedate=March 14, 2015 \|accessdate=May 15, 2020}}</ref> it also enabled limited speech functionality in [[Windows 98]], [[Windows ME]], [[Windows NT 4.0]], and [[Windows 2000]].<ref name="Description">{{cite web \|url=https://support.microsoft.com/en-us/kb/278927 \|title=Description of the speech recognition and handwriting recognition methods in Word 2002 \|publisher=[[Microsoft]] \|work=Windows Support \|archiveurl=https://web.archive.org/web/20150703125056/https://support.microsoft.com/en-us/kb/278927 \|archivedate=July 3, 2015 \|accessdate=March 26, 2018}}</ref> [[Windows XP]] [[Windows XP editions#Tablet PC Edition\|Tablet PC Edition]] 2002 included speech recognition capabilities with the Tablet PC Input Panel,<ref name="WindowsXPTabletPCEdition">{{cite web \|url=http://winsupersite.com/article/windows-xp2/windows-xp-tablet-pc-edition-reviewed-127413 \|title=Windows XP Tablet PC Edition Review \|last=Thurrott \|first=Paul \|authorlink=Paul Thurrott \|date=June 25, 2002 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|accessdate=June 26, 2015}}</ref><ref name="Natural">{{cite web \|url=http://download.microsoft.com/download/9/8/f/98f3fe47-dfc3-4e74-92a3-088782200fe7/TWDT05006_WinHEC05.ppt \|title=Natural Input On Mobile PC Systems \|last=Dresevic \|first=Bodin \|date=2005 \|publisher=[[Microsoft]] \|format=PPT \|accessdate=June 26, 2015}}</ref> and [[Microsoft Plus!#Microsoft Plus! for Windows XP\|Microsoft Plus! for Windows XP]] enabled voice commands for Windows Media Player.<ref name="VoiceCommand">{{cite web \|url=http://winsupersite.com/article/product-review/plus-for-windows-xp-review \|title=Plus! for Windows XP Review \|last=Thurrott \|first=Paul \|authorlink=Paul Thurrott \|date=October 6, 2010 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|accessdate=June 30, 2015}}</ref> However, these all required installation of speech recognition as a separate component; before Windows Vista, Windows did not include integrated or extensive speech recognition.<ref name="Natural"/> [[Microsoft Office 2007\|Office 2007]] and later versions rely on WSR for speech recognition services.<ref name="Office2007SR">{{cite web \|url=https://support.office.com/en-us/article/What-happened-to-speech-recognition-c6541b32-82df-4c18-bfa5-c411f45337d3 \|title=What happened to speech recognition? \|publisher=[[Microsoft]] \|work=Office Support \|archiveurl=https://web.archive.org/web/20161110044211/https://support.office.com/en-us/article/What-happened-to-speech-recognition-c6541b32-82df-4c18-bfa5-c411f45337d3 \|archivedate=November 10, 2016 \|accessdate=May 15, 2020}}</ref> ===Windows Vista=== Line 41: ==Overview and features== WSR allows a user to control the Windows [[desktop metaphor\|desktop]] [[user interface]] through voice commands. Applications—including most of those bundled with Windows—can be controlled through voice commands.<ref name="Guide"/> Users can dictate text within documents, email, and forms; control the operating system user interface; perform [[keyboard shortcut]]s; and move the [[cursor (computing)\|mouse cursor]].<ref name="CommonCommands">{{cite web \|url=http://windows.microsoft.com/en-us/windows/common-speech-recognition-commands#1TC=windows-vista \|title=Common commands in Speech Recognition \|publisher=[[Microsoft]] \|work=Windows Support \|accessdate=June 30, 2015}}</ref> WSR relies on ~~the Speech API developed by Microsoft~~SAPI,<ref name="TalkingWindowsVista"/> and third-party applications must support the Text Services Framework for dictation; the majority of integrated applications in Windows Vista can be controlled through speech.<ref name="Guide">{{cite web \|url=https://msdn.microsoft.com/en-us/library/bb530325.aspx \|title=Windows Vista Speech Recognition Step-by-Step Guide \|last=Phillips \|first=Todd \|date=2007 \|publisher=[[Microsoft]] \|work=[[MSDN]] \|accessdate=June 30, 2015}}</ref> [[American English\|English (U.S.)]], [[British English\|English (U.K.)]], [[French language\|French]], [[German language\|German]], [[Japanese language\|Japanese]], [[Mandarin Chinese]], and [[Spanish language\|Spanish]] are supported languages.<ref name="SpeechRecognition">{{cite web \|url=https://www.microsoft.com/enable/products/windowsvista/speech.aspx \|title=Windows Speech Recognition \|publisher=[[Microsoft]] \|work=Microsoft Accessibility \|archiveurl=https://web.archive.org/web/20070204044614/https://www.microsoft.com/enable/products/windowsvista/speech.aspx \|archivedate=February 4, 2007 \|accessdate=May 15, 2020}}</ref> When WSR is started for the first time, it presents an optional interactive step-by-step tutorial that users can commence to learn basic commands while adapting the recognizer to their specific voice characteristics;<ref name="Guide"/> the tutorial is estimated to require approximately 10 minutes to complete.<ref name="MSR8">{{cite web \|url=http://www.pcworld.com/article/3124761/windows/the-windows-weakness-no-one-mentions-speech-recognition.html \|title=The Windows weakness no one mentions: Speech recognition \|last=Hachman \|first=Mark \|date=May 10, 2017 \|publisher=[[International Data Group\|IDG]] \|work=[[PC World]] \|accessdate=March 28, 2018}}</ref> The accuracy of the recognizer increases through regular use, which adapts it to contexts, grammars, patterns, and vocabularies.<ref name="SpeechRecognition"~~>{{cite web \|url=https:~~/~~/www.microsoft.com/enable/products/windowsvista/speech.aspx \|title=Windows Speech Recognition \|publisher=[[Microsoft]] \|work=Microsoft Accessibility \|accessdate=June 26, 2015}}</ref~~><ref name="Privacy"/> Custom language models for the specific contexts, phonetics, and terminologies of users in particular occupational fields such as legal or medical are also supported.<ref name="CustomizedVocabularies">{{cite web \|url=https://blogs.msdn.microsoft.com/robch/2005/09/20/customized-speech-vocabularies-in-windows-vista/ \|title=Customized speech vocabularies in Windows Vista \|last=Chambers \|first=Rob \|date=September 20, 2005 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|accessdate=March 29, 2018}}</ref> With [[Windows Search]],<ref name="ThurrottAllchin">{{cite web \|url=http://www.itprotoday.com/jim-allchin-talks-windows-vista \|title=Jim Allchin Talks Windows Vista \|last=Thurrott \|first=Paul \|authorlink=Paul Thurrott \|date=October 6, 2010 \|publisher=[[Penton (company)\|Penton]] \|work=[[Windows IT Pro]] \|accessdate=March 29, 2018}}</ref> the recognizer also can optionally harvest text in documents, email, as well as handwritten [[tablet PC]] input to contextualize and disambiguate terms to improve accuracy; no information is sent to Microsoft.<ref name="Privacy">{{cite web \|url=http://download.microsoft.com/download/7/9/4/7945a146-fc32-48c2-8c14-83b1b36696e5/Windows%20Vista%20Privacy%20Statement.rtf \|title=Windows Vista Privacy Statement \|author=[[Microsoft]] \|date=2006 \|format=RTF \|accessdate=July 1, 2015}}</ref> WSR is a locally processed speech recognition platform; it does not rely on cloud computing for accuracy, dictation, or recognition.<ref name="MicrosoftPrivacyStatement">{{cite web \|url=https://privacy.microsoft.com/en-us/privacystatement \|title=Microsoft Privacy Statement \|publisher=[[Microsoft]] \|accessdate=May 12, 2020}}</ref> Speech profiles that store information about users are retained locally.<ref name="Privacy"/> Backups and transfers of profiles can be performed via [[Windows Easy Transfer]].<ref name="Transfer">{{cite web \|url=http://blogs.msdn.com/b/robch/archive/2007/02/15/transferring-windows-speech-recognition-profiles-from-one-machine-to-another.aspx \|title=Transferring Windows Speech Recognition profiles from one machine to another \|last=Chambers \|first=Rob \|date=February 15, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|accessdate=June 28, 2015}}</ref> Line 66: :: '''Keyboard shortcuts:''' "Press ''keyboard key''"; "Press ''{{Keypress\|SHIFT}}'' plus ''{{Keypress\|a}}''"; "Press capital ''{{Keypress\|b}}''." :: Keys that can be pressed without first giving the press command include: {{Keypress\|Backspace}}, {{Keypress\|Delete}}, {{Keypress\|End}}, {{Keypress\|Enter}}, {{Keypress\|Home}}, {{Keypress\|Page Down}}, {{Keypress\|Page Up}}, and {{Keypress\|Tab}}.<ref name="CommonCommands"/> :: '''Mouse commands:''' "Click"; "Click ''that''"; "Double-click"; "Double-click ''that''"; "Mark"; "Mark ''that''"; "Right-click"; "Right-click ''that''"; "[[Windows Speech Recognition#Mousegrid\|MouseGrid]]".<ref name="CommonCommands"/> :: '''Window management commands:''' "Close (alternatively maximize, minimize, or restore) window"; "Close ''that''"; "Close ''name of open application''"; "Switch applications"; "Switch to ''name of open application''"; "Scroll ''direction''"; "Scroll ''direction'' in ''number of pages''"; "Show desktop"; "[[Windows Speech Recognition#Show numbers\|Show Numbers]]."<ref name="CommonCommands"/> : '''Speech recognition commands:''' "Start listening"; "Stop listening"; "Show speech options"; "Open speech dictionary"; "Move speech recognition"; "Minimize speech recognition"; "Restore speech recognition".<ref name="CommonCommands"/> In the English language, applicable commands can be shown by speaking "What can I say?"<ref name="SpeechRecognition"/> Users can also query the recognizer about tasks in Windows by speaking "How do I ''task name''" (e.g., "How do I install a printer?") which opens related help documentation.<ref name="General Commands">{{cite web \|url=https://blogs.msdn.microsoft.com/robch/2007/03/12/windows-speech-recognition-general-commands/ \|title=Windows Speech Recognition: General commands \|last=Chambers \|first=Rob \|date=March 12, 2007 \|publisher=[[Microsoft]] \|work=[[Microsoft Developer Network\|MSDN]] \|accessdate=May 1, 2017}}</ref>

Windows Speech Recognition: Difference between revisions