Media Resource Control Protocol: Difference between revisions

Content deleted Content added
top: clean up, typo(s) fixed: , → , using AWB
No edit summary
 
(10 intermediate revisions by 7 users not shown)
Line 1:
{{short description|Audio communication protocol}}
{{primary sources|date=September 2011}}
'''Media Resource Control Protocol''' ('''MRCP''') is a communication protocol used by speech servers to provide various services (such as [[speech recognition]] and [[speech synthesis]]) to their clients. MRCP relies on another protocol, such as [[Real Time Streaming Protocol]] (RTSP) or [[Session Initiation Protocol]] ([[Session Initiation Protocol|SIP]]) for establishing a control session and audio streams between the client and the server.
 
MRCP uses a similar style of clear-text signaling as [[HTTP]] and many other Internet protocols, in which each message contains 3 sections: a first-line, a header and a body. The first line indicates the type of message as well as information such as response codes. The header contains a number of lines, each in the format '''<nowiki><header>: <data></nowiki>'''. The body, whose length is specified by the header, contains the details of the message.
 
Like HTTP, MRCP uses a request (usually issued by the client) and response model. Responses may simply acknowledge receipt of the request or give other information regarding its processing. For example, an MRCP client may request to send some audio [[data]] for processing (say, for speech recognition), to which the server could respond with a message containing a suitable [[port number]] to send the data, since MRCP does not have support for audio data specifically as this would have to be handled by some other protocol, such as [[Real-time Transport Protocol]] (RTP).
Line 8 ⟶ 9:
MRCP protocol version 2 has been approved as an [[Request for Comments|RFC]]. Version 2 uses [[Session Initiation Protocol|SIP]] for managing sessions and audio streams between the server and the clients, whereas version 1 did not specify the underlying protocol.
 
MRCP has been adopted by a wide range of commercial speech servers, such as [[Verbio Technologies]], [https://vernacular.ai/viva Skit.ai's VIVA], [[Microsoft Speech Server]], [[LumenVox Speech Engine]], [[NeospeechReadSpeaker]] SpeechspeechServer Engine/ServerMRCP, [[Nuance Communications|Nuance]] Recognizer and Vocalizer, Sestek TTS, Sestek Call Steering as well as commercial [[Interactive Voice Response]] software such as Blueworx Voice Response..
 
==External links==
* RFC{{IETF RFC|4463|link=no}}, A Media Resource Control Protocol (MRCP)
* RFC{{IETF RFC|6787|link=no}}, Media Resource Control Protocol version 2 (MRCPv2)
* [http://www.unimrcp.org UniMRCP], An open source cross-platform MRCP implementation