Content deleted Content added
ChrstphrChvz (talk | contribs) →Audio and video payload types: RFC 3189 was obsoleted by RFC 6469 |
Asilvering (talk | contribs) Wikipedia:Articles for deletion/RTP payload formats closed as no consensus (XFDcloser) |
||
(57 intermediate revisions by 17 users not shown) | |||
Line 1:
{{Short description|Multimedia information signaling}}
The [[Real-time Transport Protocol]] (RTP) specifies a general-purpose data format and [[network protocol]] for transmitting digital media streams on [[Internet Protocol]] (IP) networks. The details of media encoding, such as signal sampling rate, frame size and timing, are specified in an '''RTP payload format'''. The format parameters of the RTP payload are typically communicated between transmission endpoints with the [[Session Description Protocol]] (SDP), but other protocols, such as the [[XMPP|Extensible Messaging and Presence Protocol]] (XMPP) may be used.
==
The technical parameters of payload formats for audio and video streams are standardised.
The standard also describes the process of registering new payload types with IANA.
* {{Sum RFC|3550|ref=yes}}
* {{Sum RFC|3551|ref=yes}}
* {{Sum RFC|3611|ref=yes}}
* {{Sum RFC|4856|ref=yes}}
==Text messaging payload types==
Payload formats and types for text messaging are defined in the following specifications:
* {{Sum RFC|4103|ref=yes|notes=yes}}
* {{Sum RFC|9071|ref=yes}}
==MIDI payload types==
Payload formats and types for [[MIDI]] are defined in the following specifications:
*{{Sum RFC|6295|ref=yes}}
*{{Sum RFC|4696|ref=yes}}
==Audio and video payload types==
Payload formats and types for audio and video are defined in the following specifications:
* {{Sum RFC|2029|ref=yes}}
* {{Sum RFC|2190|ref=yes}}
* {{Sum RFC|2198|ref=yes}}
* {{Sum RFC|2250|ref=yes}}
* {{Sum RFC|2343|ref=yes}}
* {{Sum RFC|2435|ref=yes}}
* {{Sum RFC|2586|ref=yes}}
* {{Sum RFC|2658|ref=yes}}
* {{Sum RFC|3190|ref=yes}}
* {{Sum RFC|3389|ref=yes}}
* {{Sum RFC|3497|ref=yes}}
* {{Sum RFC|3640|ref=yes}}
* {{Sum RFC|3952|ref=yes}}
* {{Sum RFC|4175|ref=yes}}
* {{Sum RFC|4184|ref=yes}}
* {{Sum RFC|4352|ref=yes}}
* {{Sum RFC|4587|ref=yes}}
* {{Sum RFC|4598|ref=yes}}
* {{Sum RFC|4629|ref=yes}}
* {{Sum RFC|4733|ref=yes}}
* {{Sum RFC|4749|ref=yes}}
* {{Sum RFC|4788|ref=yes}}
* {{Sum RFC|4867|ref=yes}}
* {{Sum RFC|5188|ref=yes}}
* {{Sum RFC|5215|ref=yes}}
* {{Sum RFC|5371|ref=yes}}
* {{Sum RFC|5391|ref=yes}}
* {{Sum RFC|5404|ref=yes}}
* {{Sum RFC|5574|ref=yes}}
* {{Sum RFC|5577|ref=yes}}
* {{Sum RFC|5584|ref=yes}}
* {{Sum RFC|5686|ref=yes}}
* {{Sum RFC|5993|ref=yes}}
* {{Sum RFC|6184|ref=yes}}
* {{Sum RFC|6190|ref=yes}}
* {{Sum RFC|6416|ref=yes}}
* {{Sum RFC|6469|ref=yes}}
* {{Sum RFC|7310|ref=yes}}
* {{Sum RFC|7587|ref=yes}}
* {{Sum RFC|7741|ref=yes}}
* {{Sum RFC|7798|ref=yes}}
* {{Sum RFC|9134|ref=yes}}
* {{Sum RFC|9607|ref=yes}}
* {{Sum RFC|9628|ref=yes}}
Payload identifiers 96–127 are used for payloads defined dynamically during a session. It is recommended to dynamically
Applications should always support PCMU (payload type 0)
{| class="wikitable sortable"
Line 30 ⟶ 77:
!No. of channels
!Clock rate (Hz)<ref group=note>The "clock rate" is the rate at which the timestamp in the RTP header is incremented, which need not be the same as the codec's sampling rate. For instance, video codecs typically use a clock rate of 90000 so their frames can be more precisely aligned with the RTCP NTP timestamp, even though video sampling rates are typically in the range of 1–60 samples per second.</ref>
!Frame size (
!Default packet
!Description
!References
Line 44 ⟶ 91:
|20
|ITU-T [[G.711]] PCM [[μ-Law]] audio 64 kbit/s
|
|-
Line 55 ⟶ 102:
|
|reserved, previously [[FS-1016]] [[CELP]] audio 4.8 kbit/s
|{{IETF RFC
|-
Line 66 ⟶ 113:
|
|reserved, previously ITU-T [[G.721]] [[ADPCM]] audio 32 kbit/s or ITU-T [[G.726]] audio 32 kbit/s
|{{IETF RFC
|-
Line 77 ⟶ 124:
|20
|European [[GSM-FR|GSM Full Rate]] audio 13 kbit/s (GSM 06.10)
|
|-
Line 88 ⟶ 135:
|30
|ITU-T [[G.723.1]] audio
|
|-
Line 99 ⟶ 146:
|20
|[[Interactive Multimedia Association|IMA]] [[ADPCM]] audio 32 kbit/s
|
|-
Line 110 ⟶ 157:
|20
|[[Interactive Multimedia Association|IMA]] [[ADPCM]] audio 64 kbit/s
|
|-
Line 121 ⟶ 168:
|20
|Experimental [[Linear predictive coding|Linear Predictive Coding]] audio 5.6 kbit/s
|
|-
Line 132 ⟶ 179:
|20
|ITU-T G.711 PCM [[A-Law]] audio 64 kbit/s
|
|-
Line 139 ⟶ 186:
|audio
|1
|8000{{refn|group=note|Although the sampling rate for G.722 is 16000, its clock rate is 8000 to remain backwards compatible with RFC 1890, which incorrectly used this value.
|any
|20
|ITU-T [[G.722]] audio 64 kbit/s
|{{IETF RFC|3551|link=no}}
|-
Line 153 ⟶ 200:
|any
|20
|[[Linear PCM]] 16-bit Stereo audio 1411.2 kbit/s,{{Ref RFC|2586}}{{Ref RFC|3108|rp=62}}{{Ref RFC|4856|rp=18}} uncompressed
|{{IETF RFC|3551|link=no}}{{Rp|p=27}}
|-
Line 165 ⟶ 212:
|20
|[[Linear PCM]] 16-bit audio 705.6 kbit/s, uncompressed
|{{IETF RFC|3551|link=no}}{{Rp|p=27}}
|-
Line 176 ⟶ 223:
|20
|[[QCELP|Qualcomm Code Excited Linear Prediction]]
|{{IETF RFC|2658|link=no}}, {{IETF RFC|3551|link=no}}{{Rp|p=28}}
|-
Line 186 ⟶ 233:
|
|
|[[Comfort noise]]. Payload type used with audio codecs that do not support comfort noise as part of the codec itself such as [[G.711]], [[G.722.1]], [[G.722]], [[G.726]], [[G.727]], [[G.728]], [[GSM 06.10]], [[Siren (codec)|Siren]], and
|
|-
Line 198 ⟶ 245:
|
|[[MPEG-1]] or [[MPEG-2]] audio only
|{{IETF RFC|2250|link=no}}, {{IETF RFC|3551|link=no}}
|-
Line 209 ⟶ 256:
|20
|ITU-T [[G.728]] audio 16 kbit/s
|
|-
Line 220 ⟶ 267:
|20
|[[Interactive Multimedia Association|IMA]] [[ADPCM]] audio 44.1 kbit/s
|
|-
Line 231 ⟶ 278:
|20
|IMA ADPCM audio 88.2 kbit/s
|
|-
Line 242 ⟶ 289:
|20
|ITU-T [[G.729]] and G.729a audio 8 kbit/s; Annex B is implied unless the <code>annexb=no</code> parameter is used
|{{IETF RFC|3551|link=no}},{{Rp|p=20}} {{IETF RFC|4856|link=no}}{{Rp|p=12}}
|-
Line 253 ⟶ 300:
|
|reserved, previously [[comfort noise]]
|
|-
|25
|CELLB
|video
|
Line 264 ⟶ 311:
|
|[[Sun Microsystems|Sun]] CellB video<ref>[https://docs.oracle.com/cd/E19504-01/802-5863/802-5863.pdf XIL Programmer's Guide], Chapter 22 "CellB Codec". August 1997. Retrieved on 2014-07-19.</ref>
|
|-
Line 275 ⟶ 322:
|
|[[JPEG]] video
|
|-
Line 285 ⟶ 332:
|
|
|[[Xerox PARC]]'s Network Video (nv)<ref>[https://www.cs.columbia.edu/~hgs/rtp/nv.html nv - network video on Henning Schulzrinne's website], [https://www.dgp.toronto.edu/tp/techdocs/NetVid.html Network Video on The University of Toronto's website], Retrieved on 2009-07-09.</ref><ref>[https://github.com/ronf/nv Ron Frederick Github with source code]</ref>
|{{IETF RFC|3551|link=no}}{{Rp|p=32}}
|-
Line 297 ⟶ 344:
|
|ITU-T [[H.261]] video
|
|-
Line 308 ⟶ 355:
|
|MPEG-1 and MPEG-2 video
|
|-
Line 319 ⟶ 366:
|
|MPEG-2 [[MPEG transport stream|transport stream]]
|
|-
Line 330 ⟶ 377:
|
|[[H.263]] video, first version (1996)
|{{IETF RFC|2190|link=no}}, {{IETF RFC|3551|link=no}}
|-
Line 341 ⟶ 388:
|
|reserved because RTCP packet types 200–204 would otherwise be indistinguishable from RTP payload types 72–76 with the marker bit set
|
|-
|77–95
|unassigned
|
|
|
|
|
|note that RTCP packet type 207 (XR, Extended Reports) would be indistinguishable from RTP payload types 79 with the marker bit set
|{{IETF RFC|3551|link=no}}, {{IETF RFC|3611|link=no}}
|-
Line 352 ⟶ 410:
|
|[[H.263]] video, second version (1998)
|{{IETF RFC|2190|link=no}}, {{IETF RFC|3551|link=no}}, {{IETF RFC|4629|link=no}}
|-
Line 363 ⟶ 421:
|
|[[H.263]] video, third version (2000)
|
|-
Line 374 ⟶ 432:
|
|[[H.264]] video (MPEG-4 Part 10)
|{{IETF RFC
|-
Line 385 ⟶ 443:
|
|[[H.264]] video
|
|-
Line 396 ⟶ 454:
|
|[[H.265]] video (HEVC)
|
|-
Line 418 ⟶ 476:
|20, 30
|[[iLBC|Internet low Bitrate Codec]] 13.33 or 15.2 kbit/s
|
|-
Line 429 ⟶ 487:
|
|ITU-T [[G.711|G.711.1]] A-law
|
|-
Line 440 ⟶ 498:
|
|ITU-T [[G.711|G.711.1]] μ-law
|
|-
Line 462 ⟶ 520:
|
|ITU-T [[G.719]]
|
|-
Line 473 ⟶ 531:
|
|ITU-T [[G.722.1]] and G.722.1 Annex C
|
|-
Line 484 ⟶ 542:
|20
|ITU-T [[G.726]] audio 16 kbit/s
|
|-
Line 495 ⟶ 553:
|20
|ITU-T G.726 audio 24 kbit/s
|
|-
Line 506 ⟶ 564:
|20
|ITU-T G.726 audio 32 kbit/s
|
|-
Line 517 ⟶ 575:
|20
|ITU-T G.726 audio 40 kbit/s
|
|-
Line 528 ⟶ 586:
|20
|ITU-T [[G.729]] Annex D
|
|-
Line 539 ⟶ 597:
|20
|ITU-T [[G.729]] Annex E
|
|-
Line 550 ⟶ 608:
|
|ITU-T [[G.729.1]]
|
|-
Line 561 ⟶ 619:
|20
|ITU-T [[GSM-EFR]] (GSM 06.60)
|
|-
Line 572 ⟶ 630:
|
|ITU-T [[Half Rate|GSM-HR]] (GSM 06.20)
|
|-
Line 583 ⟶ 641:
|
|[[Adaptive Multi-Rate]] audio
|
|-
Line 594 ⟶ 652:
|
|[[AMR-WB|Adaptive Multi-Rate Wideband]] audio (ITU-T G.722.2)
|
|-
Line 605 ⟶ 663:
|
|[[AMR-WB+|Extended Adaptive Multi Rate – WideBand]] audio
|
|-
Line 616 ⟶ 674:
|
|[[Vorbis]] audio
|
|-
Line 627 ⟶ 685:
|20
|[[Opus (audio format)|Opus]] audio
|
|-
Line 638 ⟶ 696:
|
|[[Speex]] audio
|
|-
Line 649 ⟶ 707:
|
|Loss-Tolerant [[MP3]] audio
|{{IETF RFC
|-
Line 659 ⟶ 717:
|
|
|[[MPEG-4 Audio]] (includes [[Advanced_Audio_Coding|AAC]])
|{{IETF RFC
|-
Line 671 ⟶ 729:
|
|[[MPEG-4 Visual]]
|{{IETF RFC
|-
Line 682 ⟶ 740:
|
|[[MPEG-4]] Elementary Streams
|
|-
Line 693 ⟶ 751:
|
|[[VP8]] video
|
|-
Line 704 ⟶ 762:
|
|[[VP9]] video
|{{IETF RFC|9628|link=no}}
|-
|dynamic
|AV1
|video
|
|90000
|
|
|[[AV1]] video
|[https://aomediacodec.github.io/av1-rtp-spec/ av1-rtp-spec]
|-
Line 715 ⟶ 784:
|20
|[[Linear PCM]] 8-bit audio with 128 offset
|{{IETF RFC
|-
Line 726 ⟶ 795:
|20 (by analogy with L16)
|IEC 61119 12-bit nonlinear audio
|{{IETF RFC
|-
Line 737 ⟶ 806:
|20
|[[Linear PCM]] 16-bit audio
|{{IETF RFC
|-
Line 748 ⟶ 817:
|20 (by analogy with L16)
|[[Linear PCM]] 20-bit audio
|{{IETF RFC
|-
Line 759 ⟶ 828:
|20 (by analogy with L16)
|[[Linear PCM]] 24-bit audio
|{{IETF RFC
|-
Line 770 ⟶ 839:
|
|Uncompressed Video
|
|-
Line 781 ⟶ 850:
|
|[[Dolby AC-3]] audio
|
|-
Line 792 ⟶ 861:
|
|[[Dolby Digital Plus|Enhanced AC-3]] audio
|
|-
Line 803 ⟶ 872:
|
|[[Text over IP]]
|
|-
|dynamic
|EVRC<br>EVRC0<br>EVRC1
|audio
Line 813 ⟶ 883:
|
|[[Enhanced Variable Rate Codec|EVRC]] audio
|
|-
|dynamic
Line 823 ⟶ 893:
|
|[[Enhanced Variable Rate Codec B|EVRC-B]] audio
|
|-
Line 834 ⟶ 904:
|
|[[Enhanced Variable Rate Codec B|EVRC-WB]] audio
|
|-
Line 845 ⟶ 915:
|
|[[JPEG 2000]] video
|
|-
Line 856 ⟶ 926:
|
|[[UEMCLIP]] audio
|
|-
Line 867 ⟶ 937:
|
|[[Adaptive Transform Acoustic Coding|ATRAC]]3 audio
|
|-
Line 878 ⟶ 948:
|
|[[Adaptive Transform Acoustic Coding|ATRAC]]3+ audio
|
|-
Line 889 ⟶ 959:
|
|[[Adaptive Transform Acoustic Coding|ATRAC]] Advanced Lossless audio
|
|-
Line 899 ⟶ 969:
|
|
|[[DV (video format)|DV]] video
|{{IETF RFC
|-
Line 911 ⟶ 981:
|
|[[ITU-R BT.656]] video
|
|-
Line 922 ⟶ 992:
|
|Bundled MPEG-2 video
|
|-
Line 933 ⟶ 1,003:
|
|[[SMPTE 292M]] video
|
|-
Line 944 ⟶ 1,014:
|
|Redundant Audio Data
|
|-
Line 955 ⟶ 1,025:
|
|Variable-rate DVI4 audio
|
|-
Line 966 ⟶ 1,036:
|
|MPEG-1 Systems Streams video
|
|-
Line 977 ⟶ 1,047:
|
|MPEG-2 Program Streams video
|
|-
Line 988 ⟶ 1,058:
|
|tone
|
|-
Line 999 ⟶ 1,069:
|
|[[Dual-tone multi-frequency signaling|DTMF]] tone
|
|-
Line 1,010 ⟶ 1,080:
|4<ref group=note>For aptX, the packetization interval must be rounded down to the nearest packet interval that can contain an integer number of samples. So at sampling rates of 11025, 22050, or 44100, a packetization rate of "4" is rounded down to 3.99.</ref>
|[[aptX]] audio
|
|
|dynamic
|jxsv
|video
|
|90000
|
|
|[[JPEG XS]] video
|{{IETF RFC|9134|link=no}}
|-
|dynamic
|scip
|audio/video
|
|8000 or 90000
|
|
|[[Secure Communications Interoperability Protocol|SCIP]]
|{{IETF RFC|9607|link=no}}
|}
{{reflist|group=note}}
==See also==
* [[Session Initiation Protocol]]
* [[
* [[Comparison of audio coding formats]]
|