One document matched: draft-ietf-avtcore-rtp-multi-stream-09.xml
<?xml version="1.0" encoding="US-ASCII"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<?rfc toc="yes" ?>
<?rfc strict="yes" ?>
<?rfc compact="yes" ?>
<?rfc sortrefs="yes" ?>
<?rfc symrefs="yes" ?>
<rfc category="std" docName="draft-ietf-avtcore-rtp-multi-stream-09"
ipr="trust200902" updates="3550, 4585">
<front>
<title abbrev="Multiple Media Streams in an RTP Session">Sending Multiple
Media Streams in a Single RTP Session</title>
<author fullname="Jonathan Lennox" initials="J." surname="Lennox">
<organization abbrev="Vidyo">Vidyo, Inc.</organization>
<address>
<postal>
<street>433 Hackensack Avenue</street>
<street>Seventh Floor</street>
<city>Hackensack</city>
<region>NJ</region>
<code>07601</code>
<country>USA</country>
</postal>
<email>jonathan@vidyo.com</email>
</address>
</author>
<author fullname="Magnus Westerlund" initials="M." surname="Westerlund">
<organization>Ericsson</organization>
<address>
<postal>
<street>Farogatan 2</street>
<city>SE-164 80 Kista</city>
<country>Sweden</country>
</postal>
<phone>+46 10 714 82 87</phone>
<email>magnus.westerlund@ericsson.com</email>
</address>
</author>
<author fullname="Qin Wu" initials="Q." surname="Wu">
<organization>Huawei</organization>
<address>
<postal>
<street>101 Software Avenue, Yuhua District</street>
<city>Nanjing, Jiangsu 210012</city>
<country>China</country>
</postal>
<email>sunseawq@huawei.com</email>
</address>
</author>
<author fullname="Colin Perkins" initials="C. " surname="Perkins">
<organization>University of Glasgow</organization>
<address>
<postal>
<street>School of Computing Science</street>
<city>Glasgow</city>
<code>G12 8QQ</code>
<country>United Kingdom</country>
</postal>
<email>csp@csperkins.org</email>
</address>
</author>
<date />
<area>RAI</area>
<workgroup>AVTCORE</workgroup>
<keyword>I-D</keyword>
<keyword>Internet-Draft</keyword>
<!-- TODO: more keywords -->
<abstract>
<t>This memo expands and clarifies the behaviour of Real-time Transport
Protocol (RTP) endpoints that use multiple synchronization sources
(SSRCs). This occurs, for example, when an endpoint sends multiple media
streams in a single RTP session. This memo updates RFC 3550 with regards
to handling multiple SSRCs per endpoint in RTP sessions, with a
particular focus on RTCP behaviour. It also updates RFC 4585 to update
and clarify the calculation of the timeout of SSRCs and the inclusion of
feedback messages.</t>
</abstract>
</front>
<middle>
<section anchor="introduction" title="Introduction">
<t>At the time the <xref target="RFC3550">Real-Time Transport Protocol
(RTP)</xref> was originally designed, and for quite some time after,
endpoints in RTP sessions typically only transmitted a single media
stream, and thus used a single synchronization source (SSRC) per RTP
session, where separate RTP sessions were typically used for each
distinct media type. Recently, however, a number of scenarios have
emerged in which endpoints wish to send multiple RTP media streams,
distinguished by distinct RTP synchronization source (SSRC) identifiers,
in a single RTP session. These are outlined in <xref
target="usecases"></xref>. Although the initial design of RTP did
consider such scenarios, the specification was not consistently written
with such use cases in mind. The specification is thus somewhat unclear
in places.</t>
<t>This memo updates <xref target="RFC3550"></xref> to clarify behaviour
in use cases where endpoints use multiple SSRCs. It also updates <xref
target="RFC4585"></xref> to resolve problems with regards to timeout of
inactive SSRCs, and to clarify behaviour around inclusion of feedback
messages.</t>
</section>
<section title="Terminology">
<t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
"OPTIONAL" in this document are to be interpreted as described in <xref
target="RFC2119">RFC 2119</xref> and indicate requirement levels for
compliant implementations.</t>
</section>
<section anchor="usecases" title="Use Cases For Multi-Stream Endpoints">
<t>This section discusses several use cases that have motivated the
development of endpoints that sends RTP data using multiple SSRCs in a
single RTP session.</t>
<section anchor="telepresence"
title="Endpoints with Multiple Capture Devices">
<t>The most straightforward motivation for an endpoint to send
multiple simultaneous RTP streams in a single RTP session is when an
endpoint has multiple capture devices, and hence can generate multiple
media sources, of the same media type and characteristics. For
example, telepresence systems of the type described by the CLUE
Telepresence Framework <xref target="I-D.ietf-clue-framework"></xref>
often have multiple cameras or microphones covering various areas of a
room, and hence send several RTP streams of each type within a single
RTP session.</t>
</section>
<section title="Multiple Media Types in a Single RTP Session">
<t>Recent work has updated <xref
target="I-D.ietf-avtcore-multi-media-rtp-session"> RTP</xref> and
<xref target="I-D.ietf-mmusic-sdp-bundle-negotiation"> SDP</xref> to
remove the historical assumption in RTP that media sources of
different media types would always be sent on different RTP sessions.
In this work, a single endpoint's audio and video RTP media streams
(for example) are instead sent in a single RTP session to reduce the
number of transport layer flows used.</t>
</section>
<section title="Multiple Stream Mixers">
<t>There are several RTP topologies which can involve a central device
that itself generates multiple RTP media streams in a session. An
example is a mixer providing centralized compositing for a
multi-capture scenario like that described in <xref
target="telepresence"></xref>. In this case, the centralized node is
behaving much like a multi-capturer endpoint, generating several
similar and related sources.</t>
<t>A more complex example is the selective forwarding middlebox,
described in Section 3.7 of <xref
target="I-D.ietf-avtcore-rtp-topologies-update"></xref>. This is a
middlebox that receives media streams from several endpoints, and then
selectively forwards modified versions of some RTP streams toward the
other endpoints to which it is connected. For each connected endpoint,
a separate media source appears in the session for every other source
connected to the middlebox, "projected" from the original streams, but
at any given time many of them can appear to be inactive (and thus are
receivers, not senders, in RTP). This sort of device is closer to
being an RTP mixer than an RTP translator, in that it terminates RTCP
reporting about the mixed streams, and it can re-write SSRCs,
timestamps, and sequence numbers, as well as the contents of the RTP
payloads, and can turn sources on and off at will without appearing to
be generating packet loss. Each projected stream will typically
preserve its original RTCP source description (SDES) information.</t>
</section>
<section title="Multiple SSRCs for a Single Media Source">
<t>There are also several cases where multiple SSRCs can be used to
send data from a single media source within a single RTP session.
These include, but are not limited to, transport robustness tools,
such as the RTP retransmission payload format <xref
target="RFC4588"></xref>, that require one SSRC to be used for the
media data and another SSRC for the repair data. Similarly, some
layered media encoding schemes, for example H.264 SVC <xref
target="RFC6190"></xref>, can be used in a configuration where each
layer is sent using a different SSRC within a single RTP session.</t>
</section>
</section>
<section title="Use of RTP by endpoints that send multiple media streams">
<t>RTP is inherently a group communication protocol. Each endpoint in an
RTP session will use one or more SSRCs, as will some types of RTP level
middlebox. Accordingly, unless restrictions on the number of SSRCs have
been signalled, RTP endpoints can expect to receive RTP data packets
sent using a number of different SSRCs, within a single RTP session.
This can occur irrespective of whether the RTP session is running over a
point-to-point connection or a multicast group, since middleboxes can be
used to connect multiple transport connections together into a single
RTP session (the RTP session is defined by the shared SSRC space, not by
the transport connections). Furthermore, if RTP mixers are used, some
SSRCs might only be visible in the contributing source (CSRC) list of an
RTP packet and in RTCP, and might not appear directly as the SSRC of an
RTP data packet.</t>
<t>Every RTP endpoint will have an allocated share of the available
session bandwidth, as determined by signalling and congestion control.
The endpoint MUST keep its total media sending rate within this share.
However, endpoints that send multiple media streams do not necessarily
need to subdivide their share of the available bandwidth independently
or uniformly to each media stream and its SSRCs. In particular, an
endpoint can vary the bandwidth allocation to different streams
depending on their needs, and can dynamically change the bandwidth
allocated to different SSRCs (for example, by using a variable rate
codec), provided the total sending rate does not exceed its allocated
share. This includes enabling or disabling media streams, or their
redundancy streams, as more or less bandwidth becomes available.</t>
</section>
<section title="Use of RTCP by Endpoints that send multiple media streams">
<t>The RTP Control Protocol (RTCP) is defined in Section 6 of <xref
target="RFC3550"></xref>. The description of the protocol is phrased in
terms of the behaviour of "participants" in an RTP session, under the
assumption that each endpoint is a participant with a single SSRC.
However, for correct operation in cases where endpoints have multiple
SSRC values, the specification MUST be interpreted as each SSRC counting
as a separate participant in the RTP session, so that an endpoint that
has multiple SSRCs counts as multiple participants.</t>
<section anchor="reporting" title="RTCP Reporting Requirement">
<t>An RTP endpoint that has multiple SSRCs MUST treat each SSRC as a
separate participant in the RTP session. Each SSRC will maintain its
own RTCP-related state information, and hence will have its own RTCP
reporting interval that determines when it sends RTCP reports. If the
mechanism in <xref
target="I-D.ietf-avtcore-rtp-multi-stream-optimisation"></xref> is not
used, then each SSRC will send RTCP reports for all other SSRCs,
including those co-located at the same endpoint.</t>
<t>If the endpoint has some SSRCs that are sending data and some that
are only receivers, then they will receive different shares of the
RTCP bandwidth and calculate different base RTCP reporting intervals.
Otherwise, all SSRCs at an endpoint will calculate the same base RTCP
reporting interval. The actual reporting intervals for each SSRC are
randomised in the usual way, but reports can be aggregated as
described in <xref target="compound"></xref>.</t>
</section>
<section anchor="sec-initial-reporting-interval"
title="Initial Reporting Interval">
<t>When a participant joins a unicast session, the following text from
Section 6.2 of <xref target="RFC3550"></xref> is relevant: "For
unicast sessions... the delay before sending the initial compound RTCP
packet MAY be zero." The basic assumption is that this also ought to
apply in the case of multiple SSRCs. Caution has to be exercised,
however, when an endpoint (or middlebox) with a large number of SSRCs
joins a unicast session, since immediate transmission of many RTCP
reports can create a significant burst of traffic, leading to
transient congestion and packet loss due to queue overflows.</t>
<t>To ensure that the initial burst of traffic generated by an RTP
endpoint is no larger than would be generated by a TCP connection, an
RTP endpoint MUST NOT send more than four compound RTCP packets with
zero initial delay when it joins an RTP session, independently of the
number of SSRCs used by the endpoint. Each of those initial compound
RTCP packets MAY include aggregated reports from multiple SSRCs,
provided the total compound RTCP packet size does not exceed the MTU,
and the avg_rtcp_size is maintained as in <xref
target="avg_rtcp_size"></xref>. Aggregating reports from several SSRCs
in the initial compound RTCP packets allows a substantial number of
SSRCs to report immediately. Endpoints SHOULD prioritize reports on
SSRCs that are likely to be most immediately useful, e.g., for SSRCs
that are initially senders.</t>
<t>An endpoint that needs to report on more SSRCs than will fit into
the four compound RTCP reports that can be sent immediately MUST send
the other reports later, following the usual RTCP timing rules
including timer reconsideration. Those reports MAY be aggregated as
described in <xref target="compound"></xref>.</t>
<t><list style="empty">
<t>Note: The above is based on an TCP initial window of 4 packets,
not the larger TCP initial windows for which there is an ongoing
experiment. The reason for this is a desire to be conservative,
since an RTP endpoint will also in many cases start sending RTP
data packets at the same time as these initial RTCP packets are
sent.</t>
</list></t>
</section>
<section anchor="compound"
title="Aggregation of Reports into Compound RTCP Packets">
<t>As outlined in <xref target="reporting"></xref>, an endpoint with
multiple SSRCs has to treat each SSRC as a separate participant when
it comes to sending RTCP reports. This will lead to each SSRC sending
a compound RTCP packet in each reporting interval. Since these packets
are coming from the same endpoint, it might reasonably be expected
that they can be aggregated to reduce overheads. Indeed, Section 6.1
of <xref target="RFC3550"></xref> allows RTP translators and mixers to
aggregate packets in similar circumstances:</t>
<t><list style="empty">
<t>"It is RECOMMENDED that translators and mixers combine
individual RTCP packets from the multiple sources they are
forwarding into one compound packet whenever feasible in order to
amortize the packet overhead (see Section 7). An example RTCP
compound packet as might be produced by a mixer is shown in Fig.
1. If the overall length of a compound packet would exceed the MTU
of the network path, it SHOULD be segmented into multiple shorter
compound packets to be transmitted in separate packets of the
underlying protocol. This does not impair the RTCP bandwidth
estimation because each compound packet represents at least one
distinct participant. Note that each of the compound packets MUST
begin with an SR or RR packet."</t>
</list></t>
<t>This allows RTP translators and mixers to generate compound RTCP
packets that contain multiple SR or RR packets from different SSRCs,
as well as any of the other packet types. There are no restrictions on
the order in which the RTCP packets can occur within the compound
packet, except the regular rule that the compound RTCP packet starts
with an SR or RR packet. Due to this rule, correctly implemented RTP
endpoints will be able to handle compound RTCP packets that contain
RTCP packets relating to multiple SSRCs.</t>
<t>Accordingly, endpoints that use multiple SSRCs can aggregate the
RTCP packets sent by their different SSRCs into compound RTCP packets,
provided 1) the resulting compound RTCP packets begin with an SR or RR
packet; 2) they maintain the average RTCP packet size as described in
<xref target="avg_rtcp_size"></xref>; and 3) they schedule packet
transmission and manage aggregation as described in <xref
target="agg"></xref>.</t>
<section anchor="avg_rtcp_size" title="Maintaining AVG_RTCP_SIZE">
<t>The RTCP scheduling algorithm in <xref target="RFC3550"></xref>
works on a per-SSRC basis. Each SSRC sends a single compound RTCP
packet in each RTCP reporting interval. When an endpoint uses
multiple SSRCs, it is desirable to aggregate the compound RTCP
packets sent by its SSRCs, reducing the overhead by forming a larger
compound RTCP packet. This aggregation can be done as described in
<xref target="agg"></xref>, provided the average RTCP packet size
calculation is updated as follows.</t>
<t>Participants in an RTP session update their estimate of the
average RTCP packet size (avg_rtcp_size) each time they send or
receive an RTCP packet (see Section 6.3.3 of <xref
target="RFC3550"></xref>). When a compound RTCP packet that contains
RTCP packets from several SSRCs is sent or received, the
avg_rtcp_size estimate for each SSRC that is reported upon is
updated using div_packet_size rather than the actual packet
size:</t>
<figure>
<artwork><![CDATA[
avg_rtcp_size = (1/16) * div_packet_size + (15/16) * avg_rtcp_size
]]></artwork>
</figure>
<t>where div_packet_size is packet_size divided by the number of
SSRCs reporting in that compound packet. The number of SSRCs
reporting in a compound packet is determined by counting the number
of different SSRCs that are the source of Sender Report (SR) or
Receiver Report (RR) RTCP packets within the compound RTCP packet.
Non-compound RTCP packets (i.e., RTCP packets that do not contain an
SR or RR packet <xref target="RFC5506"></xref>) are considered to
report on a single SSRC.</t>
<t>An SSRC that doesn't follow the above rule, and instead uses the
full RTCP compound packet size to calculate avg_rtcp_size, will
derive an RTCP reporting interval that is overly large by a factor
that is proportional to the number of SSRCs aggregated into compound
RTCP packets and the size of set of SSRCs being aggregated relative
to the total number of participants. This increased RTCP reporting
interval can cause premature timeouts if it is more than five times
the interval chosen by the SSRCs that understand compound RTCP that
aggregate reports from many SSRCs. A 1500 octet MTU can fit five
typical size reports into a compound RTCP packet, so this is a real
concern if endpoints aggregate RTCP reports from multiple SSRCs.</t>
<t>The issue raised in the previous paragraph is mitigated by the
modification in timeout behaviour specified in <xref
target="sec-timeout"></xref> of this memo. This mitigation is in
place in those cases where the RTCP bandwidth is sufficiently high
that an endpoint, using avg_rtcp_size calculated without taking into
account the number of reporting SSRCs, can transmit more frequently
than approximately every 5 seconds. Note, however, that the
non-updated endpoint's RTCP reporting is still negatively impacted
even if the premature timeout of its SSRCs are avoided. If
compatibility with non-updated endpoints is a concern, the number of
reports from different SSRCs aggregated into a single compound RTCP
packet SHOULD either be limited to two reports, or aggregation ought
not used at all. This will limit the non-updated endpoint's RTCP
reporting interval to be no larger than twice the RTCP reporting
interval that would be chosen by an endpoint following this
specification.</t>
</section>
<section anchor="agg"
title="Scheduling RTCP when Aggregating Multiple SSRCs">
<t>This section revises and extends the behaviour defined in Section
6.3 of <xref target="RFC3550"></xref>, and in Section 3.5.3 of <xref
target="RFC4585"></xref> if the RTP/AVPF profile or the RTP/SAVPF
profile is used, regarding actions to take when scheduling and
sending RTCP packets where multiple reporting SSRCs are aggregating
their RTCP packets into the same compound RTCP packet. These changes
to the RTCP scheduling rules are needed to maintain important RTCP
timing properties, including the inter-packet distribution, and the
behaviour during flash joins and other changes in session
membership.</t>
<t>The variables tn, tp, tc, T, and Td used in the following are
defined in Section 6.3 of <xref target="RFC3550"></xref>. The
variables T_rr_interval and T_rr_last are defined in <xref
target="RFC4585"></xref>.</t>
<t>Each endpoint MUST schedule RTCP transmission independently for
each of its SSRCs using the regular calculation of tn for the RTP
profile being used. Each time the timer tn expires for an SSRC, the
endpoint MUST perform RTCP timer reconsideration and, if applicable,
T_rr_interval based suppression. If the result indicates that a
compound RTCP packet is to be sent by that SSRC, and the
transmission is not an early RTCP packet <xref
target="RFC4585"></xref>, then the endpoint SHOULD try to aggregate
RTCP packets of additional SSRCs that are scheduled in the future
into the compound RTCP packet before it is sent. The reason to limit
or not aggregate at due to backwards compatibility reasons was
discussed earlier in <xref target="avg_rtcp_size"></xref>.</t>
<t>Aggregation proceeds as follows. The endpoint selects the SSRC
that has the smallest tn value after the current time, tc, and
prepares the RTCP packets that SSRC would send if its timer tn
expired at tc. If those RTCP packets will fit into the compound RTCP
packet that is being generated, taking into account the path MTU and
the previously added RTCP packets, then they are added to the
compound RTCP packet; otherwise they are discarded. This process is
repeated for each SSRC, in order of increasing tn, until the
compound RTCP packet is full, or all SSRCs have been aggregated. At
that point, the compound RTCP packet is sent.</t>
<t>When the compound RTCP packet is sent, the endpoint MUST update
tp, tn, and T_rr_last (if applicable) for each SSRC that was
included. These variables are updated as follows: <list
style="letters">
<t>For the first SSRC that reported in the compound RTCP packet,
set the effective transmission time, tt, of that SSRC to tc.</t>
<t>For each additional SSRC that reported in the compound RTCP
packet, calculate the transmission time that SSRC would have had
if it had not been aggregated into the compound RTCP packet.
This is derived by taking tn for that SSRC, then performing
reconsideration and updating tn until tp + T <= tn. Once this
is done, set the effective transmission time, tt, for that SSRC
to the calculated value of tn. If the RTP/AVPF profile or the
RTP/SAVPF profile is being used, then T_rr_interval based
suppression MUST NOT be used in this calculation.</t>
<t>Calculate average effective transmission time, tt_avg, for
the compound RTCP packet based on the tt values for all SSRCs
sent in the compound RTCP packet. Set tp for each of the SSRCs
sent in the compound RTCP packet to tt_avg. If the RTP/AVPF
profile or the RTP/SAVPF profile is being used, set T_tt_last
for each SSRC sent in the compound RTCP packet to tt_avg.</t>
<t>For each of the SSRCs sent in the compound RTCP packet,
calculate new tn values based on the updated parameters and the
usual RTCP timing rules, and reschedule the timers.</t>
</list></t>
<t>When using the RTP/AVPF profile or the RTP/SAVPF profile, the
above mechanism only attempts to aggregate RTCP packets when the
compound RTCP packet to be sent is not an early RTCP packet, and
hence the algorithm in Section 3.5.3 of <xref
target="RFC4585"></xref> will control RTCP scheduling. If
T_rr_interval == 0, or if T_rr_interval != 0 and option 1, 2a, or 2b
of the algorithm are chosen, then the above mechanism updates the
necessary variables. However, if the transmission is suppressed per
option 2c of the algorithm, then tp is updated to tc as aggregation
has not taken place.</t>
<t>Reverse reconsideration MUST be performed following Section 6.3.4
of <xref target="RFC3550"></xref>. In some cases, this can lead to
the value of tp after reverse reconsideration being larger than tc.
This is not a problem, and has the desired effect of proportionally
pulling the tp value towards tc (as well as tn) as the reporting
interval shrinks in direct proportion the reduced group size.</t>
<t>The above algorithm has been shown in simulations to maintain the
inter-RTCP packet transmission time distribution for each SSRC, and
to consume the same amount of bandwidth as non-aggregated RTCP
packets. With this algorithm the actual transmission interval for an
SSRC triggering an RTCP compound packet transmission is following
the regular transmission rules. The value tp is set to somewhere in
the interval [0,1.5/1.21828*Td] ahead of tc. The actual value is
average of one instance of tc and the randomized transmission times
of the additional SSRCs, thus the lower range of the interval is
more probable. This compensates for the bias that is otherwise
introduced by picking the shortest tn value out of the N SSRCs
included in aggregate.</t>
<t>The algorithm also handles the cases where the number of SSRCs
that can be included in an aggregated packet varies. An SSRC that
previously was aggregated and fails to fit in a packet still has its
own transmission scheduled according to normal rules. Thus, it will
trigger a transmission in due time, or the SSRC will be included in
another aggregate. The algorithm's behaviour under SSRC group size
changes is as follows:</t>
<t><list style="hanging">
<t
hangText="RTP sessions where the number of SSRC are growing:">When
the group size is growing, Td grows in proportion to the number
of new SSRCs in the group. When reconsideration is performed due
to expiry of the tn timer, that SSRC will reconsider the
transmission and with a certain probability reschedule the tn
timer. This part of the reconsideration algorithm is only
impacted by the above algorithm by having tp values that were in
the future instead of set to the time of the actual last
transmission at the time of updating tp.</t>
<t
hangText="RTP sessions where the number of SSRC are shrinking:">When
the group shrinks, reverse reconsideration moves the tp and tn
values towards tc proportionally to the number of SSRCs that
leave the session compared to the total number of participants
when they left. The setting of the tp value forward in time
related to the tc could be believed to have negative effect.
However, the reason for this setting is to compensate for bias
caused by picking the shortest tn out of the N aggregated. This
bias remains over a reduction in the number of SSRCs. The
reverse reconsideration compensates the reduction independently
of aggregation being used or not. The negative effect that can
occur on removing an SSRC is that the most favourable tn
belonged to the removed SSRC. The impact of this is limited to
delaying the transmission, in the worst case, one reporting
interval.</t>
</list></t>
<t>In conclusion the investigations performed has found no
significant negative impact on the scheduling algorithm.</t>
</section>
</section>
<section title="Use of RTP/AVPF or RTP/SAVPF Feedback">
<t>This section discusses the transmission of RTP/AVPF feedback
packets when the transmitting endpoint has multiple SSRCs. The
guidelines in this section also apply to endpoints using the RTP/SAVPF
profile.</t>
<section title="Choice of SSRC for Feedback Packets">
<t>When an RTP/AVPF endpoint has multiple SSRCs, it can choose what
SSRC to use as the source for the RTCP feedback packets it sends.
Several factors can affect that choice: <list style="symbols">
<t>RTCP feedback packets relating to a particular media type
SHOULD be sent by an SSRC that receives that media type. For
example, when audio and video are multiplexed onto a single RTP
session, endpoints will use their audio SSRC to send feedback on
the audio received from other participants.</t>
<t>RTCP feedback packets and RTCP codec control messages that
are notifications or indications regarding RTP data processed by
an endpoint MUST be sent from the SSRC used for that RTP data.
This includes notifications that relate to a previously received
request or command <xref target="RFC4585"></xref><xref
target="RFC5104"></xref>.</t>
<t>If separate SSRCs are used to send and receive media, then
the corresponding SSRC SHOULD be used for feedback, since they
have differing RTCP bandwidth fractions. This can also affect
the consideration if the SSRC can be used in immediate mode or
not.</t>
<t>Some RTCP feedback packet types require consistency in the
SSRC used. For example, if a <xref target="RFC5104">TMMBR
limitation</xref> is set by an SSRC, the same SSRC needs to be
used to remove the limitation.</t>
<t>If several SSRCs are suitable for sending feedback, it might
be desirable to use an SSRC that allows the sending of feedback
as an early RTCP packet.</t>
</list></t>
<t>When an RTCP feedback packet is sent as part of a compound RTCP
packet that aggregates reports from multiple SSRCs, there is no
requirement that the compound packet contains an SR or RR packet
generated by the sender of the RTCP feedback packet. For
reduced-size RTCP packets, aggregation of RTCP feedback packets from
multiple sources is not limited further than Section 4.2.2 of <xref
target="RFC5506"></xref>.</t>
</section>
<section title="Scheduling an RTCP Feedback Packet">
<t>When an SSRC has a need to transmit a feedback packet in early
mode it MUST schedule that packet following the algorithm in Section
3.5 of <xref target="RFC4585"></xref> modified as follows: <list
style="symbols">
<t>To determine whether an RTP session is considered to be a
point-to-point session or a multiparty session, an endpoint MUST
count the number of distinct RTCP SDES CNAME values used by the
SSRCs listed in the SSRC field of RTP data packets it receives
and in the "SSRC of sender" field of RTCP SR, RR, RTPFB, or PSFB
packets it receives. An RTP session is considered to be a
multiparty session if more than one CNAME is used by those
SSRCs, unless signalling indicates that the session is to be
handled as point to point, or RTCP reporting groups <xref
target="I-D.ietf-avtcore-rtp-multi-stream-optimisation"></xref>
are used. If RTCP reporting groups are used, an RTP session is
considered to be a point-to-point session if the endpoint
receives only a single reporting group, and considered to be a
multiparty session if multiple reporting groups are received, or
if a combination of reporting groups and SSRCs that are not part
of a reporting group are received. Endpoints MUST NOT determine
whether an RTP session is multiparty or point-to-point based on
the type of connection (unicast or multicast) used, or on the
number of SSRCs received.</t>
<t>When checking if there is already a scheduled compound RTCP
packet containing feedback messages (Step 2 in Section 3.5.2 of
<xref target="RFC4585"></xref>), that check MUST be done
considering all local SSRCs.</t>
<t>If an SSRC is not allowed to send an early RTCP packet, then
the feedback message MAY be queued for transmission as part of
any early or regular scheduled transmission that can occur
within the maximum useful lifetime of the feedback message
(T_max_fb_delay). This modifies the behaviour in bullet 4a) in
Section 3.5.2 of <xref target="RFC4585"></xref>.</t>
</list></t>
<t>The first bullet point above specifies a rule to determine if an
RTP session is to be considered a point-to-point session or a
multiparty session. This rule is straightforward to implement, but
is known to incorrectly classify some sessions as multiparty
sessions. The known problems are as follows:<list style="hanging">
<t
hangText="Endpoint with multiple synchronization contexts:">An
endpoint that is part of a point-to-point session can have
multiple synchronization contexts, for example due to forwarding
an external media source into a interactive real-time
conversation. In this case the classification will consider the
peer as two endpoints, while the actual RTP/RTCP transmission
will be under the control of one endpoint.</t>
<t hangText="Selective Forwarding Middlebox:">The SFM as defined
in <xref target="I-D.ietf-avtcore-rtp-topologies-update">Section
3.7 of </xref> has control over the transmission and
configurations between itself and each peer endpoint
individually. It also fully controls the RTCP packets being
forwarded between the individual legs. Thus, this type of
middlebox can be compared to the RTP mixer, which uses its own
SSRCs to mix or select the media it forwards, that will be
classified as a point-to-point RTP session by the above
rule.</t>
</list>In the above cases it is very reasonable to use <xref
target="I-D.ietf-avtcore-rtp-multi-stream-optimisation">RTCP
reporting groups</xref>. If that extension is used, an endpoint can
indicate that the multitude of CNAMEs are in fact under a single
endpoint or middlebox control by using only a single reporting
group.</t>
<t>The above rules will also classify some sessions where the
endpoint is connected to an RTP mixer as being point to point. For
example the mixer could act as gateway to an Any Source Multicast
based RTP session for the discussed endpoint. However, this will in
most cases be okay, as the RTP mixer provides separation between the
two parts of the session. The responsibility falls on the mixer to
act accordingly in each domain.</t>
<t>Finally, we note that signalling mechanisms could be defined to
override the rules when it would result in the wrong
classification.</t>
</section>
</section>
</section>
<section title="Adding and Removing SSRCs">
<t>The set of SSRCs present in a single RTP session can vary over time
due to changes in the number of endpoints in the session, or due to
changes in the number or type of media streams being sent.</t>
<t>Every endpoint in an RTP session will have at least one SSRC that it
uses for RTCP reporting, and for sending media if desired. It can also
have additional SSRCs, for sending extra media streams or for additional
RTCP reporting. If the set of media streams being sent changes, then the
set of SSRCs being sent will change. Changes in the media format or
clock rate might also require changes in the set of SSRCs used. An
endpoint can also have more active SSRCs than it has active RTP media
streams, and send RTCP relating to SSRCs that are not currently sending
RTP data packets so that its peers are aware of the SSRCs, and have the
associated context (e.g., clock synchronisation and an SDES CNAME) in
place to be able to play out media as soon as they becomes active.</t>
<t>In the following, we describe some considerations around adding and
removing RTP streams and their associated SSRCs.</t>
<section title="Adding RTP Streams">
<t>When an endpoint joins an RTP session it can have zero, one, or
more RTP streams it will send, or that it is prepared to send. If it
has no RTP stream it plans to send, it still needs an SSRC that will
be used to send RTCP feedback. If it will send one or more RTP
streams, it will need the corresponding number of SSRC values. The
SSRCs used by an endpoint are made known to other endpoints in the RTP
session by sending RTP and RTCP packets. SSRCs can also be signalled
using non-RTP means (e.g., <xref target="RFC5576"></xref>). Unless
restricted by signalling, an endpoint can, at any time, send an
additional RTP stream, identified by a new SSRC (this might be
associated with a signalling event, but that is outside the scope of
this memo). This makes the new SSRC visible to the other endpoints in
the session, since they share the single SSRC space inherent in the
definition of an RTP session.</t>
<t>An endpoint that has never sent an RTP stream will have an SSRC
that it uses for RTCP reporting. If that endpoint wants to start
sending an RTP stream, it is RECOMMENDED that it use its existing SSRC
for that stream, since otherwise the participant count in the RTP
session will be unnecessary increased, leading to a longer RTCP
reporting interval and larger RTCP reports due to cross reporting. If
the endpoint wants to start sending more than one RTP stream, it will
need to generate a new SSRC for the second and any subsequent RTP
streams.</t>
<t>An endpoint that has previously stopped sending an RTP stream, and
that wants to start sending a new RTP stream, cannot generally re-use
the existing SSRC, and often needs to generate a new SSRC, because an
SSRC cannot change media type (e.g., audio to video) or RTP timestamp
clock rate <xref target="RFC7160"></xref>, and because the SSRC might
be associated with a particular semantic by the application (note: an
RTP stream can pause and restart using the same SSRC, provided RTCP is
sent for that SSRC during the pause; these rules only apply to new RTP
streams reusing an existing SSRC).</t>
</section>
<section title="Removing RTP Streams">
<t>An SSRC is removed from an RTP session in one of two ways. When an
endpoint stops sending RTP and RTCP packets using an SSRC, then that
SSRC will eventually time out as described in Section 6.3.5 of <xref
target="RFC3550"></xref>. Alternatively, an SSRC can be explicitly
removed from use by sending an RTCP BYE packet as described in Section
6.3.7 of <xref target="RFC3550"></xref>. It is RECOMMENDED that SSRCs
are removed from use by sending an RTCP BYE packet. Note that <xref
target="RFC3550"></xref> requires that the RTCP BYE SHOULD be the last
RTP/RTCP packet sent in the RTP session for an SSRC. If an endpoint
needs to restart an RTP stream after sending an RTCP BYE for its SSRC,
it needs to generate a new SSRC value for that stream.</t>
<t>The finality of sending RTCP BYE, means that endpoints needs to
consider if the ceasing of transmission of an RTP stream is temporary
or more permanent. Temporary suspension of media transmission using a
particular RTP stream (SSRC) needs to maintain that SSRC as an active
participant, by continuing RTCP transmission for it. That way the
media sending can be resume immediately, knowing that the context is
in place. Permanent transmission halting needs to send RTCP BYE to
allow the other participants to use the RTCP bandwidth resources and
clean up their state databases.</t>
<t>An endpoint that ceases transmission of all its RTP streams but
remains in the RTP session MUST maintain at least one SSRC that is to
be used for RTCP reporting and feedback (i.e., it cannot send a BYE
for all SSRCs, but needs to retain at least one active SSRC). As some
Feedback packets can be bound to media type there might be need to
maintain one SSRC per media type within an RTP session. An alternative
can be to create a new SSRC to use for RTCP reporting and feedback.
However, to avoid the perception that an endpoint drops completely out
of an RTP session such a new SSRC ought to be first established before
terminating all the existing SSRCs.</t>
</section>
</section>
<section title="RTCP Considerations for Streams with Disparate Rates">
<t>An RTP session has a single set of parameters that configure the
session bandwidth. These are the RTCP sender and receiver fractions
(e.g., the SDP "b=RR:" and "b=RS:" lines <xref
target="RFC3556"></xref>), and the parameters of the <xref
target="RFC4585">RTP/AVPF profile</xref> (e.g., trr-int) if that profile
(or its <xref target="RFC5124"> secure extension, RTP/SAVPF</xref>) is
used. As a consequence, the base RTCP reporting interval, before
randomisation, will be the same for every sending SSRC in an RTP
session. Similarly, every receiving SSRC in an RTP session will have the
same base reporting interval, although this can differ from the
reporting interval chosen by sending SSRCs. This uniform RTCP reporting
interval for all SSRCs can result in RTCP reports being sent more often,
or too seldom, than is considered desirable for a RTP stream.</t>
<t>For example, consider a scenario when an audio flow sending at tens
of kilobits per second is multiplexed into an RTP session with a
multi-megabit high quality video flow. If the session bandwidth is
configured based on the video sending rate, and the default RTCP
bandwidth fraction of 5% of the session bandwidth is used, it is likely
that the RTCP bandwidth will exceed the audio sending rate. If the
reduced minimum RTCP interval described in Section 6.2 of <xref
target="RFC3550"></xref> is then used in the session, as appropriate for
video where rapid feedback on damaged I-frames is wanted, the uniform
reporting interval for all senders could mean that audio sources are
expected to send RTCP packets more often than they send audio data
packets. This bandwidth mismatch can be reduced by careful tuning of the
RTCP parameters, especially trr_int when the RTP/AVPF profile is used,
but cannot be avoided entirely as it is inherent in the design of the
RTCP timing rules, and affects all RTP sessions that contain flows with
greatly mismatched bandwidth.</t>
<t>Different media rates or desired RTCP behaviours can also occur with
SSRCs carrying the same media type. A common case in multiparty
conferencing is when a small number of video streams are shown in high
resolution, while the others are shown as low resolution thumbnails,
with the choice of which is shown in high resolution being voice
activity controlled. Here the differences are both in actual media rate
and in choices for what feedback messages might be needed. Other
examples of differences that can exist are due to the intended usage of
a media source. A media source carrying the video of the speaker in a
conference is different from a document camera. Basic parameters that
can differ in this case are frame-rate, acceptable end-to-end delay, and
the SNR fidelity of the image. These differences affect not only the
needed bit-rates, but also possible transmission behaviours, usable
repair mechanisms, what feedback messages the control and repair
requires, the transmission requirements on those feedback messages, and
monitoring of the RTP stream delivery. Other similar scenarios can also
exist.</t>
<t>Sending multiple media types in a single RTP session causes that
session to contain more SSRCs than if each media type was sent in a
separate RTP session. For example, if two participants each send an
audio and a video flow in a single RTP session, that session will
comprise four SSRCs, but if separate RTP sessions had been used for
audio and video, each of those two RTP sessions would comprise only two
SSRCs. Sending multiple media streams in an RTP session hence increases
the amount of cross reporting between the SSRCs, as each SSRC reports on
all other SSRCs in the session. This increases the size of the RTCP
reports, causing them to be sent less often than would be the case if
separate RTP sessions where used for a given RTCP bandwidth.</t>
<t>Finally, when an RTP session contains multiple media types, it is
important to note that the RTCP reception quality reports, feedback
messages, and extended report blocks used might not be applicable to all
media types. Endpoints will need to consider the media type of each SSRC
only send or process reports and feedback that apply to that particular
SSRC and its media type. Signalling solutions might have shortcomings
when it comes to indicating that a particular set of RTCP reports or
feedback messages only apply to a particular media type within an RTP
session.</t>
<t>From an RTCP perspective, therefore, it can be seen that there are
advantages to using separate RTP sessions for each media stream, rather
than sending multiple media streams in a single RTP session. However,
these are frequently offset by the need to reduce port use, to ease
NAT/firewall traversal, achieved by combining media streams into a
single RTP session. The following sections consider some of the issues
with using RTCP in sessions with multiple media streams in more
detail.</t>
<section anchor="sec-timeout-ssrc" title="Timing out SSRCs">
<t>Various issues have been identified with timing out SSRC values
when sending multiple media streams in an RTP session.</t>
<section anchor="sec-avpf-bug"
title="Problems with the RTP/AVPF T_rr_interval Parameter">
<t>The RTP/AVPF profile includes a method to prevent regular RTCP
reports from being sent too often. This mechanism is described in
Section 3.5.3 of <xref target="RFC4585"></xref>, and is controlled
by the T_rr_interval parameter. It works as follows. When a regular
RTCP report is sent, a new random value, T_rr_current_interval, is
generated, drawn evenly in the range 0.5 to 1.5 times T_rr_interval.
If a regular RTCP packet is to be sent earlier then
T_rr_current_interval seconds after the previous regular RTCP
packet, and there are no feedback messages to be sent, then that
regular RTCP packet is suppressed, and the next regular RTCP packet
is scheduled. The T_rr_current_interval is recalculated each time a
regular RTCP packet is sent. The benefit of suppression is that it
avoids wasting bandwidth when there is nothing requiring frequent
RTCP transmissions, but still allows utilization of the configured
bandwidth when feedback is needed.</t>
<t>Unfortunately this suppression mechanism skews the distribution
of the RTCP sending intervals compared to the regular RTCP reporting
intervals. The standard RTCP timing rules, including reconsideration
and the compensation factor, result in the intervals between sending
RTCP packets having a distribution that is skewed towards the upper
end of the range [0.5/1.21828, 1.5/1.21828]*Td, where Td is the
deterministic calculated RTCP reporting interval. With Td = 5s this
distribution covers the range [2.052s, 6.156s]. In comparison, the
RTP/AVPF suppression rules act in an interval that is 0.5 to 1.5
times T_rr_interval; for T_rr_interval = 5s this is [2.5s,
7.5s].</t>
<t>The effect of this is that the time between consecutive RTCP
packets when using T_rr_interval suppression can become large. The
maximum time interval between sending one regular RTCP packet and
the next, when T_rr_interval is being used, occurs when
T_rr_current_interval takes its maximum value and a regular RTCP
packet is suppressed at the end of the suppression period, then the
next regular RTCP packet is scheduled after its largest possible
reporting interval. Taking the worst case of the two intervals gives
a maximum time between two RTCP reports of 1.5*T_rr_interval +
1.5/1.21828*Td.</t>
<t>This behaviour can be surprising when Td and T_rr_interval have
the same value. That is, when T_rr_interval is configured to match
the regular RTCP reporting interval. In this case, one might expect
that regular RTCP packets are sent according to their usual
schedule, but feedback packets can be sent early. However, the
above-mentioned issue results in the RTCP packets actually being
sent in the range [0.5*Td, 2.731*Td] with a highly non-uniform
distribution, rather than the range [0.41*Td, 1.23*Td]. This is
perhaps unexpected, but is not a problem in itself. However, when
coupled with packet loss, it raises the issue of premature
timeout.</t>
</section>
<section anchor="sec-timeout" title="Avoiding Premature Timeout">
<t>In <xref target="RFC3550">RTP/AVP</xref> the timeout behaviour is
simple, and is 5 times Td, where Td is calculated with a Tmin value
of 5 seconds. In other words, if the configured RTCP bandwidth
allows for an average RTCP reporting interval shorter than 5
seconds, the timeout is 25 seconds of no activity from the SSRC (RTP
or RTCP), otherwise the timeout is 5 average reporting
intervals.</t>
<t><xref target="RFC4585">RTP/AVPF</xref> introduces different
timeout behaviours depending on the value of T_rr_interval. When
T_rr_interval is 0, it uses the same timeout calculation as RTP/AVP.
However, when T_rr_interval is non-zero, it replaces Tmin in the
timeout calculation, most likely to speed up detection of timed out
SSRCs. However, using a non-zero T_rr_interval has two consequences
for RTP behaviour.</t>
<t>First, due to suppression, the number of RTP and RTCP packets
sent by an SSRC that is not an active RTP sender can become very
low, because of the issue discussed in <xref
target="sec-avpf-bug"></xref>. As the RTCP packet interval can be as
long as 2.73*Td, then during a 5*Td time period an endpoint might in
fact transmit only a single RTCP packet. The long intervals result
in fewer RTCP packets, to a point where a single RTCP packet loss
can sometimes result in timing out an SSRC.</t>
<t>Second, the RTP/AVPF changes to the timeout rules reduce
robustness to misconfiguration. It is common to use RTP/AVPF
configured such that RTCP packets can be sent frequently, to allow
rapid feedback, however this makes timeouts very sensitive to
T_rr_interval. For example, if two SSRCs are configured one with
T_rr_interval = 0.1s and the other with T_rr_interval = 0.6s, then
this small difference will result in the SSRC with the shorter
T_rr_interval timing out the other if it stops sending RTP packets,
since the other RTCP reporting interval is more than five times its
own. When RTP/AVP is used, or RTP/AVPF with T_rr_interval = 0, this
is a non-issue, as the timeout period will be 25s, and differences
between configured RTCP bandwidth can only cause premature timeouts
when the reporting intervals are greater than 5s and differ by a
factor of five. To limit the scope for such problematic
misconfiguration, we propose an update to the RTP/AVPF timeout rules
in <xref target="sec-rtcp-timeout-spec"></xref>.</t>
</section>
<section title="Interoperability Between RTP/AVP and RTP/AVPF">
<t>If endpoints implementing the RTP/AVP and RTP/AVPF profiles (or
their secure variants) are combined within a single RTP session, and
the RTP/AVPF endpoints use a non-zero T_rr_interval that is
significantly below 5 seconds, there is a risk that the RTP/AVPF
endpoints will prematurely timeout the SSRCs of the RTP/AVP
endpoints, due to their different RTCP timeout rules. Conversely, if
the RTP/AVPF endpoints use a T_rr_interval that is significant
larger than 5 seconds, there is a risk that the RTP/AVP endpoints
will timeout the SSRCs of the RTP/AVPF endpoints.</t>
<t>Mixing endpoints using two different RTP profiles within a single
RTP session is NOT RECOMMENDED. However, if mixed RTP profiles are
used, and the RTP/AVPF endpoints are not updated to follow <xref
target="sec-rtcp-timeout-spec"></xref> of this memo, then the
RTP/AVPF session SHOULD be configured to use T_rr_interval = 4
seconds to avoid premature timeouts.</t>
<t>The choice of T_rr_interval = 4 seconds for interoperability
might appear strange. Intuitively, this value ought to be 5 seconds,
to make both the RTP/AVP and RTP/AVPF use the same timeout period.
However, the behaviour outlined in <xref
target="sec-avpf-bug"></xref> shows that actual RTP/AVPF reporting
intervals can be longer than expected. Setting T_rr_interval = 4
seconds gives actual RTCP intervals near to those expected by
RTP/AVP, ensuring interoperability.</t>
</section>
<section anchor="sec-rtcp-timeout-spec"
title="Updated SSRC Timeout Rules">
<t>To ensure interoperability and avoid premature timeouts, all
SSRCs in an RTP session MUST use the same timeout behaviour.
However, previous specification are inconsistent in this regard. To
avoid interoperability issues, this memo updates the timeout rules
as follows: <list style="symbols">
<t>For the RTP/AVP, RTP/SAVP, RTP/AVPF, and RTP/SAVPF profiles,
the timeout interval SHALL be calculated using a multiplier of
five times the deterministic RTCP reporting interval. That is,
the timeout interval SHALL be 5*Td.</t>
<t>For the RTP/AVP, RTP/SAVP, RTP/AVPF, and RTP/SAVPF profiles,
calculation of Td, for the purpose of calculating the
participant timeout only, SHALL be done using a Tmin value of 5
seconds and not the reduced minimal interval, even if the
reduced minimum interval is used to calculate RTCP packet
transmission intervals.</t>
</list> This changes the behaviour for the RTP/AVPF or RTP/SAVPF
profiles when T_rr_interval != 0. Specifically, the first paragraph
of Section 3.5.4 of <xref target="RFC4585"></xref> is updated to use
Tmin instead of T_rr_interval in the timeout calculation for
RTP/AVPF entities.</t>
</section>
</section>
<section anchor="sec-rtcp-tuning" title="Tuning RTCP transmissions">
<t>This sub-section discusses what tuning can be done to reduce the
downsides of the shared RTCP packet intervals. First, it is considered
what possibilities exist for the <xref target="RFC3551">RTP/AVP</xref>
profile, then what additional tools are provided by <xref
target="RFC4585">RTP/AVPF</xref>.</t>
<section title="RTP/AVP and RTP/SAVP">
<t>When using the RTP/AVP or RTP/SAVP profiles, the options for
tuning the RTCP reporting intervals are limited to the RTCP sender
and receiver bandwidth, and whether the minimum RTCP interval is
scaled according to the bandwidth. As the scheduling algorithm
includes both randomisation and reconsideration, one cannot simply
calculate the expected average transmission interval using the
formula for Td given in Section 6.3.1 of <xref
target="RFC3550"></xref>. However, by considering the inputs to that
expression, and the randomisation and reconsideration rules, we can
begin to understand the behaviour of the RTCP transmission
interval.</t>
<t>Let's start with some basic observations:<list style="letters">
<t>Unless the scaled minimum RTCP interval is used, then Td
prior to randomization and reconsideration can never be less
than Tmin. The default value of Tmin is 5 seconds.</t>
<t>If the scaled minimum RTCP interval is used, Td can become as
low as 360 divided by RTP Session bandwidth in kilobits per
second. In SDP the RTP session bandwidth is signalled using a
"b=AS" line. An RTP Session bandwidth of 72kbps results in Tmin
being 5 seconds. An RTP session bandwidth of 360kbps of course
gives a Tmin of 1 second, and to achieve a Tmin equal to once
every frame for a 25 frame-per-second video stream requires an
RTP session bandwidth of 9Mbps. Use of the RTP/AVPF or RTP/SAVPF
profile allows more frequent RTCP reports for the same
bandwidth, as discussed below.</t>
<t>The value of Td scales with the number of SSRCs and the
average size of the RTCP reports, to keep the overall RTCP
bandwidth constant.</t>
<t>The actual transmission interval for a Td value is in the
range [0.5*Td/1.21828,1.5*Td/1.21828], and the distribution is
skewed, due to reconsideration, with the majority of the
probability mass being above Td. This means, for example, that
for Td = 5s, the actual transmission interval will be
distributed in the range [2.052s, 6.156s], and tending towards
the upper half of the interval. Note that Tmin parameter limits
the value of Td before randomisation and reconsideration are
applied, so the actual transmission interval will cover a range
extending below Tmin.</t>
</list></t>
<t>Given the above, we can calculate the number of SSRCs, n, that an
RTP session with 5% of the session bandwidth assigned to RTCP can
support while maintaining Td equal to Tmin. This will tell us how
many media streams we can report on, keeping the RTCP overhead
within acceptable bounds. We make two assumptions that simplify the
calculation: that all SSRCs are senders, and that they all send
compound RTCP packets comprising an SR packet with n-1 report
blocks, followed by an SDES packet containing a 16 octet CNAME value
<xref target="RFC7022"></xref> (such RTCP packets will vary in size
between 54 and 798 octets depending on n, up to the maximum of 31
report blocks that can be included in an SR packet). If we put this
packet size, and a 5% RTCP bandwidth fraction into the RTCP interval
calculation in Section 6.3.1 of <xref target="RFC3550"></xref>, and
calculate the value of n needed to give Td = Tmin for the scaled
minimum interval, we find n=9 SSRCs can be supported (irrespective
of the interval, due to the way the reporting interval scales with
the session bandwidth). We see that to support more SSRCs without
changing the scaled minimum interval, we need to increase the RTCP
bandwidth fraction from 5%; changing the session bandwidth to a
higher value would reduce the Tmin. However, if using the default 5%
allocation of RTCP bandwidth, an increase will result in more SSRCs
being supported given a fixed Td target.</t>
<!-- csp: see page 1593 of notes for the above calculation -->
<t>Based on the above, when using the RTP/AVP profile or the
RTP/SAVP profile, the key limitation for rapid RTCP reporting in
small unicast sessions is going to be the Tmin value. The RTP
session bandwidth configured in RTCP has to be sufficiently high to
reach the reporting goals the application has following the rules
for the scaled minimal RTCP interval.</t>
</section>
<section title="RTP/AVPF and RTP/SAVPF">
<t>When using RTP/AVPF or RTP/SAVPF, we have a powerful additional
tool for tuning RTCP transmissions: the T_rr_interval parameter. Use
of this parameter allows short RTCP reporting intervals;
alternatively it gives the ability to sent frequent RTCP feedback
without sending frequent regular RTCP reports.</t>
<t>The use of the RTP/AVPF or RTP/SAVPF profile with T_rr_interval
set to a value greater than zero but smaller than Tmin allows more
frequent RTCP feedback than the RTP/AVP or RTP/SAVP profiles, for a
given RTCP bandwidth. This happens because Tmin is set to zero after
the transmission of the initial RTCP report, causing the reporting
interval for later packet to be determined by the usual RTCP
bandwidth-based calculation, with Tmin=0, and the T_rr_interval.
This has the effect that we are no longer restricted by the minimal
interval (whether the default 5 second minimum, or the reduced
minimum interval). Rather, the RTCP bandwidth and the T_rr_interval
are the governing factors, allowing faster feedback. Applications
that care about rapid regular RTCP feedback ought to consider using
the RTP/AVPF or RTP/SAVPF profile, even if they don't use the
feedback features of that profile.</t>
<t>The use of the RTP/AVPF or RTP/SAVPF profile allows RTCP feedback
packets to be sent frequently, without also requiring regular RTCP
reports to be sent frequently, since T_rr_interval limits the rate
at which regular RTCP packets can be sent, while still permitting
RTCP feedback packets to be sent. Applications that can use feedback
packets for some media streams, e.g., video streams, but don't want
frequent regular reporting for other media streams, can configure
the T_rr_interval to a value so that the regular reporting for both
audio and video is at a level that is considered acceptable for the
audio. They could then use feedback packets, which will include RTCP
SR/RR packets unless reduced size RTCP feedback packets <xref
target="RFC5506"></xref> are used, for the video reporting. This
allows the available RTCP bandwidth to be devoted on the feedback
that provides the most utility for the application.</t>
<t>Using T_rr_interval still requires one to determine suitable
values for the RTCP bandwidth value. Indeed, it might make this
choice even more important, as this is more likely to affect the
RTCP behaviour and performance than when using the RTP/AVP or
RTP/SAVP profile, as there are fewer limitations affecting the RTCP
transmission.</t>
<t>When T_rr_interval is non-zero, there are configurations that
need to be avoided. If the RTCP bandwidth chosen is such that the Td
value is smaller than, but close to, T_rr_interval, then the actual
regular RTCP packet transmission interval can become very large, as
discussed in <xref target="sec-avpf-bug"></xref>. Therefore, for
configuration where one intends to have Td smaller than
T_rr_interval, then Td is RECOMMENDED to be targeted at values less
than 1/4th of T_rr_interval which results in that the range becomes
[0.5*T_rr_interval, 1.81*T_rr_interval].</t>
<t>With the RTP/AVPF or RTP/SAVPF profiles, using T_rr_interval = 0
has utility, and results in a behaviour where the RTCP transmission
is only limited by the bandwidth, i.e., no Tmin limitations at all.
This allows more frequent regular RTCP reporting than can be
achieved using the RTP/AVP profile. Many configurations of RTCP will
not consume all the bandwidth that they have been configured to use,
but this configuration will consume what it has been given. Note
that the same behaviour will be achieved as long as T_rr_interval is
smaller than 1/3 of Td as that prevents T_rr_interval from affecting
the transmission.</t>
<t>There exists no method for using different regular RTCP reporting
intervals depending on the media type or individual media stream,
other than using a separate RTP session for each type or stream.</t>
</section>
</section>
</section>
<section anchor="security" title="Security Considerations">
<t>When using the secure RTP protocol (RTP/SAVP) <xref
target="RFC3711"></xref>, or the secure variant of the feedback profile
(RTP/SAVPF) <xref target="RFC5124"></xref>, the cryptographic context of
a compound secure RTCP packet is the SSRC of the sender of the first
RTCP (sub-)packet. This could matter in some cases, especially for
keying mechanisms such as <xref target="RFC3830">Mikey</xref> which
allow use of per-SSRC keying.</t>
<t>Otherwise, the standard security considerations of RTP apply; sending
multiple media streams from a single endpoint in a single RTP session
does not appear to have different security consequences than sending the
same number of media streams spread across different RTP sessions.</t>
</section>
<section anchor="iana" title="IANA Considerations">
<t>No IANA actions are needed.</t>
</section>
<section title="Acknowledgments">
<t>The authors like to thank Harald Alvestrand and everyone else who has
been involved in the development of this document.</t>
</section>
</middle>
<back>
<references title="Normative References">
<?rfc include='reference.RFC.2119'?>
<?rfc include='reference.RFC.3550'?>
<?rfc include='reference.RFC.3711'?>
<?rfc include='reference.RFC.4585'?>
<?rfc include='reference.RFC.5124'?>
<?rfc include='reference.RFC.5506'?>
</references>
<references title="Informative References">
<?rfc include='reference.RFC.3551'?>
<?rfc include='reference.RFC.3556'?>
<?rfc include='reference.RFC.3830'?>
<?rfc include='reference.RFC.4588'?>
<?rfc include='reference.RFC.5104'?>
<?rfc include='reference.RFC.5576'?>
<?rfc include='reference.RFC.6190'?>
<?rfc include='reference.RFC.7022'?>
<?rfc include='reference.RFC.7160'?>
<?rfc include='reference.I-D.ietf-avtcore-rtp-topologies-update'?>
<?rfc include='reference.I-D.ietf-mmusic-sdp-bundle-negotiation'?>
<?rfc include='reference.I-D.ietf-clue-framework'?>
<?rfc include='reference.I-D.ietf-avtcore-multi-media-rtp-session'?>
<?rfc include='reference.I-D.ietf-avtcore-rtp-multi-stream-optimisation'?>
</references>
</back>
</rfc>
| PAFTECH AB 2003-2026 | 2026-04-23 14:22:34 |