http://stupid.domain.name/ietf/

One document matched: draft-ietf-clue-signaling-00.xml
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<?rfc toc='yes'?>
<?rfc tocdepth='4'?>
<?rfc compact="yes"?>
<rfc category="std" ipr="trust200902" docName='draft-ietf-clue-signaling-00'>

<!--56789012345678901234567890123456789012345678901234567890123456789-->

<!-- Notes:

Paul's messages from 12/18/12 4:11PM and 12/19/12 11:21AM
formed a starting point for this draft.

-->

<front>
   <title abbrev="CLUE Signaling">
      CLUE Signaling
   </title>
   <author initials="P." surname="Kyzivat" fullname="Paul Kyzivat">
      <organization>Huawei</organization>
      <address>
         <email>pkyzivat@alum.mit.edu</email>
      </address>
   </author>
   <author initials="L." surname="Xiao" fullname="Lennard Xiao">
      <organization>Huawei</organization>
      <address>
         <email>lennard.xiao@huawei.com</email>
      </address>
   </author>
   <author initials="C." surname="Groves" fullname="Christian Groves">
      <organization>Huawei</organization>
      <address>
         <email>Christian.Groves@nteczone.com</email>
      </address>
   </author>
   <author initials="R." surname="Hansen" fullname="Robert Hansen">
         <organization>Cisco Systems</organization>
         <address>
            <email>rohanse2@cisco.com</email>
         </address>
      </author>
   
   <date year="2014" />
   <abstract>
      <t>

This document specifies how CLUE-specific signaling such as the 
<xref target="I-D.presta-clue-protocol">CLUE protocol</xref> and the 
<xref target="I-D.ietf-clue-datachannel">CLUE data channel</xref> are used with
each other and with existing signaling mechanisms such as SIP and SDP to produce a 
telepresence call.

      </t>
   </abstract>
</front>
<middle>
   <section title="Introduction">
      <t>
To enable devices to participate in a telepresence call, selecting the sources they
wish to view, receiving those media sources and displaying them in an optimal fashion,
CLUE involves two principal and inter-related protocol negotiations. SDP, conveyed
via SIP, is used to negotiate the specific media capabilities that can be delivered to
specific addresses on a device. Meanwhile, a <xref target="I-D.presta-clue-protocol">CLUE protocol</xref>,
transported via a <xref target="I-D.ietf-clue-datachannel">CLUE data channel</xref>, 
is used to negotiate the capture sources available, their attributes and any constraints in their use,
along which which captures the far end provides a device wishes to receive.
      </t>   
      <t>
Beyond negotiating the CLUE channel, SDP is also used to negotiate the details of supported
media streams and the maximum capability of each of those streams. As the
<xref target="I-D.ietf-clue-framework">CLUE Framework</xref> defines a manner in which
the media provider expresses their maximum encoding capabilities, SDP is also used
to express the encoding limits for each potential encoding. 
      </t>
	  <t>
Backwards-compatibility is an important consideration of the document: it is vital that a CLUE-capable device
contacting a device that does not support CLUE is able to fall back to a fully functional non-CLUE call.
The document also defines how a non-CLUE call may be upgraded to CLUE in mid-call, and similarly how CLUE
functionality can be removed mid-call to return to a standard non-CLUE call.
	  </t>
      <t>
This document originally also defined the CLUE protocol itself. These details have mostly
been split out into <xref target="I-D.presta-clue-protocol"/>
and expanded, but at present some details remain in this document.
      </t>

   </section>

   <section title="Terminology">
      <t>

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in <xref target="RFC2119"/>.

      </t>
      <t>

This document draws liberally from the terminology defined in the
<xref target="I-D.ietf-clue-framework">CLUE Framework</xref>.

      </t>
	  
	  <t>

Other terms introduced here:

      </t>
      <t><list style='hanging'>
	     <t hangText="CLUE data channel:">
A reliable, bidirectional, transport mechanism used to convey CLUE messages.
See <xref target="I-D.ietf-clue-datachannel" /> for more details..
         </t>
		 
         <t hangText="CLUE-capable device:">
A device that supports the <xref target="I-D.ietf-clue-datachannel">CLUE data channel</xref>,
the <xref target="I-D.presta-clue-protocol">CLUE protocol</xref> and the principles of CLUE negotiation.
         </t>
		 
		 <t hangText="CLUE-enabled device:">
A CLUE-capable device that wishes to negotiate a CLUE data channel and send and/or receive
CLUe-controlled media.
         </t>
		 
		 <t hangText="Non-CLUE device:">
A device that supports standard SIP and SDP, but either does not support CLUE, or that does but does not
currently wish to invoke CLUE capabilities.
         </t>
		 
		 <t hangText="CLUE-controlled media:">
A media "m" line that is under CLUE control; the capture source that provides 
the media on this "m" line is negotiated in CLUE.
There is a corresponding "non-CLUE-controlled" media term. See <xref target="sec.group" /> for details
of how this control is signalled in SDP
         </t>

      </list></t>

   </section>

    <section title="Media Feature Tag Definition" anchor="sec.tag">
        <t>
The "sip.telepresence" media feature tag indicates support for CLUE. A CLUE-capable device 
SHOULD include this media feature tag in its REGISTER requests and OPTION responses. It SHOULD
also include the media feature tag in INVITE and UPDATE <xref target="RFC3311" /> requests and responses.
        </t>
        <t>
Presence of the media feature tag in the contact field of a request or response
can be used to determine that the far end supports CLUE. 
        </t>
    </section>
   
   <section title="SDP Grouping Framework TELEPRESENCE Extension Semantics" anchor="sec.group">
   
		<section title="General">
            <t>
This section defines a new SDP Grouping Framework extension, TELEPRESENCE.
            </t>
            <t>
The TELEPRESENCE extension can be indicated using an SDP session-level
'group' attribute. Each SDP media "m" line that is included in this group, using
SDP media-level mid attributes, is CLUE-controlled, by a CLUE data channel also included in this
TELEPRESENCE group.
            </t>
		</section>
        
        <section title="The CLUE data channel and the TELEPRESENCE grouping semantic">
            <t>
The <xref target="I-D.ietf-clue-datachannel">CLUE data channel</xref> is a bidirectional
SCTP over DTLS channel used for the transport of CLUE messages. This channel must be established before CLUE
protocol messages can be exchanged and CLUE-controlled media can be sent.
			</t>
            <t>
The data channel is a generic transport that is not specific to CLUE - if a device wishes to use
the CLUE protocol on the data channel it MUST include a TELEPRESENCE group in the SDP and include the "mid"
of the "m" line for the data channel in that group. A TELEPRESENCE grup MUST NOT include the "mid"s for more than one
data channel, and the data channel "mid" MUST NOT be included in more than one TELEPRESENCE group.
            </t>
			<t>
Presence of the data channel in a CLUE group in an SDP offer or answer also serves, 
along with the 'sip.telepresence' media feature tag, as an indication that the device
supports CLUE and wishes to upgrade the call to include CLUE-controlled media.	A CLUE-enabled device
SHOULD include a data channel "m" line in offers and, when allowed by <xref target="RFC3264" />, answers.
            </t>
        </section>
        
        <section title="CLUE-controlled media and the TELEPRESENCE grouping semantic">
            <t>
CLUE-controlled media lines in an SDP are "m" lines in which the content of the media streams to be sent is
negotiated via the <xref target="I-D.presta-clue-protocol">CLUE protocol</xref>. For an "m" line to be
CLUE-controlled, its "mid" value MUST be included in a TELEPRESENCE group. CLUE-controlled media line "mid"s
MUST NOT be included in more than one TELEPRESENCE group.
            </t>
            <t>
CLUE-controlled media is controlled by the CLUE protocol as negotiated on the CLUE data channel with an "mid" included
in the TELEPRESENCE group. If no data channel is included in the group the other "m" lines in the group
are still considered CLUE-controlled and under all the restrictions of CLUE-controlled media specified in this document.
            </t>
            <t>
"m" lines not specified as under CLUE control follow normal rules for media streams negotiated in SDP as defined 
in documents such as <xref target="RFC3264" />.
            </t>            
        </section>                        
        
        <section title="SDP Offer/Answer Procedures">        
            <section title="Generating the Initial Offer">
                <section title="Signalling CLUE Encodings" anchor="sec.sdp_encodings">
                    <t>
The <xref target="I-D.ietf-clue-framework">CLUE Framework</xref> defines the concept of "encodings", which represent
the sender's encode ability. Each encoding the media provider wishes to signal is signalled via an "m"
line of the appropriate media type, which MUST be marked as sendonly with the "a=sendonly" attribute
or as inactive with the "a=inactive" attribute.
                    </t>
                    <t>
The encoder limits of active (eg, "a=sendonly") encodings can then be expressed using
existing SDP syntax. For instance, for H.264 see Table 6 in <xref target="RFC6184" /> for a list of
valid parameters for representing encoder sender stream limits.			
                    </t>
                    <t>
These encodings are CLUE-controlled and hence MUST include an "mid" in a TELEPRESENCE group as defined above.
                    </t>
                    <t>
As well as the normal restrictions defined in <xref target="RFC3264" /> media MUST NOT be sent on this stream
until the media provider has received a valid CLUE CONFIGURE message specifying the capture to be used for this stream.
In the case of RTP media this includes corresponding RTCP packets.
                    </t>                    
                    <t>
Every "m" line representing a CLUE encoding SHOULD contain a "label" attribute as defined in <xref target="RFC4574" />.
This label is used to identify the encoding by the sender in CLUE ADVERTISEMENT messages
and by the receiver in CLUE CONFIGURE messages.
                    </t>
                    
                    <section title="Media line directionality">
                        <t>
Presently, this specification mandates that CLUE-controlled "m"-lines must be unidirectional. This is because setting "m"-lines to "a=sendonly" allows the encoder limits
to be expressed, whereas in other cases codec attributes express the receive capabilities of a media line.
                        </t>
                        <t>
It is possible that in future versions of this draft or its successor this restriction will be relaxed. If a device does not feel there is a benefit to expressing encode
limitations, or if there are no meaningful codec-specific limitations to express (such as with many audio codecs) there are benefits to allowing bidirectional "m"-lines.
With bidirectional media lines recipients do not always need to create a new offer to add their own "m"-lines to express their send capabilities; if they can produce
an equal or lesser number of streams to send then they may not need additional "m"-lines. 
                        </t>
                        <t>
However, at present the need to express encode limitations and the wish to simplify the offer/answer procedure means that for the time being only unidirectional media
lines are allowed for CLUE-controlled media. The highly asymmetric nature of CLUE means that the probability of the recipient of the initial offer needing to make their
own offer to add additional "m"-lines is significantly higher than it is for most other SIP call scenarios, in which there is a tendancy for both sides to have similar numbers
of potential audio and video streams they can send. 
                        </t>
                    </section>
                    <section title="Alternate encoding limit syntaxes" anchor="sec.sdp_encoding_limits">
                        <t>
Note that while the expressing of CLUE encoding limits in SDP has been discussed at some length by the working group
and it has been agreed that this is the current, working assumption, formal consensus has not been agreed on this.
Alternatives include placing encoding limits in the CLUE ADVERTISEMENT message, or by using
alternate SDP syntax, such as is suggested in <xref target="I-D.groves-clue-latent-config" />.
                        </t>
                    </section>                    
                </section>
                <section title="Receiving CLUE-controlled media">
                    <t>
As well as including sendonly media lines to send CLUE-controlled media, the sender of the initial SDP offer MAY also include
"a=recvonly" media lines to preallocate "m" lines to receive media; these are described in more detail in the next section.
                    </t>
                </section>

                <section title="Interoperability with non-CLUE devices" anchor="sec.interop">
                    <t>
A CLUE-enabled device sending an initial SDP offer SHOULD NOT include any "m" line for 
CLUE-controlled media beyond the "m" line for the CLUE data channel, and SHOULD include
at least one non-CLUE-controlled media "m" line. 
                    </t>
                    <t>
If the device has evidence that the receiver is also CLUE-enabled, for instance due to
receiving an initial INVITE with no SDP but including a 'sip.telepresence' media feature tag,
the above recommendation is waived, and the initial offer MAY contain "m" lines for CLUE-controlled media.
                    </t>
                </section>
            </section>
            
            <section title="Generating the Answer">

                <section title="Negotiating use of CLUE and the CLUE data channel">
                    <t>
If the recipient wishes to enable CLUE for the call, they MUST negotiate data channel support 
for an "m" line, and include the "mid" of that "m" line in a corresponding TELEPRESENCE group. 
                    </t>
                </section>            
            
                <section title="Negotiating receipt of CLUE capture encodings in SDP">
                    <t>
A receiver who wishes to receive a CLUE stream via a specific encoding requires an "a=recvonly" "m" line 
that matches the "a=sendonly" encoding.
                    </t>
                    <t>
These "m" lines are CLUE-controlled and hence MUST include an "mid" the corresponding TELEPRESENCE group
corresponding to the encoding they wish to send.
                    </t>
                    <t>
In the case of RTCP for RTP media or any other media type that includes a bidirectional flow of packets for
unidirectional media streams, such bidirectional packets MUST NOT be sent until the media consumer
has received acknowledgement that the media provider has received a valid CLUE CONFIGURE message
specifying the capture to be used for this stream.
                    </t>
                </section>

            </section>
            
            <section title="Processing the initial Offer/Answer negotiation">
                <t>
In the event that both offer and answer include a data channel "m" line with a mid value included in
corresponding TELEPRESENCE groups CLUE has been successfully negotiated and the call is now
CLUE-enabled, otherwise the call is not CLUE enabled.
                </t>
                <section title="Successful CLUE negotiation">                    
                    <t>
In the event of successful CLUE enablement of the call, devices MUST now begin negotiation of the CLUE channel,
see <xref target="I-D.ietf-clue-datachannel" /> for negotiation details. If negotiation is successful, 
sending of <xref target="I-D.presta-clue-protocol">CLUE protocol</xref> messages can begin.
                  	</t>
                    <t>
A CLUE-enabled device MAY choose not
to send media on the non-CLUE-controlled channels during the period in which control of the CLUE-controlled	
media lines is being negotiated. However, a CLUE-enabled device MUST still be prepared to receive media on
non-CLUE-controlled media lines as defined in <xref target="RFC3264" />.                    
                    </t>
                    <t>
If either side of the call wishes to add additional CLUE-controlled "m" lines to send or receive CLUE-controlled
media they MAY now send a SIP request with a new SDP offer. Note that if BUNDLE has been successfully
negotiated and a Bundle Address Synchronization offer is required, the device to receive that offer SHOULD NOT
generate a new SDP offer until it has received that BAS offer.
                    </t>         
                </section>
                <section title="CLUE negotiation failure">
                    <t>
In the event that the negotiation of CLUE fails and the call is not CLUE enabled in the initial offer/answer
then CLUE is not in use in the call, and the CLUE-capable devices MUST either revert to
non-CLUE behaviour or terminate the call.
                    </t>
                </section>
            </section>
            
            <section title="Modifying the session">
            
                <section title="Enabling CLUE mid-call">
                    <t>
A CLUE-enabled device that receives an initial SDP offer from a non-CLUE-enabled device 
SHOULD include a new data channel "m" line and corresponding TELEPRESENCE group in any subsequent
offers it sends, to indicate that it is CLUE-enabled.
                    </t>
                    <t>
If, in an ongoing non-CLUE call, one or both sides of the call add the CLUE
data channel "m" line to their SDP and places the "mid" for that channel in corresponding
TELEPRESENCE groups then the call is now CLUE-enabled; negotiation of the data channel and subsequently
the CLUE protocol begin.
                    </t>                    
                </section>
                
                <section title="Disabling CLUE mid-call">
                    <t>
If, in an ongoing CLUE-enabled call, an SDP offer-answer negotiation completes in a fashion in which
either the CLUE data channel was not successfully negotiated or one side did not include the data 
channel in a matching TELEPRESENCE group then CLUE for this channel is disabled. In the event that this occurs, 
CLUE is no longer enabled and sending of all CLUE-controlled media associated with the corresponding TELEPRESENCE
group MUST stop.
                    </t>
                    <t>
Note that this is distinct to cases where the CLUE data channel fails or an error occurs on the CLUE protocol;
see <xref target="I-D.presta-clue-protocol" /> for details of media and state preservation in this circumstance.
                    </t>
                </section>            
            </section>
        </section>
    </section>	
    
    <section title="Interaction of CLUE protocol and SDP negotiations" anchor="sec.coordination">
      <t>
Information about media streams in CLUE is split between two message types:
SDP, which defines media addresses and limits, and the CLUE channel,
which defines properties of capture devices available, scene information and additional constraints.
As a result certain operations, such as advertising support for a new transmissible capture with associated stream,
cannot be performed atomically, as they require changes to both SDP and CLUE messaging.
      </t>
      <t>
This section defines how the negotiation of the two protocols interact,
provides some recommendations on dealing with intermediary stages in non-atomic operations,
and mandates additional constraints on when CLUE-configured media can be sent.   
      </t>      
      <section title="Independence of SDP and CLUE negotiation">         
         <t>
To avoid complicated state machines with the potential to reach invalid states if messages were to be lost,
or be rewritten en-route by middle boxes, the current proposal is that SDP and CLUE messages are independent.
The state of the CLUE channel does not restrict when an implementation may send a new SDP offer or answer,
and likewise the implementation's ability to send a new CLUE ADVERTISEMENT or CONFIGURE message is not restricted
by the results of or the state of the most recent SDP negotiation.
         </t>
         <t>
The primary implication of this is that a device may receive an SDP with a CLUE
encoding it does not yet have capture information for,
or receive a CLUE CONFIGURE message specifying a capture encoding for which the
far end has not negotiated a media stream in SDP.
         </t>
         <t>
CLUE messages contain an EncodingID which is used to identify a specific encoding in SDP.
The non-atomic nature of CLUE negotiation means that a sender may wish to send a new ADVERTISEMENT before the corresponding SDP message.
As such the sender of the CLUE message MAY include an EncodingID which does not currently match an extant id in SDP.
         </t>
      </section>
      <section title="Recommendations for operating with non-atomic operations">
         <t>
Generally, implementations that receive messages for which they have incomplete information
SHOULD wait until they have the corresponding information they lack before sending messages to make changes related to that information.
For instance, an implementation that receives a new SDP offer with three new "a=sendonly" 
CLUE "m" lines that has not received the corresponding CLUE ADVERTISEMENT providing the
capture information for those streams SHOULD NOT include corresponding "a=recvonly" lines in its answer,
but instead should make a new SDP offer when and if a new ADVERTISEMENT arrives with captures relevant to those encodings.
         </t>         
         <t>
Because of the constraints of offer/answer and because new SDP negotiations are generally more 'costly' than sending a new CLUE message,
implementations needing to make changes to both channels SHOULD prioritize sending the updated CLUE message over sending the new SDP message.
The aim is for the recipient to receive the CLUE changes before the SDP changes, 
allowing the recipient to send their SDP answers without incomplete information,
reducing the number of new SDP offers required.
         </t>
      </section>
     <section title="Constraints on sending media">
         <t>
While SDP and CLUE message states do not impose constraints on each other, both impose constraints on the sending of media - 
media MUST NOT be sent unless it has been negotiated in both CLUE and SDP:
an implementation MUST NOT send a specific CLUE capture encoding unless its most
recent SDP exchange contains an active media channel for that encoding AND
the far end has sent a CLUE CONFIGURE message specifying a valid capture for that encoding.
         </t>
      </section>
   </section>

   <section title="Multiplexing of CLUE-controlled media using BUNDLE" anchor="sec-bundle">
      <section title="Overview">
         <t>
A CLUE call may involve sending and/or receiving significant numbers of media streams. Conventionally, media
streams are sent and received on unique ports. However, each seperate port used for this purpose may impose
costs that a device wishes to avoid, such as the need to open that port on firewalls and NATs, the need to collect 
<xref target="RFC5245">ICE candidates</xref>, etc.
         </t>
         <t>
The <xref target="I-D.ietf-mmusic-sdp-bundle-negotiation">BUNDLE</xref> extension can be used to negotiate the multiplexing
of multiple media lines onto a single 5-tuple for sending and receiving media, allowing devices in calls to another BUNDLE-supporting
device to potentially avoid some of the above costs.
         </t>
         <t>
While CLUE-capable devices MAY support the BUNDLE extension for this purpose supporting the extension is not mandatory for a device to be CLUE-compliant.
         </t>
      </section>

      <section title="Usage of BUNDLE with CLUE">
         <t>
This specification imposes no additional requirements or restrictions on the usage of BUNDLE when used with CLUE. There is no restriction
on combining CLUE-controlled media lines and non-CLUE-controlled media lines in the same BUNDLE group or in multiple such groups. However,
there are several steps an implementation may wish to ameliorate the cost and time requirements of extra SDP offer/answer exchanges between
CLUE and BUNDLE.
         </t>
         <section title="Generating the Initial Offer">
            <t>
BUNDLE mandates that the initial SDP offer MUST use a unique address for each m-line with a non-zero port.
Because CLUE implementations generarlly will not include CLUE-controlled media lines with the exception of the data channel
CLUE devices that support large numbers of streams can avoid ever having to open
large numbers of ports if they successfully negotiate BUNDLE. 
            </t>
         </section>
         <section title="Bundle Address Synchronization">
            <t>
When using BUNDLE the initial offerer may be mandated to send a Bundle Address Synchronisation offer.
If the initial offerer also followed the recommendation of not including CLUE-controlled media lines
in their offer, they MAY choose to include them in this subsequent offer. In this circumstance the BUNDLE
specification recommends that the offerer does not "modify SDP parameters that could get the answerer to
reject the BAS offer". Including new CLUE-controlled media lines using codecs and other attributes used
in existing media lines should not increase the chance of the answerer rejecting the BAS offer; implementations
should consider carefully before including new codecs or other new SDP attributes in these CLUE-controlled media lines.
            </t>
         </section>
         <section title="Multiplexing of the data channel and RTP media">
         <t>
BUNDLE-supporting CLUE-enabled devices MAY include the data channel in the same BUNDLE group as RTP media.
In this case the device MUST be able to demultiplex the various transports - see section 7.2 of the
<xref target="I-D.ietf-mmusic-sdp-bundle-negotiation">BUNDLE draft</xref>. If the BUNDLE group includes
other protocols than the data channel transported via DTLS the device MUST also be able to differentiate the various protocols.
         </t>
         </section>
      </section>
   </section>

   <section title="Example: A call between two CLUE-capable endpoints" anchor="sec-clueexample">
      <t>
This example illustrates a call between two CLUE-capable endpoints.
Alice, initiating the call, is a system with three cameras and three screens.
Bob, receiving the call, is a system with two cameras and two screens.
A call-flow diagram is presented, followed by an summary of each message.
      </t>
      <t>
To manage the size of this section only video is considered, and SDP snippets only
illustrate video 'm' lines. ACKs are not discussed. Note that BUNDLE is not in use.
      </t>
      <figure>
         <artwork>
            <![CDATA[ 
                  	
              +----------+                      +-----------+
              |  Alice   |                      |    Bob    |
              |          |                      |           |
              +----+-----+                      +-----+-----+
                   |                                  |
                   |                                  |
                   | SIP INVITE 1 (BASIC SDP+COMEDIA) |
                   |--------------------------------->|
                   |                                  |
                   |                                  |
                   | SIP 200 OK 1 (BASIC SDP+COMEDIA) |
                   |<---------------------------------|
                   |                                  |
                   |                                  |
                   | SIP ACK 1                        |
                   |--------------------------------->|
                   |                                  |
                   |                                  |
                   |                                  |
                   |<########### MEDIA 1 ############>|
                   |   1 video A->B, 1 video B->A     |
                   |<################################>|
                   |                                  |
                   |                                  |
                   |                                  |
                   |<================================>|
                   |   CLUE CTRL CHANNEL ESTABLISHED  |
                   |<================================>|
                   |                                  |
                   |                                  |
                   | CLUE ADVERTISEMENT 1             |
                   |*********************************>|
                   |                                  |
                   |                                  |
                   |             CLUE ADVERTISEMENT 2 |
                   |<*********************************|
                   |                                  |
                   |                                  |
                   | SIP INVITE 2 (+3 sendonly)       |
                   |--------------------------------->|
                   |                                  |
                   |                                  |
                   |                 CLUE CONFIGURE 1 |
                   |<*********************************|
                   |                                  |
                   |                                  |
                   | CLUE RESPONSE 1                  |
                   |*********************************>|                                      
                   |                                  |
                   |                                  |
                   |       SIP 200 OK 2 (+2 recvonly) |
                   |<---------------------------------|
                   |                                  |
                   |                                  |
                   | SIP ACK 2                        |
                   |--------------------------------->|
                   |                                  |
                   |                                  |
                   |                                  |
                   |<########### MEDIA 2 ############>|
                   |   2 video A->B, 1 video B->A     |
                   |<################################>|
                   |                                  |
                   |                                  |
                   |       SIP INVITE 3 (+2 sendonly) |
                   |<---------------------------------|
                   |                                  |                   
                   |                                  |
                   | CLUE CONFIGURE 2                 |
                   |*********************************>|
                   |                                  |
                   |                                  |
                   |                  CLUE RESPONSE 2 |
                   |<*********************************|                
                   |                                  |
                   |                                  |
                   | SIP 200 OK 3 (+2 recvonly)       |
                   |--------------------------------->|                   
                   |                                  |
                   |                                  |
                   |                                  |
                   |                        SIP ACK 3 |
                   |<---------------------------------|
                   |                                  |
                   |                                  |
                   |                                  |
                   |<########### MEDIA 3 ############>|
                   |   2 video A->B, 2 video B->A     |
                   |<################################>|                   
                   |                                  |
                   |                                  |
                   |                                  |
                   v                                  v	
            ]]>
         </artwork>
      </figure>    
      <t>
In INVITE 1, Alice sends Bob a SIP INVITE including in the SDP body the basilar audio and video capabilities 
("BASIC SDP") and the information needed for opening a control channel to be used 
for CLUE protocol messages exchange, according to what is envisioned in the COMEDIA approach ("COMEDIA") 
for DTLS/SCTP channel <xref target="I-D.ietf-mmusic-sctp-sdp"/>.
A snippet of the SDP showing the grouping attribute and the video m-line are shown below (mid 3 represents the CLUE channel):  
         <figure>
            <artwork>
               <![CDATA[
   ...
   a=group:CLUE 3
   ...
   m=video 6002 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=sendrecv
   a=mid:2                           
               ]]>
            </artwork>
         </figure>         
      </t>
      <t>
Bob responds with a similar SDP (200 OK 1); due to their similiarity no SDP snippet is shown here.
Alice and Bob are each able to send a single audio and video stream (whether they choose to send this initial media
before CLUE has been negotiated is implementation-dependent). This is illustrated as MEDIA 1.
      </t>
      <t>
With the successful initial O/A Alice and Bob are also free to negotiate the CLUE channel.
Once this is successfully established CLUE negotiation can begin.   This is illustrated as CLUE CHANNEL ESTABLISHED.  
      </t>
      <t>
Alice now sends her CLUE Advertisement (ADVERTISEMENT 1). 
She advertises three static captures representing her three cameras. 
She also includes switched captures suitable for two- and one-screen systems.
All of these captures are in a single capture scene,
with suitable capture scene entries to tell Bob that he should either subscribe to the three static captures,
the two switched capture view or the one switched capture view.
Alice has no simultaneity constraints, so includes all six captures in one simultaneous set.
Finally, Alice includes an encoding group with three encoding IDs: "enc1", "enc2" and "enc3".
These encoding ids aren't currently valid, but will match the next SDP offer she sends.
      </t>  
      <t>
Bob received ADVERTISEMENT 1 but does not yet send a Configure message, 
because he has not yet received Alice's encoding information, 
so as yet he does not know if she will have sufficient resources to send him the two streams he ideally wants at a quality he is happy with.
      </t>  
      <t>
Bob also sends his CLUE ADVERTISEMENT (ADVERTISEMENT 2). 
He advertises two static captures representing his cameras. 
He also includes a single composed capture for single-screen systems,
in which he will composite the two camera views into a single video stream.
All three captures are in a single capture scene,
with suitable capture scene entries to tell Alice that she should either subscribe to the two static captures,
or the single composed capture.
Bob also has no simultaneity constraints, so includes all three captures in one simultaneous set.
Bob also includes a single encoding group with two encoding IDs: "foo" and "bar".
      </t>
      <t>
Similarly, Alices receives ADVERTISEMENT 2 but does not yet send a CONFIGURE message,
because she has not yet received Bob's encoding information.
      </t>
      <t>
Alice now sends INVITE 2. She maintains the sendrecv audio, video and CLUE m-lines,
and she adds three new sendonly m-lines to represents the maximum three encodings she can send.
Each of these m-lines has a label corresponding to one of the encoding ids from ADVERTISEMENT 1.
Each also has its mid added to the grouping attribute to show they are controlled by the CLUE channel.
A snippet of the SDP showing the grouping attribute and the video m-lines are shown below (mid 3 represents the CLUE channel):
         <figure>
            <artwork>
               <![CDATA[
   ...
   a=group:CLUE 3 4 5 6
   ...
   m=video 6002 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=sendrecv
   a=mid:2
   ...
   m=video 6004 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=mid:4
   a=label:enc1
   m=video 6006 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=mid:5
   a=label:enc2
   m=video 6008 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=mid:6
   a=label:enc3                           
               ]]>
            </artwork>
        </figure>         
      </t>    
      <t>
Bob now has all the information he needs to decide which streams to configure.
As such he now sends CONFIGURE 1. This requests the pair of switched captures that represent Alice's scene,
and he configures them with encoder ids "enc1" and "enc2". This also serves as an ack
for Alice's ADVERTISMENT 1.
      </t>
      <t>
Alice receives Bob's message CONFIGURE 1 and sends RESPONSE 1 to ack its receptions.
She does not yet send the capture encodings specified,
because at this stage Bob hasn't negotiated the ability to receive these streams in SDP.
      </t>
      <t>
Bob now sends his SDP answer as part of 200 OK 2.
Alongside his original audio, video and CLUE m-lines he includes two active recvonly m-lines and a zeroed m-line for the third.
He adds their mid values to the grouping attribute to show they are controlled by the CLUE channel.
A snippet of the SDP showing the grouping attribute and the video m-lines are shown below (mid 100 represents the CLUE channel):
         <figure>
            <artwork>
               <![CDATA[
   ...
   a=group:CLUE 11 12 100
   ...
   m=video 58722 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=sendrecv
   a=mid:10
   ...
   m=video 58724 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600   
   a=recvonly
   a=mid:11
   m=video 58726 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=recvonly
   a=mid:12
   m=video 0 RTP/AVP 96                          
               ]]>
            </artwork>
         </figure>         
      </t>
      <t>
On receiving 200 OK 2 from Bob Alice is now able to send the two streams of video Bob requested -
this is illustrated as MEDIA 2.
      </t>
      <t>
The constraints of offer/answer meant that Bob could not include his encoder information as new m-lines in 200 OK 2.
As such Bob now sends INVITE 3 to generate a new offer. Along with all the streams from 200 OK 2
Bob also includes two new sendonly streams.
Each stream has a label corresponding to the encoding ids in his ADVERTISEMENT 2 message.
He also adds their mid values to the grouping attribute to show they are controlled by the CLUE channel.
A snippet of the SDP showing the grouping attribute and the video m-lines are shown below (mid 100 represents the CLUE channel):
         <figure>
            <artwork>
               <![CDATA[
   ...
   a=group:CLUE 11 12 13 14 100
   ...
   m=video 58722 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=sendrecv
   a=mid:10
   ...
   m=video 58724 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600   
   a=recvonly
   a=mid:11
   m=video 58726 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=recvonly
   a=mid:12
   m=video 0 RTP/AVP 96
   m=video 58728 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=label:foo
   a=mid:13
   m=video 58730 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=label:bar
   a=mid:14
               ]]>
            </artwork>
         </figure>
      </t>
      <t>
Having received this Alice now has all the information she needs to send CONFIGURE 2.
She requests the two static captures from Bob, to be sent on encodings "foo" and "bar".
      </t>
      <t>
Bob receives Alice's message CONFIGURE 2 and sends RESPONSE 2 to ack its receptions. 
Bob does not yet send the capture encodings specified,
because Alice hasn't yet negotiated the ability to receive these streams in SDP.
      </t>      
      <t>
Alice now sends 200 OK 3, matching two recvonly m-lines to Bob's new sendonly lines.
She includes their mid values in the grouping attribute to show they are controlled by the CLUE channel.
A snippet of the SDP showing the grouping attribute and the video m-lines are shown below (mid 3 represents the CLUE channel):
         <figure>
            <artwork>
               <![CDATA[
   ...
   a=group:CLUE 3 4 5 7 8
   ...
   m=video 6002 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=sendrecv
   a=mid:2
   ...
   m=video 6004 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=mid:4
   a=label:enc1
   m=video 6006 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016
   a=sendonly
   a=mid:5
   a=label:enc2
   m=video 0 RTP/AVP 96
   m=video 6010 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=recvonly
   a=mid:7
   m=video 6012 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=recvonly
   a=mid:8
               ]]>
         </artwork>
      </figure>
      </t>
      <t>
Finally, on receiving 200 OK 3 Bob is now able to send the two streams of video Alice requested -
this is illustrated as MEDIA 3.
      </t>
      <t>
Both sides of the call are now sending multiple video streams with their sources defined via CLUE negotiation.
As the call progresses either side can send new ADVERTISEMENT or CONFIGURE or new SDP negotiation to add,
remove or change what they have available or want to receive.
      </t>
   </section>     

   <section title="Example: A call between a CLUE-capable and non-CLUE endpoint" anchor="sec-nonclueexample">
      <t>
In this brief example Alice is a CLUE-capable endpoint making a call to Bob,
who is not CLUE-capable, i.e., it is not able to use the CLUE protocol.
      </t>
         <figure>
            <artwork>
               <![CDATA[

      +----------+                      +-----------+
      |   EP1    |                      |    EP2    |
      |          |                      |           |
      +----+-----+                      +-----+-----+
           |                                  |
           |                                  |
           | SIP INVITE 1 (BASIC SDP+COMEDIA) |
           |--------------------------------->|
           |                                  |
           |                                  |
           | 200 0K 1 (BASIC SDP+*NO*COMEDIA) |
           |<---------------------------------|
           |                                  |
           |                                  |
           | ACK 1                            |
           |--------------------------------->|
           |                                  |
           |                                  |
           |                                  |
           |<########### MEDIA 1 ############>|
           |   1 video A->B, 1 video B->A     |
           |<################################>|
           |                                  |
           |                                  |
           |                                  |
           |                                  |
           v                                  v	
               ]]>
            </artwork>
         </figure>
      <t>
In INVITE 1, Alice sends Bob a SIP INVITE including in the SDP body the basilar audio and video capabilities 
("BASIC SDP") and the information needed for opening a control channel to be used 
for CLUE protocol messages exchange, according to what is envisioned in the COMEDIA approach ("COMEDIA") 
for DTLS/SCTP channel <xref target="I-D.ietf-mmusic-sctp-sdp"/>.
A snippet of the SDP showing the grouping attribute and the video m-line are shown below (mid 3 represents the CLUE channel):  
         <figure>
            <artwork>
               <![CDATA[
   ...
   a=group:CLUE 3
   ...
   m=video 6002 RTP/AVP 96
   a=rtpmap:96 H264/90000
   a=fmtp:96 profile-level-id=42e016;max-mbps=108000;max-fs=3600
   a=sendrecv
   a=mid:2                           
               ]]>
            </artwork>
         </figure>         
      </t>
      <t>
Bob is not CLUE capable, and hence does not recognize the "CLUE" semantic for the grouping attribute,
not does he support the CLUE channel. He responds with an answer with audio and video,
but with the CLUE channel zeroed.
      </t>
      <t>
From the lack of the CLUE channel Alice understands that Bob does not support CLUE, or does not wish to use it.
Both sides are now able to send a single audio and video stream to each other.
Alice at this point begins to send her fallback video:
in this case likely a switched view from whichever camera shows the current loudest participant on her side.
      </t>
   </section>

   <section title="CLUE requirements on SDP O/A" anchor="sec.sdp_rqmts">
      <t>
The current proposal calls for a new "CLUE" semantic for the <xref target="RFC5888">SDP Grouping Framework</xref>.
      </t>
      <t>
Any other SDP extensions required to support CLUE signaling should also be specified here.
Then we will need to take action within MMUSIC to make those happen.
This section should be empty and removed before this document becomes an RFC.
      </t>
      <t>
NOTE: The RTP mapping document 
<xref target="I-D.even-clue-rtp-mapping"/>
is also likely to call for SDP extensions.
We will have to reconcile how to coordinate these two documents.
      </t>
   </section>

   <section title="SIP Signaling" anchor="sec.sip_signaling">
      <t>
(Placeholder) This may be unremarkable. If so we can drop it.
      </t>
   </section>

   <section title="CLUE over RTCWEB">
      <t>
We may want to rule this out of scope for now. 
But we should be thinking about this.
      </t>
   </section>

   <section title="Open Issues">
      <t>

Here are issues pertinent to signaling that need resolution.
Resolution will probably result in changes somewhere in this document,
but may also impact other documents.


        <list style='symbols'>
         <t>
While the preference is to multiplex multiple capture encodings over a single RTP session,
this will not always be desirable or possible. The factors that prevent multiplexing
may come from either the provider or the consumer. So the extent of multiplexing
must be negotiated. The decision about how to multiplex affects the number and
grouping of m-lines in the SDP. The endpoint of a CLUE session that sends an offer
needs to know the mapping of capture encodings to m-lines for both sides. 
<vspace blankLines='1'/>
AFAIK this issue hasn't yet been considered at all.
         </t>
         <t>
The current method for expressing encodings in SDP limits the parameters available
when describing H264 encoder capabilities to those defined in Table 6 in <xref target="RFC6184" />
         </t>
        </list>
      </t>
   </section>

   <section title="What else?">
      <t>
      </t>
   </section>

   <section title="Acknowledgements">
      <t>
The team focusing on this draft consists of:
Roni Even,
Rob Hansen,
Christer Holmberg,
Paul Kyzivat,
Simon Pietro-Romano,
Roberta Presta.
      </t>
      <t>
Christian Groves has contributed detailed comments and suggestions.
      </t>
      <t>
The author list should be updated as people contribute substantial text to this document.
      </t>
   </section>

   <section title="IANA Considerations">
      <t>

TBD

      </t>
   </section>
   <section title="Security Considerations">
      <t>

TBD

      </t>
   </section>

   <section title="Change History">
      <t><list style='hanging'>
            <t hangText="-00:"> Revision by Rob Hansen
              <list style='symbols'>
               <t>
Submitted as -00 working group document
               </t>
              </list>
             </t>
            <t hangText="draft-kyzivat-08:"> Revisions by Rob Hansen
              <list style='symbols'>
               <t>
Added media feature tag for CLUE support ('sip.telepresence')
               </t>
               <t>
Changed grouping semantic from 'CLUE' to 'TELEPRESENCE'
               </t>
               <t>
Restructured document to be more centred on the grouping semantic and its use with O/A
               </t>
               <t>
Lots of additional text on usage of the grouping semantic
               </t>
               <t>
Stricter definition of CLUE-controlled m lines and how they work
               </t>
               <t>
Some additional text on defining what happens when CLUE supports is added or removed
               </t>
               <t>
Added details on when to not send RTCP for CLUE-controlled "m" lines.
               </t>
               <t>
Added a section on using BUNDLE with CLUE
               </t>
                <t>
Updated data channel references to point at new WG document rather than indivual draft
               </t>
              </list>
             </t>    
	     <t hangText="draft-kyzivat-07:"> Revisions by Rob Hansen
          <list style='symbols'>
           <t>
Removed the text providing arguments for encoding limits being in SDP and encoding groups in the CLUE protocol
in favor of the specifics of how to negotiate encodings in SDP
           </t>
           <t>
Added normative language on the setting up of a CLUE call, and added sections on mid-call changes to the
CLUE status.
           </t>
           <t>
Added references to <xref target="I-D.ietf-clue-datachannel" /> where appropriate.
           </t>
           <t>
Added some terminology for various types of CLUE and non-CLUE states of operation.
           </t>
		   <t>
Moved language related to topics that should be in <xref target="I-D.ietf-clue-datachannel" /> and
<xref target="I-D.presta-clue-protocol" />, but that has not yet been resolved in those documents, into
an appendix.
		   </t>
          </list>
         </t>  
         <t hangText="draft-kyzivat-06:"> Revisions by Rob Hansen
          <list style='symbols'>
           <t>
Removed CLUE message XML schema and details that are now in draft-presta-clue-protocol
           </t>
           <t>
Encoding limits in SDP section updated to note that this has been investigated and discussed
and is the current working assumption of the WG, though consensus has not been fully achieved.
           </t>
           <t>
A section has also been added on the current mandation of unidirectional "m"-lines.
           </t>
           <t>
Updated CLUE messaging in example call flow to match draft-presta-clue-protocol-03
           </t>
          </list>
         </t>         
         <t hangText="draft-kyzivat-05:"> Revisions by pkyzivat:
          <list style='symbols'>
            <t>
Specified versioning model and mechanism.
            </t>
            <t>
Added explicit response to all messages.
            </t>
            <t>
Rearranged text to work with the above changes. 
(Which rendered diff almost useless.)
            </t>
          </list>
         </t>
         <t hangText="draft-kyzivat-04:"> Revisions by Rob Hansen: ???</t>
         <t hangText="draft-kyzivat-03:"> Revisions by pkyzivat: 
          <list style='symbols'>
            <t>
Added a syntax section with an XML schema for CLUE messages.
This is a strawhorse, and is very incomplete, but it establishes
a template for doing this based on elements defined in the data model.
(Thanks to Roberta for help with this!)
            </t>
            <t>
Did some rewording to fit the syntax section in and reference it.
            </t>
            <t>
Did some relatively minor restructuring of the document to make
it flow better in a logical way.
            </t>
          </list>

         </t>
         <t hangText="draft-kyzivat-02:"> A bunch of revisions by pkyzivat: 
         <list style='symbols'>
            <t>
Moved roberta's call flows to a more appropriate place in the document.
            </t>
            <t>
New section on versioning.
            </t>
            <t>
New section on NAK.
            </t>
            <t>
A couple of possible alternatives for message acknowledgment.
            </t>
            <t>
Some discussion of when/how to signal changes in provider state.
            </t>
            <t>
Some discussion about the handling of transport errors.
            </t>
            <t>
Added a change history section.
            </t>
         </list>
These were developed by Lennard Xiao, Christian Groves and Paul, 
so added Lennard and Christian as authors.

         </t>
         <t hangText="draft-kyzivat-01:">

Updated by roberta to include some sample call flows.

         </t>
         <t hangText="draft-kyzivat-00:">

Initial version by pkyzivat. Established general outline for the document,
and specified a few things thought to represent wg consensus.

         </t>
      </list></t>
   </section>

</middle>

<back>
   <references title="Normative References">
      <?rfc include="reference.RFC.2119"?>
      <?rfc include="reference.I-D.ietf-clue-framework"?>
      <?rfc include="reference.I-D.presta-clue-data-model-schema"?>
      <?rfc include="reference.I-D.presta-clue-protocol"?>
	  <?rfc include="reference.I-D.ietf-clue-datachannel"?>
	  <?rfc include="reference.I-D.groves-clue-latent-config"?> 
      <?rfc include="reference.I-D.ietf-mmusic-sctp-sdp"?>
      <?rfc include="reference.I-D.tuexen-tsvwg-sctp-dtls-encaps"?>
      <?rfc include="reference.RFC.4574"?>
      <?rfc include="reference.RFC.5888"?>
   </references>
   <references title="Informative References">
      <?rfc include="reference.RFC.3264"?>
      <?rfc include="reference.RFC.3311"?>
      <?rfc include="reference.RFC.5245"?>
      <?rfc include="reference.RFC.4353"?>
      <?rfc include="reference.RFC.6120"?>
      <?rfc include="reference.RFC.6184"?>
      <?rfc include="reference.I-D.even-clue-sdp-clue-relation"?>
      <?rfc include="reference.I-D.even-clue-rtp-mapping"?>
      <?rfc include="reference.I-D.hansen-clue-sdp-interaction"?>
      <?rfc include="reference.I-D.ietf-mmusic-sdp-bundle-negotiation"?>
   </references>
   
   <section title="CLUE Signalling and data channel concerns" anchor="sec.transition">
      <t>
[The specifics of the CLUE signaling protocol are in the process of being defined in
<xref target="I-D.presta-clue-protocol"/>, while the negotiation of the CLUE data channel
is being defined in <xref target="I-D.ietf-clue-datachannel" />. As such, considerable text originally
in this section have been transitioned to these document. The following text relates to issues that
are no longer the focus of this document, but remain important and unresolved, and so have been preserved here.]
      </t>        

         <section title="Protocol Versioning and Options" anchor="sec.versioning">

            <section title="Versioning Objectives">
               <t>
The CLUE versioning mechanism addresses the following needs:
               </t>
               <t>
                  <list style='symbols'>
                     <t>Coverage:
                        <list style='symbols'>
                        <t>Versioning of basic behavior and options,</t>
                        <t>CLUE message exchange,</t>
                        <t>CLUE message exchange,</t>
                        <t>coordinated use of SIP and SDP,</t>
                        <t>required media behavior.</t>
                        </list>
                     </t>
                     <t>Remain fixed for the duration of the CLUE channel</t>
                     <t>Be extensible for configuration of new options.</t>
                     <t>
Be sufficient (with extensions) for all envisioned future versions.
                     </t>
                  </list>
               </t>
            </section>

            <section title="Versioning Overview">
               <t>
An initial message exchange on the CLUE channel handles the negotiation of version and options.
               </t>
               <t>
                  <list style='symbols'>
                     <t>
Dedicated message types are used for this negotiation.
                     </t>
                     <t>
The negotiation is repeated if the CLUE channel is reestablished.
                     </t>
                  </list>
               </t>
               <t>
The version usage is similar in philosophy to XMPP:
               </t>
               <t>
                  <list style='symbols'>
                     <t>
See <xref target="RFC6120"/> section 4.7.5.
                     </t>
                     <t>
A version has major and minor components. (Each a non-negative integer.)
                     </t>
                     <t>
Major version changes denote non-interoperable changes.
                     </t>
                     <t>
Minor version changes denote schema changes that are backward compatible by ignoring unknown XML elements, or other backward compatible changes.
                     </t>
                     <t>
If a common major version cannot be negotiated, then CLUE MUST NOT be used.
                     </t>
                     <t>
The same message exchange also negotiates options.
                     </t>
                     <t>
Each option is denoted by a unique XML element in the negotiation.

                     </t>
                  </list>
               </t>
               <t>
<xref target="fig.simpleOptNeg"/> shows the negotiation in simplified form:
               </t>
               <t>
               </t>
               <figure anchor="fig.simpleOptNeg" 
                       title="Basic Option Negotiation (simplified)"
                       align="center" >
                  <artwork align="center"><![CDATA[
| Supported       Supported |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Required         Required |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|  Advertise/Configure/...  |
|<------------------------->|
                  ]]></artwork>
               </figure>
               <t>
Dedicated message types are used for the negotiation because:
               </t>
               <t>
                  <list style='symbols'>
                     <t>
The protocol can then ensure that the negotiation is done first, and once. 
Not changing mid-session means an endpoint can plan ahead, and predict what may be used and what might be received.
                     </t>
                     <t>
This provides extensible framework for negotiating optional features.
                     </t>
                     <t>
A full option negotiation can be completed before other messages are exchanged.
                     </t>
                  </list>
               </t>
               <t>
<xref target="fig.simpleSupportedMsg"/> and  <xref target="fig.simpleRequiredMsg"/>
are simplified examples of the Supported and Required messages: 
               </t>

               <figure anchor="fig.simpleSupportedMsg" 
                       title="Supported Message (simplified)"
                       align="center" >
                  <artwork align="center"><![CDATA[
<supported>
	<version major=“1” minor=“0”>
	  <!– May repeat version if multiple
         major versions supported.      ->
   <!- Options follow ->
   <mediaProvider/>
	...
</supported>
                  ]]></artwork>
               </figure>
               <t>
               </t>

               <figure anchor="fig.simpleRequiredMsg" 
                       title="Required Message (simplified)"
                       align="center" >
                  <artwork align="center"><![CDATA[
<required>
	<version major=“1” minor=“0”>
	  <!– Requested options of peer follow ->
   <!- Options follow ->
   <mediaProvider/>
	...
</required>
                  ]]></artwork>
               </figure>
               <t>
               </t>

            </section>

            <section title="Version Negotiation">
               <t>
<![CDATA[ 
The Supported message includes one or more <version> elements, each denoting a major/minor version combination that the sender of the message is capable of supporting. 
]]>
               </t>
               <t>
<![CDATA[ 
The <version> element contains both a major and minor version. Each is a non-negative integer. Each <version> element in the message MUST contain a unique major version number, distinct from the major version number in all the other <version> elements in the message. The minor version in a <version> element denotes the largest minor version the sender supports for the corresponding major version. (Minor versions are always backwards compatible, so support for a minor version implies support for all smaller minor versions.)
]]>
               </t>
               <t>
Each endpoint of the CLUE channel sends a Supported message, and receives the Supported message sent by the other end. Then each end compares the versions sent and the versions received to determine the version to be used for this CLUE session. 
               </t>
               <t>
                  <list style='symbols'>
                     <t>
If there is no major version in common between the two ends, negotiation fails.
                     </t>
                     <t>
<![CDATA[ 
The <version> elements from the two ends that have the largest matching major version are selected.
]]>
                     </t>
                     <t>
After exchange each end determines compatible version numbers to be used for encoding and decoding messages, and other behavior in the CLUE session.

                        <list style='symbols'>
                           <t>
<![CDATA[ 
The <version> elements from the two ends that have the largest matching major version are selected.
]]>
                           </t>
                           <t>
The side that sent the smaller minor version chooses the one it sent.
                           </t>
                           <t>
The side that sent the larger minor version may choose the minor version it received, or the one it sent, or any value between those two. 
                           </t>
                        </list>
                     </t>
                     <t>
<![CDATA[ 
Each end then sends a Required message with a single <version> element containing the major and minor versions it has chosen.
]]>
<vspace blankLines='1'/>
[[Note: “required” is the wrong semantic for this. Might want a better message name.]]
                     </t>
                     <t>
Each end then behaves in accord with the specifications denoted by the version it chose. This continues until the end of the CLUE session, or until changed as a result of another version negotiation when the CLUE channel is reestablished.
<vspace blankLines='1'/>
[[Note: The version negotiation remains in effect even if the CLUE channel is lost.]]

                     </t>
                  </list>
               </t>
            </section>

            <section title="Option Negotiation">
               <t>
Option negotiation is used to agree upon which options will be available for use within the CLUE session. (It does not say that these options must be used.) This may be used for both standard and proprietary options. (As used here, and option could be either a feature described as part of this specification that is optional to implement, or a feature defined in a separate specification that extends this one.)
               </t>
               <t>
Each end includes, within the Supported message it sends, elements describing those options it is willing and able to use with this CLUE session.
               </t>
               <t>
Each side, upon receiving a Supported message, selects from that message those option elements that it wishes the peer to use. (If/when occasion for that use arises.) It then includes those selected elements into the Required message that it sends.
               </t>
               <t>
Within a received Supported message, unknown option elements MUST be ignored. This includes elements that are of a known type that is not known to denote an option.
               </t>
            </section>

            <section title="Option Elements">
               <t>
Each option is denoted, in the Supported and Required messages, by an XML element. There are no special rules for these elements – they can be any XML element. The attributes and body of the element may carry further information about the option. The same element type is used to denote the option in the Supported message and the corresponding Required message, but the attributes and body may differ according to  option-specific rules. This may be used to negotiate aspects of a particular option. The ordering of option elements is irrelevant within the Supported and Required messages, and need not be consistent in the two. 
               </t>
                <t>
<![CDATA[ 
Only one option element is defined in this document: <mediaProvider>.
]]>
               </t>
               <section title="<mediaProvider>">
                  <t>
<![CDATA[ 
The <mediaProvider> element, when placed in a Supported message, indicates that the sender is willing and able to send ADVERTISEMENT messages and receive CONFIGURE messages.
When placed in a Required message, the <mediaProvider> element indicates that the sender is willing, able, and desirous of receiving ADVERTISEMENT messages and sending CONFIGURE messages. If an endpoint does not receive <mediaProvider> in a Required message, it MUST NOT send ADVERTISEMENT messages. For common cases <mediaProvider> should be supported and required by both endpoints, to enable bidirectional exchange of media. If not required by either end, the CLUE session is useless. This is an error condition, and SHOULD result in termination of the CLUE channel.
]]>
                  </t>
                  <t>
<![CDATA[ 
The <mediaProvider> element has no defined attributes or body.
]]>
                  </t>
               </section>

            </section>

            <section title="Version & option negotiation errors">
               <t>
The following are errors that may be detected and reported during version negotiation:
               </t>
               <t>
                  <list style='symbols'>
                     <t>
Version incompatibility
<vspace blankLines='1'/>
There is no common value between the major version numbers sent in a Supported message and those in the received Supported message.
                 </t>
                     <t>
Option incompatibility
<vspace blankLines='1'/>
<![CDATA[ 
This can occur if options supported by one endpoint are inconsistent with those supported by the other endpoint. E.g., The <mediaProvider> option is not specified by either endpoint. Options SHOULD be specified so as to make it difficult for this problem to occur. 
]]>
<vspace blankLines='1'/>
<![CDATA[ 
This error may also be used to indicate that insufficient options have been required among the two ends for a useful session to result. This can occur with a feature that needs to be present on at least one end, but not on a specific end. E.g., The <mediaProvider> option was Supported by at least one of the endpoints, but it was not Required by either.
]]>
<vspace blankLines='1'/>
This may also be used to indicate that an option element in the Required message has attributes or body content that is syntactically correct, but in inconsistent with the rules for option negotiation specified for that particular element. The definition of each option must specify the negotiation rules for that option.
                     </t>
                     <t>
Unsupported option
<vspace blankLines='1'/>
An option element type received in a Required message did not appear in the corresponding Supported element.
<vspace blankLines='1'/>
(Unsupported options received in a Supported message do not trigger this error. They are ignored.)
                     </t>
                  </list>
               </t>
               <t>
These errors are reported using the normal message error reporting mechanism.
               </t>
               <t>
Other applicable error codes may also be returned in response to a Supported or Required message. 
               </t>
               <t>
Errors that occur at this stage result in negotiation failure. When this occurs, CLUE cannot be used until the end of the SIP session, or until a new CLUE channel is negotiated and a subsequent version negotiation succeeds. The SIP session may continue without CLUE features.
               </t>
            </section>

            <section title="Definition and Use of Version Numbers">
               <t>
[[NOTE: THIS IS AWKWARD. SUGGESTIONS FOR BETTER WAYS TO DEFINE THIS ARE WELCOME.]]
               </t>
               <t>
This document defines CLUE version 1.0 (major=1, minor=0). This denotes the normative behavior defined in this document and other documents upon which it normatively depends, including but is not limited to:
               </t>
               <t>
                  <list style='symbols'>
                     <t>
the schema defined in <xref target="I-D.presta-clue-protocol"/>;
                     </t>
                     <t>
the schema defined in [clue-data-model];
                     </t>
                     <t>
the protocol used to exchange CLUE messages;
                     </t>
                     <t>
the protocol defined herein that defines valid sequence of CLUE messages;
                     </t>
                     <t>
the specific rules defined herein for employing SIP, SDP, and RTP to realize the CLUE messages.
                     </t>
                  </list>
               </t>
                <t>
Given two CLUE versions Vx and Vy, then Vx is backward compatible with Vy if and only if:
               </t>
               <t>
                  <list style='symbols'>
                     <t>
All messages valid according to the schema of Vx are also valid according to the schemas of Vy
                     </t>
                     <t>
All messages valid according to the schema of Vy can be made valid according to the schemas of Vx by deleting elements undefined in the schemas of Vx. 
<vspace blankLines='1'/>
[[NOTE: THIS PROBABLY NEEDS WORK!]]
                     </t>
                     <t>
All normative behaviors defined for Vx are defined consistently for Vy. 
<vspace blankLines='1'/>
[[NOTE: SOME HAND WAVING HERE.]]
                     </t>
                  </list>
               </t>
               <t>
Revisions, updates,  to any of the documents denoted by Version 1.0 MAY result in the definition of a new CLUE version. If they do, then this document MUST be revised to define the new version.
               </t>
               <t>
The CLUE version to be defined in a revision to this document MUST be determined as follows:
               </t>
               <t>
                  <list style='symbols'>
                     <t>
If the revision and the document being revised are mutually backward compatible (they are functionally equivalent), then the CLUE version MUST remain unchanged.
                     </t>
                     <t>
Else if the revision is backward compatible with the document being revised, then the CLUE major version MUST remain unchanged, and the CLUE minor version MUST be increased by one (1).
                     </t>
                     <t>
Else the CLUE major version must be increased by one (1), and the CLUE minor version set to zero (0).
                     </t>
                  </list>
               </t>
               <t>
When a CLUE implementation sends a Supported message, it MUST include the CLUE versions it is willing and able to conform with.
               </t>
            </section>

            <section title="Version & Option Negotiation Examples">
               <t>
               </t>

               <section title="Successful Negotiation - Multi-version">
                  <t>
                  </t>
                  <figure align="center">
                     <artwork align="center"><![CDATA[
| Supported       Supported |
| Version 2.0               |
| Version 1.2   Version 1.1 |
| mediaProv       mediaProv |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| OK response   OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Required         Required |
| Version 1.2   Version 1.1 |
| mediaProv       mediaProv |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| OK response   OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|         Advertise         |
|<------------------------->|
|                           |
|         Configure         |
|<------------------------->|
                     ]]></artwork>
                  </figure>
                  <t>
The endpoint on the left can support versions 1.2 and 2.0, and
because of backward compatibility can support versions 1.0 and 1.1.
The endpoint on the right supports only version 2.0.
Both endpoints with to both provide and consume media. 
They each send a Supported message indicating what they support.
                  </t>
                  <t>
The element on the left, upon receiving the Supported message, 
determines that it is permitted to use version 1.2 or 1.1,
and decides to use 1.2. 
It sends a Required message containing version 1.2 and also includes
the mediaProvider option element, because it wants its peer to
provide media.
                  </t>
                  <t>
The element on the right, upon receiving the Supported message, 
selects version 1.1 because it is the highest version in common 
to the two sides. It sends a Required message containing version 1.1
because that is the highest version in common. It also includes
the mediaProvider option element, because it wants its peer to
provide media.
                  </t>
                  <t>
Upon receiving the Required messages, both endpoints determine
that they should send ADVERTISEMENTs.
                  </t>
                  <t>
ADVERTISEMENT and CONFIGURE messages will flow in both directions.
                  </t>

               </section>

               <section title="Successful Negotiation - Consumer-Only Endpoint">
                  <t>
                  </t>
                  <figure align="center">
                     <artwork align="center"><![CDATA[
| Supported       Supported |
| Version 1.0   Version 1.0 |
| mediaProv       (no opts) |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| OK response   OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Required         Required |
| Version 1.0   Version 1.0 |
| (no opts)       mediaProv |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| OK response   OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|  Advertise                |
|-------------------------->|
|                           |
|                 Configure |
|<--------------------------|
                     ]]></artwork>
                  </figure>
                  <t>
The endpoint on the right consumes media, but doesn't provide any 
so it doesn't include the mediaProvider option element in the Supported message it sends. 
                  </t>
                  <t>
The element on the left would like to include a mediaProvider option element
in the Requirements message it sends, but can't because it did not receive one in
the Supported message it received.
                  </t>
                  <t>
ADVERTISEMENT messages will only go from left to right, 
and CONFIGURE messages will only go from right to left.
                  </t>

               </section>

               <section title="Successful Negotiation - Provider-Only Endpoint">
                  <t>
                  </t>
                  <figure align="center">
                     <artwork align="center"><![CDATA[
| Supported       Supported |
| Version 1.0   Version 1.0 |
| mediaProv       mediaProv |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| OK response   OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Required         Required |
| Version 1.0   Version 1.0 |
| (no opts)       mediaProv |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| OK response   OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|  Advertise                |
|-------------------------->|
|                           |
|                 Configure |
|<--------------------------|
                     ]]></artwork>
                  </figure>
                  <t>
The endpoint on the left provides media but does not consume any
so it includes the mediaProvider option element in the Supported message it sends,
but does't include the mediaProvider option element in the Required message it sends.
                  </t>
                  <t>
ADVERTISEMENT messages will only go from left to right, 
and CONFIGURE messages will only go from right to left.
                  </t>

               </section>

               <section title="Version Incompatibility">
                  <t>
                  </t>
                  <figure align="center">
                     <artwork align="center"><![CDATA[
| Supported       Supported |
| Version 1.2   Version 2.1 |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Version           Version |
| Incompat.       Incompat. |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|     close clue channel    |
|<------------------------->|
|                           |
|     legacy mode or BYE    |
|<------------------------->|
                     ]]></artwork>
                  </figure>
                  <t>
Upon receiving the Supported message, each endpoint discovers there is
no major version in common, so CLUE usage is not possible. 
Each sends an error response indicating this and then ceases CLUE usage.
                  </t>

               </section>

               <section title="Option Incompatibility">
                  <t>
                  </t>
                  <figure align="center">
                     <artwork align="center"><![CDATA[
| Supported       Supported |
| Version 1.0   Version 1.0 |
| mediaProv       mediaProv |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Required         Required |
| (no opts)       (no opts) |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| Option             Option |
| Incompat.       Incompat. |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|     close clue channel    |
|<------------------------->|
|                           |
|     legacy mode or BYE    |
|<------------------------->|
                     ]]></artwork>
                  </figure>
                  <t>
Neither of the endpoints is willing to provide media. 
It makes no sense to continue CLUE operation in this situation.
Each endpoint realizes this upon receiving the Supported message,
sends an error response indicating this and then ceases CLUE usage.
                  </t>

               </section>

               <section title="Syntax Error">
                  <t>
                  </t>
                  <figure align="center">
                     <artwork align="center"><![CDATA[
| Supported          !@#$%^ |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
| syntax error  OK response |
|------------\ /------------|
|             X             |
|<-----------/ \----------->|
|                           |
|     close clue channel    |
|-------------------------->|
|                           |
|     legacy mode or BYE    |
|<------------------------->|
                     ]]></artwork>
                  </figure>
                  <t>
                  </t>

               </section>

            </section>

         </section>

         <section title="Message Transport" anchor="sec.transport">
            <t>

CLUE messages are transported over a bidirectional CLUE channel.  
In a two-party CLUE session, a CLUE channel connects the two endpoints.
In a CLUE conference, each endpoint has a CLUE channel connecting it to an MCU. 
(In conferences with cascaded mixers <xref target="RFC4353"/>, 
two MCUs will be connected by a CLUE channel.)

            </t>

            <section title="CLUE Channel Lifetime">
               <t>
The transport mechanism used for CLUE messages is DTLS/SCTP as specified in 
<xref target="I-D.tuexen-tsvwg-sctp-dtls-encaps"/> and 
<xref target="I-D.ietf-mmusic-sctp-sdp"/>. 
A CLUE channel consists of one SCTP stream in each direction over a DTLS/SCTP session.
The mechanism for establishing the DTLS/SCTP session is described in
<xref target="I-D.ietf-clue-datachannel"/>.
               </t>
               <t>
The CLUE channel will usually be offered during the initial SIP INVITE,
and remain connected for the duration of the CLUE/SIP session.
However this need not be the case. The CLUE channel may be established
mid-session after desire and capability for CLUE have been determined,
and the CLUE channel may be dropped mid-call if the desire and/or capability
to support it is lost.
               </t>
               <t>
There may be cases when it becomes necessary to "reset" the CLUE channel.
This by be as a result of an error on the underlying SCTP association,
a need to change the endpoint address of the SCTP association, 
loss of CLUE protocol state, or something else TBD.
               </t>
               <t>
The precise mechanisms used to determine when a reset is required,
and how to accomplish it and return to a well defined state are TBS.
               </t>
            </section>

            <section title="Channel Error Handling" anchor="sec.channel-errors">
               <t>
We will need to specify behavior in the face of transport errors that are so
severe that they can't be managed via CLUE messaging within the CLUE channel.
Some errors of this sort are:
               <list style='symbols'>
                  <t>
Unable to establish the SCTP association after signaling it in SDP.
                  </t>
                  <t>
CLUE channel setup rejected by peer.
                  </t>
                  <t>
Error reported by transport while writing message to CLUE channel. 
                  </t>
                  <t>
Error reported by transport while reading message from CLUE channel.
                  </t>
                  <t>
Timeout - overdue acknowledgement of a CLUE message. 
(Requirements for now soon a message must be responded to are TBD.)
                  </t>
                  <t>
Application fault. CLUE protocol state lost.
                  </t>
               </list>
The worst case is to drop the entire CLUE call.
Another possibility is to fall back to legacy compatibility mode.
Or perhaps a "reset" can be done on the protocol. E.g. this might be
accomplished by sending a new O/A and establishing a replacement SCTP association.
Or a new CLUE channel might be established within the existing SCTP association.
               </t>
            </section>

         </section>

         <section title="Message Framing">
            <t>
Message framing is provided by the SCTP transport protocol.
Each CLUE message is carried in one SCTP message.
            </t>
         </section>

   </section>
</back>
</rfc>
PAFTECH AB 2003-2026
2026-04-24 03:02:03