One document matched: draft-ietf-alto-deployments-13.xml


<?xml version="1.0" encoding="US-ASCII"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<?xml-stylesheet type='text/xsl' href='rfc2629.xslt' ?>
<?rfc toc="yes" ?>
<?rfc symrefs="yes" ?>
<?rfc sortrefs="yes"?>
<?rfc iprnotified="no" ?>
<?rfc strict="no" ?>
<?rfc compact="no" ?>
<?rfc subcompact="no" ?>
<rfc category="info" docName="draft-ietf-alto-deployments-13"
     ipr="trust200902">

  <front>

    <title abbrev="Deployment Considerations">ALTO Deployment
    Considerations</title>

    <!--<author fullname="Martin Stiemerling" initials="M." surname="Stiemerling" role="editor">-->
    <author fullname="Martin Stiemerling" initials="M." surname="Stiemerling">
      <organization abbrev="NEC Europe Ltd.">NEC Laboratories
      Europe</organization>

      <address>
        <postal>
          <street>Kurfuerstenanlage 36</street>

          <code>69115</code>

          <city>Heidelberg</city>

          <country>Germany</country>
        </postal>

        <phone>+49 6221 4342 113</phone>

        <facsimile>+49 6221 4342 155</facsimile>

        <email>martin.stiemerling@neclab.eu</email>

        <uri>http://ietf.stiemerling.org</uri>
      </address>
    </author>

        <author fullname="Sebastian Kiesel" initials="S." surname="Kiesel">
            <organization abbrev="University of Stuttgart">
                University of Stuttgart Information Center
            </organization>
            <address>
                <postal>
                    <street>
                        Networks and Communication Systems Department
                    </street>
                    <street>Allmandring 30</street>
                    <city>Stuttgart</city>
                    <code>70550</code>
                    <country>Germany</country>
                </postal>
                <email>ietf-alto@skiesel.de</email>
                <uri>http://www.rus.uni-stuttgart.de/nks/</uri>
            </address>
        </author>

    <author fullname="Michael Scharf" initials="M." surname="Scharf">
      <organization abbrev="Alcatel-Lucent">Alcatel-Lucent</organization>

      <address>
        <postal>
          <street>Lorenzstrasse 10</street>

          <code>70435</code>

          <city>Stuttgart</city>

          <country>Germany</country>
        </postal>

        <email>michael.scharf@alcatel-lucent.com</email>

      </address>
    </author>

    <author fullname="Hans Seidel" initials="H." surname="Seidel">
      <organization abbrev="BENOCS">BENOCS GmbH</organization>

      <address>
    
        <email>hseidel@benocs.com</email>

      </address>
    </author>
 
    <author fullname="Stefano Previdi" initials="S." surname="Previdi">
      <organization abbrev="Cisco">Cisco Systems, Inc.</organization>

      <address>
        <postal>
          <street>Via Del Serafico 200</street>

          <code>00191</code>

          <city>Rome</city>

          <country>Italy</country>
        </postal>

        <email>sprevidi@cisco.com</email>

      </address>
    </author>
	
    <date year="2016"/>

    <area>APP</area>

    <workgroup>ALTO</workgroup>

    <keyword>ALTO</keyword>

    <keyword>ALTO Deployment Considerations</keyword>

    <abstract>
      <t>Many Internet applications are used to access resources such
      as pieces of information or server processes that are available
      in several equivalent replicas on different hosts. This
      includes, but is not limited to, peer-to-peer file sharing
      applications. The goal of Application-Layer Traffic Optimization
      (ALTO) is to provide guidance to applications that have to
      select one or several hosts from a set of candidates, which are
      able to provide a desired resource. This memo discusses
      deployment related issues of ALTO. It addresses different use
      cases of ALTO such as peer-to-peer file sharing and CDNs and
      presents corresponding examples. The document also includes
      recommendations for network administrators and application
      designers planning to deploy ALTO, such recommendations how
	  to generate ALTO map information.</t>
    </abstract>

  </front>

  <middle>

    <section title="Introduction">

      <t>Many Internet applications are used to access resources such
      as pieces of information or server processes that are
      available in several equivalent replicas on different
      hosts. This includes, but is not limited to, peer-to-peer (P2P)
      file sharing applications and Content Delivery Networks
      (CDNs). The goal of Application-Layer Traffic Optimization
      (ALTO) is to provide guidance to applications that have to
      select one or several hosts from a set of candidates, which are
      able to provide a desired resource. The basic ideas and problem
      space of ALTO is described in <xref target="RFC5693"></xref> and
      the set of requirements is discussed in <xref
      target="RFC6708"></xref>. The ALTO protocol is specified in
      <xref target="RFC7285"></xref>. An ALTO server discovery 
      procedure is defined in <xref target="RFC7286"></xref>.</t>

      <t>This document discusses use cases and operational issues that
      can be expected when ALTO gets deployed. This includes, but is
      not limited to, location of the ALTO server, imposed load to the
      ALTO server, and from whom the queries are performed. This document
	  provides guidance which ALTO services to use, and
      it summarizes known challenges as well as deployment experiences,
	  including potential processes to generate ALTO network and cost
      maps. It thereby complements the
      management considerations in the protocol specification <xref
      target="RFC7285"></xref>, which are independent
      of any specific use of ALTO.</t>

    </section>

    <section title="General Considerations">

      <section title="ALTO Entities">

	<section anchor="sec.general_deployment" title="Baseline Scenario">

	  <t>The ALTO protocol <xref
	  target="RFC7285"></xref> is a client/server
	  protocol, operating between a number of ALTO clients and an ALTO
	  server, as sketched in <xref target="fig.overview"></xref>.</t>

	  <t><figure anchor="fig.overview"
		     title="Baseline deployment scenario of the ALTO protocol">
          <artwork><![CDATA[
              +----------+
              |  ALTO    |
              |  Server  |
              +----------+
                    ^
             _.-----|------.
         ,-''       |       `--.
       ,'           |           `.
      (     Network |             )
       `.           |           ,'
         `--.       |       _.-'
             `------|-----''
                    v
 +----------+  +----------+   +----------+
 |  ALTO    |  |  ALTO    |...|  ALTO    |
 |  Client  |  |  Client  |   |  Client  |
 +----------+  +----------+   +----------+
]]></artwork>
	  </figure></t>

	  <t>This document uses the terminology introduced in <xref
	  target="RFC5693"></xref>. In particular, the following terms
	  are defined by <xref target="RFC5693"></xref>:</t>

	  <t><list style="symbols">

	    <t>ALTO Service: Several resource providers may be able to
	    provide the same resource. The ALTO service gives
	    guidance to a resource consumer and/or resource directory
	    about which resource provider(s) to select in order to
	    optimize the client's performance or quality of
	    experience, while improving resource consumption in the
	    underlying network infrastructure.</t>

	    <t>ALTO Server: A logical entity that provides interfaces
	    to the queries to the ALTO service.</t>

	    <t>ALTO Client: The logical entity that sends ALTO
	    queries.  Depending on the architecture of the
	    application, one may embed it in the resource consumer
	    and/or in the resource directory.</t>

	  </list></t>

	  <t>According to that definition, both an ALTO server and an
	  ALTO client are logical entities. An ALTO service may be
	  offered by more than one ALTO servers. In ALTO deployments,
	  the functionality of an ALTO server can therefore be
	  realized by several server instances, e.g., by using load
	  balancing between different physical servers. The term ALTO
	  server should not be confused with use of a single physical
	  server.</t>

	</section>

	<section anchor="sec.general_overview" title="Placement of ALTO Entities">

	  <t>The ALTO server and ALTO clients can be situated at
	  various entities in a network deployment. The first
	  differentiation is whether the ALTO client is located on the
	  actual host that runs the application, as shown in <xref
	  target="fig.tracker_less"></xref>, or if the ALTO client is
	  located on a resource directory, as shown in <xref
	  target="fig.tracker"></xref>.</t>

	  <t><figure anchor="fig.tracker_less"
		     title="Overview of protocol interaction between ALTO elements without a resource directory">
            <artwork><![CDATA[
                                           +--------------+
                                           |     App      |
                                           +-----------+  |
                                       ===>|ALTO Client|  |****
                                    ===    +-----------+--+   *
                                 ===                    *     *
                              ===                       *     *
   +-------+     +-------+<===             +--------------+   *
   |       |     |       |                 |     App      |   *
   |       |.....|       |<========        +-----------+  |   *
   |       |     |       |        ========>|ALTO Client|  |   *
   +-------+     +-------+<===             +-----------+--+   *
   Source of       ALTO       ==                        *     *
   topological    Server        ==                      *     *
   information                    ==       +--------------+   *
                                    ==     |     App      |   *
                                      ==   +-----------+  |****
                                        ==>|ALTO Client|  |
                                           +-----------+--+
                                             Application
   Legend:
   === ALTO protocol
   *** Application protocol
   ... Provisioning protocol
]]></artwork>
          </figure></t>

	  <t><xref target="fig.tracker_less"></xref> shows the
	  operational model for an ALTO client running at
	  endpoints. An example would be a peer-to-peer file sharing
	  application that does not use a tracker, such as edonkey. In
	  addition, ALTO clients at peers could also be used in a
	  similar way even if there is a tracker, as further discussed
	  in <xref target="sec.p2p_tracker_cons"></xref>.</t>

	  <t><figure anchor="fig.tracker"
		     title="Overview of protocol interaction between ALTO elements with a resource directory">
            <artwork><![CDATA[
                                                    +-----+
                                                  **| App |****
                                                **  +-----+   *
                                              **       *      *
                                            **         *      *
   +-------+     +-------+     +--------------+        *      *
   |       |     |       |     |              |     +-----+   *
   |       |.....|       |     +-----------+  |*****| App |   *
   |       |     |       |<===>|ALTO Client|  |     +-----+   *
   +-------+     +-------+     +-----------+--+        *      *
   Source of       ALTO          Resource   **         *      *
   topological    Server         directory    **       *      *
   information                                  **  +-----+   *
                                                  **| App |****
                                                    +-----+
                                                  Application
   Legend:
   === ALTO protocol
   *** Application protocol
   ... Provisioning protocol
]]></artwork>
          </figure></t>

	  <t>In <xref target="fig.tracker"></xref>, a use case with a
	  resource directory is illustrated, e.g., a tracker in
	  peer-to-peer file-sharing. Both deployment scenarios may
	  differ in the number of ALTO clients that access an ALTO
	  service: If an ALTO client is implemented in a resource
	  directory, an ALTO server may be accessed by a limited and less
	  dynamic set of clients, whereas in the general case any host
	  could be an ALTO client. This use case is further detailed in
	  <xref target="sec.p2p_cons"></xref>.</t>

	  <t>Using ALTO in CDNs may be similar to a resource directory
	  <xref target="I-D.jenkins-alto-cdn-use-cases"></xref>. The
	  ALTO server can also be queried by CDN entities to get
	  guidance about where a particular client accessing data
	  in the CDN is exactly located in the Internet Service
	  Provider's network, as discussed in <xref
	  target="sec.cdn_cons"></xref>.</t>

	</section>

      </section>

      <section anchor="sec.alto_apps" title="Classification of Deployment Scenarios">

	<section anchor="sec.alto_classification" title="Roles in ALTO Deployments">

	  <t>ALTO is a general-purpose protocol and it is intended to
	  be used by a wide range of applications. This implies that
	  there are different possibilities where the ALTO entities
	  are actually located, i.e., if the ALTO clients and the ALTO
	  server are in the same Internet Service Provider (ISP)
	  domain, or if the clients and the ALTO server are
	  managed/owned/located in different domains.</t>

          <t>An ALTO deployment involves four kinds of entities:</t>

          <t><list style="numbers">

	    <t>Source of topological information</t>

	    <t>ALTO server</t>

	    <t>ALTO client</t>

	    <t>Resource consumer (using the ALTO guidance)</t>

	  </list></t>

          <t>Each of these entities corresponds to a certain role, which
	  results in requirements and constraints on the interaction
	  between the entities.</t>
	  
          <t>A key design objective of the ALTO service is that each
          these four roles can be separated, i.e., they can be
          realized by different organizations or disjoint system
          components. ALTO is inherently designed for use in
          multi-domain environments. Most importantly, ALTO is
          designed to enable deployments in which the ALTO server and
          the ALTO client are not located within the same
          administrative domain.</t>

	  <t>As explained in <xref target="RFC5693"></xref>, from
	  this follows that at least three different kinds of entities
	  can operate an ALTO server:</t>

          <t><list style="numbers">

            <t>Network operators. Network Service Providers (NSPs)
            such as Internet Service Providers (ISPs) may have
            detailed knowledge of their network topology and
            policies. In this case, the source of the topology
            information and the provider of the ALTO server may be
            part of the same organization.</t>

            <t>Third parties. Topology information could also be
            collected by entities separate from network operators but
            that may either have collected network information or have
            arrangements with network operators to learn the network
            information. Examples of such entities could be Content
            Delivery Network (CDN) operators or companies specialized
            on offering ALTO services on behalf of ISPs. </t>

            <t>User communities. User communities could run
            distributed measurements for estimating the topology of
            the Internet. In this case the topology information may
            not originate from ISP data.</t>

          </list></t>

	  <t>Regarding the interaction between ALTO server and client,
	  ALTO deployments can be differentiated according to the
	  following aspects:</t>

	  <t><list style="numbers">

	    <t>Applicable trust model: The deployment of ALTO can
	    differ depending on whether ALTO client and ALTO server
	    are operated within the same organization and/or network,
	    or not. This affects a lot of constraints, because the
	    trust model is very different. For instance, as discussed
	    later in this memo, the level-of-detail of maps can depend
	    on who the involved parties actually are.</t>

	    <t>Composition of the user group: The main use case of ALTO is to
	    provide guidance to any Internet application. However, an
	    operator of an ALTO server could also decide to offer
	    guidance only to a set of well-known ALTO clients, e. g., after
	    authentication and authorization. In the peer-to-peer
	    application use case, this could imply that only selected
	    trackers are allowed to access the ALTO server. The
	    security implications of using ALTO in closed groups
	    differ from the public Internet.</t>

	    <t>Covered destinations: In general, an ALTO server has to
	    be able to provide guidance for all potential
	    destinations. Yet, in practice a given ALTO client may
	    only be interested in a subset of destinations, e.g.,
	    only in the network cost between a limited set of resource
	    providers. For instance, CDN optimization may not need the
	    full ALTO cost maps, because traffic between individual
	    residential users is not in scope. This may imply that an
	    ALTO server only has to provide the costs that matter for
	    a given user, e. g., by customized maps.</t>

          </list></t>

	  <t>The following sections enumerate different classes of use
	  cases for ALTO, and they discuss deployment implications of
	  each of them. An ALTO server can in principle be operated by
	  any organization, and there is no requirement that an ALTO
	  server is deployed and operated by an ISP. Yet, since the ALTO
	  solution is designed for ISPs, most examples in this
	  document assume that the operator of an ALTO server is a
	  network operator (e.g., an ISP or the network department in
	  a large enterprise) that offers ALTO guidance in particular
	  to users of this network.</t>

	  <t>It must be emphasized that any application using ALTO
	  must also work if no ALTO servers can be found or if no
	  responses to ALTO queries are received, e.g., due to
	  connectivity problems or overload situations (see also <xref
	  target="RFC6708"></xref>).</t>

	</section>

	<section title="Information Exposure">

	  <t>There are basically two different approaches how an ALTO
	  server can provide network information and guidance:</t>

	  <t><list style="numbers">

            <t>The ALTO server provides maps that contain
            provider-defined cost values between network location
            groupings (e.g., sets of IP prefixes). These maps can be
            retrieved by clients via the ALTO protocol, and the actual
            processing of the map data is done inside the
            client. Since the maps contain (aggregated) cost
            information for all endpoints, the client does not have to
            reveal any internal operational data, such as the IP
            addresses of candidate resource providers. The ALTO protocol
            supports this mode of operation by the Network and Cost
            Map Service.</t>

            <t>The ALTO server provides a query interface that returns
            costs or rankings for explicitly specified endpoints. This
            means that the query of the ALTO client has to include
            additional information (e.g., a list of IP addresses). The
            server then calculates and returns costs or rankings for
            the endpoints specified in the request (e.g., a sorted
            list of the IP addresses). In ALTO, this method can be
            realized by the Endpoint Cost Service.</t>

          </list></t>

	  <t>Both approaches have different privacy implications for
	  the server and client:</t>

	  <t>For the client, approach 1 has the advantage that all
	  operational information stays within the client and is not
	  revealed to the provider of the server. However, this
	  service implies that a network operator providing an ALTO
	  server has to expose a certain amount of information about
	  its network structure (e.g., IP prefixes or topology
	  information in general).</t>

	  <t>For the operator of a server, approach 2 has the
	  advantage that the query responses reveal less topology
	  information to ALTO clients. But this method requires that
	  client sends internal operational information to the server,
	  such as the IP addresses of hosts also running the
	  application. For clients, such data can be sensitive.</t>

	  <t>As a result, both approaches have their pros and cons, as
	  further detailed in <xref target="risks"></xref>.</t>

	</section>

	<section title="More Advanced Deployments">

	  <t>From an ALTO client's perspective, there are different
	  ways to use ALTO:</t>

	  <t><list style="numbers">

	    <t>Single service instance with single metric guidance: An
	    ALTO client only obtains guidance regarding a single
	    metric from a single ALTO service, e.g., an ALTO server
	    that is offered by the network service provider of the
	    corresponding access network. Corresponding ALTO server
	    instances can be discovered e.g. by ALTO server discovery
	    <xref target="RFC7286"></xref>
	    <xref target="I-D.kiesel-alto-xdom-disc"></xref>. Being a
	    REST-ful protocol, an ALTO service can use known methods
	    to balance the load between different server instances or
	    between clusters of servers, i.e., an ALTO server can be
	    realized by many instances with a load balancing
	    scheme. The ALTO protocol also supports the use of
	    different URIs for different ALTO features.</t>

	    <t>Single service instance with multiple metric guidance: An
	    ALTO client could also query an ALTO service for different
	    kinds of information, e.g., cost maps with different
	    metrics. The ALTO protocol is extensible and permits such
	    operation. However, ALTO does not define how a client
	    shall deal with different forms of guidance, and it is up
	    to the client to determine what provided information may
	    indeed be useful.</t>

	    <t>Multiple service instances: An ALTO client can also decide
	    to access multiple ALTO servers providing guidance,
	    possibly from different operators or organizations. Each
	    of these services may only offer partial guidance, e.g.,
	    for a certain network partition. In that case, it may be
	    difficult for an ALTO client to compare the guidance from
	    different services. Different organization may use
	    different methods to determine maps, and they may also
	    have different (possibly even contradicting or competing)
	    guidance objectives. How to discover multiple ALTO servers
	    and how to deal with conflicting guidance is an open
	    issue.</t>

          </list></t>

	  <t>There are also different options regarding the
	  synchronization of guidance offered by an ALTO service:</t>

	  <t><list style="numbers">

	    <t>Authoritative servers: An ALTO server instance can provide
	    guidance for all destinations for all kinds of ALTO
	    clients.</t>

	    <t>Cascaded servers: An ALTO server may itself include an
	    ALTO client and query other ALTO servers, e.g., for
	    certain destinations. This results is a cascaded
	    deployment of ALTO servers, as further explained
	    below.</t>

	    <t>Inter-server synchronization: Different ALTO servers
	    my communicate by other means. This approach is not further
	    discussed in this document.</t>

          </list></t>

	  <!--<section anchor="advanced" title="Cascading ALTO Servers">-->

	  <t>An assumption of the ALTO design is that ISP operate ALTO
	  servers independently, irrespectively of other ISPs. This
	  may true for most envisioned deployments of ALTO but there
	  may be certain deployments that may have different
	  settings. <xref target="fig.alto-proxy"></xref> shows such
	  setting with a university network that is connected to two
	  upstream providers. NREN is a National Research and
	  Education Network, which provides cheap high-speed connectivity
      to specific destinations, e.g., other universities.
      ISP is a commercial upstream provider from which the university
      buys connectivity to all destinations that cannot be reached
      via the NREN.
      The university, as well as ISP,
	  are operating their own ALTO server. The ALTO clients,
	  located on the peers will contact the ALTO server located at
	  the university.</t>

	  <t><figure anchor="fig.alto-proxy" title="Example of a cascaded ALTO server">
	    <artwork><![CDATA[
       +-----------+
       |    ISP    |
       |   ALTO    |<==========================++
       |  Server   |                           ||
       +-----------+                           ||
         ,-------.            ,------.         ||
      ,-'         `-.      ,-'         `-.     ||
     /   Commercial  \    /               \    ||
    (    Upstream     )  (       NREN      )   ||
     \   ISP         /    \               /    ||
      `-.         ,-'      `-.         ,-'     ||
         `---+---'            `+------'        ||
             |                 |               ||
             |                 |               ||
             |,-------------.  |               \/
           ,-+               `-+          +-----------+
         ,'      University     `.        |University |
        (        Network          )       |   ALTO    |
         `.                      /        |  Server   |
           `-.               +--'         +-----------+
              `+------------'|              /\     /\
               |             |              ||     ||
      +--------+-+         +-+--------+     ||     ||
      |   Peer1  |         |   PeerN  |<====++     ||
      +----------+         +----------+            ||
           /\                                      ||
           ||                                      ||
           ++======================================++

   Legend:
   === ALTO protocol
]]></artwork>
	  </figure></t>

	  <t>In this setting all "destinations" useful for the peers
	  within NREN are free-of-charge for the peers located in the
	  university network (i.e., they are preferred in the rating
	  of the ALTO server). However, all traffic that is not
	  towards NREN will be handled by the ISP upstream
	  provider. Therefore, the ALTO server at the university may
	  also include the guidance given by the ISP ALTO server
	  in its replies to the ALTO clients. This is an example for
	  cascaded ALTO servers.</t>

	  <!--</section>-->

	</section>

      </section>

    </section>

    <section anchor="sec.ISP_deployment_req_general" title="Deployment Considerations by ISPs">

      <section anchor="sec.guidance" title="Objectives for the Guidance to Applications">

	<!--<section anchor="sec.ISP_deployment" title="Motivation for Traffic Optimization">-->

	<section anchor="sec.ISP_deployment_req" title="General Objectives for Traffic Optimization">

	  <t>The Internet consists of many networks. The networks are
	  operated by Network Service Providers (NSP) or Internet
	  Service Providers (ISP), which also
	  include e.g. universities, enterprises, or other
	  organizations. The Internet provides network connectivity,
	  e.g., by access networks, such as cable networks, xDSL
	  networks, 3G/4G mobile networks, etc. Network operators need
	  to manage, to control and to audit the traffic. Therefore,
	  it is important to understand how to deploy an ALTO service
	  and its expected impact.</t>

	  <t>The general objective of ALTO is to give guidance to
	  applications on what endpoints (e.g., IP addresses or IP
	  prefixes) are to be preferred according to the operator of
	  the ALTO server. The ALTO protocol gives means to let the
	  ALTO server operator express its preference, whatever this
	  preference is.</t>

	  <t>ALTO enables ISPs to support application-level traffic
	  engineering by influencing application resource provider
	  selection. This traffic engineering can have different
	  objectives:</t>

	  <t><list style='numbers'>

	    <t>Inter-network traffic localization: ALTO can help to
	    reduce inter-domain traffic. The networks of ISPs are
	    interconnected through peering points.  From a business view,
	    the inter-network settlement is needed for exchanging
	    traffic between these networks. These peering agreements
	    can be costly. To reduce these costs, a simple objective
	    is to decrease the traffic exchange across the peering
	    points and thus keep the traffic in the own network or
	    Autonomous System (AS) as far as possible.</t>

	    <t>Intra-network traffic localization: In case of large
	    ISPs, the network may be grouped into several networks,
	    domains, or Autonomous Systems (ASs). The core network
	    includes one or several backbone networks, which are
	    connected to multiple aggregation, metro, and access
	    networks. If traffic can be limited to certain areas such
	    as access networks, this decreases the usage of backbone
	    and thus helps to save resources and costs.</t>

	    <t>Network off-loading: Compared to fixed networks, mobile
	    networks have some special characteristics, including
	    smaller link bandwidth, high cost, limited radio frequency
	    resource, and limited terminal battery. In mobile
	    networks, wireless links should be used efficiently. For
	    example, in the case of a P2P service, it is likely that
	    hosts should prefer retrieving data from hosts in fixed
	    networks, and avoid retrieving data from mobile hosts.</t>

	    <t>Application tuning: ALTO is also a tool to optimize the
	    performance of applications that depend on the network and
	    perform resource provider selection decisions among network
	    endpoints. And example is the network-aware selection of
	    Content Delivery Network (CDN) caches.</t>
	  
	  </list></t>

	  <t>In the following, these objectives are explained in more
	  detail with examples.</t>

	</section>

	<section title="Inter-Network Traffic Localization">
	<!--<section title="Keeping Traffic Local in a Network">-->

          <t>ALTO guidance can be used to keep traffic local in a
          network, for instance in order to reduce peering costs. 
          An ALTO server can let applications prefer other
          hosts within the same network operator's network instead of
          randomly connecting to other hosts that are located in
          another operator's network. Here, a network operator would
          always express its preference for hosts in its own network,
          while hosts located outside its own network are to be
          avoided (i.e., they are undesired to be considered by the
          applications). <xref target="fig.network_local"></xref>
          shows such a scenario where hosts prefer hosts in the same
          network (e.g., Host 1 and Host 2 in ISP1 and Host 3 and Host
          4 in ISP2). </t>

          <t><figure anchor="fig.network_local"
		     title="Inter-network traffic localization">
              <artwork><![CDATA[
                         ,-------.         +-----------+
       ,---.          ,-'         `-.      |   Host 1  |
    ,-'     `-.      /     ISP 1   ########|ALTO Client|
   /           \    /              #  \    +-----------+
  /    ISP X    \   |              #  |    +-----------+
 /               \  \              ########|   Host 2  |
;             +----------------------------|ALTO Client|
|             |   |   `-.         ,-'      +-----------+
|             |   |      `-------'                      
|     Inter-  |   |      ,-------.         +-----------+
:     network |   ;   ,-'         `########|   Host 3  |
 \    traffic |  /   /     ISP 2   # \     |ALTO Client|
  \           | /   /              #  \    +-----------+
   \          |/    |              #  |    +-----------+
    `-.     ,-|     \              ########|   Host 4  |
       `---'  +----------------------------|ALTO Client| 
                      `-.         ,-'      +-----------+ 
                         `-------'                       

    Legend:
    ### preferred "connections"
    --- non-preferred "connections"
]]></artwork>
	  </figure></t>

	  <t>Examples for corresponding ALTO maps can be found in
	  <xref target="sec.ISP_deployment2"></xref>. Depending on the
	  application characteristics, it may not be possible or even
	  not be desirable to completely localize all traffic.</t>

        </section>

        <section title="Intra-Network Traffic Localization">
        <!--<section title="Objective: Intra-Network Localization/Bottleneck Off-Loading">-->

          <t>The previous section describes the results of the ALTO
          guidance on an inter-network level. In the same way, ALTO can also
          be used for intra-network localization. In this case, ALTO
          provides guidance which internal hosts are to be preferred
          inside a single network (e.g., one AS). This application-level 
          traffic engineering can reduce
	  the capacity requirements in the core network of an ISP. <xref
          target="fig.intra_network_local"></xref> shows such a
          scenario where Host 1 and Host 2 are located in an access net 1 of
          ISP 1 and connect via a low capacity link to the core
          of the same ISP 1. If Host 1 and Host 2 exchange their data
          with remote hosts, they would probably congest the
          bottleneck link.</t>

          <t><figure anchor="fig.intra_network_local"
		     title="Intra-network traffic localization">
              <artwork><![CDATA[
           Bottleneck    ,-------.         +-----------+
       ,---.     |    ,-'         `-.      |   Host 1  |
    ,-'     `-.  |   /     ISP 1   ########|ALTO Client|
   /           \ |  /    (Access   #  \    +-----------+
  /    ISP 1    \|  |     net 1)   #  |    +-----------+
 /   (Core       V  \              ########|   Host 2  |
;    network) +--X~~~X---------------------|ALTO Client|
|             |   |   `-.         ,-'      +-----------+
|             |   |      `-------'                      
|             |   |      ,-------.         +-----------+
:             |   ;   ,-'         `########|   Host 3  |
 \            |  /   /     ISP 1   # \     |ALTO Client|
  \           | /   /     (Access  #  \    +-----------+
   \          |/    |      net 2)  #  |    +-----------+
    `-.     ,-X     \              ########|   Host 4  |
       `---'  ~~~~~~~X---------------------|ALTO Client| 
                ^     `-.         ,-'      +-----------+ 
                |        `-------'                       
             Bottleneck
    Legend:
    ### preferred "connections"
    --- non-preferred "connections"
]]></artwork>
	  </figure></t>

          <t>The operator can guide the hosts in such a situation to try first
          local hosts in the same network islands, avoiding or at least
          lowering the effect on the bottleneck link, as shown in <xref
          target="fig.intra_network_local"></xref>.</t>

          <t>The objective is to avoid bottlenecks by optimized
          endpoint selection at application level. ALTO is not a
          method to deal with the congestion at the bottleneck.</t>

	</section>

        <section title="Network Off-Loading">
        <!--<section title="Objective: Off-Loading Traffic from Network"> -->

          <t>Another scenario is off-loading traffic from
          networks. This use of ALTO can be beneficial in particular
          in mobile networks. A network operator may have
          the desire to guide hosts in its own mobile network to use hosts outside this
          mobile network. One reason can be that the wireless network
          is not made for the load cause by, e.g., peer-to-peer
          applications, and it therefore makes sense when peers fetch
          their data from remote peers in other parts of the
          Internet.</t>

          <t><figure anchor="fig.network_de_local"
		     title="ALTO traffic network de-localization">
	    <artwork><![CDATA[
                         ,-------.         +-----------+
       ,---.          ,-'         `-.      |   Host 1  |
    ,-'     `-.      /     ISP 1   +-------|ALTO Client|
   /           \    /    (Mobile   |  \    +-----------+
  /    ISP X    \   |    network)  |  |    +-----------+
 /               \  \              +-------|   Host 2  |
;             #############################|ALTO Client|
|             #   |   `-.         ,-'      +-----------+
|             #   |      `-------'                      
|             #   |      ,-------.  
:             #   ;   ,-'         `-.
 \            #  /   /     ISP 2     \ 
  \           # /   /     (Fixed      \
   \          #/    |     network)    |    +-----------+
    `-.     ,-#     \                 /    |   Host 3  |
       `---'  #############################|ALTO Client| 
                      `-.         ,-'      +-----------+ 
                         `-------'                       

    Legend:
    ### preferred "connections"
    --- non-preferred "connections"
]]></artwork>
	  </figure></t>

	  <t><xref target="fig.network_de_local"></xref> shows the
	  result of such a guidance process where Host 2 prefers a
	  connection with Host 3 instead of Host 1, as shown in <xref
	  target="fig.network_local"></xref>.</t>

	  <t>A realization of this scenario may have certain
	  limitations and may not be possible in all cases. For
	  instance, it may require that the ALTO server can
	  distinguish mobile and non-mobile hosts, e.g., based on
	  their IP address. This may depend on mobility solutions and
	  may not be possible or accurate. In general, ALTO is not
	  intended as a fine-grained traffic engineering solution for
	  individual hosts. Instead, it typically works on aggregates
	  (e.g., if it is known that certain IP prefixes are often
	  assigned to mobile users).</t>

        </section>

        <section title="Application Tuning">

	  <t>ALTO can also provide guidance to optimize the
	  application-level topology of networked applications, e.g.,
	  by exposing network performance information. Applications
	  can often run their own measurements to determine network
	  performance, e.g., by active delay measurements or bandwidth
	  probing, but such measurements result in overhead and
	  complexity. Accessing an ALTO server can be a simpler
	  alternative. In addition, an ALTO server may also expose
	  network information that applications cannot easily measure
	  or reverse-engineer.</t>

	</section>

      </section>

      <section title="Provisioning of ALTO Topology Data">

	<section anchor="sec.ISP_deployment_req_other" title="High-Level Process and Requirements">

	  <t>A process to generate ALTO topology information typically comprises several
	  steps. The first step is to gather information, which is described 
	  in the following section. The subsequent sections then describe
      how the gathered data can be processed, and
	  which methods can be applied to generate the information exposed by ALTO,
 	  such as network and cost maps.</t>
	
	  <t>Providing ALTO guidance can result in a win-win situation
	  both for network providers and users of the ALTO
	  information. Applications possibly get a better performance,
	  while the network provider has means to optimize the
	  traffic engineering and thus its costs. Yet, there can be
	  security concerns with exposing topology data. Corresponding
	  limitations are discussed in <xref
	  target="sec.security.leakage"/>.</t>

	  <t>ISPs may have important privacy requirements when
	  deploying ALTO, which have to be taken into account when
	  processing ALTO topology data. In particular, an ISP may not be willing to
	  expose sensitive operational details of its network. The
	  topology abstraction of ALTO enables an ISP to expose the
	  network topology at a desired granularity only, determined
	  by security policies.</t>

	  <t>With the Endpoint Cost Service (ECS), the ALTO client does
	  not have to implement any specific algorithm or mechanism
	  in order to retrieve, maintain and process network topology
	  information (of any kind). The complexity of the network
	  topology (computation, maintenance and distribution) is kept
	  in the ALTO server and ECS is delivered on demand. This
	  allows the ALTO server to enhance and modify the way the
	  topology information sources are used and combined. This
	  simplifies the enforcement of privacy policies of the
	  ISP.</t>

	  <t>The ALTO Network Map and Cost Map service expose an
	  abstracted view on the ISP network topology. Therefore, 
	  care is needed when constructing those maps in
	  order to take privacy policies into account, as further
	  discussed in <xref target="host_group_descriptors"/>. The
	  ALTO protocol also supports further features such as endpoint
	  properties, which could also be used to expose topology guidance.
	  The privacy considerations for ALTO maps also apply to such
	  ALTO extensions.</t>	   

	</section>
	  
	<section anchor="sec.data_sources" title="Data Collection from Data Sources">
	
	  <t>The first step in the process of generating ALTO information
 	  is to gather the required information from the network.
	  An ALTO server can collect topological information from a
	  variety of sources in the network and provides a cohesive,
	  abstracted view of the network topology to applications
	  using an ALTO client. Topology data sources that may include routing
	  protocols, network policies, state and performance
	  information, geo-location, etc. 
	  An ALTO server requires at least
	  some topology and/or routing information, i.e., information about
	  present endpoints and their interconnection.
	  With this information it is in principle possible
	  to compute paths between all known endpoints.
	  Based on such basic data, the ALTO
	  server builds an ALTO-specific network topology that
	  represents the network as it should be understood and
	  utilized by applications (resource consumers) at endpoints
	  using ALTO services (e.g., Network/Cost Map Service or ECS).
	  A basic dataset can be extended by many other information
	  obtainable from the network.</t>
 
  	  <t>The ALTO protocol does not assume a specific network
	  technology or topology. In principle, ALTO can be used with various types
	  of addresses (Endpoint Addresses). <xref
	  target="RFC7285"></xref> defines the use of IPv4/IPv6
	  addresses or prefixes in ALTO, but further address types
	  could be added by extensions.  In this document, only the
	  use of IPv4/IPv6 addresses is considered.</t>

	  <t> The exposure of network topology
	  information is controlled and managed by the ALTO server.
          ALTO abstract network topologies can be automatically
	  generated from the physical or logical topology of the
	  network, e.g., using "live" network data.  
	  The generation would typically be based on
	  policies and rules set by the network operator. The maps and
	  the guidance can significantly differ depending on the use
	  case, the network architecture, and the trust relationship
	  between ALTO server and ALTO client, etc. Besides the
	  security requirements that consist of not delivering any
	  confidential or critical information about the
	  infrastructure, there are efficiency requirements in terms
	  of what aspects of the network are visible and required by
	  the given use case and/or application.</t>
	  
	  <t>The ALTO server operator has to ensure that the ALTO
	  topology does not reveal any details that would endanger
	  the network integrity and security.  For instance, ALTO is
	  not intended to leak raw Interior Gateway Protocol (IGP) or
	  Border gateway Protocol (BGP) databases to ALTO clients.</t>

	  <t><figure anchor="fig.data_sources" title="Potential data sources for ALTO">
	    <artwork><![CDATA[
        +--------+   +--------+
        |  ALTO  |   |  ALTO  |
        | Client |   | Client |
        +--------+   +--------+
               /\     /\
               ||     || ALTO protocol
               ||     ||
               \/     \/
              +---------+
              |  ALTO   |
              | Server  |
              +---------+
               :   :   :    
               :   :   :
      +........+   :   +........+ Provisioning
      :            :            : protocol
      :            :            :
 +---------+  +---------+  +---------+
 |   BGP   |  |   I2RS  |  |   NMS   | Potential
 | Speaker |  |  Client |  |   OSS   | data sources
 +---------+  +---------+  +---------+
      ^            ^            ^
      |            |            |
 Link-State      I2RS      SNMP/NETCONF,
  NLRI for       data      traffic statistics,
  IGP/BGP                  IPFIX, etc.
]]></artwork>
	  </figure></t>	
	  
	  <t>As illustrated in <xref
	  target="fig.data_sources"></xref>, the topology data used by
	  an ALTO server can originate from different data
	  sources:</t>

	  <t><list style='symbols'>

	    <t>Relevant information sources are interior gateway protocols (IGPs)
		or the Border Gateway Protocol (BGP).
		An ALTO server could get network routing information by listening to IGPs and/or peering with BGP speakers.
		For data collection, link-state protocols are more suitable since every router propagates
        its information throughout the whole network.  Hence, it is possible
        to obtain information about all routers and their neighbors from one
        single router in the network.  In contrast, distance-vector protocols
        are less suitable since routing information is only shared among
        neighbors.  To obtain the whole topology with distance-vector routing
        protocols it is necessary to retrieve routing information from every
        router in the network.</t>
	  
	    <t>The document <xref
	    target="I-D.ietf-idr-ls-distribution"></xref> describes a
	    mechanism by which links state and traffic engineering
	    information can be collected from networks and shared with
	    external components using the BGP routing protocol. This
	    is achieved using a new BGP Network Layer Reachability
	    Information (NLRI) encoding format.  The mechanism is
	    applicable to physical and virtual IGP links and can also
	    include Traffic Engineering (TE) data.  For instance,
	    prefix data can be carried and originated in BGP, while TE
	    data is originated and carried in an IGP.  The mechanism
	    described is subject to policy control.</t>

	    <t>The Interface to the Routing System (I2RS) is a
	    solution for state transfer in and out of the Internet's
	    routing system <xref
	    target="I-D.ietf-i2rs-architecture"></xref>. An ALTO
	    server could use an I2RS client to observe routing-related
	    information. With the rise of Software-Defined Networking (SDN) and 
		a decoupling of network data and control plane, topology
		information could also be fetched from an SDN controller.
		This scenario is not further discussed in the remainder
		of this document.</t>
		
	    <t>An ALTO server can also leverage a Network Management
	    System (NMS) or an Operations Support System (OSS) as
	    data sources. NMS or OSS solutions are used to control,
	    operate, and manage a network, e.g., using the Simple
	    Network Management Protocol (SNMP) or NETCONF.  As
	    explained for instance in <xref
	    target="RFC7491"></xref>, the
	    NMS and OSS can be consumers of network events reported
	    and can act on these reports as well as displaying them to
	    users and raising alarms.  The NMS and OSS can also access
	    the Traffic Engineering Database (TED) and Label Switched
	    Path Database (LSP-DB) to show the users the current state
	    of the network. In addition, NMS and OSS systems may have
	    access to IGP/BGP routing information, network inventory
	    data (e.g., links, nodes, or link properties not visible
	    to routing protocols, such as Shared Risk Link Groups),
	    statistics collection system that provides traffic
	    information, such as traffic demands or link utilization
	    obtained from IP Flow Information Export (IPFIX), as well
	    as other Operations, Administration, and Maintenance (OAM)
            information (e.g., syslog). NMS or OSS systems
	    also may have functions to correlate and orchestrate
	    information originating from other data sources. For
	    instance, it could be required to correlate IP prefixes
	    with routers (Provider, Provider Edge, Customer Edge,
	    etc.), IGP areas, VLAN IDs, or policies.</t>
		
	  </list></t>

	  <t>The data sources mentioned so far are only a subset of potential
      topology sources and depending on the network type, (e.g. mobile,
      satellite network) different hardware and protocols are in operation
      to form and maintain the network.</t>
	  
	  <t>In general it is challenging to gather detailed information about
      the whole Internet, since the network consists of multiple domains and in many
      cases it is not possible to collect information across network
      borders.  Hence, potential information sources may be limited to
	  a certain domain.</t>
	  
	</section>

	<section anchor="host_group_descriptors" title="Partitioning and Grouping of IP Address Ranges">

	  <t>ALTO introduces provider-defined network location
	  identifiers called Provider-defined Identifiers (PIDs) to
	  aggregate network endpoints in the Map Services.  Endpoints
	  within one PID may be treated as single entity, assuming
	  proximity based on network topology or other similarity.  A
	  key use case of PIDs is to specify network preferences
	  (costs) between PIDs instead of individual endpoints. It is
	  up to the operator of the ALTO server how to group endpoints
	  and how to assign PIDs. For example, a PID may denote a
	  subnet, a set of subnets, a metropolitan area, a POP, an
	  autonomous system, or a set of autonomous systems.</t>

	  <t>This document only considers deployment scenarios in
	  which PIDs expand to a set of IP address ranges (CIDR). A
	  PID is characterized by a string identifier and its
	  associated set of endpoint addresses <xref
	  target="RFC7285"></xref>. If an ALTO server offers the Map
	  Service, corresponding identifiers have to be
	  configured.</t>

	  <t>An automated ALTO implementation may use dynamic
	  algorithms to aggregate network topology. However, it is
	  often desirable to have a mechanism through which the
	  network operator can control the level and details of
	  network aggregation based on a set of requirements and
	  constraints. This will typically be governed by policies
	  that enforce a certain level of abstraction and prevent
	  leakage of sensitive operational data.</t>

	  <t>For instance, an ALTO server may leverage BGP information
	  that is available in a networks service provider network
	  layer and compute the group of prefix. An example are BGP
	  communities, which are used in MPLS/IP networks as a common
	  mechanism to aggregate and group prefixes. A BGP community
	  is an attribute used to tag a prefix to group prefixes based
	  on mostly any criteria (as an example, most ISP networks
	  originate BGP prefixes with communities identifying the
	  Point of Presence (PoP) where the prefix has been
	  originated). These BGP communities could be used to map IP
	  address ranges to PIDs. By an additional policy, the ALTO
	  server operator may decide an arbitrary cost defined between
	  groups. Alternatively, there are algorithms that allow the
	  dynamic computation of costs between groups. The ALTO
	  protocol itself is independent of such algorithms and
	  policies.</t>

	</section>

	<section anchor="rating_criteria" title="Rating Criteria and/or Cost Calculation">

	  <t>An ALTO server indicates preferences amongst network
	  locations in the form of path costs.  Path costs are generic
	  costs and can be internally computed by the operator of the
	  ALTO server according to its own policy. For a given ALTO
	  network map, an ALTO cost map defines directional path costs
	  pairwise amongst the set of source and destination network
	  locations defined by the PIDs.</t>

	  <t>The ALTO protocol permits the use of different cost
	  types. An ALTO cost type is defined by the combination of a
	  cost metric and a cost mode. The cost metric identifies what
	  the costs represent. The cost mode identifies how the costs
	  should be interpreted, e.g., whether returned costs should
	  be interpreted as numerical values or ordinal rankings. The
	  ALTO protocol also allows the definition of additional
	  constraints defining which elements of a cost map shall be
	  returned.</t>

	  <t>The ALTO protocol specification <xref target="RFC7285"></xref>
          defines the "routingcost" cost metric as basic set of rating
	  criteria, which has to be supported
	  by all implementations. This cost metric conveys a
	  generic measure for the cost of routing traffic from a
	  source to a destination.  A lower value indicates a higher
	  preference for traffic to be sent from a source to a
	  destination. It is up to the ALTO server how that
	  metric is calculated.</t>

      <t>It is possible to calculate the "routingcost" cost metric
	  based on actual routing protocol information. Typically, Interior Gateway Protocols
      (IGP) provide details about endpoints and links within the own network, 
      while the Bordger Gateway Protocol (BGPs) is
      used to provide details about links to endpoints in other networks. 
	  Besides topology
	  and routing information networks have a
      multitude of other attributes about its state, condition, and
      operation.  That comprises but is not limited to attributes like link
      utilization, bandwidth and delay, ingress/egress points of data flows
      from/towards endpoints outside of the network up to the location of
      nodes and endpoints.</t>
	  
	  <t>In order to enable use of extended information,
	  there is an extension procedure for adding new ALTO
	  cost types. The following list gives an overview on further
	  rating criteria that have been proposed or which are in use
	  by ALTO-related prototype implementations. This list is not
	  intended as normative text; a definition of further metrics
	  can be found for instance in <xref
	  target="I-D.wu-alto-te-metrics"></xref>. Instead, the only
	  purpose of the following list is to document and discuss
	  rating criteria that have been proposed so far. It can also
	  depend on the use case of ALTO whether such rating criteria
	  are useful, and whether the corresponding information would
	  indeed be made available by ISPs.</t>

	  <!--<section anchor="rating_criteria_distance" 
	      title="Distance-related Rating Criteria">-->

	  <t>Distance-related rating criteria:</t>
	  <t><list style='symbols'>

  	    <t>Relative topological distance: The term relative means
  	    that a larger numerical value means greater distance, but
  	    it is up to the ALTO service how to compute the values,
  	    and the ALTO client will not be informed about the nature
  	    of the information. One way of generating this kind of
  	    information may be counting AS hops, but when querying
  	    this parameter, the ALTO client must not assume that the
  	    numbers actually are AS hops. In addition to the AS path,
  	    a relative cost value could also be calculated taking into
  	    account other routing protocol parameters, such as BGP
  	    local preference or multi-exit discriminator (MED)
  	    attributes.</t>

            <t>Absolute topological distance, expressed in the number
            of traversed autonomous systems (AS).</t>

            <t>Absolute topological distance, expressed in the number
            of router hops (i.e., how much the TTL value of an IP
            packet will be decreased during transit).</t>

            <t>Absolute physical distance, based on knowledge of the
            approximate geo-location (e.g., continent, country) of an IP
            address.</t>

	  </list></t>
	  <!--</section>-->

	  <!--<section anchor="rating_criteria_performance" 
	      title="Performance-related Rating Criteria">-->

	  <t>Performance-related rating criteria:</t>

	  <t><list style='symbols'>

	    <t>The minimum achievable throughput between the resource
	    consumer and the candidate resource provider, which is
	    considered useful by the application (only in ALTO
	    queries).</t>

            <t>An arbitrary upper bound for the throughput from/to the
            candidate resource provider (only in ALTO responses). This
            may be, but is not necessarily the provisioned access
            bandwidth of the candidate resource provider.</t>
            
	    <t>The maximum round-trip time (RTT) between resource
	    consumer and the candidate resource provider, which is
	    acceptable for the application for useful communication
	    with the candidate resource provider (only in ALTO queries).</t>
            
	    <t>An arbitrary lower bound for the RTT between resource
	    consumer and the candidate resource provider (only in ALTO
	    responses). This may be, for example, based on
	    measurements of the propagation delay in a completely
	    unloaded network.  </t>

	  </list></t>

	  <!--<section anchor="rating_criteria_charging" 
	      title="Charging-related Rating Criteria">-->
	  <t>Charging-related rating criteria:</t>

	  <t><list style='symbols'>

	    <t>Traffic volume caps, in case the Internet access of the
            resource consumer is not charged by "flat rate". For each
            candidate resource provider, the ALTO service could
            indicate the amount of data that may be transferred
            from/to this resource provider until a given point in
            time, and how much of this amount has already been
            consumed.  Furthermore, it would have to be indicated how
            excess traffic would be handled (e.g., blocked, throttled,
            or charged separately at an indicated price). The
            interaction of several applications running on a host, out
            of which some use this criterion while others don't, as
            well as the evaluation of this criterion in resource
            directories, which issue ALTO queries on behalf of other
            endpoints, are for further study.</t>

	    <t>Other metrics representing an abstract cost, e.g.,
	    determined by policies that distinguish "cheap" from
	    "expensive" IP subnet ranges without detailing the
	    cost function.</t>

	  </list></t>
	  <!--</section>-->

	  <t>These rating criteria are subject to the remarks below:</t>

	  <t>The ALTO client must be aware that with high probability
	  the actual performance values will differ from whatever an ALTO
	  server exposes. In particular, an ALTO client must not
	  consider a throughput parameter as a permission to send data
	  at the indicated rate without using congestion control
	  mechanisms.</t>

	  <t>The discrepancies are due to various reasons, including,
	  but not limited to the facts that</t>

	  <t><list style='symbols'>

	    <t>the ALTO service is not an admission control system</t>

	    <t>the ALTO service may not know the instantaneous
	    congestion status of the network</t>

	    <t>the ALTO service may not know all link bandwidths,
	    i.e., where the bottleneck really is, and there may be
	    shared bottlenecks</t>

	    <t>the ALTO service may not have all information about
	    the actual routing</t>

            <t>the ALTO service may not know whether the candidate endpoint
            itself is overloaded</t>

            <t>the ALTO service may not know whether the candidate endpoint
            throttles the bandwidth it devotes for the considered
            application</t>

            <t>the ALTO service may not know whether the candidate
            endpoint will throttle the data it sends to the client (e.g.,
            because of some fairness algorithm, such as
            tit-for-tat).</t>

	  </list></t>

	  <t>Because of these inaccuracies and the lack of complete,
	  instantaneous state information, which are inherent to the
	  ALTO service, the application must use other mechanisms
	  (such as passive measurements on actual data transmissions)
	  to assess the currently achievable throughput, and it must
	  use appropriate congestion control mechanisms in order to
	  avoid a congestion collapse.  Nevertheless, the rating
	  criteria may provide a useful shortcut for quickly excluding
	  candidate resource providers from such probing, if it is
	  known in advance that connectivity is in any case worse than
	  what is considered the minimum useful value by the
	  respective application.</t>

	  <!--</section>-->

	  <!--<section anchor="rating_criteria_inappropriate" 
	      title="Inappropriate Rating Criteria">-->

	  <t>Rating criteria that should not be defined for and used
	  by the ALTO service include:</t>

	  <t><list style='symbols'>

            <t>Performance metrics that are closely related to the
            instantaneous congestion status. The definition of
            alternate approaches for congestion control is explicitly
            out of the scope of ALTO. Instead, other appropriate
            means, such as using TCP based transport, have to be used
            to avoid congestion.</t>

	    <t>Performance metrics that raise privacy concerns. For
	    instance, it has been questioned whether an ALTO service
	    could publicly expose the provisioned access bandwidth,
	    e.g. of cable / DSL customers, because this could enables
	    identification of "premium" customers.</t>

            <!--
            <t>The provisioned access bandwidth, e.g. of cable / DSL
            customers.  This has been proposed several times and questioned,
            because of problems with privacy, fears that "premium" customers
            with high access bandwidth might attract so much traffic that
            their service becomes de-facto worse, etc.</t>
            -->

	  </list></t>
	  <!--</section>-->

	</section>

      </section>

      <section anchor="risks" title="Known Limitations of ALTO">

	<section title="General Limitations">

         <t>ALTO is designed as a protocol between clients integrated
         in applications and servers that provide network information
         and guidance (e.g., basic network location structure and
         preferences of network paths). The objective is to modify
         network resource consumption patterns at application level
         while maintaining or improving application performance. This
         design focus results in a number of characteristics of
         ALTO:</t>

	  <t><list style="symbols">

	    <t>Endpoint focus: In typical ALTO use cases, neither the
	    consumer of the topology information (i.e., the ALTO
	    client) nor the considered resources (e.g., files at
	    endpoints) are part of the network. The ALTO server
	    presents an abstract network topology containing only
	    information relevant to an application overlay for
	    better-than-random resource provider selection among its
	    endpoints.  The ALTO protocol specification <xref
	    target="RFC7285"></xref> is not designed to expose network
	    internals such as routing tables or configuration data
	    that are not relevant for application-level resource provider
	    selection decisions in network endpoints.</t>

	    <t>Abstraction: The ALTO services such as the Network/Cost
	    Map Service or the ECS provide an abstract view of the
	    network only.  The operator of the ALTO server has full
	    control over the granularity (e.g., by defining policies
	    how to aggregate subnets into PIDs) and the
	    level-of-detail of the abstract network representation
	    (e.g., by deciding what cost types to support).</t>

	    <t>Multiple administrative domains: The ALTO protocol is
	    designed for use cases where the ALTO server and client
	    can be located in different organizations or trust
	    domains. ALTO assumes a loose coupling between server and
	    client. In addition, ALTO does not assume that an ALTO
	    client has any a priori knowledge about the ALTO server
	    and its supported features. An ALTO server can be
	    discovered automatically.</t>

            <t>Read-only: ALTO is a query/response protocol to
            retrieve guidance information. Neither network/cost map
            queries nor queries to the endpoint cost service are
            designed to affect state in the network.</t>

	  </list></t>

          <t>If ALTO shall be deployed for use cases violating these
          assumptions, the protocol design may result in limitations.</t>

	  <t>For instance, in an Application-Based Network Operation
	  (ABNO) environment the application could issue explicit
	  service request to the network <xref
	  target="RFC7491"></xref>.  In
	  this case, the application would require detailed knowledge
	  about the internal network topology and the actual state.  A
	  network configuration would also require a corresponding
	  security solution for authentication and authorization.
	  ALTO is not designed for operations to control, operate, and
	  manage a network.</t>

          <t>Such deployments could be addressed by network management
          solutions, e.g., based on SNMP <xref
          target="RFC3411"></xref> or NETCONF <xref
          target="RFC6241"></xref> and YANG <xref
          target="RFC6020"></xref> that are typically designed to
          manipulate configuration state. Reference <xref
          target="RFC7491"></xref>
          contains a more detailed discussion of interfaces between
          components such as Element Management System (EMS), Network
          Management System (NMS), Operations Support System (OSS),
          Traffic Engineering Database (TED), Label Switched Path
          Database (LSP-DB), Path Computation Element (PCE), and other
          Operations, Administration, and Maintenance (OAM)
          components.</t>

	</section>
      
	<section title="Limitations of Map-based Approaches">

	  <t>The specification of the Map Service in the ALTO protocol
	  <xref target="RFC7285"></xref> is based on the concept of
	  network maps. A network map partitions the network into
	  Provider-defined Identifiers (PIDs) that group one or more
	  endpoints (e.g., subnetworks) to a single aggregate. The
	  "costs" between the various PIDs are stored in a
	  cost map. Map-based approaches lower the signaling load on
	  the server as maps have to be retrieved only if they
	  change.</t>

	  <t>One main assumption for map-based approaches is that the
	  information provided in these maps is static for a long
	  period of time. This assumption is fine as
	  long as the network operator does not change any parameter,
	  e.g., routing within the network and to the upstream peers,
	  IP address assignment stays stable (and thus the mapping to
	  the partitions). However, there are several cases where this
	  assumption is not valid:</t>

	  <t><list style="numbers">

            <t>ISPs reallocate IP subnets from time to time;</t>

            <t>ISPs reallocate IP subnets on short notice;</t>

            <t>IP prefix blocks may be assigned to a router that serves
            a variety of access networks;</t>

            <t>Network costs between IP prefixes may change depending
            on the ISP's routing and traffic engineering.</t>

          </list></t>

	  <!-- text below is a copy of Rich Woundy's comment -->

          <t>These effects can be explained as follows:</t>

	  <t>Case 1: ISPs may reallocate IP subnets within their
	  infrastructure from time to time, partly to ensure the
	  efficient usage of IPv4 addresses (a scarce resource), and
	  partly to enable efficient route tables within their network
	  routers. The frequency of these "renumbering events" depend
	  on the growth in number of subscribers and the availability
	  of address space within the ISP. As a result, a subscriber's
	  household device could retain an IP address for as short
	  as a few minutes, or for months at a time or even
	  longer.</t>

	  <t>It has been suggested that ISPs providing ALTO services
	  could sub-divide their subscribers' devices into different
	  IP subnets (or certain IP address ranges) based on the
	  purchased service tier, as well as based on the location in
	  the network topology. The problem is that this
	  sub-allocation of IP subnets tends to decrease the
	  efficiency of IP address allocation, in particular for
	  IPv4. A growing ISP that needs to maintain high efficiency
	  of IP address utilization may be reluctant to jeopardize
	  their future acquisition of IP address space.</t>

	  <t>However, this is not an issue for map-based approaches if
	  changes are applied in the order of days.</t>

	  <!-- text above is a copy of Rich Woundy's comment -->

	  <t>Case 2: ISPs can use techniques that allow the
	  reallocation of IP prefixes on very short notice, i.e.,
	  within minutes. An IP prefix that has no IP address
	  assignment to a host anymore can be reallocated to areas
	  where there is currently a high demand for IP addresses.</t>

	  <t>Case 3: In residential access networks (e.g., DSL,
	  cable), IP prefixes are assigned to broadband gateways,
	  which are the first IP-hop in the access-network between the
	  Customer Premises Equipment (CPE) and the Internet. The
	  access-network between CPE and broadband gateway (called
	  aggregation network) can have varying characteristics (and
	  thus associated costs), but still using the same IP
	  prefix. For instance one IP address IP1 out of a given CIDR
	  prefix can be assigned to a VDSL access line (e.g., 2 MBit/s uplink)
	  while another IP address IP2 within the same given
	  CIDR prefix is assigned to a slow ADSL line (e.g., 128 kbit/s
	  uplink). These IP addresses may be assigned on a first come
	  first served basis, i.e., a single IP address out of the
	  same CIDR prefix can change its associated costs quite
	  fast. This may not be an issue with respect to the used
	  upstream provider (thus the cross ISP traffic) but depending
	  on the capacity of the aggregation-network this may raise to
	  an issue.</t>

	  <!-- Below: Michael's comments -->

	  <t>Case 4: The routing and traffic engineering inside an ISP
	  network, as well as the peering with other autonomous
	  systems, can change dynamically and affect the information
	  exposed by an ALTO server. As a result, cost maps and
	  possibly also network maps can change.</t>

	</section>

	<section title="Limitations of Non-Map-based Approaches">
        
	  <t>The specification of the ALTO protocol <xref
	  target="RFC7285"></xref> also includes the
	  Endpoint Cost Service (ECS) mechanism. ALTO clients can ask
	  the ALTO server for guidance for specific IP addresses,
	  thereby avoiding the need of processing maps. This can
	  mitigate some of the problems mentioned in the previous
	  section.</t>

	  <t>However frequent requests, particularly with long lists
	  of IP addresses, may overload the ALTO server.
          The server has to rank each
	  received IP address, which causes load at the server. This
	  may be amplified when not only a single ALTO
	  client is asking for guidance, but a larger number of
	  them. The results of the ECS are also more difficult to
	  cache than ALTO maps. Therefore, the ALTO client may have
	  to await the server response before starting a communication,
	  which results in an additional delay.</t>

	  <t>Caching of IP addresses at the ALTO client or the usage
	  of the H12 approach <xref
	  target="I-D.kiesel-alto-h12"></xref> in conjunction with
	  caching may lower the query load on the ALTO server.</t>

	  <t>When ALTO server receives an ECS request, it may not have
	  the most appropriate topology information in order to
	  accurately determine the ranking. <xref
	  target="RFC7285"></xref> generally assumes
	  that a server can always offer some guidance. In such a case
	  the ALTO server could adopt one of the following
	  strategies:</t>

	  <t><list style="symbols">

	    <t>Reply with available information (best effort).</t>

	    <t>Query another ALTO server presumed to have better
	    topology information and return that response (cascaded
	    servers).</t>

	    <t>Redirect the request to another ALTO server presumed to
	    have better topology information (redirection).</t>
	  </list></t>

	  <t>The protocol mechanisms and decision processes that would
	  be used to determine if redirection is necessary and which
	  mode to use is out of the scope of this document, since
	  protocol extensions could be required.</t>

	</section>

      </section>

      <section anchor="sec.monitoring" title="Monitoring ALTO">

        <section title="Impact and Observation on Network Operation">

          <t>ALTO presents a new opportunity for managing network
          traffic by providing additional information to clients.  In
          particular, the deployment of an ALTO server may shift
          network traffic patterns, and the potential impact to
          network operation can be large. An ISP providing ALTO may
          want to assess the benefits of ALTO as part of the
          management and operations (cf. <xref
          target="RFC7285"></xref>). For instance, the
          ISP might be interested in understanding whether the
          provided ALTO maps are effective, and in order to decide
          whether an adjustment of the ALTO configuration would be
          useful. Such insight can be obtained from a monitoring
          infrastructure. An ISP offering ALTO could consider the
          impact on (or integration with) traffic engineering and the
          deployment of a monitoring service to observe the effects of
          ALTO operations. The measurement of impacts can be
          challenging because ALTO-enabled applications may not
          provide related information back to the ALTO service
          provider.</t>

	  <t>To construct an effective monitoring infrastructure, the
	  ALTO service provider should decide how to monitor the
	  performance of ALTO and identify and deploy data sources to
	  collect data to compute the performance metrics. In certain
	  trusted deployment environments, it may be possible to
	  collect information directly from ALTO clients. It may also
	  be possible to vary or selectively disable ALTO guidance for
	  a portion of ALTO clients either by time, geographical
	  region, or some other criteria to compare the network
	  traffic characteristics with and without ALTO. Monitoring
	  an ALTO service could also be realized by third parties. In
	  this case, insight into ALTO data may require a trust
	  relationship between the monitoring system operator and the
	  network service provider offering an ALTO service.</t>

          <t>The required monitoring depends on the network
          infrastructure and the use of ALTO, and an exhaustive
          description is outside the scope of this document.</t>

	</section>

        <section title="Measurement of the Impact">

	  <t>ALTO realizes an interface between the network and
	  applications. This implies that an effective monitoring
	  infrastructure may have to deal with both network and
	  application performance metrics.  This document does not
	  comprehensively list all performance metrics that could be
	  relevant, nor does it formally specify metrics.</t>

	  <t>The impact of ALTO can be classified regarding a
	  number of different criteria:</t>

	  <t><list style='symbols'>

	    <t>Total amount and distribution of traffic: ALTO enables
	    ISPs to influence and localize traffic of applications
	    that use the ALTO service. An ISP may therefore be
	    interested in analyzing the impact on the traffic, i.e.,
	    whether network traffic patterns are shifted. For
	    instance, if ALTO shall be used to reduce the inter-domain
	    P2P traffic, it makes sense to evaluate the total amount
	    of inter-domain traffic of an ISP. Then, one possibility
	    is to study how the introduction of ALTO reduces the total
	    inter-domain traffic (inbound and/our outbound). If the
	    ISPs intention is to localize the traffic inside his
	    network, the network-internal traffic distribution will be
	    of interest. Effectiveness of localization can be
	    quantified in different ways, e.g., by the load on core
	    routers and backbone links, or by considering more
	    advanced effects, such as the average number of hops that
	    traffic traverses inside a domain.</t>

	    <t>Application performance: The objective of ALTO is
	    improve application performance. ALTO can be used by very
	    different types applications, with different communication
	    characteristics and requirements. For instance, if ALTO
	    guidance achieves traffic localization, one would expect
	    that applications achieve a higher throughput and/or
	    smaller delays to retrieve data. If application-specific
	    performance characteristics (e.g., video or audio quality)
	    can be monitored, such metrics related to user experience
	    could also help to analyze the benefit of an ALTO
	    deployment. If available, selected statistics from the
	    TCP/IP stack in hosts could be leveraged, too.</t>

	  </list></t>

	  <t>Of potential interest can also be the share of
	  applications or customers that actually use an offered ALTO
	  service, i.e., the adoption of the service.</t>

	  <t>Monitoring statistics can be aggregated, averaged, and
	  normalized in different ways. This document does not mandate
	  specific ways how to calculate metrics.</t>

	</section>

        <section title="System and Service Performance">

	  <t>A number of interesting parameters can be measured at the
	  ALTO server. <xref target="RFC7285"></xref>
	  suggests certain ALTO-specific metrics to be monitored:</t>

	  <t><list style='symbols'>

	    <t>Requests and responses for each service listed in a
	    Information Directory (total counts and size in
	    bytes).</t>

	    <t>CPU and memory utilization</t>

	    <t>ALTO map updates</t>

	    <t>Number of PIDs</t>

	    <t>ALTO map sizes (in-memory size, encoded size, number of
	    entries)</t>

	  </list></t>

	  <t>This data characterizes the workload, the system
	  performance as well as the map data. Obviously, such data
	  will depend on the implementation and the actual deployment
	  of the ALTO service. Logging is also recommended in <xref
	  target="RFC7285"></xref>.</t>

	</section>

	<section title="Monitoring Infrastructures">

	  <t>Understanding the impact of ALTO may require interaction
	  between different systems, operating at different layers.
	  Some information discussed in the preceding sections is only
	  visible to an ISP, while application-level performance can
	  hardly be measured inside the network. It is possible that
	  not all information of potential interest can directly be
	  measured, either because no corresponding monitoring
	  infrastructure or measurement method exists, or because it
	  is not easily accessible.</t>

	  <t>One way to quantify the benefit of deploying ALTO is to
	  measure before and after enabling the ALTO service. In
	  addition to passive monitoring, some data could also be
	  obtained by active measurements, but due to the resulting
	  overhead, the latter should be used with care. Yet, in all
	  monitoring activities an ALTO service provider has to take
	  into account that ALTO clients are not bound to ALTO server
	  guidance as ALTO is only one source of information, and any
	  measurement result may thus be biased.</t>

	  <t>Potential sources for monitoring the use of ALTO include:</t>

	  <t><list style='symbols'>

	    <t>Network Operations, Administration, and Maintenance (OAM)
	    systems: Many ISPs deploy OAM systems to monitor the
	    network traffic, which may have insight into traffic
	    volumes, network topology, and bandwidth information
	    inside the management area. Data can be obtained by SNMP,
	    NETCONF, IP Flow Information Export (IPFIX), syslog,
	    etc.</t>

	    <t>Applications/clients: Relevant data could be obtained
	    by instrumentation of applications.</t>

	    <t>ALTO server: If available, log files or other
	    statistics data could be analyzed.</t>

	    <t>Other application entities: In several use cases, there 
	    are other application entities that could provide data as well.
	    For instance, there may be centralized log servers that collect
	    data.</t>

	  </list></t>

	  <t>In many ALTO use cases some data sources are located
	  within an ISP network while some other data is gathered at
	  application level. Correlation of data could require a
	  collaboration agreement between the ISP and an application
	  owner, including agreements of data interchange formats,
	  methods of delivery, etc. In practice, such a collaboration
	  may not be possible in all use cases of ALTO, because the
	  monitoring data can be sensitive, and because the
	  interacting entities may have different priorities. Details
	  of how to build an over-arching monitoring system for
	  evaluating the benefits of ALTO are outside the scope of
	  this memo.</t>

	</section>

      </section>

      <section anchor="sec.ISP_deployment2" title="Abstract Map Examples for Different Types of ISPs">

	<section title="Small ISP with Single Internet Uplink">

          <t>The ALTO protocol does not mandate how to determine costs
          between endpoints and/or determine map data. In complex
          usage scenarios this can be a non-trivial problem. In order
          to show the basic principle, this and the following sections
          explain for different deployment scenarios how ALTO maps
          could be structured.</t>

	  <t>For a small ISP, the inter-domain traffic optimizing
	  problem is how to decrease the traffic exchanged with other
	  ISPs, because of high settlement costs. By using the ALTO
	  service to optimize traffic, a small ISP can define two
	  "optimization areas": one is its own network; the other one
	  consists of all other network destinations. The cost map can
	  be defined as follows: the cost of a link between clients of
	  the inner ISP's network is lower than between clients of the outer
	  ISP's network and clients of inner ISP's network. As a
	  result, a host with an ALTO client inside the network of this
	  ISP will prefer retrieving data from hosts connected to the
	  same ISP.</t>

	  <t>An example is given in <xref
	  target="fig.small_ISPs3"/>. It is assumed that ISP A is a
	  small ISP only having one access network. As operator of the
	  ALTO service, ISP A can define its network to be one
	  optimization area, named as PID1, and define other networks
	  to be the other optimization area, named as PID2. C1 is
	  denoted as the cost inside the network of ISP A. C2 is
	  denoted as the cost from PID2 to PID1, and C3 from PID1
	  to PID2. For the sake of simplicity, in the following C2=C3
	  is assumed. In order to keep traffic local inside ISP A, it
	  makes sense to define: C1<C2</t>

	  <figure anchor="fig.small_ISPs3"
		  title="Example ALTO deployment in small ISPs">
	    <artwork><![CDATA[
           -----------
       ////           \\\\
     //                   \\
   //                       \\                  /-----------\
  | +---------+               |             ////             \\\\
  | | ALTO    |  ISP A        |    C2      |    Other Networks   |
 |  | Service |  PID 1         <-----------     PID 2
  | +---------+  C1           |----------->|                     |
  |                           |  C3 (=C2)   \\\\             ////
   \\                       //                  \-----------/
     \\                   //
       \\\\           ////
           -----------
]]></artwork>
	  </figure>	

	  <t>A simplified extract of the corresponding ALTO network
	  and cost maps is listed in <xref
	  target="fig.small_ISP_network_map"/> and <xref
	  target="fig.small_ISP_cost_map"/>, assuming that the network
	  of ISP A has the IPv4 address ranges 192.0.2.0/24 and
	  198.51.100.0/25. In this example, the cost values C1 and C2
	  can be set to any number C1<C2.</t>

	  <figure anchor="fig.small_ISP_network_map"
		  title="Example ALTO network map">
	    <artwork><![CDATA[
   HTTP/1.1 200 OK
   ...
   Content-Type: application/alto-networkmap+json

   {
    ...
     "network-map" : {
       "PID1" : {
         "ipv4" : [
           "192.0.2.0/24",
           "198.51.100.0/25"
         ]
       },
       "PID2" : {
         "ipv4" : [
           "0.0.0.0/0"
         ],
         "ipv6" : [
           "::/0"
         ]
       }
     }
   }
]]></artwork>
	  </figure>	

	  <figure anchor="fig.small_ISP_cost_map"
		  title="Example ALTO cost map">
	    <artwork><![CDATA[
   HTTP/1.1 200 OK
   ...
   Content-Type: application/alto-costmap+json

   {
       ...
       "cost-type" : {"cost-mode"  : "numerical",
                      "cost-metric": "routingcost"
       }
     },
     "cost-map" : {
       "PID1": { "PID1": C1,  "PID2": C2 },
       "PID2": { "PID1": C2,  "PID2": 0 },
     }
   }
]]></artwork>
	  </figure>	

	</section>

	<section title="ISP with Several Fixed Access Networks">

	  <t>This example discusses a P2P application traffic
	  optimization use case for a larger ISP with a fixed network
	  comprising several access networks and a core network. The
	  traffic optimizing objectives include (1) using the
	  backbone network efficiently, (2) adjusting the traffic
	  balance in different access networks according to traffic
	  conditions and management policies, and (3) achieving a
	  reduction of settlement costs with other ISPs.</t>

	  <t>Such a large ISP deploying an ALTO service may want to
	  optimize its traffic according to the network topology of
	  its access networks. For example, each access network could
	  be defined to be one optimization area, i.e., traffic should
	  be kept local withing that area if possible. This can be
	  achieved by mapping each area to a PID. Then the costs
	  between those access networks can be defined according to a
	  corresponding traffic optimizing requirement by this
	  ISP. One example setup is further described below and also
	  shown in <xref target="fig.large_ISPs"/>.</t>
	
	  <t>In this example, ISP A has one backbone network and three
	  access networks, named as AN A, AN B, and AN C. A P2P
	  application is used in this example. For a reasonable
	  application-level traffic optimization, the first
	  requirement could be a decrease of the P2P traffic on the
	  backbone network inside the Autonomous System of ISP A and
	  the second requirement could be a decrease of the P2P
	  traffic to other ISPs, i.e., other Autonomous Systems. The
	  second requirement can be assumed to have priority over the
	  first one. Also, we assume that the settlement rate with ISP
	  B is lower than with other ISPs. ISP A can deploy an ALTO
	  service to meet these traffic distribution requirements. In
	  the following, we will give an example of an ALTO setting
	  and configuration according to these requirements.</t>
	
	  <t>In the network of ISP A, the operator of the ALTO server
	  can define each access network to be one optimization area,
	  and assign one PID to each access network, such as PID 1,
	  PID 2, and PID 3. Because of different peerings with
	  different outer ISPs, one can define ISP B to be one
	  additional optimization area and assign PID 4 to it. All
	  other networks can be added to a PID to be one further
	  optimization area (PID 5).</t>
	
	  <t>In the setup, costs (C1, C2, C3, C4, C5, C6, C7, C8) can
	  be assigned as shown in <xref
	  target="fig.large_ISPs"/>. Cost C1 is denoted as the link
	  cost in inner AN A (PID 1), and C2 and C3 are defined
	  accordingly. C4 is denoted as the link cost from PID 1 to
	  PID 2, and C5 is the corresponding cost from PID 3, which is
	  assumed to have a similar value. C6 is the cost between PID
	  1 and PID 3. For simplicity, this scenario assumes
	  symmetrical costs between the AN this example. C7 is denoted
	  as the link cost from the ISP B to ISP A. C8 is the link
	  cost from other networks to ISP A.</t>
	
	  <t>According to previous discussion of the first requirement
	  and the second requirement, the relationship of these costs
	  will be defined as: (C1, C2, C3) < (C4, C5, C6) < (C7)
	  < (C8)</t>


	  <figure anchor="fig.large_ISPs"
		  title="ALTO deployment in large ISPs with layered fixed network structures">
            <artwork><![CDATA[
 +------------------------------------+         +----------------+
 | ISP A   +---------------+          |         |                |
 |         |    Backbone   |          |   C7    |      ISP B     |
 |     +---+    Network    +----+     |<--------+      PID 4     |
 |     |   +-------+-------+    |     |         |                |
 |     |           |            |     |         |                |
 |     |           |            |     |         +----------------+
 | +---+--+     +--+---+     +--+---+ |
 | |AN A  |  C4 |AN B  |  C5 |AN C  | |
 | |PID 1 +<--->|PID 2 |<--->+PID 3 | |
 | |C1    |     |C2    |     |C3    | |         +----------------+
 | +---+--+     +------+     +--+---+ |         |                |
 |     ^                        ^     |   C8    | Other Networks |
 |     |                        |     |<--------+ PID 5          |
 |     +------------------------+     |         |                |
 |                  C6                |         |                |
 +------------------------------------+         +----------------+
]]></artwork>
	  </figure>
	
	</section>

	<section title="ISP with Fixed and Mobile Network">

	  <t>An ISP with both mobile network and fixed network may
	  focus on optimizing the mobile traffic by keeping traffic in
	  the fixed network as much as possible, because wireless
	  bandwidth is a scarce resource and traffic is costly in
	  mobile network. In such a case, the main requirement of
	  traffic optimization could be decreasing the usage
	  of radio resources in the mobile network. An ALTO service
	  can be deployed to meet these needs.</t>
	
	  <t><xref target="fig.mobile_ISPs2"/> shows an example: ISP A
	  operates one mobile network, which is connected to a
	  backbone network.  The ISP also runs two fixed access
	  networks AN A and AN B, which are also connected to the
	  backbone network. In this network structure, the mobile
	  network can be defined as one optimization area, and PID 1
	  can be assigned to it. Access networks AN A and B can also
	  be defined as optimization areas, and PID 2 and PID 3 can be
	  assigned, respectively. The cost values are then defined as
	  shown in <xref target="fig.mobile_ISPs2"/>.</t>
	
	  <t>To decrease the usage of wireless link, the relationship
	  of these costs can be defined as follows:</t>
	
	  <t>From view of mobile network: C4 < C1. This means that
	  clients in mobile network requiring data resources from other
	  clients will prefer clients in AN A to clients in the mobile
	  network.  This policy can decrease the usage of wireless
	  link and power consumption in terminals.</t>
	
	  <t>From view of AN A: C2 < C6, C5 = maximum cost. This
	  means that clients in other optimization area will avoid
	  retrieving data from the mobile network.</t>

	  <figure anchor="fig.mobile_ISPs2" title="ALTO deployment in ISPs with mobile network">
	    <artwork><![CDATA[
 +-----------------------------------------------------------------+
 |                                                                 |
 |  ISP A                 +-------------+                          |
 |               +--------+   ALTO      +---------+                |
 |               |        |   Service   |         |                |
 |               |        +------+------+         |                |
 |               |               |                |                |
 |               |               |                |                |
 |               |               |                |                |
 |       +-------+-------+       | C6    +--------+------+         |
 |       |     AN A      |<--------------|      AN B     |         |
 |       |     PID 2     |   C7  |       |      PID 3    |         |
 |       |     C2        |-------------->|      C3       |         |
 |       +---------------+       |       +---------------+         |
 |             ^    |            |              |     ^            |
 |             |    |            |              |     |            |
 |             |    |C4          |              |     |            |
 |          C5 |    |            |              |     |            |
 |             |    |   +--------+---------+    |     |            |
 |             |    +-->|  Mobile Network  |<---+     |            |
 |             |        |  PID 1           |          |            |
 |             +------- |  C1              |----------+            |
 |                      +------------------+                       |
 +-----------------------------------------------------------------+
]]></artwork>
	  </figure>			

	  <t>These examples show that for ALTO in particular the
	  relationships between different costs matter; the operator of
	  the server has several degrees of freedom how to set the
	  absolute values.</t>

	</section>

  </section>

  <section anchor="sec.live" title="Comprehensive Example for Map Calculation">
	
	  <t>In addition to the previous, abstract examples, this section presents a more detailed scenario
	  with a realistic IGP and BGP routing protocol configuration. 
	  This example was first described in <xref target="I-D.seidel-alto-map-calculation"></xref>.</t>

	  <section anchor="sec.scenario" title="Example Network">
	
   <t><xref target="fig.live_example"></xref> depicts a network which is used to explain the steps carried
   out in the course of this example.  The network consists of nine
   routers (R1 to R9). Two of them are border routers (R1 + R8)
   connected to neighbored networks (AS 2 to AS 4).  Furthermore, AS 4
   is not directly connected to the local network, but has AS 3 as
   transit network.  The links between the routers are point-to-point
   connections, hence a /30 subnet is sufficient for each.  These
   connections also form the core network with the
   100.1.1.0/24 subnet.  This subnet is large enough to provide /30
   subnets for all router interconnections.  In addition to the core
   network, the local network also has five client networks attached to
   five different routers (R2, R5, R6, R7 and R9).  Each client network
   is a /24 subnet with 100.1.10x.0 (x = [1..5]) as network address.</t>
	
	  <figure anchor="fig.live_example" title="Example Network">
	    <artwork><![CDATA[
+--------------+      +--------+      +--------+     +--------------+
|100.1.102.0/24+------+   R6   |      |   R7   +-----+100.1.103.0/24|
+--------------+      +----+---+      +----+---+     +--------------+
                           |               |
+--------------+           |               |
|     AS 2     |           |               |
| 100.2.0.0/16 |           |               |
+-------+------+           |               |
        |                  |               |
        |                  |               |
    +---+----+        +----+---+      +----+---+     +--------------+
    |   R1   +--------+   R3   +------+   R5   |-----+100.1.104.0/24|
    +---+----+        +----+---+      +----+---+     +--------------+
        |     \      /     |               |
        |      \    /      |               |
        |       \  /       |               |         +--------------+
        |        \/        |               |         |     AS 4     |
        |        /\        |               |         | 100.4.0.0/16 |
        |       /  \       |               |         +------+-------+
        |      /    \      |               |                |
        |     /      \     |               |                |
    +---+----+        +----+---+      +----+---+     +------+-------+
    |   R2   |        |   R4   |      |   R8   +-----+     AS 3     |
    +---+----+        +----+---+      +----+---+     | 100.3.0.0/16 |
        |                  |               |         +--------------+
        |                  |               |
        |                  |               |
+-------+------+           |          +----+---+     +--------------+
|100.1.101.0/24|           +----------+   R9   +-----+100.1.105.0/24|
+--------------+                      +--------+     +--------------+
]]></artwork>
	  </figure>			
	
   <t>The example network utilizes two different routing protocols, one for
   IGP and another for EGP routing.  The used IGP is a link-state
   protocol such as IS-IS.  The applied link weights are shown in Figure 2.
   To obtain the topology and routing information from the network, the
   topology data source must be connected directly to one of the routers
   (R1...R9). Furthermore, the topology data source must be enabled to communicate
   with the router and vice versa.</t>

   <t>The Border Gateway Protocol (BGP) is used in this scenario
   to route between autonomous systems (AS).  External BGP is
   running on the two border routers R1 and R8.  Furthermore, internal
   BGP is used to propagate external as well as internal prefixes within
   the network boundaries.  It is running on every router with an
   attached client network (R2, R5, R6, R7 and R9).  Since no route
   reflector is present it is necessary to fetch routes from each BGP
   router separately.</t>

   	  <figure anchor="fig.live_weights" title="Example Network Link Weights">
	    <artwork><![CDATA[
           R1   R2   R3   R4   R5   R6   R7   R8   R9
       R1   0   15   15   20    -    -    -    -    -
       R2  15    0   20    -    -    -    -    -    -
       R3  15   20    0    5    5   10    -    -    -
       R4  20    -    5    0    5    -    -    -   20
       R5   -    -    5    5    0    -   10   10    -
       R6   -    -   10    -    -    0    -    -    -
       R7   -    -    -    -   10    -    0    -    -
       R8   -    -    -    -   10    -    -    0   10
       R9   -    -    -   20    -    -    -   10    0
]]></artwork>
	  </figure>			
   
   <t>For monitoring purposes it is possible to enable e.g. SNMP
   or NETCONF on the routers within the
   network.  This way an ALTO server may obtain
   several additional information about the state of the network.  As
   example, utilization, latency, and bandwidth information could
   be retrieved periodically from the network components to get
   and keep an up-to-date view on the network situation.</t>

   <t>In the following, it is assumed that the listed information are collected from the network:</t>
	
   <t><list style="symbols">
  
     <t>IS-IS: topology, Link weights</t>

     <t>BGP: prefixes, AS numbers, AS distances, metrics</t>

     <t>SNMP: latency, utilization, bandwidth</t>

	</list></t>

	</section>

    <section anchor="sec.live_data_processing" title="Potential Data Processing and Storage">

   <t>Due to the variety of data source available in a network it may be
   necessary to aggregate the information and define a suitable data
   model that can hold the information efficiently and easily accessible.  One
   potential model is an annotated directed graph that represents
   the topology. The attributes can be annotated at the
   corresponding positions in the graph. In the following it
   is shown how such a topology graph could describe the example topology.</t>

   <t>In the topology graph, a node represents a router in the network,
   while the edges stand for the links that connect the routers.  Both
   routers and links have a set of attributes that store information
   gathered from the network.</t>
 
   <t>Each router could be associated with a basic set of information, such as:</t>

   <t><list style="symbols">

	 <t>ID:  Unique ID within the network to identify the router.</t>

     <t>Neighbor IDs:  List of directly connected routers.</t>

     <t>Endpoints:  List of connected endpoints.  The endpoints may also have
      further attributes themselves depending on the network and address
      type.  Such potential attributes are costs for reaching the
      endpoint from the router, AS numbers, or AS distances. 
	  Endpoints may belong to more than one router, for example
      when they are assigned to link interfaces.</t>
	 
   </list></t>

   <t>In addition to the basic set many more attributes may be assigned to
   router nodes.  This mainly depends on the utilized data sources.
   Examples for such additional attributes are geographic location, host name and/
   or interface types, just to name a few.</t>

   <t>A suitable information set for a link is described in the following list:</t>

   <t><list style="symbols">

      <t>Source ID:  ID of the source router of the link.</t>

      <t>Destination ID:  ID of the destination router of the link.</t>

      <t>Weight:  The cost to cross the link defined by the used IGP.</t>

   </list></t>
     
   <t>Additional attributes that provide technical details and state
   information can be assigned to links as well.  The availability of
   such additional attributes depends on the utilized data sources.
   Such attributes can be characteristics like maximum bandwidth,
   utilization, or latency on the link as well as the link type.</t>

   <t>The example network shown in
   <xref target="fig.live_example"></xref> represents such an internal network graph
   where the routers R1 to R9 represent the nodes and the connections
   between them are the links. For instance, R2 has one directly attached IPv4 endpoint that belongs to its own
   AS.</t>
   	
	  <figure anchor="fig.live_router_r2" title="Example Router R2">
	    <artwork><![CDATA[
   ID:  2

   Neighbor IDs:  1,3 (R1, R3)

   Endpoints:

      Endpoint:  100.1.101.0/24

      Metric:  10 (default client subnet metric)

      ASNumber:  1 (our own AS)

      ASDistance:  0

   Host Name:  R2
]]></artwork>
	  </figure>			
	     
   <t>Router R8 has two attached IPv4 endpoints.  The first one belongs to a
   directly neighbored AS with AS number 3.  The AS distance from our
   network to AS3 is 1.  The second endpoint belongs to an AS (AS4) that
   is no direct neighbor but directly connected to AS3.  To reach
   endpoints in AS4 it is necessary to cross AS3, which increases the AS
   distance by one.</t>
   
	  <figure anchor="fig.live_router_r8" title="Example Router R8">
	    <artwork><![CDATA[
   ID:  8

   Neighbor IDs:  5,9 (R5, R9)

   Endpoints:

      Endpoint:  100.3.0.0/16

      Metric:  100

      ASNumber:  3

      ASDistance:  1

      Endpoint:  100.4.0.0/16

      Metric:  200

      ASNumber:  4

      ASDistance:  2

   Host Name:  R8
]]></artwork>
	  </figure>			
	
   <t>In the example, the link attributes are equal for all links and only
   their values differ. It is assumed that the attributes utilization, bandwidth, and
   latency are collected e.g. via SNMP or NETCONF. In the topology of <xref target="fig.live_example"></xref>
   the links between R1 and R2 would then have
   the following link attributes:</t>

   	  <figure anchor="fig.live_link_data" title="Link Attributes">
	    <artwork><![CDATA[
   R1->R2:

   Source ID:  1

   Destination ID:  2

   Weight:  15

   Bandwidth:  10Gbit/s

   Utilization:  0.1

   Latency:  2ms

   R2->R1:

   Source ID:  2

   Destination ID:  1

   Weight:  15

   Bandwidth:  10Gbit/s

   Utilization:  0.55

   Latency:  5ms
]]></artwork>
	  </figure>			
	  
   <t>It has to be emphasized that values for utilization and latency can be very volatile.</t>

   	</section>

    <section anchor="sec.live_network_map" title="Calculation of Network Map">

   <t>The goal of the ALTO map calculation process is to get from the graph
   presentation of the network to a coarse-grained matrix presentation.  The first step is to generate the
   network map.  Only after the network map has been generated it is
   possible to compute the cost map, since it relies on the network map.</t>

   <t>To generate an ALTO network map a grouping function is required.  A
   grouping function processes information from the network graph to
   group endpoints into PIDs.  The way of grouping is manifold and
   algorithm can utilize all information provided by the network graph
   to perform the grouping.  The functions may omit certain endpoints to
   simplify the map or to hide details about the network that are not
   intended to be published with the resulting ALTO network map.</t>

   <t>For IP endpoints, which are either an IP (version 4 or version 6)
   address or prefix, <xref target="RFC7285"/> requires the use of longest-prefix
   matching algorithm  to map IPs to PIDs.
   This requirement results in the constraint that every IP must be mapped
   to a PID and that the same prefix or address is not mapped to more
   than one PID.  To meet the first constraint every calculated map must
   provide a default PID that contains the prefixes 0.0.0.0/0 for IPv4
   and ::/0 for IPv6.  Both prefixes cover their entire address space
   and if no other PID matches an IP endpoint the default PID will.  The
   second constraint must be met by the grouping function that
   assigns endpoints to PIDs. In case of collision the grouping
   function must decide which PID gets an endpoint.  These
   or other constraints may apply to other endpoint types depending
   on the used matching algorithm.</t>

   <t>A simple example for such grouping is to compose PIDs by
   host names. For instance,
   each router's host name is selected as the name for a
   PID and the attached endpoints are the member endpoints of the
   corresponding PID.  Additionally, backbone prefixes should not
   appear in the map so they are filtered out.  The following table
   shows the resulting ALTO network map based on the example network
   from <xref target="fig.live_example"></xref>:</t>

	  <figure anchor="fig.live_netmap" title="Example ALTO Network Map">
	    <artwork><![CDATA[
       PID  |  Endpoints
   ---------+-----------------------------------
        R1  |  100.2.0.0/16
        R2  |  100.1.101.0/24
        R5  |  100.1.104.0/24
        R6  |  100.1.102.0/24
        R7  |  100.1.103.0/24
        R8  |  100.3.0.0/16, 100.4.0.0/16
        R9  |  100.1.105.0/24
    default |  0.0.0.0/0, ::/0
]]></artwork>
	  </figure>			

   <t>Since router R3 and R4 have no endpoints assigned they are not
   represented in the network map.  Furthermore, as previously mentioned,
   the "default" PID was added to represent all endpoints that are not
   part of the example network.</t>

   	</section>

    <section anchor="sec.live_cost_map" title="Calculation of Cost Map">

   <t>After successfully creating the network map, the typical next step is to
   calculate the costs between the generated PIDs, which form the cost map.
   Those costs are calculated by cost functions.
   A cost function may calculate values unidirectional, which means it is
   necessary to compute the costs from every PID to every PID.  In
   general, it is possible to use all available information in the
   network graph to compute the costs.  In case a PID contains more than
   one IP address or subnet the cost function may calculate a set of cost
   values for each source/destination IP pair.  In that case a tie-
   breaker function is required which decides the resulting cost value.
   Such tie-breaker can be simple functions such as minimum, maximum, or
   average value.</t>

   <t>No matter what metric the cost function is using, the path
   from source to destination is usually defined by the minimum path weight.
   The path weight is the sum of link weights of all traversed links.
   The path may be determined for instance with the Bellman-Ford or Dijkstra
   algorithm.  Hence, the cost function must first determine the path
   before it can determine any other metric value.  As a result, the
   metric value can differ from the expectation where the path with the
   shortest path weight was not considered.  But it is also possible
   that more than one path with the same minimum path weight exist,
   which means it is not entirely clear which path is going to be
   selected by the network.  Hence, a tie-breaker similar to the one
   used to resolve costs for PIDs with multiple endpoints is necessary.</t>

   <t>An important note is that <xref target="RFC7285"/> does not require cost maps to
   provide costs for every PID pair, so if no path cost can be
   calculated for a certain pair the corresponding field in the cost map
   is left out.  Administrators may also not want to provide cost values
   for other PID pairs for arbitrary reasons.  Such pairs may be defined
   before the cost calculation is performed.</t>

   <t>Based on the network map example shown in the previous section it is possible to
   calculate the cost maps.  In this example, the selected metric for the
   cost map is the minimum number of hops necessary to get from source
   to destination PID.  Our chosen tie-breaker selects the minimum hop count
   when more than one value is returned by the cost function.</t>

   	  <figure anchor="fig.live_hopcount" title="Example ALTO Hopcount Cost Map">
	    <artwork><![CDATA[
      PID  | default | R1  | R2  | R5  | R6  | R7  | R8  | R9  |
   --------+---------+-----+-----+-----+-----+-----+-----+-----|
   default |    x    |  x  |  x  |  x  |  x  |  x  |  x  |  x  |
      R1   |    x    |  0  |  2  |  3  |  3  |  4  |  4  |  3  |
      R2   |    x    |  2  |  0  |  3  |  3  |  4  |  4  |  4  |
      R5   |    x    |  3  |  3  |  0  |  3  |  2  |  2  |  3  |
      R6   |    x    |  3  |  3  |  3  |  0  |  4  |  4  |  4  |
      R7   |    x    |  4  |  4  |  2  |  4  |  0  |  3  |  4  |
      R8   |    x    |  4  |  4  |  2  |  4  |  3  |  0  |  2  |
      R9   |    x    |  3  |  4  |  3  |  4  |  4  |  2  |  0  |
]]></artwork>
	  </figure>			

   <t>It should be mentioned that R1->R9 has several paths with equal
   path weights.  The paths R1->R3->R5->R8->R9, R1->R3->R4->R9 and
   R1->R4->R9 all have a path weight of 40.  Due to the minimum value
   tie-breaker 3 hops for the path R1->R4->R9 is chosen as value.
   Furthermore, since the "default" PID is sort of virtual PID with no
   endpoints that are part of the example network, no cost values are
   calculated for other PIDs from or towards it.</t>
	
	</section>
	
  </section>
	
       <section anchor="sec.alto_p2p_expectations" title="Deployment Experiences">

	<t>There are multiple interoperable implementations of the ALTO protocol. Some
	experiences in implementating and using ALTO for large-scale networks have been documented in <xref
	target="I-D.seidel-alto-map-calculation"></xref> and are here summarized:</t>

    <t><list style="symbols">

	 <t>Data collection: Retrieving topology information typically
	 requires implementing several protocols other than ALTO for data collection.
	 For such other protocols, ALTO deployments faced protocol behaviors
	 that were different to what would be
	 expected from the specification of the corresponding protocol.
	 This includes behavior caused by older versions of the protocol
     specification, a lax interpretation on the remote side or simply
     incompatibility with the corresponding standard. This sort of problems
	 in collecting data can make an ALTO deployment more complicated, even if
	 it is unrelated to ALTO protocol itself.</t> 

     <t>Data processing: Processing network information can be very complex and quite
     resource-demanding. Gathering information from an autonomous system connected
     to Internet may imply that a server must store and
     process hundreds of thousands of prefixes, several hundreds of
     megabytes of IPFIX/Netflow information per minute, and information from
     hundreds of routers and attributes of thousands of links.  A
     lot of disk memory, RAM, and CPU cycles as well as efficient
     algorithms are required to process the information. Operators of an
	 ALTO server have to be aware that significant compute resources are not only required
	 for the ALTO server, but also for the corresponding data collection.</t>
	 
     <t>Network map calculation:  
     Large IP based networks consist of hundreds of thousands of prefixes,
     which have to be mapped to PIDs in the process of network map calculation.
     As a result, network maps get very large (up to tens of megabytes).
     However, depending on the design of the network and the chosen
     grouping function the calculated network maps contains redundancy
     that can be removed.  There are at least two ways to reduce the size
     by removing redundancy.
     First, adjacent IP prefixes can be merged.  When a PID has two
     adjacent prefix entries it can merge them together to one larger
     prefix.  It is mandatory that both prefixes are in the same PID.
     However, it cannot be ruled out that the large prefix is assigned to
     another PID.  This must be checked and it is up to the grouping
     function whether it merges the prefixes and removes the larger prefix
     from the other PID or not.  A simple example, when a PID comprises
     the prefixes 192.168.0.0/24 and 192.168.1.0/24 it can easily merge
     them to 192.168.0.0/23.
     Second, a prefix and its next-longer-prefix match may be in the same
     PID.  In this case, the smaller prefix can simply be removed since it
     is redundant for obvious reasons.  A simple example, a PID comprises
     the prefixes 192.168.0.0/22 and 192.168.1.0/24 and the /22 is the
     next-longer prefix match of the /24, the /24 prefix can simply be
     removed.  In contrast, if another PID contains the 192.168.0.0/23
     prefix, the entry 192.168.1.0/24 cannot be removed since the next-longer prefix
     is not in the same PID anymore. Operators of an ALTO server thus have
	 to analyze whether their address assignment schemes allows such tuning.</t>

     <t>Cost map calculation: One known implementation challenge with
	 cost map calculations is the
     vast amount of CPU cycles that may be required to calculate the costs
     in large networks.  This is particular problematic if costs are calculated
     between the endpoints of each source-destination PID pair. 
	 Very often several to many endpoints of a PID are attached to the
     same node, so the same path cost is calculated several times.  This
     is clearly inefficient. A remedy could be more sophisticated algorithms,
     such as looking up the routers the
     endpoints of each PID are connected to in our network graph and
     calculated cost map based on the costs between the routers. When 
	 deploying and configuring ALTO servers, administrators should consider
	 the impact of huge cost maps and possibly ensure that map sizes do not
	 get too large.</t>
	 
   </list></t>

	<t>In addition, further deployment experiences have been documented.
	One real example is described in greater
	detail in reference <xref
	target="I-D.lee-alto-chinatelecom-trial"></xref>.</t>

        <t>Also, experiments have been conducted with ALTO-like
        deployments in Internet Service Provider (ISP) networks. For
        instance, NTT performed tests with their HINT server
        implementation and dummy nodes to gain insight on how an
        ALTO-like service can influence peer-to-peer systems <xref
        target="RFC6875"></xref>. The results
        of an early experiment conducted in the Comcast network are
        documented in <xref target="RFC5632"></xref>.</t>
 
     </section>

    </section>

    <section anchor="sec.p2p_cons" title="Using ALTO for P2P Traffic Optimization">

      <section title="Overview">

	<section anchor="sec.p2phistory" title="Usage Scenario">

          <t>Originally, peer-to-peer (P2P) applications were the
          main driver for the development of ALTO. In this use case it
          is assumed that one party (usually the operator of a
          "managed" IP network domain) will disclose information about
          the network through ALTO.  The application overlay will
          query this information and optimize its behavior in order to
          improve performance or Quality of Experience in the
          application while reducing the utilization of the underlying
          network infrastructure.  The resulting win-win situation is
          assumed to be the incentive for both parties to provide or
          consume the ALTO information, respectively.</t>

	  <t>P2P systems can be built without or with use of a
	  centralized resource directory ("tracker"). The scope of
	  this section is the interaction of P2P applications with the
	  ALTO service. In this scenario, the resource consumer
	  ("peer") asks the resource directory for a list of candidates
          that can provide the desired resource.
	  There are different options for how ALTO can be deployed in such
	  use cases with a centralized resource directory.</t>

	  <t>For efficiency reasons (i.e., message size), usually only
	  a subset of all resource providers known to the resource
	  directory will be returned to the resource consumer.  Some
	  or all of these resource providers, plus further resource
	  providers learned by other means such as direct
	  communication between peers, will be contacted by the
	  resource consumer for accessing the resource.  The purpose
	  of ALTO is giving guidance on this peer selection, which is
	  supposed to yield better-than-random results.  The tracker
	  response as well as the ALTO guidance are most beneficial in
	  the initial phase after the resource consumer has decided to
	  access a resource, as long as only few resource providers
	  are known. Later, when the resource consumer has already
	  exchanged some data with other peers and measured the
	  transmission speed, the relative importance of ALTO may
	  dwindle.</t>

	</section>

	<section title="Applicability of ALTO" anchor="sec.p2p_tracker_cons">

	  <t>A tracker-based P2P application can leverage ALTO in
	  different ways. In the following, the different alternatives
	  and their pros and cons are discussed.</t>

	  <t><figure anchor="fig.localALTOServer"
		     title="Global tracker and local ALTO servers">
	    <artwork><![CDATA[
                         ,-------.         +-----------+
       ,---.          ,-'         ========>|   Peer 1  |********
    ,-'     `-.      /     ISP 1  V  \     |ALTO Client|       *
   /           \    / +-------------+ \    +-----------+       *
  /    ISP X    \   | + ALTO Server | |    +-----------+       *
 /               \  \ +-------------+<====>|   Peer 2  |       *
;   +---------+   :  \               /     |ALTO Client|****** *
|   | Global  |   |   `-.         ,-'      +-----------+     * *
|   | Tracker |   |      `-------'                           * *
|   +---------+   |      ,-------.         +-----------+     * *
:        *        ;   ,-'         ========>|   Peer 3  |     * *
 \       *       /   /     ISP 2  V  \     |ALTO Client|**** * *
  \      *      /   / +-------------+ \    +-----------+   * * *
   \     *     /    | | ALTO Server | |    +-----------+   * * *
    `-.  *  ,-'     \ +-------------+<====>|   Peer 4  |** * * *
       `-*-'         \               /     |ALTO Client| * * * *
         *            `-.         ,-'      +-----------+ * * * *
         *               `-------'                       * * * *
         *                                               * * * *
         *******************************************************
    Legend:
    === ALTO protocol
    *** Application protocol
]]></artwork>
	    </figure></t>

            <t><xref target="fig.localALTOServer"></xref> depicts a
            tracker-based P2P system with several peers. The peers
            (i.e., resource consumers) embed an ALTO client to improve
            the resource provider selection. The tracker (i.e., resource
            directory) itself may be hosted and operated by another
            entity. A tracker outside the networks of the ISPs of the
            peers may be a typical use case. For instance, a tracker
            like Pirate Bay can serve Bittorrent peers world-wide. The
            figure only shows one tracker instance, but deployments
            with several trackers could be possible, too.</t>

	    <t>The scenario depicted in <xref
	    target="fig.localALTOServer"></xref> lets the peers
	    directly communicate with their ISP's ALTO server (i.e.,
	    ALTO client embedded in the peers), thus giving the peers
	    the most control on which information they query for, as
	    they can integrate information received from one tracker
	    or several trackers and through direct peer-to-peer
	    knowledge exchange. For instance, the latter approach is
	    called peer exchange (PEX) in bittorent.  In this
	    deployment scenarios, the peers have to discover a
	    suitable ALTO server (e.g., offered by their ISP, as
	    described in <xref target="RFC7286"></xref>).</t>

	    <t>There are also tracker-less P2P system architectures
	    that do not rely on centralized resource directories,
	    e.g., unstructured P2P networks. Regarding the use of ALTO,
	    their deployment would be similar to
	    <xref target="fig.localALTOServer"></xref>, since the ALTO
	    client would be embedded in the peers as well. This option
	    is not further considered in this memo.</t>

	    <figure anchor="fig.global_tracker"
		  title="Global tracker accessing ALTO server at various ISPs">
             <artwork><![CDATA[
                              ,-------.
       ,---.               ,-'         `-.   +-----------+
    ,-'     `-.           /     ISP 1     \  |   Peer 1  |********
   /           \         / +-------------+ \ |           |       *
  /    ISP X    \   ++====>| ALTO Server |  )+-----------+       *
 /               \  ||   \ +-------------+ / +-----------+       *
; +-----------+   : ||    \               /  |   Peer 2  |       *
| |  Tracker  |<====++     `-.         ,-'   |           |****** *
| |ALTO Client|   |           `-------'      +-----------+     * *
| +-----------+<====++        ,-------.                        * *
:        *        ; ||     ,-'         `-.   +-----------+     * *
 \       *       /  ||    /     ISP 2     \  |   Peer 3  |     * *
  \      *      /   ||   / +-------------+ \ |           |**** * *
   \     *     /    ++====>| ALTO Server |  )+-----------+   * * *
    `-.  *  ,-'          \ +-------------+ / +-----------+   * * *
       `-*-'              \               /  |   Peer 4  |** * * *
         *                 `-.         ,-'   |           | * * * *
         *                    `-------'      +-----------+ * * * *
         *                                                 * * * *
         *                                                 * * * *
         *********************************************************
    Legend:
    === ALTO protocol
    *** Application protocol
]]></artwork>
	  </figure>

	  <t>An alternative deployment scenario for a tracker-based
	  system is depicted in <xref
	  target="fig.global_tracker"></xref>. Here, the tracker
	  embeds the ALTO client. As already explained, the tracker itself may be hosted and
	  operated by an entity different than the ISP hosting and
	  operating the ALTO server. The key difference to the previously
	  discussed use case is that the ALTO client is different from the
	  resource consumer. 
          Initially, the tracker has to discover the handling ALTO
          server for each new peer <xref target="RFC7286"></xref>
          <xref target="I-D.kiesel-alto-xdom-disc"></xref>. The peers
          do not query the ALTO servers themselves. This gives the
          peers a better initial selection of candidates, but does not
          consider peers learned through direct peer-to-peer knowledge
          exchange.</t>

	    <t><figure anchor="fig.p4p_approach"
		       title="Local trackers and local ALTO servers (P4P approach)">
	      <artwork><![CDATA[
                   ISP 1  ,-------.         +-----------+
        ,---.          +-------------+******|   Peer 1  |
     ,-'     `-.      /|   Tracker   |\     |           |
    /           \    / +-------------+****  +-----------+
   /    ISP X    \   |       ===       | *  +-----------+
  /               \  \ +-------------+ / *  |   Peer 2  |
 ;   +---------+   :  \| AlTO Server |/  ***|           |
 |   | Global  |   |   +-------------+      +-----------+
 |   | Tracker |   |      `-------'
 |   +---------+   |                        +-----------+
 :        *        ;      ,-------.         |   Peer 3  |
  \       *       /    +-------------+  ****|           |
   \      *      /    /|   Tracker   |***   +-----------+
    \     *     /    / +-------------+ \    +-----------+
     `-.  *  ,-'     |       ===       |    |   Peer 4  |**
        `-*-'        \ +-------------+ /    |           | *
          *           \| ALTO Server |/     +-----------+ *
          *            +-------------+                    *
          *        ISP 2  `-------'                       *
          *************************************************
     Legend:
     === ALTO protocol
     *** Application protocol
]]></artwork>
	    </figure></t>

	    <t>There are some attempts to let ISPs deploy their
	    own trackers, as shown in <xref
	    target="fig.p4p_approach"></xref>. In this case, the
	    client cannot get guidance from the ALTO server,
	    other than by talking to the ISP's tracker. However, the
	    peers would have still chance the contact other trackers,
	    deployed by entities other than the peer's ISP.</t>

	</section>

      </section>

      <section title="Deployment Recommendations">

	<section title="ALTO Services">

	  <t>The ALTO protocol specification <xref
	  target="RFC7285"></xref> details how an ALTO client
	  can query an ALTO server for guiding information and receive
	  the corresponding replies. In case of peer-to-peer networks,
	  two different ALTO services can be used: The Cost Map
	  Service is often preferred as solution by peer-to-peer
	  software implementors and users, since it avoids
	  disclosing peer IP addresses to a centralized
	  entity. Alternatively, network operators may have a
	  preference for the Endpoint Cost Service (ECS), since it does not
	  require exposure of the network topology.</t>

	  <t>For actual use of ALTO in P2P applications, both software
	  vendors and network operators have to agree which ALTO
	  services to use. The ALTO protocol is flexible and supports
	  both services. Note that for other use cases of ALTO, in
	  particular in more controlled environments, both the Cost
	  Map Service as well as Endpoint Cost Service might be
	  feasible and it is more an engineering trade-off whether to
	  use a map-based or query-based ALTO service.</t>

	</section>

	<section anchor="sec.alto_in_tracker_p2p" title="Guidance Considerations">

	  <t>As explained in <xref target="sec.p2p_tracker_cons"></xref>,
	  for a tracker-based P2P application there are two
	  fundamentally different possibilities where to place the
	  ALTO client:</t>

	  <t><list style='numbers'>

	    <t>ALTO client in the resource consumer ("peer")</t>

	    <t>ALTO client in the resource directory ("tracker")</t>

	  </list></t>

	  <t>Both approaches have advantages and drawbacks that have
	  to be considered. If the ALTO client is in the resource
	  consumer (<xref target="fig.localALTOServer"></xref>), a
	  potentially very large number of clients has to be
	  deployed. Instead, when using an ALTO client in the resource
	  directory (<xref target="fig.global_tracker"></xref> and
	  <xref target="fig.p4p_approach"></xref>), ostensibly peers
	  do not have to directly query the ALTO server. In this case,
	  an ALTO server could even not permit access to peers.</t>

	  <t>However, it seems to be beneficial for all participants
	  to let the peers directly query the ALTO server. Considering
	  the plethora of different applications that could use ALTO,
	  e.g. multiple tracker or non-tracker based P2P systems or
	  other applications searching for relays, this renders the
	  ALTO service more useful. The peers are also the single
	  point having all operational knowledge to decide whether to
	  use the ALTO guidance and how to use the ALTO guidance. For
	  a given peer one can also expect that an ALTO server of the
	  corresponding ISP provides useful guidance and can be
	  discovered.</t>

          <t>Yet, ALTO clients in the resource consumer also have
          drawbacks compared to use in the resource directory. In the
          following, both scenarios are compared more in detail in
          order to explain the impact on ALTO guidance and the need
          for third-party ALTO queries.</t>

          <t>In the first scenario (see <xref target="fig.rcq"/>), the
          peer (resource consumer) queries the tracker (resource
          directory) for the desired resource (F1).  The resource
          directory returns a list of potential resource providers
          without considering ALTO (F2).  It is then the duty of the
          resource consumer to invoke ALTO (F3/F4), in order to
          solicit guidance regarding this list.</t>

	  <t><figure anchor="fig.rcq" 
		     title="Basic message sequence chart for resource consumer-initiated ALTO query">
            <artwork><![CDATA[
Peer w. ALTO cli.            Tracker               ALTO Server 
--------+--------       --------+--------       --------+--------
        | F1 Tracker query      |                       |    
        |======================>|                       |
        | F2 Tracker reply      |                       |    
        |<======================|                       |    
        | F3 ALTO protocol query                        |
        |---------------------------------------------->|    
        | F4 ALTO protocol reply                        |
        |<----------------------------------------------|    
        |                       |                       |    

====  Application protocol (i.e., tracker-based P2P app protocol)
----  ALTO protocol
]]></artwork>
	  </figure></t>

	  <t>In the second scenario (see <xref target="fig.3pq"/>),
	  the resource directory has an embedded ALTO client, which we
	  will refer to as Resource Directory ALTO Client (RDAC) in
	  this document. After receiving a query for a given resource
	  (F1) the resource directory invokes the RDAC to evaluate all
	  resource providers it knows (F2/F3).  Then it returns a,
	  possibly shortened, list containing the "best" resource
	  providers to the resource consumer (F4).</t>

	  <t><figure anchor="fig.3pq" 
		     title="Basic message sequence chart for third-party ALTO query">
            <artwork><![CDATA[
      Peer               Tracker w. RDAC           ALTO Server 
--------+--------       --------+--------       --------+--------
        | F1 Tracker query      |                       |    
        |======================>|                       |
        |                       | F2 ALTO cli. p. query |
        |                       |---------------------->|    
        |                       | F3 ALTO cli. p. reply |
        |                       |<----------------------|
        | F4 Tracker reply      |                       |    
        |<======================|                       |    
        |                       |                       |    

====  Application protocol (i.e., tracker-based P2P app protocol)
----  ALTO protocol
]]></artwork>
	  </figure></t>

	  <t>Note: The message sequences depicted in <xref
	  target="fig.rcq"/> and <xref target="fig.3pq"/> may occur
	  both in the target-aware and the target-independent query
	  mode (cf. <xref target="RFC6708"></xref>). In the
	  target-independent query mode no message exchange with the
	  ALTO server might be needed after the tracker query, because
	  the candidate resource providers could be evaluated using a
	  locally cached "map", which has been retrieved from the ALTO
	  server some time ago.</t>

	  <t>The first approach has the following problem: While the
	  resource directory might know thousands of peers taking part
	  in a swarm, the list returned to the resource consumer is
	  usually shortened for efficiency reasons. Therefore, the
	  "best" (in the sense of ALTO) potential resource providers
	  might not be contained in that list anymore, even before
	  ALTO can consider them.</t>
            
	  <t>Much better traffic optimization could be achieved if the
	  tracker would evaluate all known peers using ALTO. This list
	  would then include a significantly higher fraction of "good"
	  peers. If the tracker returned "good" peers only, there
	  might be a risk that the swarm might disconnect and split
	  into several disjunct partitions.  However, finding the
	  right mix of ALTO-biased and random peer selection is out of
	  the scope of this document.</t>
            
	  <t>Therefore, from an overall optimization perspective, the
	  second scenario with the ALTO client embedded in the
	  resource directory is advantageous, because it is ensured
	  that the addresses of the "best" resource providers are
	  actually delivered to the resource consumer. An
	  architectural implication of this insight is that the ALTO
	  server discovery procedures must support third-party
	  discovery. That is, as the tracker issues ALTO queries on
	  behalf of the peer which contacted the tracker, the tracker
	  must be able to discover an ALTO server that can give
	  guidance suitable for that respective peer (see <xref
	  target="I-D.kiesel-alto-xdom-disc"></xref>).</t>

	  <t>In principle, a combined approach could also be
	  possible. For instance, a tracker could use a coarse-grained
	  "global" ALTO server to find the peers in the general
	  vicinity of the requesting peer, while peers could use
	  "local" ALTO servers for a more fine-grained guidance. Yet,
	  there is no known deployment experience for such a combined
	  approach.</t>

	</section>

      </section>

    </section>

    <section anchor="sec.cdn_cons" title="Using ALTO for CDNs">

      <section title="Overview">

	<section title="Usage Scenario">

	  <t>This section briefly introduces the usage of ALTO for
	  Content Delivery Networks (CDNs), as explained in <xref
	  target="I-D.jenkins-alto-cdn-use-cases"></xref>. CDNs are
	  used in the delivery of some Internet services
	  (e.g. delivery of websites, software updates and video
	  delivery) from a location closer to the location of the
	  user. A CDN typically consists of a network of servers often
	  attached to Internet Service Provider (ISP) networks. The
	  point of attachment is often as close to content consumers
	  and peering points as economically or operationally feasible
	  in order to decrease traffic load on the ISP backbone and to
	  provide better user experience measured by reduced latency
	  and higher throughput.</t>

	  <t>CDNs use several techniques to redirect a client to a
	  server (surrogate). A request routing function within a CDN
	  is responsible for receiving content requests from user
	  agents, obtaining and maintaining necessary information
	  about a set of candidate surrogates, and for selecting and
	  redirecting the user agent to the appropriate surrogate. One
	  common way is relying on the DNS system, but there are many
	  other ways, see <xref target="RFC3568"></xref>.</t>

	  <t><figure anchor="fig.cdn_arch"
		     title="Use of ALTO information for CDN request routing">
            <artwork><![CDATA[
+--------------------+
| CDN Request Router |
|  with ALTO Client  |
+--------------------+
          /\
          || ALTO protocol
          ||
          \/
      +---------+
      |  ALTO   |
      | Server  |
      +---------+
           :
           : Provisioning protocol
           :
     ,-----------.
  ,-'  Source of  `-.
 (    topological    )
  `-. information ,-'
     `-----------'
]]></artwork>
	  </figure></t>

	  <t>In order to derive the optimal benefit from a CDN it is
	  preferable to deliver content from the servers (caches) that
	  are "closest" to the end user requesting the
	  content. "closest" may be as simple as geographical or IP
	  topology distance, but it may also consider other
	  combinations of metrics and CDN or Internet Service Provider
	  (ISP) policies. As illustrated in <xref
	  target="fig.cdn_arch"/>, ALTO could provide this
	  information.</t>

	  <t><figure anchor="fig.cdn_redirection" 
		     title="Example of CDN surrogate selection">
           <artwork><![CDATA[
User Agent                  Request Router                 Surrogate
     |                             |                           |
     |     F1 Initial Request      |                           |
     +---------------------------->|                           |
     |                             +--+                        |
     |                             |  | F2 Surrogate Selection |
     |                             |<-+       (using ALTO)     |
     |   F3 Redirection Response   |                           |
     |<----------------------------+                           |
     |                             |                           |
     |     F4 Content Request      |                           |
     +-------------------------------------------------------->|
     |                             |                           |
     |                             |          F5 Content       |
     |<--------------------------------------------------------+
     |                             |                           |
]]></artwork>
	  </figure></t>

	  <t><xref target="fig.cdn_redirection"/> illustrates the
	  interaction between a user agent, a request router, and a
	  surrogate for the delivery of content in a single CDN.  As
	  explained in <xref
	  target="I-D.jenkins-alto-cdn-use-cases"></xref>, the user
	  agent makes an initial request to the CDN (F1). This may be
	  an application-level request (e.g., HTTP) or a DNS
	  request. In the second step (F2), the request router selects
	  an appropriate surrogate (or set of surrogates) based on the
	  user agent's (or its proxy's) IP address, the request
	  router's knowledge of the network topology (which can be
	  obtained by ALTO) and reachability cost between CDN caches
	  and end users, and any additional CDN policies. Then (F3),
	  the request router responds to the initial request with an
	  appropriate response containing a redirection to the
	  selected cache, for example by returning an appropriate DNS
	  A/AAAA record, a HTTP 302 redirect, etc. The user agent uses
	  this information to connect directly to the surrogate and
	  request the desired content (F4), which is then delivered
	  (F5).</t>

	</section>

	<section title="Applicability of ALTO">

	  <t>The most simple use case for ALTO in a CDN context is to
	  improve the selection of a CDN surrogate or origin. In this
	  case, the CDN makes use of an ALTO server to choose a better
	  CDN surrogate or origin than would otherwise be the
	  case. Although it is possible to obtain raw network map and
	  cost information in other ways, for example passively
	  listening to the ISP's routing protocols or use of active
	  probing, the use of an ALTO service to expose that
	  information may provide additional control to the ISP over
	  how their network map/cost is exposed.  Additionally it may
	  enable the ISP to maintain a functional separation between
	  their routing plane and network map computation functions.
	  This may be attractive for a number of reasons, for
	  example:</t>

	  <t><list style="symbols">

	    <t>The ALTO service could provide a filtered view of the
	    network and/or cost map that relates to CDN locations and
	    their proximity to end users, for example to allow the ISP
	    to control the level of topology detail they are willing
	    to share with the CDN.</t>

	    <t>The ALTO service could apply additional policies to the
	    network map and cost information to provide a CDN-specific
	    view of the network map/cost, for example to allow the ISP
	    to encourage the CDN to use network links that would not
	    ordinarily be preferred by a Shortest Path First routing
	    calculation.</t>

	    <t>The routing plane may be operated and controlled by a
	    different operational entity (even within a single ISP) than
	    the CDN. Therefore, the CDN may not be able to passively
	    listen to routing protocols, nor may it have access to
	    other network topology data (e.g., inventory
	    databases).</t>

	  </list></t>

	  <t>When CDN servers are deployed outside of an ISP's network
	  or in a small number of central locations within an ISP's
	  network, a simplified view of the ISP's topology or an
	  approximation of proximity is typically sufficient to enable
	  the CDN to serve end users from the optimal server/location.
	  As CDN servers are deployed deeper within ISP networks it
	  becomes necessary for the CDN to have more detailed
	  knowledge of the underlying network topology and costs
	  between network locations in order to enable the CDN to
	  serve end users from the most optimal servers for the
	  ISP.</t>

	  <t>The request router in a CDN will typically also take
	  into account criteria and constraints that are not related
	  to network topology, such as the current load of CDN surrogates, 
	  content owner policies, end user subscriptions, etc. This document
	  only discusses use of ALTO for network information.</t>

	  <!-- DNS -->

	  <t>A general issue for CDNs is that the CDN logic
	  has to match the client's IP address with the closest CDN
	  surrogate, both for DNS or HTTP redirect based approaches
	  (see, for instance, <xref
	  target="I-D.penno-alto-cdn"></xref>). This matching is not
	  trivial, for instance, in DNS based approaches, where the IP
	  address of the DNS original requester is unknown (see <xref
	  target="I-D.ietf-dnsop-edns-client-subnet"></xref> for a
	  discussion of this and a solution approach).</t>

	  <t>In addition to use by a single CDN, ALTO can also be used
	  in scenarios that interconnect several CDNs. This use case
	  is detailed in <xref
	  target="I-D.seedorf-cdni-request-routing-alto"></xref>.</t>

	</section>

      </section>

      <section title="Deployment Recommendations">

	<section title="ALTO Services">

	  <!-- Map -->

	  <t>In its simplest form an ALTO server would provide an ISP
	  with the capability to offer a service to a CDN that
	  provides network map and cost information. The CDN can use
	  that data to enhance its surrogate and/or origin
	  selection. If an ISP offers an ALTO network and cost map
	  service to expose a cost mapping/ranking between end user IP
	  subnets (within that ISP's network) and CDN surrogate IP
	  subnets/locations, periodic updates of the maps may be
	  needed. As introduced in <xref target="risks"></xref>), it
	  is common for broadband subscribers to obtain their IP
	  addresses dynamically and in many deployments the IP subnets
	  allocated to a particular network region can change
	  relatively frequently, even if the network topology itself
	  is reasonably static.</t>

	  <!-- ECS -->

	  <t>An alternative would be to use the ALTO Endpoint Cost
	  Service (ECS): When an end user requests a given content, the CDN
	  request router issues an ECS request with the endpoint
	  address (IPv4/IPv6) of the end user (content requester) and
	  the set of endpoint addresses of the surrogate (content
	  targets).  The ALTO server receives the request and ranks
	  the addresses based on their
	  distance from the content requester. Once the request router
	  obtained from the ALTO server the ranked list of locations
	  (for the specific user), it can incorporate this information
	  into its selection mechanisms in order to point the user to
	  the most appropriate surrogate.</t>

	  <!-- Map vs. ECS -->

	  <t>Since CDNs operate in a controlled environment, the ALTO
	  network/cost map service and ECS have a similar level of
	  security and confidentiality of network-internal
	  information. However, the network/cost map service and ECS
	  differ in the way the ALTO service is delivered and address
	  a different set of requirements in terms of topology
	  information and network operations.</t>

	  <t>If a CDN already has means to model connectivity
	  policies, the map-based approaches could possibly be
	  integrated into that. If the ECS service is preferred, a
	  request router that uses ECS could cache the results of ECS
	  queries for later usage in order to address the scalability
	  limitations of ECS and to reduce the number of transactions
	  between CDN and ALTO server. The ALTO server may indicate in
	  the reply message how long the content of the message is to
	  be considered reliable and insert a lifetime value that will
	  be used by the CDN in order to cache (and then flush or
	  refresh) the entry.</t>

	</section>

	<section title="Guidance Considerations">

	  <t>In the following it is discussed how a CDN could make use
	  of ALTO services.</t>

	  <t>In one deployment scenario, ALTO could expose ISP end
	  user reachability to a CDN. The request router needs to have
	  information about which end user IP subnets are reachable via
	  which networks or network locations. The network map
	  services offered by ALTO could be used to expose this
	  topology information while avoiding routing plane peering
	  between the ISP and the CDN. For example, if CDN surrogates
	  are deployed within the access or aggregation network, the
	  ISP is likely to want to utilize the surrogates deployed in
	  the same access/aggregation region in preference to
	  surrogates deployed elsewhere, in order to alleviate the
	  cost and/or improve the user experience.</t>

	  <t>In addition, CDN surrogates could also use ALTO guidance,
	  e.g., if there is more than one upstream source of content
	  or several origins. In this case, ALTO could help a
	  surrogate with the decision about which upstream source to
	  use. This specific variant of using ALTO is not further
	  detailed in this document.</t>

	  <t>If content can be provided by several CDNs, there may be
	  a need to interconnect these CDNs. In this case, ALTO can be
	  uses as an interface <xref
	  target="I-D.seedorf-cdni-request-routing-alto"></xref>, in
	  particular for footprint and capabilities advertisement.</t>

	  <t>Other and more advanced scenarios of deploying ALTO are
	  also listed in <xref
	  target="I-D.jenkins-alto-cdn-use-cases"></xref> and <xref
	  target="I-D.penno-alto-cdn"></xref>.</t>

	  <!-- Granularity -->

	  <t>The granularity of ALTO information required depends on
	  the specific deployment of the CDN.  For example, an
	  "over-the-top" CDN whose surrogates are deployed only within
	  the Internet backbone may only require knowledge of which
	  end user IP subnets are reachable via which ISPs' networks,
	  whereas a CDN deployed within a particular ISP's network
	  requires a finer granularity of knowledge.</t>

	  <!-- Ranking and Network Events -->

	  <t>An ALTO server ranks addresses based on topology information
	  it acquires from the network.  By default, according to
	  <xref target="RFC7285"></xref>, distance in ALTO represents an
	  abstract "routingcost" that can be computed for instance
	  from routing protocol information. But an ALTO server may
	  also take into consideration other criteria or other
	  information sources for policy, state, and performance
	  information (e.g., geo-location), as explained in <xref
	  target="sec.data_sources"/>.</t>

	  <t>The different methods and algorithms through which the
	  ALTO server computes topology information and rankings is
	  out of the scope of this document. If rankings are
	  based on routing protocol information, it is obvious that
	  network events may impact the ranking computation. Due to
	  internal redundancy and resilience mechanisms inside current
	  networks, most of the network events happening in the
	  infrastructure will be handled internally in the network,
	  and they should have limited impact on a CDN. However,
	  catastrophic events such as main trunks failures or backbone
	  partitioning will have to be taken into account by the ALTO
	  server to redirect traffic away from the impacted area.</t>

	  <t>An ALTO server implementation may want to keep state
	  about ALTO clients so to inform and signal to these clients
	  when a major network event happened, e.g., by a notification
	  mechanism. In a CDN/ALTO interworking architecture with few
	  CDN components interacting with the ALTO server there are
	  less scalability issues in maintaining state about clients
	  in the ALTO server, compared to ALTO guidance to any
	  Internet user.</t>

	</section>

      </section>

    </section>

    <section title="Other Use Cases">

      <t>This section briefly surveys and references other use cases
      that have been tested or suggested for ALTO deployments.</t>

      <section title="Application Guidance in Virtual Private Networks (VPNs)">

	<t>Virtual Private Network (VPN) technology is widely used in
	public and private networks to create groups of users that are
	separated from other users of the network and allows these
	users to communicate among themselves as if they were on a private
	network. Network Service Providers (NSPs) offer different
	types of VPNs. <xref target="RFC4026"></xref> distinguishes between
	Layer 2 VPN (L2VPN) and Layer 3 VPN (L3VPN) using different
	sub-types. In the following, the term "VPN" is used to refer
	to provider supplied virtual private networking.</t>

	<t>From the perspective of an application at an endpoint, a
	VPN may not be very different to any other IP connectivity
	solution, but there are a number of specific applications that
	could benefit from ALTO topology exposure and guidance in
	VPNs. As in the general Internet, one advantage is
	that applications do not have to perform excessive
	measurements on their own. For instance, potential use cases
	for ALTO application guidance in VPN environments are:</t>

        <t><list style="symbols">
          <t>Enterprise application optimization: Enterprise customers often
          run distributed applications that exchange large amounts of data,
          e.g., for synchronization of replicated data bases. Network topology 
          information could be useful for placement of replicas as well as for 
          the scheduling of transfers.</t>

          <t>Private cloud computing solution: An enterprise customer could
          run its own data centers at the four sites. The cloud management system
          could want to understand the network costs between different
          sites for intelligent routing and placement
          decisions of Virtual Machines (VMs) among the VPN sites.</t>

          <t>Cloud-bursting: One or more VPN endpoints could be located
          in a public cloud. If an enterprise customer needs additional
          resources, they could be provided by a public cloud, which is
          accessed through the VPN. Network topology awareness would
          help to decide in which data center of the public cloud 
          those resources should be allocated.</t>
        </list></t>

        <t>These examples focus on enterprises, which are typical
        users of VPNs. VPN customers typically have no insight into
        the network topology that transports the VPN. Similar to
        other ALTO use cases, better-than-random application-level
        decisions would be enabled by an ALTO server offered by the
        NSP, as illustrated in <xref target="fig.vpn"></xref>.</t>

        <t><figure title="Using ALTO in VPNs" anchor="fig.vpn">
	  <artwork><![CDATA[
                    +---------------+
                    |  Customer's   |
                    |   management  |
                    |  application  |.
                    | (ALTO client) |  .
                    +---------------+    .  VPN provisioning
                           /\              . (out-of-scope)
                           || ALTO           .
                           \/                  .
                 +---------------------+       +----------------+
                 |     ALTO server     |       | VPN portal/OSS |
                 |   provided by NSP   |       | (out-of-scope) |
                 +---------------------+       +----------------+
                            : VPN network
                            : and cost maps
                            :
                  /---------:---------\ Network service provider
                  |         :         |
     +-------+   _______________________   +-------+
     | App a | ()_____. .________. .____() | App d |
     +-------+    |   | |        | |  |    +-------+
                  \---| |--------| |--/           
                      | |        | |
                      |^|        |^| Customer VPN
                       V          V
                   +-------+  +-------+
                   | App b |  | App c |
                   +-------+  +-------+
]]></artwork>
	</figure></t>

        <t>A common characteristic of these use cases is that
        applications will not necessarily run in the public Internet,
        and that the relationship between the provider and customer of
        the VPN is rather well-defined. Since VPNs often run in a
        managed environment, an ALTO server may have access to
        topology information (e.g., traffic engineering data) that
        would not be available for the public Internet, and it may
        expose it to the customer of the VPN only.</t>

	<t>Also, a VPN will not necessarily be static. The customer
	could possibly modify the VPN and add new VPN sites by a Web
	portal, network management systems, or other Operation Support
	Systems (OSS) solutions. Prior to adding a new VPN site, an
	application will not have connectivity to that site, i.e.,
	an ALTO server could offer access to information that an
	application cannot measure on its own (e.g., expected delay to
	a new VPN site).</t>

	<t>The VPN use cases, requirements, and solutions are further
	detailed in <xref
	target="I-D.scharf-alto-vpn-service"></xref>.</t>

      </section>

      <section anchor="sec.p2pcache" title="In-Network Caching">

	<t>Deployment of intra-domain P2P caches has been proposed for
	cooperation between the network operator and the P2P
	service providers, e.g., to reduce the bandwidth consumption in
	access networks <xref
	target="I-D.deng-alto-p2pcache"></xref>.</t>

	<t><figure anchor="fig.p2pcache" 
		   title="General architecture of intra-ISP caches">
	  <artwork><![CDATA[
         +--------------+                +------+
         | ISP 1 network+----------------+Peer 1|
         +-----+--------+                +------+
         |
+--------+------------------------------------------------------+
|        |                                      ISP 2 network   |
|  +---------+                                                  |
|  |L1 Cache |                                                  |
|  +-----+---+                                                  |
|        +--------------------+----------------------+          |
|        |                    |                      |          |
| +------+------+      +------+-------+       +------+-------+  |
| | AN1         |      | AN2          |       | AN3          |  |
| | +---------+ |      | +----------+ |       |              |  |
| | |L2 Cache | |      | |L2 Cache  | |       |              |  |
| | +---------+ |      | +----------+ |       |              |  |
| +------+------+      +------+-------+       +------+-------+  |
|        |                                           |          |
|        +--------------------+                      |          |
|        |                    |                      |          |
| +------+------+      +------+-------+       +------+-------+  |
| | SUB-AN11    |      | SUB-AN12     |       | SUB-AN31     |  |
| | +---------+ |      |              |       |              |  |
| | |L3 Cache | |      |              |       |              |  |
| | +---------+ |      |              |       |              |  |
| +------+------+      +------+-------+       +------+-------+  |
|        |                    |                      |          |
+--------+--------------------+----------------------+----------+
         |                    |                      |
     +---+---+            +---+---+                  |
     |       |            |       |                  |
  +--+--+ +--+--+      +--+--+ +--+--+            +--+--+
  |Peer2| |Peer3|      |Peer4| |Peer5|            |Peer6|
  +-----+ +-----+      +-----+ +-----+            +-----+
]]></artwork>
	</figure></t>

	<t><xref target="fig.p2pcache"/> depicts the overall
	architecture of potential P2P cache deployments inside an
	ISP 2 with various access network types. As shown in the
	figure, P2P caches may be deployed at various levels,
	including the interworking gateway linking with other ISPs,
	internal access network gateways linking with different types
	of accessing networks (e.g. WLAN, cellular and wired), and
	even within an accessing network at the entries of individual
	WLAN sub-networks. Moreover, depending on the network context
	and the operator's policy, each cache can be a Forwarding
	Cache or a Bidirectional Cache <xref
	target="I-D.deng-alto-p2pcache"></xref>.</t>

	<t>In such a cache architecture, the locations of caches could
	be used as dividers of different PIDs to guide intra-ISP
	network abstraction and mark costs among them according to the
	location and type of relevant caches.</t>

	<t>Further details and deployment considerations can be found
	in <xref target="I-D.deng-alto-p2pcache"></xref>.</t>

      </section>

      <section anchor="sec.abno" title="Other Application-based Network Operations">

        <t>An ALTO server can be part of an overall framework for
        Application-Based Network Operations (ABNO) <xref
        target="RFC7491"></xref> that
        brings together different technologies for gathering
        information about the resources available in a network, for
        consideration of topologies and how those topologies map to
        underlying network resources, for requesting path computation,
        and for provisioning or reserving network resources. Such an
        architecture may include additional components such as a Path
        Computation Element (PCE) for on-demand and
        application-specific reservation of network connectivity,
        reliability, and resources (such as bandwidth). Some 
        use cases how to leverage ALTO for joint network and
        application-layer optimization are explained in <xref
        target="RFC7491"></xref>.</t>

      </section>

    </section>

    <section title="Security Considerations">
<!--
      <t>The ALTO requirement document <xref target="RFC6708"></xref>
      and the ALTO protocol specification <xref
      target="RFC7285"></xref> discuss risk and protection strategies
      for the authenticity and integrity of ALTO information, a
      potential undesirable guidance from authenticated ALTO
      information, the confidentiality of ALTO information, the
      privacy of ALTO users, and the availability of the ALTO
      service. All those issues and potential countermeasures have to
      be taken into account when deploying an ALTO service.</t>

      <t>The following subsection further details key security issues
      that an operator has to consider when deploying ALTO in the use
      cases discussed this document. This document focuses security
      considerations relevant to operators and administrators of
      an ALTO service; security considerations for clients can be found
      in <xref target="RFC7285"></xref>.</t>
-->

        <t>Security concerns were extensively discussed from the 
        very beginning of the development of the ALTO protocol, and
        they have been considered in detail in the ALTO requirements
        document <xref target="RFC6708"/> as well as in the ALTO
        protocol specification document <xref target="RFC7285"/>.
        The two main security concerns are related to the unwanted disclosure
        of information through ALTO and the negative impact of
        specially crafted, wrong ("faked") guidance presented to an ALTO
        client.  In addition to this, the usual concerns related to
        the operation of any networked application apply.
        </t>

        <t>This section focuses on the peer-to-peer use case, which is
        - from a security perspective - probably the most difficult
        ALTO use case that has been considered. Special attention is
        given to the two main security concerns.</t>

        <section anchor="sec.security.trustboundary"
            title="ALTO as a Protocol Crossing Trust Boundaries">

            <t>The optimization of peer-to-peer applications was the
            first use case and the impetus for the development of the
            ALTO protocol, in particular file sharing applications
            such as BitTorrent <xref target="RFC5594"/>.</t>

            <t>As explained in <xref target="sec.p2phistory"/>, for
            the publisher of the ALTO information (i.e., the ALTO
            server operator) it is not always clear who is in charge
            of the P2P application overlay.  Some P2P applications do not
            have any central control entity and the whole overlay
            consists only of the peers, which are under control of the
            individual users.  Other P2P applications may have some
            control entities such as super peers or trackers, but
            these may be located in foreign countries and under the
            control of unknown organizations. As outlined in <xref
            target="sec.alto_in_tracker_p2p"/>, in some scenarios it
            may be very beneficial to forward ALTO information to such
            trackers, super peers, etc. located in remote networks.
            This situation is aggravated by the
            vast number of different P2P applications which are
            evolving quickly and often without any coordination with
            the network operators.</t>

            <t>In summary it can be said that in many instances of the
            P2P use case, the ALTO protocol bridges the border between
            the "managed" IP network infrastructure under strict
            administrative control and one or more "unmanaged"
            application overlays, i.e., overlays for which it is hard
            to tell who is in charge of them. This differs from
            more controlled environments (e.g., in the CDN use case),
            in which bilateral agreements between the producer and
            consumer of guidance are possible.</t>

        </section>
            
<!--
        <t>unmanaged overlay on managed network. trust boundary. disclosure of topology, app behavior. aiding and abetting. throtteling - independent operators.</t>
-->
      <section anchor="sec.security.leakage"
        title="Information Leakage from the ALTO Server">

        <t>An ALTO server will be provisioned with information about
        the ISP's network and possibly also with information about
        neighboring ISPs. This information (e.g., network topology,
        business relations, etc.) is often considered to be
        confidential to the ISP and can include very sensitive
        information. ALTO does not require any particular level of
        details of information disclosure, and hence the provider
        should evaluate how much information is revealed and the
        associated risks.</t>

	<!--
        <t>Furthermore, if the ALTO information is very fine grained, it may
        also be considered sensitive with respect to user privacy.  For
        example, consider a hypothetical (i.e., not yet standardized)
        endpoint property "provisioned access link bandwitdh" or "access
        technology (ADSL, VDSL, FTTH, etc.)" and an ALTO service that
        publishes this property for individual IP addresses. This service
        could not only be used for traffic optimization but, for example,
        also for targeted advertising. Web sites could add special banner
        advertisements, e.g., for luxury products or computer products, for
        clients at IP addresses that have exceptionally good connectivity.
        This idea would be based on the assumption that a subscriber willing
        to pay a higher price for better connectivity is an indication that
        this is a household with better-than-average income and/or computer
        professionals or enthusiasts living there.</t>
	-->

        <t>Furthermore, if the ALTO information is very fine grained,
        it may also be considered sensitive with respect to user
        privacy. For example, consider a hypothetical endpoint
        property "provisioned access link bandwidth" or "access
        technology (ADSL, VDSL, FTTH, etc.)" and an ALTO service that
        publishes this property for individual IP addresses. This
        information could not only be used for traffic optimization
        but, for example, also for targeted advertising to residential
        users with exceptionally good (or bad) connectivity, such as
        special banner ads. For an advertisement system it would be
        more complex to obtain such information otherwise, e.g., by
        bandwidth probing.</t>

        <t>Different scenarios related to the unwanted disclosure of
        an ALTO server's information have been itemized and categorized
        in RFC 6708, Section 5.2.1., cases (1)-(3) <xref target="RFC6708"/>.
        </t>

        <t>In some use cases it is not possible to
        use access control (see <xref target="sec.security.accesscontrol"/>)
        to limit the distribution of ALTO knowledge to a small set of
        trusted clients. In these scenarios it seems tempting not to
        use network maps and cost maps at all, and instead completely
        rely on endpoint cost service and endpoint ranking in the ALTO
        server. While this practice may indeed reduce the amount of
        information that is disclosed to an individual ALTO client,
        some issues should be considered:  First, when using the map based
        approach, it is trivial to analyze the maximum amount of information
        that could be disclosed to a client: the full maps. In contrast,
        when providing endpoint cost service only, the ALTO server operator
        could be prone to a false feeling of security, while clients use
        repeated queries and/or collaboration to gather more information
        than they are expected to get (see Section 5.2.1., case (3) in  
        <xref target="RFC6708"/>). Second, the endpoint cost service
        reveals more information about the user or application behavior
        to the ALTO server, e.g., which other hosts are considered as
        peers for the exchange of a significant amount of data
        (see Section 5.2.1., cases (4)-(6) in <xref target="RFC6708"/>).</t>

	<t>Consequently, users may be more reluctant to use the ALTO service
        at all if it is based on the endpoint cost service
        instead of providing network and cost maps. Given that some
        popular P2P applications are sometimes used for purposes
        such as distribution of files without the explicit permission
        from the copyright owner, it may also be in the interest of
        the ALTO server operator that an ALTO server cannot infer
        the behavior of the application to be optimized. One possible
        conclusion could be to publish network and cost maps through
        ALTO that are so coarse-grained that they do not violate
        the network operator's or the user's interests.</t>

        <t>In other use cases in more controlled environments (e.g., in the
        CDN use case) bilateral agreements, access control (see 
        <xref target="sec.security.accesscontrol"/>), and encryption could be
        used to reduce the risk of information leakage.</t>

      </section>

      <section anchor="sec.security.accesscontrol" title="ALTO Server Access">

        <t>Depending on the use case of ALTO, it may be desired to
        apply access restrictions to an ALTO server, i.e., by
        requiring client authentication. According to <xref
        target="RFC7285"></xref>, ALTO requires that
        HTTP Digest Authentication is supported, in order to
        achieve client authentication and possibly to limit the number
        of parties with whom ALTO information is directly shared. TLS
        Client Authentication may also be supported.</t>

        <t>In general, well-known security management techniques and
        best current practices <xref target="RFC4778"></xref> for
        operational ISP infrastructure also apply to an ALTO
        service, including functions to protect the system from
        unauthorized access, key management, reporting
        security-relevant events, and authorizing user access and
        privileges.</t>

	<t>For peer-to-peer applications, a potential deployment
	scenario is that an ALTO server is solely accessible by peers
	from the ISP network (as shown in <xref
	target="fig.localALTOServer"></xref>). For instance, the
	source IP address can be used to grant only access from that
	ISP network to the server. This will "limit" the number of
	peers able to attack the server to the user's of the ISP
	(however, including botnet computers).</t>

        <t>If the ALTO server has to be accessible by parties not
        located in the ISP's network (see <xref
        target="fig.global_tracker"></xref>), e.g., by a third-party
        tracker or by a CDN system outside the ISP's network, the
        access restrictions have to be looser. In the extreme
        case, i.e., no access restrictions, each and every host in the
        Internet can access the ALTO server. This might no be the
        intention of the ISP, as the server is not only subject to
        more possible attacks, but also the server load could increase,
	since possibly more ALTO clients have to be served.</t>

	<t>There are also use cases where the access to the ALTO
	server has to be much more strictly controlled, i. e., where
	an authentication and authorization of the ALTO client to the
	server may be needed. For instance, in case of CDN
	optimization the provider of an ALTO service as well as
	potential users are possibly well-known. Only CDN entities may
	need ALTO access; access to the ALTO servers by residential
	users may neither be necessary nor be desired.</t>

        <t>Access control can also help to prevent Denial-of-Service
        attacks by arbitrary hosts from the
        Internet. Denial-of-Service (DoS) can both affect an ALTO
        server and an ALTO client. A server can get overloaded if too
        many requests hit the server, or if the query load of the
        server surpasses the maximum computing capacity. An ALTO
        client can get overloaded if the responses from the sever are,
        either intentionally or due to an implementation mistake, too
        large to be handled by that particular client.</t>

      </section>

      <section title="Faking ALTO Guidance">

        <t>The ALTO services enables an ALTO service provider to
        influence the behavior of network applications. An attacker
        who is able to generate false replies, or e.g. an attacker who
        can intercept the ALTO server discovery procedure, can provide
        faked ALTO guidance.</t>

        <t>Here is a list of examples how the ALTO guidance could be
        faked and what possible consequences may arise:</t>

	<t><list style="hanging">

	  <t hangText="Sorting:">An attacker could change to sorting
	  order of the ALTO guidance (given that the order is of
	  importance, otherwise the ranking mechanism is of interest),
	  i.e., declaring peers located outside the ISP as peers to be
	  preferred. This will not pose a big risk to the network or
	  peers, as it would mimic the "regular" peer operation
	  without traffic localization, apart from the
	  communication/processing overhead for ALTO. However, it
	  could mean that ALTO is reaching the opposite goal of
	  shuffling more data across ISP boundaries, incurring more
	  costs for the ISP. In another example, fake guidance could
	  give unrealistically low costs to devices in an ISP's mobile
	  network, thus encouraging other devices to contact them,
	  thereby degrading the ISP's mobile network and causing
	  customer dissatisfaction.</t>

	  <t hangText="Preference of a single peer:">A single IP
	  address (thus a peer) could be marked as to be preferred all
	  over other peers. This peer can be located within the local
	  ISP or also in other parts of the Internet (e.g., a web
	  server). This could lead to the case that quite a number of
	  peers to trying to contact this IP address, possibly causing
	  a Denial-of-Service (DoS) attack.</t>
          
	</list></t>

        <t>It has not yet been investigated how a faked or wrong ALTO
        guidance by an ALTO server can impact the operation of the
        network and also the applications, e.g., peer-to-peer
        applications.</t>

      </section>

    </section>

    <section title="IANA Considerations">

      <t>This document makes no specific request to IANA.</t>

    </section>

    <section title="Conclusion">

      <t>This document discusses how the ALTO protocol can be deployed
      in different use cases and provides corresponding guidance and
      recommendations to network administrators and application
      developers.</t>

    </section>

    <section title="Acknowledgments">

      <t>This memo is the result of contributions made by several
      people:</t>

      <t><list style="symbols">

	  <t>Xianghue Sun, Lee Kai, and Richard Yang contributed text
	  on ISP deployment requirements and monitoring.</t>

	  <t>Stefano Previdi contributed parts of the <xref
	  target="sec.cdn_cons"/> on "Using ALTO for CDNs".</t>

	  <t>Rich Woundy contributed text to <xref
	  target="risks"/>.</t>

	  <t>Lingli Deng, Wei Chen, Qiuchao Yi, and Yan Zhang
	  contributed <xref target="sec.p2pcache"/>.</t>

      </list></t>

      <t>Thomas-Rolf Banniza, Vinayak Hegde, Qin Wu, and Wendy Roome provided
      very useful comments and reviewed the document.</t>

      <t>Martin Stiemerling is partially supported by the CHANGE
      project (http://www.change-project.eu), a research project
      supported by the European Commission under its 7th Framework
      Program (contract no.  257422).  The views and conclusions
      contained herein are those of the authors and should not be
      interpreted as necessarily representing the official policies or
      endorsements, either expressed or implied, of the CHANGE project
      or the European Commission.</t>

    </section>

  </middle>

  <back>

    <references title="Normative References">

      <?rfc include="reference.RFC.5693"?>

      <?rfc include="reference.RFC.6708"?>

      <?rfc include="reference.RFC.7285"?>

      <?rfc include="reference.RFC.7286"?>

    </references>

    <references title="Informative References">

      <?rfc include="reference.RFC.3411"?>

      <?rfc include="reference.RFC.3568"?>

      <?rfc include="reference.RFC.4026"?>

      <?rfc include="reference.RFC.4778"?>

      <?rfc include="reference.RFC.5594"?>

      <?rfc include="reference.RFC.5632"?>

      <?rfc include="reference.RFC.6020"?>

      <?rfc include="reference.RFC.6241"?>

      <?rfc include="reference.RFC.6875"?>

      <?rfc include="reference.RFC.7491"?>

      <?rfc include="reference.I-D.kiesel-alto-xdom-disc"?>

      <?rfc include="reference.I-D.jenkins-alto-cdn-use-cases"?>

      <?rfc include="reference.I-D.seedorf-cdni-request-routing-alto"?>

      <?rfc include="reference.I-D.penno-alto-cdn"?>

      <?rfc include="reference.I-D.scharf-alto-vpn-service"?>

      <?rfc include="reference.I-D.deng-alto-p2pcache"?>

      <?rfc include="reference.I-D.ietf-idr-ls-distribution"?>

      <?rfc include="reference.I-D.ietf-i2rs-architecture"?>

      <?rfc include="reference.I-D.lee-alto-chinatelecom-trial"?>

      <?rfc include="reference.I-D.kiesel-alto-h12"?>

      <?rfc include="reference.I-D.ietf-dnsop-edns-client-subnet"?>

      <?rfc include="reference.I-D.wu-alto-te-metrics"?>

      <?rfc include="reference.I-D.seidel-alto-map-calculation"?>

    </references>

  </back>

</rfc>

PAFTECH AB 2003-20262026-04-24 05:41:04