RFC 3714 - IAB Concerns Regarding Congestion Control for Voice T

Network Working Group                                      S. Floyd, Ed.
Request for Comments: 3714                                 J. Kempf, Ed.
Category: Informational                                      March 2004

             IAB Concerns Regarding Congestion Control for
                     Voice Traffic in the Internet

Status of this Memo

   This memo provides information for the Internet community.  It does
   not specify an Internet standard of any kind.  Distribution of this
   memo is unlimited.

Copyright Notice

Abstract

   This document discusses IAB concerns about effective end-to-end
   congestion control for best-effort voice traffic in the Internet.
   These concerns have to do with fairness, user quality, and with the
   dangers of congestion collapse.  The concerns are particularly
   relevant in light of the absence of a widespread Quality of Service
   (QoS) deployment in the Internet, and the likelihood that this
   situation will not change much in the near term.  This document is
   not making any recommendations about deployment paths for Voice over
   Internet Protocol (VoIP) in terms of QoS support, and is not claiming
   that best-effort service can be relied upon to give acceptable
   performance for VoIP.  We are merely observing that voice traffic is
   occasionally deployed as best-effort traffic over some links in the
   Internet, that we expect this occasional deployment to continue, and
   that we have concerns about the lack of effective end-to-end
   congestion control for this best-effort voice traffic.

Table of Contents

   1.  Introduction . . . . . . . . . . . . . . . . . . . . . . . . .  2
   2.  An Example of the Potential for Trouble. . . . . . . . . . . .  4
   3.  Why are Persistent, High Drop Rates a Problem? . . . . . . . .  6
       3.1.  Congestion Collapse. . . . . . . . . . . . . . . . . . .  6
       3.2.  User Quality . . . . . . . . . . . . . . . . . . . . . .  7
       3.3.  The Amorphous Problem of Fairness. . . . . . . . . . . .  8
   4.  Current efforts in the IETF. . . . . . . . . . . . . . . . . . 10
       4.1.  RTP. . . . . . . . . . . . . . . . . . . . . . . . . . . 10
       4.2.  TFRC . . . . . . . . . . . . . . . . . . . . . . . . . . 11
       4.3.  DCCP . . . . . . . . . . . . . . . . . . . . . . . . . . 12

       4.4.  Adaptive Rate Audio Codecs . . . . . . . . . . . . . . . 12
       4.5.  Differentiated Services and Related Topics . . . . . . . 13
   5.  Assessing Minimum Acceptable Sending Rates . . . . . . . . . . 13
       5.1.  Drop Rates at 4.75 kbps Minimum Sending Rate . . . . . . 17
       5.2.  Drop Rates at 64 kbps Minimum Sending Rate . . . . . . . 18
       5.3.  Open Issues. . . . . . . . . . . . . . . . . . . . . . . 18
       5.4.  A Simple Heuristic . . . . . . . . . . . . . . . . . . . 19
   6. Constraints on VoIP Systems . . . . . . . . . . . . . . . . . . 20
   7.  Conclusions and Recommendations. . . . . . . . . . . . . . . . 20
   8.  Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . 21
   9.  References . . . . . . . . . . . . . . . . . . . . . . . . . . 21
       9.1.  Normative References . . . . . . . . . . . . . . . . . . 21
       9.2.  Informative References . . . . . . . . . . . . . . . . . 22
   10. Appendix - Sending Rates with Packet Drops . . . . . . . . . . 26
   11. Security Considerations. . . . . . . . . . . . . . . . . . . . 29
   12. IANA Considerations. . . . . . . . . . . . . . . . . . . . . . 29
   13. Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . 30
   14. Full Copyright Statement . . . . . . . . . . . . . . . . . . . 31

1.  Introduction

   While many in the telephony community assume that commercial VoIP
   service in the Internet awaits effective end-to-end QoS, in reality
   voice service over best-effort broadband Internet connections is an
   available service now with growing demand.  While some ISPs deploy
   QoS on their backbones, and some corporate intranets offer end-to-end
   QoS internally, end-to-end QoS is not generally available to
   customers in the current Internet.  Given the current commercial
   interest in VoIP on best-effort media connections, it seems prudent
   to examine the potential effect of real time flows on congestion.  In
   this document, we perform such an analysis.  Note, however, that this
   document is not making any recommendations about deployment paths for
   VoIP in terms of QoS support, and is not claiming that best-effort
   service can be relied upon to give acceptable performance for VoIP.
   This document is also not discussing signalling connections for VoIP.
   However, voice traffic is in fact occasionally deployed as best
   effort traffic over some links in the Internet today, and we expect
   this occasional deployment to continue.  This document expresses our
   concern over the lack of effective end-to-end congestion control for
   this best-effort voice traffic.

   Assuming that VoIP over best-effort Internet connections continues to
   gain popularity among consumers with broadband connections, the
   deployment of end-to-end QoS mechanisms in public ISPs may be slow.
   The IETF has developed standards for QoS mechanisms in the Internet
   [DIFFSERV, RSVP] and continues to be active in this area [NSIS,COPS].
   However, the deployment of technologies requiring change to the
   Internet infrastructure is subject to a wide range of commercial as

   well as technical considerations, and technologies that can be
   deployed without changes to the infrastructure enjoy considerable
   advantages in the speed of deployment.  RFC 2990 outlines some of the
   technical challenges to the deployment of QoS architectures in the
   Internet [RFC2990].  Often, interim measures that provide support for
   fast-growing applications are adopted, and are successful enough at
   meeting the need that the pressure for a ubiquitous deployment of the
   more disruptive technologies is reduced.  There are many examples of
   the slow deployment of infrastructure that are similar to the slow
   deployment of QoS mechanisms, including IPv6, IP multicast, or of a
   global PKI for IKE and IPsec support.

   Interim QoS measures that can be deployed most easily include
   single-hop or edge-only QoS mechanisms for VoIP traffic on individual
   congested links, such as edge-only QoS mechanisms for cable access
   networks.  Such local forms of QoS could be quite successful in
   protecting some fraction of best-effort VoIP traffic from congestion.
   However, these local forms of QoS are not directly visible to the
   end-to-end VoIP connection.  A best-effort VoIP connection could
   experience high end-to-end packet drop rates, and be competing with
   other best-effort traffic, even if some of the links along the path
   might have single-hop QoS mechanisms.

   The deployment of IP telephony is likely to include best-effort
   broadband connections to public-access networks, in addition to other
   deployment scenarios of dedicated IP networks, or as an alternative
   to band splitting on the last mile of ADSL deployments or QoS
   mechanisms on cable access networks.  There already exists a
   rapidly-expanding deployment of VoIP services intended to operate
   over residential broadband access links (e.g., [FWD, Vonage]).  At
   the moment, many public-access IP networks are uncongested in the
   core, with low or moderate levels of link utilization, but this is
   not necessarily the case on last hop links.  If an IP telephony call
   runs completely over the Internet, the connection could easily
   traverse congested links on both ends.  Because of economic factors,
   the growth rate of Internet telephony is likely to be greatest in
   developing countries, where core links are more likely to be
   congested, making congestion control an especially important topic
   for developing countries.

   Given the possible deployment of IP telephony over congested best-
   effort networks, some concerns arise about the possibilities of
   congestion collapse due to a rapid growth in real-time voice traffic
   that does not practice end-to-end congestion control.  This document
   raises some concerns about fairness, user quality, and the danger of
   congestion collapse that would arise from a rapid growth in best-
   effort telephony traffic on best-effort networks.  We consider best-
   effort telephony connections that have a minimum sending rate and

   that compete directly with other best-effort traffic on a path with
   at least one congested link, and address the specific question of
   whether such traffic should be required to terminate, or to suspend
   sending temporarily, in the face of a persistent, high packet drop
   rate, when reducing the sending rate is not a viable alternative.

   The concerns in this document about fairness and the danger of
   congestion collapse apply not only to telephony traffic, but also to
   video traffic and other best-effort real-time traffic with a minimum
   sending rate.  RFC 2914 already makes the point that best-effort
   traffic requires end-to-end congestion control [RFC2914].  Because
   audio traffic sends at such a low rate, relative to video and other
   real-time traffic, it is sometimes claimed that audio traffic doesn't
   require end-to-end congestion control.  Thus, while the concerns in
   this document are general, the document focuses on the particular
   issue of best-effort audio traffic.

   Feedback can be sent to the IAB mailing list at iab@ietf.org, or to
   the editors at floyd@icir.org and kempf@docomolabs-usa.com.  Feedback
   can also be sent to the end2end-interest mailing list [E2E].

2.  An Example of the Potential for Trouble

   At the November, 2002, IEPREP Working Group meeting in Atlanta, a
   brief demonstration was made of VoIP over a shared link between a
   hotel room in Atlanta, Georgia, USA, and Nairobi, Kenya.  The link
   ran over the typical uncongested Internet backbone and access links
   to peering points between either endpoint and the Internet backbone.
   The voice quality on the call was very good, especially in comparison
   to the typical quality obtained by a circuit-switched call with
   Nairobi.  A presentation that accompanied the demonstration described
   the access links (e.g., DSL, T1, T3, dialup, and cable modem links)
   as the primary source of network congestion, and described VoIP
   traffic as being a very small percentage of the packets in commercial
   ISP traffic [A02].  The presentation further stated that VoIP
   received good quality in the presence of packet drop rates of 5-40%
   [AUT].  The VoIP call used an ITU-T G.711 codec, plus proprietary FEC
   encoding, plus RTP/UDP/IP framing.  The resulting traffic load over
   the Internet was substantially more than the 64 kbps required by the
   codec.  The primary congestion point along the path of the
   demonstration was a 128 kbps access link between an ISP in Kenya and
   several of its subscribers in Nairobi.  So the single VoIP call
   consumed more than half of the access link capacity, capacity that is
   shared across several different users.

   Note that this network configuration is not a particularly good one
   for VoIP.  In particular, if there are data services running TCP on
   the link with a typical packet size of 1500 bytes, then some voice

   packets could be delayed an additional 90 ms, which might cause an
   increase in the end to end delay above the ITU-recommended time of
   150 ms [G.114] for speech traffic.  This would result in a delay
   noticeable to users, with an increased variation in delay, and
   therefore in call quality, as the bursty TCP traffic comes and goes.
   For a call that already had high delay, such as the Nairobi call from
   the previous paragraph, the increased jitter due to competing TCP
   traffic also increases the requirements on the jitter buffer at the
   receiver.  Nevertheless, VoIP usage over congested best-effort links
   is likely to increase in the near future, regardless of VoIP's
   superior performance with "carrier class" service.  A best-effort
   VoIP connection that persists in sending packets at 64 Kbps,
   consuming half of a 128 Kbps access link, in the face of a drop rate
   of 40%, with the resulting user-perceptible degradation in voice
   quality, is not behaving in a way that serves the interests of either
   the VoIP users or the other concurrent users of the network.

   As the Nairobi connection demonstrates, prescribing universal
   overprovisioning (or more precisely, provisioning sufficient to avoid
   persistent congestion) as the solution to the problem is not an
   acceptable generic solution.  For example, in regions of the world
   where circuit-switched telephone service is poor and expensive, and
   Internet access is possible and lower cost, provisioning all Internet
   links to avoid congestion is likely to be impractical or impossible.

   In particular, an over-provisioned core is not by itself sufficient
   to avoid congestion collapse all the way along the path, because an
   over-provisioned core can not address the common problem of
   congestion on the access links.  Many access links routinely suffer
   from congestion.  It is important to avoid congestion collapse along
   the entire end-to-end path, including along the access links (where
   congestion collapse would consist of congested access links wasting
   scarce bandwidth carrying packets that will only be dropped
   downstream).  So an over-provisioned core does not by itself
   eliminate or reduce the need for end-to-end congestion avoidance and
   control.

   There are two possible mechanisms for avoiding this congestion
   collapse: call rejection during busy periods, or the use of end-to-
   end congestion control.  Because there are currently no
   acceptance/rejection mechanisms for best-effort traffic in the
   Internet, the only alternative is the use of end-to-end congestion
   control.  This is important even if end-to-end congestion control is
   invoked only in those very rare scenarios with congestion in
   generally-uncongested access links or networks.  There will always be
   occasional periods of high demand, e.g., in the two hours after an
   earthquake or other disaster, and this is exactly when it is
   important to avoid congestion collapse.

   Best-effort traffic in the Internet does not include mechanisms for
   call acceptance or rejection.  Instead, a best-effort network itself
   is largely neutral in terms of resource management, and the
   interaction of the applications' transport sessions mutually
   regulates network resources in a reasonably fair fashion.  One way to
   bring voice into the best-effort environment in a non-disruptive
   manner is to focus on the codec and look at rate adaptation measures
   that can successfully interoperate with existing transport protocols
   (e.g., TCP), while at the same time preserving the integrity of a
   real-time, analog voice signal; another way is to consider codecs
   with fixed sending rates.  Whether the codec has a fixed or variable
   sending rate, we consider the appropriate response when the codec is
   at its minimum data rate, and the packet drop rate experienced by the
   flow remains high.  This is the key issue addressed in this document.

3.  Why are Persistent, High Drop Rates a Problem?

   Persistent, high packet drop rates are rarely seen in the Internet
   today, in the absence of routing failures or other major disruptions.
   This happy situation is due primarily to low levels of link
   utilization in the core, with congestion typically found on lower-
   capacity access links, and to the use of end-to-end congestion
   control in TCP.  Most of the traffic on the Internet today uses TCP,
   and TCP self-corrects so that the two ends of a connection reduce the
   rate of packet sending if congestion is detected.  In the sections
   below, we discuss some of the problems caused by persistent, high
   packet drop rates.

3.1.  Congestion Collapse

   One possible problem caused by persistent, high packet drop rates is
   that of congestion collapse.  Congestion collapse was first observed
   during the early growth phase of the Internet of the mid 1980s
   [RFC896], and the fix was provided by Van Jacobson, who developed the
   congestion control mechanisms that are now required in TCP
   implementations [Jacobson88, RFC2581].

   As described in RFC 2914, congestion collapse occurs in networks with
   flows that traverse multiple congested links having persistent, high
   packet drop rates [RFC2914].  In particular, in this scenario packets
   that are injected onto congested links squander scarce bandwidth
   since these packets are only dropped later, on a downstream congested
   link.  If congestion collapse occurs, all traffic slows to a crawl
   and nobody gets acceptable packet delivery or acceptable performance.
   Because congestion collapse of this form can occur only for flows
   that traverse multiple congested links, congestion collapse is a

   potential problem in VoIP networks when both ends of the VoIP call
   are on an congested broadband connection such as DSL, or when the
   call traverses a congested backbone or transoceanic link.

3.2.  User Quality

   A second problem with persistent, high packet drop rates concerns
   service quality seen by end users.  Consider a network scenario where
   each flow traverses only one congested link, as could have been the
   case in the Nairobi demonstration above.  For example, imagine N VoIP
   flows sharing a 128 Kbps link, with each flow sending at least 64
   Kbps.  For simplicity, suppose the 128 Kbps link is the only
   congested link, and there is no traffic on that link other than the N
   VoIP calls.  We will also ignore for now the extra bandwidth used by
   the telephony traffic for FEC and packet headers, or the reduced
   bandwidth (often estimated as 70%) due to silence suppression.  We
   also ignore the fact that the two streams composing a bidirectional
   VoIP call, one for each direction, can in practice add to the load on
   some links of the path.  Given these simplified assumptions, the
   arrival rate to that link is at least N*64 Kbps.  The traffic
   actually forwarded is at most 2*64 Kbps (the link bandwidth), so at
   least (N-2)*64 Kbps of the arriving traffic must be dropped.  Thus, a
   fraction of at least (N-2)/N of the arriving traffic is dropped, and
   each flow receives on average a fraction 1/N of the link bandwidth.
   An important point to note is that the drops occur randomly, so that
   no one flow can be expected statistically to present better quality
   service to users than any other.  Everybody's voice quality therefore
   suffers.

   It seems clear from this simple example that the quality of best-
   effort VoIP traffic over congested links can be improved if each VoIP
   flow uses end-to-end congestion control, and has a codec that can
   adapt the bit rate to the bandwidth actually received by that flow.
   The overall effect of these measures is to reduce the aggregate
   packet drop rate, thus improving voice quality for all VoIP users on
   the link.  Today, applications and popular codecs for Internet
   telephony attempt to compensate by using more FEC, but controlling
   the packet flow rate directly should result in less redundant FEC
   information, and thus less bandwidth, thereby improving throughput
   even further.  The effect of delay and packet loss on VoIP in the
   presence of FEC has been investigated in detail in the literature
   [JS00, JS02, JS03, MTK03].  One rule of thumb is that when the packet
   loss rate exceeds 20%, the audio quality of VoIP is degraded beyond
   usefulness, in part due to the bursty nature of the losses [S03].  We
   are not aware of measurement studies of whether VoIP users in
   practice tend to hang up when packet loss rates exceed some limit.

   The simple example in this section considered only voice flows, but
   in reality, VoIP traffic will compete with other flows, most likely
   TCP.  The response of VoIP traffic to congestion works best by taking
   into account the congestion control response of TCP, as is discussed
   in the next subsection.

3.3.  The Amorphous Problem of Fairness

   A third problem with persistent, high packet drop rates is fairness.
   In this document we consider fairness with regard to best-effort VoIP
   traffic competing with other best-effort traffic in the Internet.
   That is, we are explicitly not addressing the issues raised by
   emergency services, or by QoS-enabled traffic that is known to be
   treated separately from best-effort traffic at a congested link.

   While fairness is a bit difficult to quantify, we can illustrate the
   effect by adding TCP traffic to the congested link discussed in the
   previous section.  In this case, the non-congestion-controlled
   traffic and congestion-controlled TCP traffic [RFC2914] share the
   link, with the congestion-controlled traffic's sending rate
   determined by the packet drop rate experienced by those flows.  As in
   the previous section, the 128 Kbps link has N VoIP connections each
   sending 64 Kbps, resulting in packet drop rate of at least (N-2)/N on
   the congested link.  Competing TCP flows will experience the same
   packet drop rates.  However, a TCP flow experiencing the same packet
   drop rates will be sending considerably less than 64 Kbps.  From the
   point of view of who gets what amount of bandwidth, the VoIP traffic
   is crowding out the TCP traffic.

   Of course, this is only one way to look at fairness.  The relative
   fairness between VoIP and TCP traffic can be viewed several different
   ways, depending on the assumptions that one makes on packet sizes and
   round-trip times.  In the presence of a fixed packet drop rate, for
   example, a TCP flow with larger packets sends more (in Bps, bytes per
   second) than a TCP flow with smaller packets, and a TCP flow with a
   shorter round-trip time sends more (in Bps) than a TCP flow with a
   larger round-trip time.  In environments with high packet drop rates,
   TCP's sending rate depends on the algorithm for setting the
   retransmit timer (RTO) as well, with a TCP implementation having a
   more aggressive RTO setting sending more than a TCP implementation
   having a less aggressive RTO setting.

   Unfortunately, there is no obvious canonical round-trip time for
   judging relative fairness of flows in the network.  Agreement in the
   literature is that the majority of packets on most links in the
   network experience round-trip times between 10 and 500 ms [RTTWeb].

   (This does not include satellite links.)  As a result, if there was a
   canonical round-trip for judging relative fairness, it would have to
   be within that range.  In the absence of a single representative
   round-trip time, the assumption of this paper is that it is
   reasonable to consider fairness between a VoIP connection and a TCP
   connection with the same round-trip time.

   Similarly, there is no canonical packet size for judging relative
   fairness between TCP connections.  However, because the most common
   packet size for TCP data packets is 1460 bytes [Measurement], we
   assume that it is reasonable to consider fairness between a VoIP
   connection, and a TCP connection sending 1460-byte data packets.
   Note that 1460 bytes is considerably larger than is typically used
   for VoIP packets.

   In the same way, while RFC 2988 specifies TCP's algorithm for setting
   TCP's RTO, there is no canonical value for the minimum RTO, and the
   minimum RTO heavily affects TCP's sending rate in times of high
   congestion [RFC2988].  RFC 2988 specifies that TCP's RTO must be set
   to SRTT + 4*RTTVAR, for SRTT the smoothed round-trip time, and for
   RTTVAR the mean deviation of recent round-trip time measurements.
   RFC 2988 further states that the RTO "SHOULD" have a minimum value of
   1 second.  However, it is not uncommon in practice for TCP
   implementations to have a minimum RTO as low as 100 ms.  For the
   purposes of this document, in considering relative fairness, we will
   assume a minimum RTO of 100 ms.

   As an additional complication, TCP connections that use fine-grained
   timestamps can have considerably higher sending rates than TCP
   connections that do not use timestamps, in environments with high
   packet drop rates.  For TCP connections with fine-grained timestamps,
   a valid round-trip time measurement is obtained when a retransmitted
   packet is successfully received and acknowledged by the receiver; in
   this case a backed-off retransmit timer can be un-backed-off as well.
   For TCP connections without timestamps, a valid round-trip time
   measurement is only obtained when the transmission of a new packet is
   received and acknowledged by the receiver.  This limits the
   opportunities for the un-backing-off of a backed-off retransmit
   timer.  In this document, in considering relative fairness, we use a
   TCP connection without timestamps, since this is the dominant use of
   TCP in the Internet.

   A separate claim that has sometimes been raised in terms of fairness
   is that best-effort VoIP traffic is inherently more important that
   other best-effort traffic (e.g., web surfing, peer-to-peer traffic,
   or multi-player games), and therefore merits a larger share of the
   bandwidth in times of high congestion.  Our assumption in this
   document is that TCP traffic includes pressing email messages,

   business documents, and emergency information downloaded from web
   pages, as well as the more recreational uses cited above.  Thus, we
   do not agree that best-effort VoIP traffic should be exempt from
   end-to-end congestion control due to any claims of inherently more
   valuable content.  (One could equally logically argue that because
   email and instant messaging are more efficient forms of communication
   than VoIP in terms of bandwidth usage, as a result email and instant
   messaging are more valuable uses of scarce bandwidth in times of high
   congestion.)  In fact, the network is incapable of making a judgment
   about the relative user value of traffic.  The default assumption is
   that all best-effort traffic has equal value to the network provider
   and to the user.

   We note that this discussion of relative fairness does not in any way
   challenge the right of ISPs to allocate bandwidth on congested links
   to classes of traffic in any way that they choose.  (For example,
   administrators rate-limit the bandwidth used by peer-to-peer traffic
   on some links in the network, to ensure that bandwidth is also
   available for other classes of traffic.)  This discussion merely
   argues that there is no reason for entire classes of best-effort
   traffic to be exempt from end-to-end congestion control.

4.  Current efforts in the IETF

   There are four efforts currently underway in IETF to address issues
   of congestion control for real time traffic: an upgrade of the RTP
   specification, TFRC, DCCP, and work on audio codecs.

4.1.  RTP

   RFC 1890, the original RTP Profile for Audio and Video Control, does
   not discuss congestion control [RFC1890].  The revised document on
   "RTP Profile for Audio and Video Conferences with Minimal Control"
   [RFC3551] discusses congestion control in Section 2.  [RFC3551] says
   the following:

      "If best-effort service is being used, RTP receivers SHOULD
      monitor packet loss to ensure that the packet loss rate is within
      acceptable parameters.  Packet loss is considered acceptable if a
      TCP flow across the same network path and experiencing the same
      network conditions would achieve an average throughput, measured
      on a reasonable timescale, that is not less than the RTP flow is
      achieving.  This condition can be satisfied by implementing
      congestion control mechanisms to adapt the transmission rate (or
      the number of layers subscribed for a layered multicast session),
      or by arranging for a receiver to leave the session if the loss
      rate is unacceptably high."

      "The comparison to TCP cannot be specified exactly, but is
      intended as an "order-of-magnitude" comparison in timescale and
      throughput.  The timescale on which TCP throughput is measured is
      the round-trip time of the connection.  In essence, this
      requirement states that it is not acceptable to deploy an
      application (using RTP or any other transport protocol) on the
      best-effort Internet which consumes bandwidth arbitrarily and does
      not compete fairly with TCP within an order of magnitude."

   Note that [RFC3551] says that receivers "SHOULD" monitor packet loss.
   [RFC3551] does not explicitly say that the RTP senders and receivers
   "MUST" detect and respond to a persistent high loss rate.  Since
   congestion collapse can be considered a "danger to the Internet" the
   use of "MUST" would be appropriate for RTP traffic in the best-effort
   Internet, where the VoIP traffic shares a link with other traffic,
   since "danger to the Internet" is one of two criteria given in RFC
   2119 for the use of "MUST" [RFC2119].  Different requirements may
   hold for a private best-effort IP network provisioned solely for
   VoIP, where the VoIP traffic does not interact with the wider
   Internet.

4.2.  TFRC

   As mentioned in RFC 3267, equation-based congestion control is one of
   the possibilities for VoIP.  TCP Friendly Rate Control (TFRC) is the
   equation-based congestion control mechanism that has been
   standardized in the IETF.  The TFRC specification, "TCP Friendly Rate
   Control (TFRC): Protocol Specification" [RFC3448], says the
   following:

      "TFRC ... is reasonably fair when competing for bandwidth with TCP
      flows, but has a much lower variation of throughput over time
      compared with TCP, making it more suitable for applications such
      as telephony or streaming media where a relatively smooth sending
      rate is of importance.  ...  TFRC is designed for applications
      that use a fixed packet size, and vary their sending rate in
      packets per second in response to congestion.  Some audio
      applications require a fixed interval of time between packets and
      vary their packet size instead of their packet rate in response to
      congestion.  The congestion control mechanism in this document
      cannot be used by those applications; TFRC-PS (for TFRC-
      PacketSize) is a variant of TFRC for applications that have a
      fixed sending rate but vary their packet size in response to
      congestion.  TFRC-PS will be specified in a later document."

   There is no draft available for TFRC-PS yet, unfortunately, but
   several researchers are still working on these issues.

4.3.  DCCP

   The Datagram Congestion Control Protocol (DCCP) is a transport
   protocol being standardized in the IETF for unreliable flows, with
   the application being able to specify either TCP-like or TFRC
   congestion control [DCCP03].

   DCCP currently has two Congestion Control IDentifiers or CCIDs; these
   are CCID 2 for TCP-like congestion control and CCID 3 for TFRC
   congestion control.  As TFRC-PS becomes available and goes through
   the standards process, we would expect DCCP to create a new CCID,
   CCID 4, for use with TFRC-PS congestion control.

4.4.  Adaptive Rate Audio Codecs

   A critical component in the design of any real-time application is
   the selection of appropriate codecs, specifically codecs that operate
   at a low sending rate, or that will reduce the sending rate as
   throughput decreases and/or packet loss increases.  Absent this, and
   in the absence of the response to congestion recommended in this
   document, the real-time application is likely to significantly
   increase the risk of Internet congestion collapse, thereby adversely
   impacting the health of the deployed Internet.  If the codec is
   capable of reducing its bit rate in response to congestion, this
   improves the scaling of the number of VoIP or TCP sessions capable of
   sharing a congested link while still providing acceptable performance
   to users.  Many current audio codecs are capable of sending at a low
   bit rate, in some cases adapting their sending rate in response to
   congestion indications from the network.

   RFC 3267 describes RTP payload formats for use with the Adaptive
   Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) audio
   codecs [RFC 3267].  The AMR codec supports eight speech encoding
   modes having bit rates between 4.75 and 12.2 kbps, with the speech
   encoding performed on 20 ms speech frames, and is able to reduce the
   transmission rate during silence periods.  The payload format
   specified in RFC 3267 includes forward error correction (FEC) and
   frame interleaving to increase robustness against packet loss
   somewhat.  The AMR codec was chosen by the Third Generation
   Partnership Project (3GPP) as the mandatory codec for third
   generation (3G) cellular systems, and RFC 3267 recommends that AMR or
   AMR-WB applications using the RTP payload format specified in RFC
   3267 use congestion control, though no specific mechanism is
   recommended.  RFC 3267 gives "Equation-Based Congestion Control for
   Unicast Applications" as an example of a congestion control mechanism
   suitable for real-time flows [FHPW00].

   The "Internet Low Bit Rate Codec", iLBC, is an IETF effort to develop
   an IPR-free codec for robust voice communication over IP [ILBRC].
   The codec is designed for graceful speech quality degradation in the
   case of lost packets, and has a payload bit rate of 13.33 kbps for 30
   ms frames or 15.20 kbps for 20 ms frames.

   There are several unencumbered low-rate codec algorithms in Ivox (the
   Interactive VOice eXchange) [IVOX], with plans to add additional
   variable rate codecs.  For example, LPC2400 (a.k.a. LQ2400) is a 2400
   bps LPC based codec with an enhancement to permit "silence
   detection".  The 2400 bps codec is reported to have a "slight robotic
   quality" [A03] (even without the additional complications of packet
   loss).  The older multirate codec described in [KFK79, KF82] is an
   LPC codec that works at two rates, 2.4 kbps and 9.6 kbps, and can
   optionally send additional "residual" bits for enhanced quality at a
   higher bit rate.

   Off-the-shelf ITU-T vocoders such as G.711 were generally designed
   explicitly for circuit-switched networks and are not as well-adapted
   for Internet use, even with the addition of FEC on top.

4.5.  Differentiated Services and Related Topics

   The Differentiated Services Working Group [DIFFSERV], which concluded
   in 2003, completed standards for the Differentiated Services Field
   (DS Field) in the IPv4 and IPv6 Headers [RFC2474], including several
   per-hop forwarding behaviors [RFC2597, RFC3246].  The Next Steps in
   Signaling Working Group [NSIS] is developing an optimized signalling
   protocol for QoS, based in part on earlier work of the Resource
   Reservation Setup Protocol Working Group [RSVP].  We do not discuss
   these and related efforts further in this document, since this
   document concerns only that VoIP traffic that might be carried as
   best-effort traffic over some congested link in the Internet.

5.  Assessing Minimum Acceptable Sending Rates

   Current IETF work in the DCCP and AVT working groups does not
   consider the problem of applications that have a minimum sending rate
   and are not able to go below that sending rate.  This clearly must be
   addressed in the TFRC-PS draft.  As suggested in the RTP document, if
   the loss rate is persistently unacceptably high relative to the
   current sending rate, and the best-effort application is unable to
   lower its sending rate, then the only acceptable answer is for that
   flow to discontinue sending on that link.  For a multicast session,
   this could be accomplished by the receiver withdrawing from the
   multicast group.  For a unicast session, this could be accomplished
   by the unicast connection terminating, at least for a period of time.

   We can formulate a problem statement for the minimum sending rate in
   the following way.  Consider a best-effort, adaptive audio
   application that is able to adapt down to a minimum sending rate of N
   Bps (bytes per second) of application data, sending M packets per
   second.  Is this a sufficiently low sending rate that the best-effort
   flow is never required to terminate due to congestion, or to reduce
   its sending rate in packets per second still further? In other words,
   is N Bps an acceptable minimum sending rate for the application,
   which can be continued in the face of congestion without terminating
   or suspending the application?

   We assume, generously for VoIP, that the limitation of the network is
   in bandwidth in bytes per second (Bps), and not in CPU cycles or in
   packets per second (pps).  If the limitation in the network is in
   bandwidth, this is a limitation in Bps, while if the limitation is in
   router processing capacity in packets, this would be a limitation in
   pps.  We note that TCP sends fixed-size data packets, and reduces its
   sending rate in pps when it adapts to network congestion, thus
   reducing the load on the forward path both in Bps and in pps.  In
   contrast, for adaptive VoIP applications, the adaption is sometimes
   to keep the same sending rate in pps, but to reduce the packet size,
   reducing the sending rate in Bps.  This fits the needs of audio as an
   application, and is a good response on a network path where the
   limitation is in Bps.  Such behavior would be a less appropriate
   response for a network path where the limitation is in pps.

   If the network limitation in fact is in Bps, then all that matters in
   terms of congestion is a flow's sending rate on the wire in Bps.  If
   this assumption of a network limitation in Bps is false, then the
   sending rate in pps could contribute to congestion even when the
   sending rate in Bps is quite moderate.  While the ideal would be to
   have a transport protocol that is able to detect whether the
   bottleneck links along the path are limited in Bps or in pps, and to
   respond appropriately when the limitation is in pps, such an ideal is
   hard to achieve. We would not want to delay the deployment of
   congestion control for telephony traffic until such an ideal could be
   accomplished.  In addition, we note that the current TCP congestion
   control mechanisms are themselves not very effective in an
   environment where there is a limitation along the reverse path in
   pps.  While the TCP mechanisms do provide an incentive to use large
   data packets, TCP does not include any effective congestion control
   mechanisms for the stream of small acknowledgement packets on the
   reverse path.  Given the arguments above, it seems acceptable to us
   to assume a network limitation in Bps rather than in pps in
   considering the minimum sending rate of telephony traffic.

   Assuming 40-byte packet headers (IP, RTP, and UDP or DCCP), the
   application data sending rate of N Bps and M pps translates to a
   sending rate on the wire of B = N+40M Bps.  If the application uses
   additional FEC (Forward Error Correction), the FEC bits must be added
   in as well.  In our example, we ignore bandwidth adjustments that are
   needed to take into account the additional overhead for FEC or the
   reduced sending rate for silence periods.  We also are not taking
   into account the possible role of header compression on congested
   edge links, which can reduce significantly the number of bytes used
   for headers on those links.

   Now, consider an equivalent-rate TCP connection with data packets of
   P bytes and a round-trip time of R seconds.  Taking into account
   header size, such a TCP connection with a sending rate on the wire of
   B Bps is sending B/(P+40) pps, or, equivalently, BR/(P+40) ppr
   (packets per round-trip time).

   Restating the question in terms of the above expressions for VoIP and
   TCP: if the best-effort VoIP connection is experiencing a persistent
   packet drop rate of D, and is at its minimum sending rate on the wire
   of B Bps, when should the application or transport protocol terminate
   or suspend the VoIP connection?

   One answer to this question is to find the sending rate in ppr for a
   TCP connection sending at the same rate on the wire in Bps, and to
   use the TCP response function to determine whether a conformant TCP
   connection would be able to maintain a sending rate close to that
   sending rate with the same persistent drop rate D.  If the sending
   rate of the VoIP connection is significantly higher than the sending
   rate of a conformant TCP connection under the same conditions, and
   the VoIP connection is unable to reduce its sending rate on the wire,
   then the VoIP connection should terminate or suspend.

   As discussed above, there are two reasons for requiring the
   application to terminate:

      1) Avoiding congestion collapse, given the possibility of multiple
         congested links,

      2) Fairness for congestion-controlled TCP traffic sharing the
         link.

   In addition, if an application requires a minimum service level from
   the network in order to operate, and that service level is
   consistently not achieved, then the application should terminate or
   suspend sending.

   One counter-argument is that users will just hang up anyway with a
   high packet drop rate so there is no point in enforcing a minimum
   acceptable rate.  Users might hang up, but they might also just keep
   on talking, with the occasional noise getting though, for minutes or
   longer waiting for a short period of clarity.  Another counter-
   argument is that nobody really benefits from VoIP connections being
   terminated or suspended when persistent packet drop rates exceed the
   allowable packet drop rate for the configured minimum sending rate.
   This is untrue, since the termination of these VoIP connections could
   allow competing TCP and VoIP traffic to make some progress.

   In the next section, we illustrate the approach outlined above for
   VoIP flows with minimum sending rates of 4.75 and 64 kbps
   respectively, and show that in practice such an approach would not
   seem too burdensome for VoIP traffic.  This approach implies that the
   VoIP traffic would terminate or suspend when the packet drop rate
   significantly exceeds 40% for a VoIP flow with a minimum sending rate
   of 4.75 kbps.  If VoIP is to deliver "carrier quality" or even near
   "carrier quality" on best-effort links, conditioning deployment on
   the ability to maintain maximum sending rates during periods of
   persistent packet drops rates exceeding 40% does not suggest a
   service model that will see widespread acceptance among consumers, no
   matter what the price differential.  Good packet throughput is vital
   for the delivery of acceptable VoIP service.

   For a VoIP flow that stops sending because its minimum sending rate
   is too high for the steady-state packet drop rate, we have not
   addressed the question of when a VoIP flow might be able to start
   sending again, to see if the congestion on the end-to-end path has
   changed.  This issue has been addressed in a proposal for
   Probabilistic Congestion Control [PCC].

   We note that if the congestion indications are in the form of ECN-
   marked packets (Explicit Congestion Notification), as opposed to
   dropped packets, then the answers about when a flow with a minimum
   sending rate would have to stop sending are somewhat different.  ECN
   allows routers to explicitly notify end-nodes of congestion by ECN-
   marking instead of dropping packets [RFC3168].  If packets are ECN-
   marked instead of dropped in the network, then there are no concerns
   of congestion collapse or of user quality (for the ECN-capable
   traffic, at any rate), and what remains are concerns of fairness with
   competing flows.  Second, in regimes with very high congestion, TCP
   has a higher sending rate with ECN-marked than with dropped packets,
   in part because of different dynamics in terms of un-backing-off a
   backed-off retransmit timer.

5.1.  Drop Rates at 4.75 kbps Minimum Sending Rate

   Consider an adaptive audio application with an RTT of R=0.1 seconds
   that is able to adapt down to a minimum sending rate of 4.75 kbps
   application data, sending M=20 packets per second.  This sending rate
   translates to N=593 Bps of application data, for a sending rate on
   the wire of B=1393 Bps.  An equivalent-rate TCP connection with data
   packets of P=1460 bytes and a round-trip time of R=0.1 seconds would
   be sending BR/(P+40) = 0.09 ppr.

   Table 1 in the Appendix looks at the packet drop rate experienced by
   a TCP connection with the RTO set to twice the RTT, and gives the
   corresponding sending rate of the TCP connection in ppr.  The second
   column gives the sending rate estimated by the standard analytical
   approach, and the third, fourth, and fifth columns give the average
   sending rate from simulations with random packet drops or marks.  The
   sixth column gives the sending rates from experiments on a 4.8-
   RELEASE FreeBSD machine.  The analytical approaches require an RTO
   expressed as a multiple of the RTT, and Table 1 shows the results for
   the RTO set to 2 RTT.  In the simulations, the minimum RTO is set to
   twice the RTT.  See the Appendix for more details.

   For a sending rate of 0.09 ppr and an RTO set to 2 RTT, Table 1 shows
   that the analytical approach gives a corresponding packet drop rate
   of roughly 50%, while the simulations in the fifth column and the
   experiments in the sixth column give a packet drop rate of between
   35% and 40% to maintain a sending rate of 0.09 ppr.  (For a reference
   TCP connection using timestamps, shown in the fourth column, the
   simulations give a packet drop rate of 55% to maintain a sending rate
   of 0.09 ppr.)  Of the two approaches for determining TCP's
   relationship between the sending rate and the packet drop rate, the
   analytic approach and the use of simulations, we consider the
   simulations to be the most realistic, for reasons discussed in the
   Appendix.  This suggests a packet drop rate of 40% would be
   reasonable for a TCP connection with an average sending rate of 0.09
   ppr.  As a result, a VoIP connection with an RTT of 0.1 sec and a
   minimum sending rate of 4.75 kbps would be required to terminate or
   suspend when the persistent packet drop rate significantly exceeds
   40%.

   These estimates are sensitive to the assumed round-trip time of the
   TCP connection.  If we assumed instead that the equivalent-rate TCP
   connection had a round-trip time of R=0.01 seconds, the equivalent-
   rate TCP connection would be sending BR/(P+40) = 0.009 ppr.  However,
   we have also assumed a minimum RTO for TCP connections of 0.1
   seconds, which in this case would mean an RTO of at least 10 RTT.
   For this setting of the RTO, we would use Table 2 from the appendix
   to determine the average TCP sending rate for a particular packet

   drop rate.  The simulations in the fifth column of Table 2 suggest
   that a TCP connection with an RTT of 0.01 sec and an RTO of 10 RTT
   would be able to send 0.009 ppr with a packet drop rate of 45%.  (For
   the same TCP connection using timestamps, shown in the fourth column,
   the simulations give a packet drop rate of 60-65% to maintain a
   sending rate of 0.009 ppr.)

   Thus, for a VoIP connection with an RTT of 0.01 sec and a minimum
   sending rate of 4.75 kbps, the VoIP connection would be required to
   terminate or suspend when the persistent packet drop rate exceeded
   45%.

5.2.  Drop Rates at 64 kbps Minimum Sending Rate

   The effect of increasing the minimum acceptable sending rate to 64
   kbps is effectively to decrease the packet drop rate at which the
   application should terminate or suspend sending.  For this section,
   consider a codec with a minimum sending rate of 64 kbps, or N=8000
   Bps, and a packet sending rate of M=50 pps.  (This would be
   equivalent to 160-byte data packets, with 20 ms. per packet.)  The
   sending rate on the wire is B = N+40M Bps, including headers, or
   10000 Bps.  A TCP connection having that sending rate, with packets
   of size P=1460 bytes and a round-trip time of R=0.1 seconds, sends
   BR/(P+40) = 0.66 ppr.  From the fifth column of Table 1, for an RTO
   of 2 RTT, this corresponds to a packet drop rate between 20 and 25%.
   [For a TCP connection using fine-grained timestamps, as shown in the
   fourth column of Table 1, this sending rate corresponds to a packet
   drop rate between 25% and 35%.]  As a result, a VoIP connection with
   an RTT of 0.1 sec and a minimum sending rate of 64 kbps would be
   required to terminate or suspend when the persistent packet drop rate
   significantly exceeds 25%.

   For an equivalent-rate TCP connection with a round-trip time of
   R=0.01 seconds and a minimum RTO of 0.1 seconds (giving an RTO of 10
   RTT), we use the fifth column of Table 2, which shows that a sending
   rate of 0.066 ppr corresponds to a packet drop rate of roughly 30%.
   [For a TCP connection using fine-grained timestamps, as shown in the
   fourth column of Table 2, this sending rate corresponds to a packet
   drop rate of roughly 45%.]  Thus, for a VoIP connection with an RTT
   of 0.01 sec and a minimum sending rate of 64 kbps, the VoIP
   connection would be required to terminate or suspend when the
   persistent packet drop rate exceeded 30%.

5.3.  Open Issues

   This document does not attempt to specify a complete protocol.  For
   example, this document does not specify the definition of a
   persistent packet drop rate.  The assumption would be that a

   "persistent packet drop rate" would refer to the packet drop rate
   over a significant number of round-trip times, e.g., at least five
   seconds.  Another possibility would be that the time interval for
   measuring the persistent drop rate is a function of the lifetime of
   the connection, with longer-lived connections using longer time
   intervals for measuring the persistent drop rate.

   The time period for detecting persistent congestion also affects the
   potential synchronization of VoIP sessions all terminating or
   suspending at the same time in response to shared congestion.  If
   flows use some randomization in setting the time interval for
   detecting persistent congestion, or use a time interval that is a
   function of the connection lifetime, this could help to prevent all
   VoIP flows from terminating at the same time.

   Another design issue for a complete protocol concerns whether a flow
   terminates when the packet drop rate is too high, or only suspends
   temporarily.  For a flow that suspends temporarily, there is an issue
   of how long it should wait before resuming transmission.  At the very
   least, the sender should wait long enough so that the flow's overall
   sending rate doesn't exceed the allowed sending rate for that packet
   drop rate.

   The recommendation of this document is that VoIP flows with minimum
   sending rates should have corresponding configured packet drop rates,
   such that the flow terminates or suspends when the persistent packet
   drop rate of the flow exceeds the configured rate.  If the persistent
   packet drop rate increases over time, flows with higher minimum
   sending rates would have to suspend sending before flows with lower
   minimum sending rates.  If VoIP flows terminate when the persistent
   packet drop rate is too high, this could lead to scenarios where VoIP
   flows with lower minimum sending rates essentially receive all of the
   link bandwidth, while the VoIP flows with higher minimum sending
   rates are required to terminate.  However, if VoIP flows suspend
   sending for a time when the persistent packet drop rate is too high,
   instead of terminating entirely, then the bandwidth could end up
   being shared reasonably fairly between VoIP flows with different
   minimum sending rates.

5.4.  A Simple Heuristic

   One simple heuristic for estimating congestion would be to use the
   RTCP reported loss rate as an indicator.  For example, if the RTCP-
   reported lost rate is greater than 30%, or N back-to-back RTCP
   reports are missing, the application could assume that the network is
   too congested, and terminate or suspend sending.

6.  Constraints on VoIP Systems

   Ultimately, attempting to run VoIP on congested links, even with
   adaptive rate codecs and minimum packet rates, is likely to run into
   hard constraints due to the nature of real time traffic in heavily
   congested scenarios.  VoIP systems exhibit a limited ability to scale
   their packet rate.  If the number of packets decreases, the amount of
   audio per packet is greater and error concealment at the receiver
   becomes harder.  Any error longer than phoneme length, which is
   typically 40 to 100 ms depending on the phoneme and speaker, is
   unrecoverable.  Ideally, applications want sub 30ms packets and this
   is what most voice codecs provide.  In addition, voice media streams
   exhibit greater loss sensitivity at lower data rates.  Lower-data
   rate codecs maintain more end-to-end state and as a result are
   generally more sensitive to loss.

   We note that very-low-bit-rate codecs have proved useful, although
   with some performance degradation, in very low bandwidth, high noise
   environments (e.g., 2.4 kbps HF radio).  For example, 2.4 kbps codecs
   "produce speech which although intelligible is far from natural
   sounding" [W98].  Figure 5 of [W98] shows how the speech quality with
   several forms of codecs varies with the bit rate of the codec.

7.  Conclusions and Recommendations

   In the near term, VoIP services are likely to be deployed, at least
   in part, over broadband best-effort connections.  Current real time
   media encoding and transmission practice ignores congestion
   considerations, resulting in the potential for trouble should VoIP
   become a broadly deployed service in the near to intermediate term.
   Poor user quality, unfairness to other VoIP and TCP users, and the
   possibility of sporadic episodes of congestion collapse are some of
   the potential problems in this scenario.

   These problems can be mitigated in applications that use fixed-rate
   codecs by requiring the best-effort VoIP application to specify its
   minimum bit throughput rate.  This minimum bit rate can be used to
   estimate a packet drop rate at which the application would terminate.

   This document specifically recommends the following:

   (1) In IETF standards for protocols regarding best-effort flows with
   a minimum sending rate, a packet drop rate must be specified, such
   that the best-effort flow terminates, or suspends sending
   temporarily, when the steady-state packet drop rate significantly
   exceeds the specified drop rate.

   (2) The specified drop rate for the minimum sending rate should be
   consistent with the use of Tables 1 and 2 as illustrated in this
   document.

   We note that this is a recommendation to the IETF community, as a
   specific follow-up to RFC 2914 on Congestion Control Principles.

   This is not a specific or complete protocol specification.

   Codecs that are able to vary their bit rate depending on estimates of
   congestion can be even more effective in providing good quality
   service while maintaining network efficiency under high load
   conditions.  Adaptive variable-bit-rate codecs are therefore
   preferable as a means of supporting VOIP sessions on shared use
   Internet environments.

   Real-time traffic such as VoIP could derive significant benefits from
   the use of ECN, where routers may indicate congestion to end-nodes by
   marking packets instead of dropping them.  However, ECN is only
   standardized to be used with transport protocols that react
   appropriately to marked packets as indications of congestion.  VoIP
   traffic that follows the recommendations in this document could
   satisfy the congestion-control requirements for using ECN, while VoIP
   traffic with no mechanism for terminating or suspending when the
   packet dropping and marking rate was too high would not.  However, we
   repeat that this document is not a complete protocol specification.
   In particular, additional mechanisms would be required before it was
   safe for applications running over UDP to use ECN.  For example,
   before using ECN, the sending application would have to ensure that
   the receiving application was capable of receiving ECN-related
   information from the lower-layer UDP stack, and of interpreting this
   ECN information as a congestion indication.

8.  Acknowledgements

   We thank Brian Adamson, Ran Atkinson, Fred Baker, Jon Crowcroft,
   Christophe Diot, Alan Duric, Jeremy George, Mark Handley, Orion
   Hodson, Geoff Huston, Eddie Kohler, Simon Leinen, David Meyer, Jean-
   Francois Mule, Colin Perkins, Jon Peterson, Mike Pierce, Cyrus
   Shaoul, and Henning Schulzrinne for feedback on this document.  (Of
   course, these people do not necessarily agree with all of the
   document.)  Ran Atkinson and Geoff Huston contributed to the text of
   the document.

   The analysis in Section 6.0 resulted from a session at the whiteboard
   with Mark Handley.  We also thank Alberto Medina for the FreeBSD
   experiments showing TCP's sending rate as a function of the packet
   drop rate.

9.  References

9.1.  Normative References

   [RFC2119]     Bradner, S., "Key words for use in RFCs to Indicate
                 Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC2988]     Paxson, V. and M. Allman, "Computing TCP's
                 Retransmission Timer", RFC 2988, November 2000.

   [RFC3267]     Sjoberg, J., Westerlund, M., Lakaniemi, A. and Q. Xie,
                 "Real-Time Transport Protocol (RTP) Payload Format and
                 File Storage Format for the Adaptive Multi-Rate (AMR)
                 and Adaptive Multi-Rate Wideband (AMR-WB) Audio
                 Codecs", RFC 3267, June 2002.

9.2.  Informative References

   [A02]         Ran Atkinson, An ISP Reality Check, Presentation to
                 ieprep, 55th IETF Meeting, November 2002.  URL
                 "http://www.ietf.cnri.reston.va.us/proceedings/
                 02nov/219.htm#slides".

   [A03]         Brian Adamson, private communication, June 2003.

   [BBFS01]      Deepak Bansal, Hari Balakrishnan, Sally Floyd, and
                 Scott Shenker, Dynamic Behavior of Slowly-Responsive
                 Congestion Control Algorithms, SIGCOMM 2001.

   [COPS]        Durham, D., Ed., Boyle, J., Cohen, R., Herzog, S.,
                 Rajan, R. and A. Sastry, "The COPS (Common Open Policy
                 Service) Protocol", RFC 2748, January 2000.

   [DCCP03]      Eddie Kohler, Mark Handley, Sally Floyd, and Jitendra
                 Padhye, Datagram Congestion Control Protocol (DCCP),
                 internet-draft Work in Progress, March 2003.  URL
                 "http://www.icir.org/kohler/dcp/".

   [DIFFSERV]    Differentiated Services (diffserv), Concluded Working
                 Group, URL
                 "http://www.ietf.cnri.reston.va.us/html.charters/
                 OLD/diffserv-charter.html".

   [E2E]         The end2end-interest mailing list, URL
                 "http://www.postel.org/mailman/listinfo/end2end-
                 interest".

   [FHPW00]      S. Floyd, M. Handley, J. Padhye, J. Widmer, "Equation-
                 Based Congestion Control for Unicast Applications", ACM
                 SIGCOMM 2000.

   [FM03]        S. Floyd and R. Mahajan, Router Primitives for
                 Protection Against High-Bandwidth Flows and Aggregates,
                 internet draft (not yet submitted).

   [FWD]         Free World Dialup, URL "www.pulver.com/fwd/".

   [IEPREP02]    Internet Emergency Preparedness (ieprep), Minutes, 55th
                 IETF Meeting, November 2002.  URL
                 "http://www.ietf.cnri.reston.va.us/proceedings/
                 02nov/219.htm#cmr".

   [ILBRC]       S.V. Andersen, et. al., Internet Low Bit Rate Codec,
                 Work in Progress, March 2003.

   [G.114]       Recommendation G.114 - One-way Transmission Time, ITU,
                 May 2003.  URL "http://www.itu.int/itudoc/itu-
                 t/aap/sg12aap/recaap/g.114/".

   [IVOX]        The Interactive VOice eXchange, URL
                 "http://manimac.itd.nrl.navy.mil/IVOX/".

   [Jacobson88]  V. Jacobson, Congestion Avoidance and Control, ACM
                 SIGCOMM '88, August 1988.

   [AUT]         The maximum feasible drop rate for VoIP traffic depends
                 on the codec.  These numbers are a range for a variety
                 of codecs; voice quality begins to deteriorate for many
                 codecs around a 10% drop rate. Note from authors.

   [JS00]        Wenyu Jiang and Henning Schulzrinne, Modeling of Packet
                 Loss and Delay and Their Effect on Real-Time Multimedia
                 Service Quality, NOSSDAV, 2000.  URL
                 "http://citeseer.nj.nec.com/jiang00modeling.html".

   [JS02]        Wenyu Jiang and Henning Schulzrinne, Comparison and
                 Optimization of Packet Loss Repair Methods on VoIP
                 Perceived Quality under Bursty Loss, NOSSDAV, 2002.
                 URL "http://www1.cs.columbia.edu/~wenyu/".

   [JS03]        Wenyu Jiang, Kazummi Koguchi, and Henning Schulzrinne,
                 QoS Evaluation of VoIP End-points, ICC 2003.  URL
                 "http://www1.cs.columbia.edu/~wenyu/".

   [KFK79]       G.S. Kang, L.J. Fransen, and E.L. Kline, "Multirate
                 Processor (MRP) for Digital Voice Communications", NRL
                 Report 8295, Naval Research Laboratory, Washington DC,
                 March 1979.

   [KF82]        G.S. Kang and L.J. Fransen, "Second Report of the
                 Multirate Processor (MRP) for Digital Voice
                 Communications", NRL Report 8614, Naval Research
                 Laboratory, Washington DC, September 1982.

   [Measurement] Web page on "Measurement Studies of End-to-End
                 Congestion Control in the Internet", URL
                 "http://www.icir.org/floyd/ccmeasure.html".  The
                 section on "Network Measurements at Specific Sites"
                 includes measurement data about the distribution of
                 packet sizes on various links in the Internet.

   [MTK03]       A. P. Markopoulou, F. A. Tobagi, and M. J. Karam,
                 "Assessing the Quality of Voice Communications Over
                 Internet Backbones", IEEE/ACM Transactions on
                 Networking, V. 11 N. 5, October 2003.

   [NSIS]        Next Steps in Signaling (nsis), IETF Working Group, URL
                 "http://www.ietf.cnri.reston.va.us/html.charters/nsis-
                 charter.html".

   [PCC]         Joerg Widmer, Martin Mauve, and Jan Peter Damm.
                 Probabilistic Congestion Control for Non-Adaptable
                 Flows.  Technical Report 3/2001, Department of
                 Mathematics and Computer Science, University of
                 Mannheim.  URL "http://www.informatik.uni-
                 mannheim.de/informatik/pi4/projects/
                 CongCtrl/pcc/index.html".

   [PFTK98]      J. Padhye, V. Firoiu, D. Towsley, J. Kurose, Modeling
                 TCP Throughput: A Simple Model and its Empirical
                 Validation, Tech Report TF 98-008, U. Mass, February
                 1998.

   [RFC896]      Nagle, J., "Congestion Control in IP/TCP", RFC 896,
                 January 1984.

   [RFC1890]     Schulzrinne, H., "RTP Profile for Audio and Video
                 Conferences with Minimal Control", RFC 1890, January
                 1996.

   [RFC2474]     Nichols, K., Blake, S., Baker, F. and D. Black,
                 "Definition of the Differentiated Services Field (DS
                 Field) in the IPv4 and IPv6 Headers", RFC 2474,
                 December 1998.

   [RFC2581]     Allman, M., Paxson, V. and W. Stevens, "TCP Congestion
                 Control", RFC 2581, April 1999.

   [RFC2597]     Heinanen, J., Baker, F., Weiss, W. and J. Wroclawski,
                 "Assured Forwarding PHB Group, RFC 2597, June 1999.

   [RFC2914]     Floyd, S., "Congestion Control Principles", BCP 41, RFC
                 2914, September 2000.

   [RFC2990]     Huston, G., "Next Steps for the IP QoS Architecture",
                 RFC 2990, November 2000.

   [RFC3042]     Allman, M., Balakrishnan, H. and S., Floyd, "Enhancing
                 TCP's Loss Recovery Using Limited Transmit", RFC 3042,
                 January 2001.

   [RFC3168]     Ramakrishnan, K., Floyd, S. and D. Black, "The Addition
                 of Explicit Congestion Notification (ECN) to IP", RFC
                 3168, September 2001.

   [RFC3246]     Davie, B., Charny, A., Bennet, J.C.R., Benson, K., Le
                 Boudec, J.Y., Courtney, W., Davari, S., Firoiu, V. and
                 D. Stiliadis, "An Expedited Forwarding PHB (Per-Hop
                 Behavior)", RFC 3246, March 2002.

   [RFC3448]     Handley, M., Floyd, S., Pahdye, J. and J. Widmer, "TCP
                 Friendly Rate Control (TFRC): Protocol Specification",
                 RFC 3448, January 2003.

   [RSVP]        Resource Reservation Setup Protocol (rsvp), Concluded
                 Working Group, URL
                 "http://www.ietf.cnri.reston.va.us/html.charters/
                 OLD/rsvp-charter.html".

   [RTTWeb]      Web Page on Round-Trip Times in the Internet, URL
                 "http://www.icir.org/floyd/rtt-questions.html"

   [S03]         H. Schulzrinne, private communication, 2003.

   [RFC3551]     Schulzrinne, H. and S. Casner, "RTP Profile for Audio
                 and Video Conferences with Minimal Control", RFC 3551,
                 July 2003.

   [Vonage]      Vonage, URL "www.vonage.com".

   [W98]         J. Woodward, Speech Coding, Communications Research
                 Group, University of Southampton, 1998.  URL
                 "http://www-mobile.ecs.soton.ac.uk/speech_codecs/",

10.  Appendix - Sending Rates with Packet Drops

   The standard way to estimate TCP's average sending rate S in packets
   per round-trip as a function of the packet drop rate would be to use
   the TCP response function estimated in [PFTK98]:

      S = 1/(sqrt(2p/3) + K min(1,3 sqrt(3p/8)) p (1 + 32 p^2))   (1)

   for acks sent for every data packet, and the RTO set to K*RTT.

   The results from Equation (1) are given in the second column in
   Tables 1 and 2 below.  However, Equation (1) overestimates TCP's
   sending rate in the regime with heavy packet drop rates (e.g., of 30%
   or more).  The analysis behind Equation (1) assumes that once a
   single packet is successfully transmitted, TCP's retransmit timer is
   no longer backed-off.  This might be appropriate for an environment
   with ECN, or for a TCP connection using fine-grained timestamps, but
   this is not necessarily the case for a non-ECN-capable TCP connection
   without timestamps.  As specified in [RFC2988], if TCP's retransmit
   timer is backed-off, this back-off should only be removed when TCP
   successfully transmits a new packet (as opposed to a retransmitted
   packet), in the absence of timestamps.

   When the packet drop rate is 50% or higher, for example, many of the
   successful packet transmissions can be of retransmitted packets, and
   the retransmit timer can remain backed-off for significant periods of
   time, in the absence of timestamps.  In this case, TCP's throughput
   is determined largely by the maximum backoff of the retransmit timer.
   For example, in the NS simulator the maximum backoff of the
   retransmit timer is 64 times the un-backed-off value.  RFC 2988
   specifies that "a maximum value MAY be placed on RTO provided it is
   at least 60 seconds."  [Although TCP implementations vary, many TCP
   implementations have a maximum of 45 seconds for the backed-off RTO
   after dropped SYN packets.]

   Another limitation of Equation (1) is that it models Reno TCP, and
   therefore underestimates the sending rate of a modern TCP connection
   that used SACK and Limited Transmit.

   The table below shows estimates of the average sending rate S in
   packets per RTT, for TCP connections with the RTO set to 2 RTT for
   Equation (1).

   These estimates are compared with simulations in the third, fourth,
   and fifth columns, with ECN, packet drops for TCP with fine-grained
   timestamps, and packet drops for TCP without timestamps respectively.
   (The simulation scripts are available from
   http://www.icir.org/floyd/VoIP/sims.)  Each simulation computes the

   average sending rate over the second half of a 10,000-second
   simulation, and for each packet drop rate, the average is given over
   50 simulations.  For the simulations with very high packet drop
   rates, it is sometimes the case that the SYN packet is repeatedly
   dropped, and the TCP sender never successfully transmits a packet.
   In this case, the TCP sender also never gets a measurement of the
   round-trip time.

   The sixth column of Table 1 shows the average sending rate S in
   packets per RTT for an experiment using a 4.8-RELEASE FreeBSD
   machine.  For the low packet drop rates of 0.1 and 0.2, the sending
   rate in the simulations is higher than the sending rate in the
   experiments; this is probably because the TCP implementation in the
   simulations uses Limited Transmit [RFC3042].  With Limited Transmit,
   the TCP sender can sometimes avoid a retransmit timeout when a packet
   is dropped and the congestion window is small.  With high packet drop
   rates of 0.65 and 0.7, the sending rate in the simulations is
   somewhat lower than the sending rate in the experiments.  For these
   high packet drop rates, the TCP connections in the experiments would
   often abort prematurely, after a sufficient number of successive
   packet drops.

   We note that if the ECN marking rate exceeds a locally-configured
   threshold, then a router is advised to switch from marking to
   dropping.  As a result, we do not expect to see high steady-state
   marking rates in the Internet, even if ECN is in fact deployed.

    Drop
   Rate p  Eq(1)  Sims:ECN  Sims:TimeStamp  Sims:Drops  Experiments
   ------  -----  --------  --------------  ----------  -----------
     0.1   2.42    2.92      2.38            2.32       0.72
     0.2    .89    1.82      1.26            0.82       0.29
     0.25   .55    1.52       .94             .44       0.22
     0.35   .23    .99        .51             .11       0.10
     0.4    .16    .75        .36             .054      0.068
     0.45   .11    .55        .24             .029      0.050
     0.5    .10    .37        .16             .018      0.036
     0.55   .060   .25        .10             .011      0.024
     0.6    .045   .15        .057            .0068     0.006
     0.65   .051   .          .033            .0034     0.008
     0.7    .041   .06        .018            .0022     0.007
     0.75   .034   .04        .0099           .0011
     0p.8    .028   .027       .0052           .00072
     0.85   .023   .015       .0021           .00034
     0.9    .020   .011       .0011           .00010
     0.95   .017   .0079      .00021          .000037

   Table 1: Sending Rate S as a Function of the Packet Drop Rate p,
            for RTO set to 2 RTT, and S in packets per RTT.

   The table below shows the average sending rate S, for TCP connections
   with the RTO set to 10 RTT.

    Drop
   Rate p  Eq(1)  Sims:ECN  Sims:TimeStamp  Sims:Drops
   ------  -----  --------  --------------  ----------
    0.1    0.97    2.92       1.67          1.64
    0.2    0.23    1.82        .56           .31
    0.25   0.13     .88        .36           .13
    0.3    0.08     .61        .23           .059
    0.35   0.056    .41        .15           .029
    0.4    0.040    .28        .094          .014
    0.45   0.029    .18        .061          .0080
    0.5    0.021    .11        .038          .0053
    0.55   0.016    .077       .022          .0030
    0.6    0.013    .045       .013          .0018
    0.65   0.010    .          .0082         .0013
    0.7    0.0085   .018       .0042
    0.75   0.0069   .012       .0025         .00071
    0.8    0.0057   .0082      .0014         .00030
    0.85   0.0046   .0047      .00057        .00014
    0.9    0.0041   .0034      .00026        .000025
    0.95   0.0035   .0024      .000074       .000013

   Table 2: Sending Rate as a Function of the Packet Drop Rate,
            for RTO set to 10 RTT, and S in packets per RTT.

11.  Security Considerations

   This document does not itself create any new security issues for the
   Internet community.

12.  IANA Considerations

   There are no IANA considerations regarding this document.

13.  Authors' Addresses

   Internet Architecture Board
   EMail:  iab@iab.org

   Internet Architecture Board Members
   at the time this document was published were:

   Bernard Aboba
   Harald Alvestrand (IETF chair)
   Rob Austein
   Leslie Daigle (IAB chair)
   Patrik Faltstrom
   Sally Floyd
   Jun-ichiro Itojun Hagino
   Mark Handley
   Geoff Huston (IAB Executive Director)
   Charlie Kaufman
   James Kempf
   Eric Rescorla
   Mike St. Johns

   This document was created in January 2004.

14.  Full Copyright Statement

   Copyright (C) The Internet Society (2004).  This document is subject
   to the rights, licenses and restrictions contained in BCP 78 and
   except as set forth therein, the authors retain all their rights.

   This document and the information contained herein are provided on an
   "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE
   REPRESENTS OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE
   INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR
   IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF
   THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
   WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Intellectual Property

   The IETF takes no position regarding the validity or scope of any
   Intellectual Property Rights or other rights that might be claimed
   to pertain to the implementation or use of the technology
   described in this document or the extent to which any license
   under such rights might or might not be available; nor does it
   represent that it has made any independent effort to identify any
   such rights.  Information on the procedures with respect to
   rights in RFC documents can be found in BCP 78 and BCP 79.

   Copies of IPR disclosures made to the IETF Secretariat and any
   assurances of licenses to be made available, or the result of an
   attempt made to obtain a general license or permission for the use
   of such proprietary rights by implementers or users of this
   specification can be obtained from the IETF on-line IPR repository
   at http://www.ietf.org/ipr.

   The IETF invites any interested party to bring to its attention
   any copyrights, patents or patent applications, or other
   proprietary rights that may cover technology that may be required
   to implement this standard.  Please address the information to the
   IETF at ietf-ipr@ietf.org.

Acknowledgement

   Funding for the RFC Editor function is currently provided by the
   Internet Society.

RFC 3714 - IAB Concerns Regarding Congestion Control for Voice T

RFC 3714 - IAB Concerns Regarding Congestion Control for Voice T

Or Display the document by number

User Contributions:

Comment about this RFC, ask questions, or add new information about this topic:

RFC 3714 - IAB Concerns Regarding Congestion Control for Voice T

RFC 3714 - IAB Concerns Regarding Congestion Control for Voice T

Search the RFC Archives

Or Display the document by number

User Contributions:

Comment about this RFC, ask questions, or add new information about this topic: