Contracting Wide-area Network Topologies to Solve Flow Problems Quickly

Firas Abuzaid†⇤, Srikanth Kandula†, Behnaz Arzani†, Ishai Menache†, Matei Zaharia⇤, Peter Bailis⇤

Microsoft Research†and Stanford University⇤

Abstract–

Many enterprises today manage traffic on their

wide-area networks using software-defined traffic engineer-

ing schemes, which scale poorly with network size; the solver

runtimes and number of forwarding entries needed at switches

increase to untenable levels. We describe a novel method,

which, instead of solving a multi-commodity flow problem

on the network, solves (1) a simpler problem on a contrac-

tion of the network, and (2) a set of sub-problems in parallel

on disjoint clusters within the network. Our results on the

topology and demands from a large enterprise, as well as on

publicly available topologies, show that, in the median case,

our method nearly matches the solution quality of currently

deployed solutions, but is

8⇥

faster and requires 6

⇥

fewer

FIB entries. We also show the value-add from using a faster

solver to track changing demands and to react to faults.

1 Introduction

Wide-area networks (WANs), which connect locations across

the globe with high-capacity optical fiber, are an expensive

resource [7,35,36,38]. Hence, enterprises seek to carefully

manage the traffic on their WANs to offer low latency and

jitter for customer-facing applications [28,62,69] and fast

response times for bulk data transfers [46,56].

The state-of-the-art approach used in several enterprises

today [35,36,38] is to compute optimal routing schemes for

the current demand by solving global multi-commodity flow

problems [7,35,36,38]; the global flow problems are re-solved

periodically, since demands may change or links may fail,

and the computed routes are encoded into switch forwarding

tables using software-defined networking techniques [7].

As network sizes grow, solving multi-commodity flow prob-

lems on the entire network becomes practically intractable.

As noted in [36], the “algorithm run time increased super-

linearly with the site count,” which led to “extended periods

of traffic blackholing during data plane failures, ultimately

violating our availability targets,” as well as “scaling pressure

on limited space in switch forwarding tables.” This problem

is unlikely to go away: anecdotal reports indicate that WAN

Contract

network

Allocate flow on

contracted

network

occasionally

Network Clusters

Demands

Flow

Allocation

Demand

History

Paths

(periodically; e.g.,

every few min)

Figure 1: NCFlow’s workflow.

Cluster

Figure 2:

The original network on the left is divided into clusters, shown

with different background colors. The contracted network is on the right.

footprints today are already over

10⇥

larger than the few tens

of sites that were considered in prior work [35,36], since

enterprises have built more sites to move closer to users.

In this paper, we seek to retain the benefits of global traffic

management for large WAN networks without requiring ex-

cessively many forwarding entries at switches or prohibitively

long solver runtimes. Also, by using a faster solver, WAN

operators can reduce loss when faults occur and carry more

traffic on the network by tracking demand changes.

Our solution is motivated by the observation that WAN

topologies and demands are concentrated: the topology typi-

cally has well-connected portions separated by a few, lower-

capacity edges, and more demand is between nearby datacen-

ters. This is likely due to multiple operational considerations:

(1) submarine cables have become shared choke points for

connectivity between continents (see Figure 3), (2) the con-

nectivity over land follows the road or rail networks along

which fiber is typically laid out, and (3) enterprises build

datacenters close to users, then steer traffic to nearby datacen-

ters [12,62,69]. Therefore, more capacity and demand are

available between nearby nodes; an analysis of data from a

large enterprise WAN in §2supports this observation.

We leverage this concentration of capacity and demand

to decompose the global flow problem into several smaller

problems, many of which can be solved in parallel. As shown

USENIX Association 18th USENIX Symposium on Networked Systems Design and Implementation 175

Contracting Wide-area Network Topologies to Solve Flow Problems Quickly, Lecture notes of Network Design

Related documents

Partial preview of the text

Download Contracting Wide-area Network Topologies to Solve Flow Problems Quickly and more Lecture notes Network Design in PDF only on Docsity!

Firas Abuzaid†⇤^ , Srikanth Kandula†^ , Behnaz Arzani†^ , Ishai Menache†^ , Matei Zaharia⇤^ , Peter Bailis⇤

Microsoft Research†^ and Stanford University⇤

1 Introduction

2 Background and Motivation

3 NCFlow

8 clusters x, y,x 6 = y, fxy 4 ,arg max Â

s.t. Â

fk  f 2 x,Ksy , 8 s 2 x; Â

Â

3.1 Basic Flow Allocation

"+^ .,%&'

3.4 Choosing clusters and paths

3.5 Setting up switch forwarding entries

4 Implementing NCFlow

5 Evaluation

5.1 Methodology

5.3 Effect of Design Choices

5.4 NCFlow on Real-World Traffic

5.5 Tracking Changing Demands

5.6 Handling Failures with NCFlow

6 Discussion

References

B.2 Proof that the heuristic in §3.2 leads to

feasible flow allocations

B.3 Proof of optimality for algorithm in §3.

given some sufficient conditions

Theorem 3. Given a set of paths P that can be used by flows,

flow allocated on a set of paths P can also be allocated by

use fewer clusters next. Let S be a set of nodes such that every

path in P contains at most one contiguous sequence of the

nodes in S. For example, the set {u, v} satisfies this property

if every path in P has neither u nor v, just u but not v (no

such set S into a cluster would allow the method in Figure 6 to

allocate the same flow as MaxFlow using the paths in P.

pair of clusters, any set of paths P on the actual graph would

C Data-plane details for NCFlow

C.1 Actions at the NCFlow controller, after

each allocation

For the former case, let Pst be the path set to target t for

i Psz i

p in the set Pwt is the value of Âr 2 Kxt f 2 y,,rp in iteration i divided

Pwzi is the value of Âr 2 Kxz f 2 y,,rp divided by the total value over

C.2 Details on switch forwarding entries

D Definitions of NoMoreFlow

Li,k dk Â

Excess i Â

volume dk , the demand will suffer loss; we use Li,k to denote

scenario losses Li,k in ascending order; (2) starting at index

demand k ’s flow to be d k Li b ,k , the demand minus the loss

F.2 Comments on benchmarking TEAVAR

equations and variables increases by |S | ⇤ | P |. The path set is

at least as large as the node pairs, i.e., | P | > N 2 where N is the

scenarios; then |S | ⇠ M 2 where M is the number of edges.

G Additional Experiments

G.1 Breakdown of NCFlow ’s Performance

G.2 Alternate clustering methods

G.3 Effect on path latency