Duality in sta ng problems: Between holding costs and waiting ...2 The traditional cost formulation...

Duality in staffing problems:Between holding costs and waiting constraints

Seung Bum SohNorthwestern University, [email protected]

Itai GurvichNorthwestern University, [email protected]

There are two alternative ways to capture the tension between capacity expenses and customer-delay costs in

staffing problems. The cost paradigm assigns a price tag to customer delay and optimizes the combined costs

of staffing and waiting. The constraint paradigm, in contrast, replaces the waiting-time cost with constraints

and seeks to minimize staffing costs subject to these constraints. The duality of these two formulations is

important for both the implementation of delay costs through constraints (e.g., specifying constraints in

a contract) and for the reverse engineering of the dollar value that a provider, solving a given constraint

formulation, assigns implicitly to customer delay.

In the single-class queue, this duality is a simple matter: the optimal trade-off of capacity and delay can be

implemented via a staffing problem with, for example, average waiting constraints. Given the waiting-time

constraints that a provider uses, we can figure out the underlying implicit delay costs.

In the multiclass case—where one must determine both the optimal staffing and the optimal

prioritization—things become more involved. Strictly convex costs can be reliably implemented by any

strictly convex constraints. Linear waiting constraints, while common in practice, do not provide a “safe”

implementation of any simple cost structure. They can be made safe, however, by augmenting them with

a variance constraint. When seeking to reverse engineer constraints to costs, strictly convex constraints are

straightforward. Linear constraints are, however, not uniquely reversible, and strictly concave constraints

cannot be an implementation of any strictly increasing waiting costs.

Finally, since strictly convex costs have multiple implementations through constraints, it is desirable to

propose a “best” implementation. We numerically study the robustness of different implementations.

1. Introduction

Determining capacity (or staffing) is a routine and central task in the day-to-day operations of

service systems. Fundamentally, the staffing problem represents a tradeoff between the cost of

capacity and the cost (or revenue loss) that is attributed to customer delays. The greater the

capacity relative to demand, the less customers have to wait.

In modeling the staffing decision as an optimization problem, the capacity cost has an obvious

representation: a dollar amount per server-hour is multiplied by the number of servers assigned to

that hour. Waiting time, however, can be modeled into the staffing problem through either waiting

cost or waiting constraints.

1

2

The traditional cost formulation assigns a dollar value to customers’ waiting time and places this

cost in the objective function as a counterbalance to the staffing cost. The problem to be solved is

then,

Minimize Staffing cost + Waiting cost. (cost formulation)

This formulation captures explicitly the tradeoff between capacity and waiting costs. As the staffing

increases and, consequently, staffing costs increase, waiting costs decrease.

It is more common for practitioners, however, to impose a constraint on a waiting-time-related

metric, namely, to solve the problem of minimizing capacity subject to a so-called Quality of Service

(QoS) constraint.

Minimize Staffing cost

Subject to waiting-time constraints(constraint formulation)

There may be multiple reasons for the prevalence of constrained formulations in practice: (i) Cus-

tomers’ disutilities from waiting are difficult to pin down. It is only recently that empirical papers

in operations have estimated such disutilities; see, e.g., Allon et al. (2011) and Aksin et al. (2013);

(ii) Industry standards often dictate the “acceptable” waiting time. These standards are expressed

as constraints: average speed of answer (ASA) must be less than 1 minute, 80% of customers must

wait less than 20 seconds, etc. In some cases, the standards are internally imposed (such as door-

to-doctor time limits in emergency rooms) or enforced by government-agency regulations; see, e.g.,

Allon (2012).

In the service-operations literature, the two formulations have been typically studied separately;

see §2. Our objective in this paper is to examine the duality of these two in service systems with

multiple classes of customers. Conceptually, we ask two questions:

(i.) A prescriptive-implementation question (the operations researcher view): Although the

provider might have a good idea of the dollar value of customer delay—so that, in principle, it

is possible to solve the cost formulation—operations (such as contracting with an outsourcer)

might require specifying constraints rather than waiting costs. What form, we ask, should these

constraints take to capture the correct tradeoff between staffing and delay costs? And how are

these recommendations consistent (or not) with formulations used in practice?

In the newsvendor (inventory) setting, for example, the costs of overstock and understock can

be used to specify a service level that can, in turn, be contracted. In a similar spirit, we ask which

Quality of Service (QoS) constraints should be contracted to reflect the customer-delay costs.

(ii.) A reverse-engineering question (the econometrician’s view): What does a choice of

constraints—e.g., ASA smaller than 20 seconds—say about the implicit cost structure of a provider,

specifically about the dollar value assigned to customer delay relative to the cost of server time?

3

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

20

25

30

35

40

45

50

55

60

33 34 35 36 37 38 39 40 41

Average waitin

g tim

e (m

inutes)

Total cost

Number of servers

Reverse engineering the M/M/N queue

cw/cs=1

cw/cs=3

cw/cs=5

Average wait

0.13 minutes

Figure 1 Reverse engineering for an M/M/N queue

In the newsvendor setting, by observing the service level that the provider chooses, Olivares et al.

(2008) impute the implicit overstock and understock costs in a healthcare setting; see also Cohen

et al. (2003) for a supply-chain setting.

It is useful to turn first to the single class M/M/N queue—a markovian queue with one class of

customers served by one group of identical servers. Let WN be the steady-state waiting time when

the staffing level is N and consider the two problems

minN≥0 N :

s.t. E[f(WN)]≤ w,(constraint) min

N≥0N +ληE[f(WN)]. (cost)

In the first problem, there is a constraint on a metric f of the waiting time, and in the latter,

there is a cost—a customer who waits w imposes a cost of ηf(w) and this is multiplied by the

number λ of customers who arrive per hour. The implementation and reverse engineering here are

as follows: (i) given η, how should we set w in the constraint formulation to generate the same

outcome as the cost formulation? And (ii) given w, what can we say about η?

For illustration, consider the special case f(w) = w. Suppose that the mean service time is six

minutes and the arrival rate is λ= 300 customers per hour. Let cw and cs be the cost for one hour

of customer wait and for one hour of agent time, respectively. Consider the staffing problem

minN

csN + cwλE[W ].

Figure 1 displays the total cost as a function of the staffing level N for three values of relative

cost, η= cw/cs. For cw/cs = 3, the optimal solution is N∗ = 37, and the average waiting time under

this solution is 0.13 minutes (roughly 10 seconds). A service firm that has c2/cs = 3 and wishes

4

to outsource its call-center operations can guarantee that the outside provider’s staffing decision

reflects its tradeoff correctly by contracting the Average Speed of Answer (ASA) constraint problem

min N

s.t. E[W ]≤ 10 seconds.

Suppose, instead, that we do seek to infer the ratio cs/cw, knowing that a service provider uses

the ASA formulation with the 10 seconds target. Given the arrival rate λ and the mean service

time, we can find this implicit cost ratio by simple trial and error. As the figure shows, for instance,

η= 1 is too small (the optimal staffing is 35 and the resulting ASA is greater than 10 seconds) and

η= 5 is too large. Due to the integrality of N , there will be a range of values in the neighborhood

of η= 3 that do the job.

With the exception of this single class queue—in which the duality of cost and constraint is

straightforward—the two classes of staffing problems have been typically studied in isolation from

each other; see §2. Our paper is fundamentally about this duality—about the prescriptive trans-

lation of costs to constraints and about the reverse engineering of constraints to waiting-time

costs—going beyond the single class queue.

We consider the simplest of such multiclass systems—the so-called V model with a common

service rate; see Figure 2. Here, the cost parameters are vectors that determine which class’s waiting

time is more costly—class-1 waiting time might cost 1$ per hour while that of class 2 may be 2$

per hour—as are the constraints–requiring, for example, a 20-second average delay for one class of

customers, but allowing a 40-second delay for another class.

Figure 2 V-model

5

The operational implication of this heterogeneity of costs and constraints is that different cost

parameters lead optimally to different prioritization rules: for one cost structure it may be optimal

to use a static priority rule, while for another it may be optimal to use a more elaborate dynamic

rule. To relate a cost formulation to a constraint formulation, one must compare both components

of their solution—the staffing level (the number of servers) and the prioritization rule.

Let us revisit the ASA formulation. For a provider with a set I = {1, . . . , I} of customer classes,

the immediate generalization is given by the problem on the left of

min N

s.t. E[Wi]≤wi, i∈ I,

π ∈Π,N ∈Z+,

(ASA constraints)min N +

∑i ciλiE[Wi]

π ∈Π,N ∈Z+.(linear costs)

We prove that the ASA constraint (with w > 0) cannot serve as a perfect implementation of

linear or strictly convex costs. A provider with such customer-delay costs that writes a contract

with ASA (linear) constraints alone cannot guarantee that the outsourcer decision will represent

the desired tradeoff between capacity and delay costs. Thus, the ASA constraint, while common

in practice, is not a safe implementation.

Suppose, alternatively, that we observe the ASA constraints that a firm uses and seek to identify

its (implicit) delay costs. The mathematical intuition that dualization via Lagrange multipliers

should allow us to map constraints to costs (reverse engineer) is only partly correct; see Remark 2.

It turns out that multiple (very) different cost structures are consistent with the ASA formulation,

which means, in particular, that one cannot impute the originating cost structure. The formulation

is simultaneously consistent with strictly convex delay costs and some discontinuous and non

monotone costs.

Let us flesh out our setup. We restrict attention to family of power functions f(x) = xa. The

single parameter a captures whether the cost (or constraint) is strictly convex, strictly concave, or

linear.

Our formalization of the constraint and cost formulations is

min N

s.t. E [W ai ]≤wi, i∈ I,

π ∈Π,N ∈Z+,

(constraint)min N +

∑i∈I λiciE [W b

i ]

π ∈Π,N ∈Z+.(cost)

N above is the number of servers and π is the prioritization rule. Note that Wi depends on both of

these decisions. With a= 1, we have the ASA constraints. Strictly convex (concave) corresponds

to a > 1 (a < 1). Similarly, the exponent b captures the convexity/concavity of the cost. To avoid

6

trivialities, we assume that classes are truly heterogeneous, i.e., wi 6=wj for all i, j, and ci 6= cj for

all i 6= j (otherwise the classes can be merged).1

Our main results are as follows:

i. Strictly convex constraints are reversible to strictly convex waiting costs, and strictly convex

costs are perfectly implementable through convex constraints and have an imperfect implementation

as linear constraints. This means, for instance, that contracting convex constraints guarantees that

the solution used by the outsourcer will reflect the waiting vs. staffing tradeoff of the provider.

Linear (ASA) constraints are not guaranteed to reflect this tradeoff. It turns out they leave too

much freedom.

ii. A linear (ASA) constraint shares solutions with different (fundamentally different) waiting-

cost functions—one that is strictly convex and another cost that is not even monotone in the

customer waiting time. It is, thus, difficult to pin down the implicit delay costs that support the

constraints. In fact, the ASA constraint formulation is a true hybrid. It shares solutions with both

strictly convex constraint formulations (a> 1) and with strictly concave formulations (a< 1).

Linear waiting costs can be implemented trivially by degenerate version of any of convex, concave,

or linear constraints.

iii. Concave constraints (a < 1) with w > 0 do not share solutions with either linear, concave,

or convex costs and thus cannot be reversed within this family. The econometrician trying to

impute the delay costs will have to look elsewhere, possibly to discontinuous and non monotone

waiting-cost functions.

iv. Choosing the “best” implementation: We prove that a strictly convex cost formulation has

multiple implementations through constraints. This multiplicity of implementations begs the ques-

tion of which constraints to choose. Take, for instance, two implementations of a quadratic cost

problem through constraints: one with a= 2 and an appropriately chosen target vector w and the

other with a= 3 and a target vector w. Implementation here means that both constraint problems

will have the same solution set as the originating cost problem. Is there a reason to prefer one

implementation over the other? To compare these, it makes sense to introduce robustness consid-

erations. We find that the higher the a that is used, the less sensitive is the outcome to mistakes

in specifying the target: a 10% error in w will have a lesser effect on the outcome than a 10% error

in w.

Similarly, suppose one is comparing two service providers that differ in their actions (staffing

and prioritization) and wishes to impute their underlying (implicit) delay costs. If one assumes

1 It is possible also to consider different values of the exponent for different classes, i.e, bi and ai for class i. However,in optimal solutions only the smallest exponent will matter, as classes with higher exponents will be given the staticpriorities.

7

that the delay cost is cubic (b= 3), small differences between the firms’ actions translate into small

differences in (reversed) coefficients relative to using a quadratic (b= 2) delay structure: the higher

the exponent, the less sensitive are the reversed coefficients.

2. Literature Review

The staffing of queues with multiple servers is a thoroughly studied topic. The study of staffing

and prioritization of multiple customer classes is somewhat more recent.

A central challenge in solving these problems is finding the optimal prioritization of customers.

Multiple papers study the prioritization of multiple classes, for given staffing levels, with the

objective of minimizing various types of delay-related costs. Some examples are Atar et al. (2004),

Atar (2005), Tezcan and Dai (2010), Perry and Whitt (2009), Mehrotra et al. (2012), Bassamboo

et al. (2006). Some papers in this group study networks with multiple server pools, which requires,

in addition to prioritization, specifying the routing to servers.

Staffing and prioritization for the constraint formulation is studied, for example, in Wallace and

Whitt (2005), Pot et al. (2008), Gurvich et al. (2008), Pang and Perry (2014) and Jouini et al.

(2010). The recent paper Chan et al. (2014) lies in the intersection of the cost and constraint

formulations. It develops a heuristic for dualizing the constraints to costs and for optimizing the

routing given those costs. This first step can subsequently be used for staffing optimization.

We use the simplest multiclass queue to study the relationship between costs and constraints.

This mapping, as explained in the introduction, is conceptually straightforward for the single-class

case. One of the first papers to relate the formulations in this setting (and, indeed, one of the first

to study the optimization of the single class queue) is Borst et al. (2004); see Examples 2.1 and

2.2 therein.

An earlier exploration of the connection between delay constraints and delay costs appears in

Soh and Gurvich (2014). The paper studies the role of the well-known family of Generalized cµ

(Gcµ) rules—developed by Van Mieghem (1995) for the minimization of convex holding costs—in

solving a typical staffing problem, specifically, the Target Service Factor (TSF) formulation

min N

s.t. P{Wi >wi} ≤ αi, i∈ I,

π ∈Π,N ∈Z+,

where wi,wi are the acceptable waiting time (AWT) targets and α1, α2 are the Service Levels (SL).

The authors prove that no Gcµ rule can be optimal for the constraint problem—only certain non

monotone and discontinuous rules work. Thus, the TSF constraint problem, while widely used in

practice, does not provide a mechanism to implement monotone increasing costs. The gap, however,

8

can be closed by adding a service-level-differentiation (SLD) constraint: a formulation with both

TSF and SLD constraints can serve as an implementation of “reasonable” cost structures. Our

result about the augmentation of the ASA constraint formulation with a variance constraint is in

a similar spirit; see Remark 1.

Finally, there is an important connection between our work and that of Milner and Olsen (2008).

Fundamentally, both papers are about how to implement certain “intentions” into quality-of-service

constraints. Theirs is a context of outsourcing, where a single customer class is outsourced to an

outside provider that caters, in turn, to multiple clients and hence is operating a multiclass queue.

They show how certain contracted constraints can lead to undesirable consequences relative, say,

to serving the customers in-house. Our setting is different—we compare two multiclass settings

(one with delay costs and the other with constraints). In this context of multiclass queues, we

seek to formalize what “undesirable” means by explicitly relating delay costs to quality-of-service

constraints.

Staffing problems or general cost-minimization problems are typically difficult to solve and some

of the analytical works cited above resort to asymptotic approximations where one solves the

problem in the “limit”; solutions to the limit problem are identified as “asymptotically optimal” if

the gap between the real optimal solution and the suggested one becomes negligible as the system

size grows. We also follow this method.

The remainder of this paper is organized as follows: The model and analysis framework are

detailed in §3. The duality results appear in §4. These results build on explicitly solving both the

staffing and constraint problems in §5. The proofs of the main duality results appear in §6, and a

numerical study of robustness appears in §7.

3. Model and analysis framework

We consider a set I = {1, . . . , I} of customer classes all requiring processing from a single pool of

servers; see Figure 2. Arrivals follow I independent Poisson processes with rate λi for class i and

we let λ=∑

i∈I λi be the aggregate arrival rate.

Service time is exponentially distributed with a common mean (normalized for convenience)

of 1. The number of servers is denoted by N . The prioritization rule is denoted by π. If steady

state exists under the prioritization rule π and the staffing level N , the steady-state class-i waiting

time and queue length are denoted as WN,πi and QN,π

i . We restrict the prioritization rule π to be

admissible in the following sense:

Definition 1. (admissible policies) A prioritization rule π is admissible if:

1. There is no blocking: All incoming customers are eventually served.

2. Customers within the same class are served in a First-Come-First-Served (FCFS) manner.

9

3. The rule is work conserving.

Let Π be the family of admissible policies.

Work conservation implies, in particular, that the total number of customers in the system behaves

as the total number in a single-class M/M/N queue with the arrival rate λ=∑

i∈I λi.

The constrained formulation is written as

minN,π

N

s.t. E[(WN,πi

)a]≤wi, i∈ I, (1)

π ∈Π, N ∈Z+,

and the cost formulation as

minN,π

N +∑i∈I

λiciE[(WN,πi

)b](2)

s.t. π ∈Π, N ∈Z+.

Since a and b separate between convex, concave, or linear constraints/costs, they will be central

to our results. For ease of reference, we use the term “a-constraint problem” to refer to (1) with

exponent a and “b-cost problem” when referring to (2) with exponent b. To avoid trivialities, we

assume true heterogeneity throughout, i.e, that ci 6= cj and that wi 6=wj for all i 6= j.

Many-server analysis: Control and staffing problems for multiclass queues are difficult to solve

exactly. Much of the literature resorts to many-server approximations where one optimizes (instead

of the original system) an approximate, more tractable setting and justifies this by proving that

what is optimal for the approximate system is nearly optimal for the real system. Building on that

literature, we conduct our analysis in the approximate system. We use three characteristics of the

approximate system:

I. Number-in-system distribution: Recall that the steady-state total number of customers,

QNΣ , in our system is identical to that of an M/M/N queue. This distribution is approximated by

a continuous distribution (see Halfin and Whitt (1981)):

P{QN

Σ >x}≈(

1 +βΦ(β)

φ (β)

)−1

e−βx, (3)

where β = (N −λ)/√λ. In particular, we treat capacity as continuous and the requirement N ∈Z+

is replaced by N ≥ 0.

II. Ratio rule: Since QNΣ is invariant to the priority rule, controlling the system requires dis-

tributing this total queue between the I classes in the best way given the problem formulation. In

10

large systems, one can, with relative precision, meet targeted ratios. A longest-queue-first rule, for

example, guarantees that the total queue QΣ is distributed evenly among the I classes, i.e., that

QNi ≈

1

IQN

Σ .

Likewise, a simple generalization of the longest-queue-first rule can be used to generate arbitrary

fixed ratios pi, i∈ I, between the individual queues, i.e., (replacing 1/I with pi)

Qi (t)≈ pi ·QΣ (t) , (4)

where∑

i∈I pi = 1. We refer to this as a fixed ratio rule. A further generalization, also used here,

allows for general ratio functions, p(x) = (p1(x), . . . , pI(x)).

Targeted ratios p are implemented in the real system by using a simple tracking rule whereby a

server that becomes available at time t chooses queue i to serve where

Qi (t)− pi(QN

Σ (t))>QΣ (t)Qj (t)− pj

(QN

Σ (t))QN

Σ (t) .

for all i 6= j. Note that the special case p1 ≡ 1 and pi ≡ 0 for all i 6= 1 results in a static priority

rule. The fact that such a simple rule generates (4) is proved for relatively general (including

discontinuous) ratios in Soh and Gurvich (2014). Building on this previous literature, we will

assume that a tracking rule can be used to achieve targeted ratios.

III. Pathwise Little’s law: Little’s law relates the steady-state waiting time to the average

queue E[Qi] = λiE[Wi]. In many-server analysis, a stronger, sample-path form of Little’s lawQi (t)≈λiWi (t) holds under FCFS-within-class. We will make use of this property as well.

Thus, in what follows, we analyze the approximate system, which is one where the characteristics

I–III above hold. In particular, a solution to the staffing problems consists of N (and consequently

β = (N −λ)/√λ) and a ratio function p.

Two solutions to a staffing problem are equivalent if their β components are the same and their

ratio functions are (almost everywhere) identical. The solution to a problem is then said to be

unique if there is a single staffing component β and a single (up to almost everywhere equivalence)

ratio function p.

4. Duality results

Definition 1. (implementation) Given a,w and b, c, we denote by SC(b, c) and SQ(a,w),

respectively, the set of optimal solutions for the cost formulation with parameters (b, c) and the

constraint formulation with parameters (a,w) (Q stands here for Quality-of-Service constraints).

We say that a b-cost problem is perfectly implementable by an a-constraint problem if for each c

there exists a w such that SQ(a,w) = SC(b, c). We call it weakly implementable by an a-constraint

formulation if for each c there exists a w such that SC(b, c)⊆SQ(a,w) but inclusion is strict.

11

In other words, perfect implementation guarantees that any optimal solution that the optimizer

of the constraint formulation may choose reflects the tradeoffs inherent to the originating cost

formulation. Under a weak implementation, this optimizer may choose a solution that is consistent

with the originating waiting-capacity tradeoff but also one that is inconsistent with it.

Theorem 1. (implementation of costs as constraints)

1. The convex cost formulation (b > 1) has a unique solution (with a fixed ratio rule as its

priority component) and is perfectly implementable by a strictly convex constraint (a > 1). It is

weakly implementable by linear constraints (a= 1).

2. Optimal solutions to concave or linear cost formulations have a static-priority prioritization

component. They are implementable by degenerate (with wi = 0 for all but one class) convex,

concave, or linear constraints.

In implementing strictly convex (b, c) costs via a constraint formulation with exponent a ≥ 1

(including a= 1), the target w is chosen so that

wi/wj =

(cicj

)− ab−1

; (5)

see Propositions 4 and 5 for detailed solutions to the cost and constraint formulations. The unique

optimal solution to both the cost formulation and its implementation through constraints has, as

its prioritization component, a fixed ratio rule: a class j whose queue exceeds the fraction

pj =λjc− 1b−1

j∑i∈I λic

− 1b−1

i

(6)

of the total queue has priority over classes whose queues do not; see Proposition 4. In other words

the target queue for class j is a linear function of the total queue length, qΣ, as schematically

captured on the left-hand side of Figure 3. Static priority (as in item 2 of the theorem) is a special

case where the ratio is 0 for the high-priority classes.

By Theorem 1, the ASA constraint, while common in practice, is not a safe implementation of a

b-cost problem with any b≥ 1. The ASA formulation has a fixed-ratio solution Gurvich and Whitt

(2007, Theorem 5.1) that is shares with the originating strictly convex cost problem. However,

it also has additional—and very different–optimal solutions of which the ratio function follows a

bang-bang rule, as on the right Figure 3: it assigns the entire queue to one class up to a threshold

q∗ and then switches and assigns all the queue to the other class.

Consider, for example, the case of two classes and with quadratic waiting costs (b = 2) and

equal coefficients (c1 = c2 = c). For each value of c, we compute the optimal waiting costs W(c)

and construct the implementation through ASA constraints by computing the appropriate targets

w = (w1,w2). How good is the implementation is captured by whether its solution reflects the

12

0q*

Total queue qΣ

p1(qΣ)

0 Total queue qΣ

p1(qΣ)

Figure 3 Tracking policies: (left) fixed ratio; (right) bang-bang ratio

tradeoff between staffing and delay in the originating cost problem. By the very definition of

implementation, the staffing level will be the same as in the originating cost problem. Moreover, if

the optimizer of the constraint problem uses the fixed-ratio optimal solution, the resulting waiting

cost will be identical to those of the originating cost problem. This is not the case if the optimizer

uses the optimal bang-bang solution. In Figure 4, we plot the optimal waiting cost W(c) in the

originating cost formulation versus the waiting cost induced by the bang-bang solution to the ASA

implementation.

0

500

1000

1500

2000

2500

3000

3500

4000

4500

5000

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Waitin

g cost

c=c1=c2

Imperfection of an ASA implementation of convex costs

Optimal waiting cost

"Possible'' waiting‐cost outcome

λ1=200, λ2=300

Figure 4 The possible downside of ASA constraints as a representation for convex waiting costs

In summary, while seemingly reasonable so as to capture the relative importance of customers

(via the ratios of wi/wj), the ASA formulation is heavily “underspecified”. It shares solutions with

convex costs and constraints but also has other solutions. These additional solutions (which it also

13

shares with the concave constraint problem) do not capture the tradeoffs of any b-cost formulation.

If the provider wishes to contract ASA constraints, it must add additional terms to avoid such

outcomes.

Remark 1. (augmenting ASA with a variance constraint) Item 1 of Theorem 1 shows that

quadratic costs can be safely implemented via quadratic constraints. Yet, as mean and standard

deviation are more common operational measures (relative to, say, the second moment), it may be

more appealing, instead, to use an ASA formulation augmented by a constraint on the waiting-time

variance. This, it turns out, is feasible.

Fix b = 2 and cost coefficients c = (ci, i ∈ I). Take the perfect implementation of this cost

problem as a constraint problem with a= 2 and the appropriate targets (see the Proof of Theorem

1) w(2) = (w(2)i , i∈ I),

minN,π N

s.t. E[(WN,πi

)2]≤w(2)

i , i∈ I,

π ∈Π, N ≥ 0.

(2nd moment)

This problem shares its unique solution with the originating cost problem. Now let w(1)i =E[WN,π

i ]

be the average waiting time under this unique solution, and write the ASA + Variance formulation:

minN,π N

s.t. E[WN,πi

]≤w(1)

i , i∈ I,

V ar(WN,πi )≤ vi :=w

(2)i − (w

(1)i )2, i∈ I,

π ∈Π, N ≥ 0.

(ASA+Var)

Recalling how w(1)i was defined, it is evident that this constraint problem has the same optimal

objective-function value as the second-moment constraint problem and, moreover, that they share

the (unique) optimal prioritization rule. Thus, the ASA+Var formulation provides a safe implemen-

tation for the quadratic cost problem. In fact, since all strictly convex costs are outcome equivalent

(see Corollary 3), this formulation provides a safe implementation for any b-cost formulation with

b > 1.

Definition 2. (reverse engineering) We say that a a-constraint formulation is strongly

reversible to a b-cost formulation if for every w there exists a c such that SC(b, c) = SQ(a,w). We

say that it is weakly reversible to a b-cost formulation if (i) for every w there exists a c such that

SC(b, c)⊆SQ(a,w) and the inclusion is strict.

In other words, a constraint formulation is strongly reversible if we can find a cost formulation

that leads to exactly the same decisions. It is weakly reversible if there is no cost formulation that

shares all of its solutions: there may be another cost formulation that shares with the constraint for-

mulation the remaining solutions in which case the constraint is consistent with two fundamentally

distinct cost structures.

14

Theorem 2. (reverse engineering)

1. A strictly convex constraint (a> 1) is reversible to a b-cost formulation for any b > 1 (includ-

ing, in particular, b= a).

2. The linear constraint (a= 1) is weakly reversible to a b-cost formulation for each b > 1.

3. The concave constraint (a< 1) cannot be reversed (weakly or strongly) to a b-cost formulation

(regardless of the value of b).

For the econometrician, Theorem 2 says that if the call center uses strictly convex constraints,

it will be possible to estimate coefficients for strictly convex costs as there is, for each b, a unique

inversion. If, instead, the call center uses linear constraints, imposing a linear or strictly convex

cost structure for estimation is not necessarily the correct route—the “implicit” costs might not

be convex. Indeed, as we have seen in the discussion following Theorem 1, the linear constraint

formulation has some solutions that are consistent with quadratic costs. Its other solutions (which

have a bang-bang prioritization component) are consistent with non monotone and discontinuous

cost functions. Finally, if the call center uses concave constraints—the econometrician must look

beyond simple cost models.

The following is a direct corollary of Theorems 1 and 2.

Corollary 3 (multiplicity of representations). All a-constraint problems with a > 1

are equivalent to each other as are all b-cost problems with b > 1.

Constraint: For any (a,w) and a′ (a,a′ > 1) there exists a w′ such that SQ(a,w) = SQ(a′,w′).

Cost: For any (b, c) and b′ (b, b′ > 1), there exists a c′ such that SQ(a,w) = SQ(a′,w′).

In all cases, the rule component of the optimal solution is a fixed-ratio function.

The multiplicity of implementations through constraint of a single cost problem means that one

must make a choice on which formulation to use; we return to this question in §7.

Summary: The schematic Figure 5 summarizes our findings thus far. On the left-hand side,

we have the “world” of cost formulations—one can think of this as the space of parameters (b, c)

in the cost formulation. On the right, we have the world of constraint formulations—the space of

parameters (a,w). Each of these can be divided into its convex, linear, and concave subspaces. The

diamond shape is the world of outcomes—this is the space of staffing and prioritization rule pairs.

A b-cost is implementable by an a-constraint if there is a path between them in the graph, i.e.,

there is a solution that solves both of them. The fact that there is a path between strictly convex

costs and strictly convex constraints (passing through a single outcome) captures graphically item

1 of both Theorems 1 and 2. The double line connecting convex costs and constraints to fixed-

ratio solutions reflects Corollary 3, i.e, that multiple instances of convex problems share identical

solutions.

15

Concave Concave

Linear

Convex

Costs ConstraintDecisions (staffing and priorities)

Convex

Linear

Static priorities Fixed ratio Bang bang

Figure 5 Summary of implementability and reverse-engineering results

Linear constraints have paths connecting them (through fixed ratio solutions) to convex costs

but also to bang-bang solutions (denoted by black circles) that correspond to non monotone and

discontinuous costs and, in particular, to none of the b-costs. Concave constraints also have some of

these bang-bang solutions. This captures the weak implementability of costs as linear constraints

and the weak reversability of the latter. Per item 2 of Theorem 1, linear and concave costs have

static priority solutions and are implementable by degenerate versions of convex, linear, or concave

constraints (this is captured by the dashed line).

Remark 2. (on dualization via Lagrange multipliers) Considering a constraint problem

such as

minN,π

N

s.t. E[(WN,πi

)a]≤wi, i∈ I,π ∈Π, N ≥ 0,

a mathematically intuitive way to generate the reverse engineered costs is to dualize the constraints

via Lagrange multipliers to obtain a dual problem

g(η) := minN,π

N +∑i∈I

ηi

(E[(WN,πi

)a]−wi) : π ∈Π, N ≥ 0,

and consider the problem g= maxη≥0 g(η). It can be shown that when a> 1, this dual problem has

a unique solution η∗ and that the consequent waiting-cost problem is

g(η∗) := minN,π

N +∑i∈I

c∗iλiE[(WN,πi

)a]: π ∈Π, N ≥ 0,

16

where c∗i = η∗i /λi shares the optimal solution (staffing and prioritization) of the original strictly

convex constraint problem. Although this is not our method of derivation, our proposed reverse

(when one sets b= a) has c∗ as its coefficients.

For a ≤ 1, Theorem 2 shows that this straightforward dualization cannot generate the desired

results. If a = 1, item 2 of that theorem states that there is no reverse with b = 1 (except for

the trivial one with ci ≡ c for all i). If a < 1, even that option is not available. The concave cost

formulation shares no solutions with the concave constraint formulation.

A central building block in the proof of Theorems 1 and 2 is the full characterization of the optimal

solutions to both the cost and constraint formulations. We turn to this task in the next section

and complete the proofs of the duality theorems in §6.

5. Optimal solutions

Proposition 4. Suppose b > 1. The solution for the cost formulation (2) is unique and has the

ratio component

p∗j (qΣ) =λjc− 1b−1

j∑i∈I λic

− 1b−1

i

· qΣ.

The optimal staffing level is N(β∗) = λ+β∗√λ where β∗ is the unique solution to

√λ−

(I∑i=1

λic− 1b−1

i

)1−b

·Γ(b+ 1) ·

((β/√λ)b{

1 + βΦ(β)

φ(β)

})′((

β/√λ)b{

1 + βΦ(β)

φ(β)

})2 = 0.

Proof: By “pathwise Little’s law”,

N +∑i∈I

λiciE[(WN,pi

)b]= N +

∑i∈I

λiciE

[(QN,pi

λi

)b]= N +

∑i∈I

λ1−bi ciE

[(QN,pi

)b].

The optimal distribution p(·) of the total queue is derived by solving the following for all values

of qΣ:

min∑i∈I

λ1−bi ci (pi)

b

s.t.∑i∈I

pi = qΣ, (7)

pi (qΣ)≥ 0, i∈ I.

17

Denote the solution (as a function of qΣ) as p∗(qΣ). Using the KKT condition, it is easily verified

that the optimal solution (which is unique due to convexity) must satisfy that(λ1−bi ci (p

∗i (qΣ))

b)′

=

bλ1−bi ci (p

∗i (qΣ))

b−1are identical for all i ∈ I. The following function p∗ uniquely satisfies this

requirement (within the family of positive functions):

p∗j (qΣ) =

(λ1−bj cj

)− 1b−1∑

i∈I

(λ1−bi ci

)− 1b−1

· qΣ =λjc− 1b−1

j∑i∈I λic

− 1b−1

i

· qΣ.

Then

bλ1−bj cj

λjc− 1b−1

j∑i∈I λic

− 1b−1

i

· qΣ

b−1

= b

1∑i∈I λic

− 1a−1

i

b−1

(qΣ)b−1

,

is all the same for j ∈ I. Hence, p∗ is optimal, but our derivation of this function does not mean

that it is the unique optimal ratio function (there might be a function that is, for specific values

of qΣ, not optimal for (7)). Next, we establish that this can happen only over a set of values of qΣ

with measure 0.

Let p be another ratio function that is not equivalent to p∗. Then the measure of the set A=

{qΣ ≥ 0 : p∗ (qΣ) 6= p (qΣ)} is nonzero. For qΣ ∈A,∑

i∈I λ1−bi ci (p

∗i (qΣ))

b −∑

i∈I λ1−bi ci (pi (qΣ))

b> 0

and vice versa, because p∗ (qΣ) is the unique solution for problem (7).

We will show that there exists ε > 0 such that m (Bε)> 0 for

Bε =

{qΣ :

∑i∈I

λ1−bi ci (p

∗i (qΣ))

b−∑i∈I

λ1−bi ci (pi (qΣ))

b> ε

}

Define Bm (m= 1,2, ...) as follows:

Bm =

{qΣ :

1

2m+1<∑i∈I

λ1−bi ci (p

∗i (qΣ))

b−∑i∈I


b ≤ 1

2m.

}.

Also define B0 as

B0 =

{QΣ :

1

2<∑i∈I

λ1−bi ci (p

∗i (qΣ))

b−∑i∈I


b

}.

Evidently,⋃∞m=0Bm =A and the sets Bm’s are disjoint.

By countable additivity of the Lebesgue measure,

∞∑m=0

m (Bm) =m (A)> 0,

and it cannot be the case that m (Bm) = 0 for all m. Let m∗ be the minimum m with m (Bm)> 0.

Then 1/2m∗+1makes ε > 0 such that m (Bε)> 0 holds.

18

Now let δ :=m (Bε). Let q be a positive number that satisfies

m [0, q]>δ

2.

The cost difference between p∗ and p satisfies the following (where fN is pdf of QNΣ ):

(costdifference) =∑i∈I

λ1−bi ciE

[(QN,p∗

i

)b]−∑i∈I

λ1−bi ciE

[(QN,pi

)b]=∑i∈I

∫ ∞0

λ1−bi ci

(p∗i(QN

Σ

))bfN(QN

Σ

)dQN

Σ −∑i∈I

∫ ∞0

λ1−bi ci

(pi(QN

Σ

))bfN(QN

Σ

)dQN

Σ

≥∑i∈I

∫Bελ1−bi ci

(p∗i(QN

Σ

))bfN(QN

Σ

)dQN

Σ −∑i∈I

∫Bελ1−bi ci

(pi(QN

Σ

))bfN(QN

Σ

)dQN

Σ

≥∑i∈I

∫Bε∩[0,q]

λ1−bi ci

(p∗i(QN

Σ

))bfN(QN

Σ

)dQN

Σ −∑i∈I

∫Bε∩[0,q]

λ1−bi ci

(pi(QN

Σ

))bfN(QN

Σ

)dQN

Σ

≥ ε · fN (q) · δ2> 0,

where, for the last inequality, we used the fact that fN(q)> 0 for all q≥ 0; recall (3). We conclude

that p cannot be optimal and, in particular, that p∗ is unique.

For the optimal staffing level, notice that, for a given N , with the optimal function p∗, the

objective function value is given by

N +∑i∈I

λ1−bi ciE

[(QN,πi (∞)

)b]= N +

∑j∈I

λ1−bj cj

λjc− 1b−1

j∑i∈I λic

− 1b−1

i

b

E[(QN

Σ (∞))b]

= N +

(∑i∈I

λic− 1b−1

i

)1−b

E[(QN

Σ (∞))b]

= λ+β√λ+

(∑i∈I

λic− 1b−1

i

)1−bΓ(b+ 1)(

β/√λ)b{

1 + βΦ(β)

φ(β)

} .The first-order condition is then given by

√λ−

(∑i∈I

λic− 1b−1

i

)1−b

·Γ(b+ 1) ·

((β/√λ)b{

1 + βΦ(β)

φ(β)

})′((

β/√λ)b{

1 + βΦ(β)

φ(β)

})2 = 0.

The optimal β (and the optimal staffing level N = λ + β√λ) is characterized by the unique

solution to this equation supposing that the function is convex in β. To see this define the function

g (β) as

g (β) :=

((β/√λ)b{

1 + βΦ(β)

φ(β)

})′((

β/√λ)b{

1 + βΦ(β)

φ(β)

})2 .

19

Then g′ (β) = g1 (β)/g2 (β), where

g1 (β) := −4 ·β−2−b(2(b+ b2 + 2(b− 1)β2 +β4

)+β

(2b2 + 4b+ (4b− 1)β2 + 2β4

)2√

2exp(β2/2

)∫ ∞−β/√

2

exp(−x2

)dx

+β2(2 + b2 +β2 +β4 + b

(3 + 2β2

))4exp

(β2)·(∫ ∞−β/√

2

exp(−x2

)dx

)2

),

and

g2 (β) :=

(2 + 2

√2βexp

(β2/2

)∫ ∞−β/√

2

exp(−x2

)dx

)3.

Clearly, g1 (β)< 0 and g2 (β)> 0 for b > 1, so that g′ (β)< 0. We conclude that the second-order

condition for the objective function value is satisfied, as required, since

−

(∑i∈I

λic− 1b−1

i

)1−b

·Γ(b+ 1) · g′ (β)> 0,

which concludes the proof. �

Proposition 5. Suppose a> 1. The solution for (1) is unique and has the ratio component

pj (qΣ) =λjw

1/aj∑

i∈I λiw1/ai

· qΣ, (8)

and the optimal staffing is given by N(β∗) = λ+β∗√λ, where β∗ is the unique solution to

Γ(a+ 1)(β/√λ)a{

1 + βΦ(β)

φ(β)

} =n∑i

λiw1/ai .

Proof: By the pathwise Little law (see §3), (1) is equivalent to

minN,p

N

s.t. E[(QN,pi

)a]≤ λaiwi, i∈ I, (9)

π ∈Π,N ≥ 0.

N(β∗) and p∗ satisfy

E[(QN,pj

)a]=

(λjw

1/aj∑

i∈I λiw1/ai

)aE[(QN

Σ

)a]=

∫ ∞0

1

1 + βΦ(β)

φ(β)

·(β/√λ)

exp(−xβ/

√λ)xadx

=

(λjw

1/aj∑

i∈I λiw1/ai

)a( n∑i

λiw1/ai

)a= λajwj

20

To prove that (N(β∗), p∗) is optimal and unique, suppose, towards contradiction, that there exist

p and N ≤N(β∗) that satisfy the constraints in (9).

We are first going to show that (N(β∗), p∗) are optimal for the cost formulation

min N +∑i∈I

λ1−aciE [Qai ] , (10)

with the specific coefficients

cj =K ·w−a−1a

j , (11)

where K is

K =− 1

Γ(a+ 1)·

(∑j∈I

λiw1ai

)a−1((β/√λ)a{

1 + βΦ(β)

φ(β)

})2

((β/√λ)a{

1 + βΦ(β)

φ(β)

})′β=β∗

. (12)

Then

N +∑j∈I

λ1−ai cjE

[(QN,pj

)a]= N +

∑j∈I

λ1−a ·K ·w−a−1a

j E[(QN,pj

)a].

By Proposition 4, this problem has a unique ratio solution, given exactly by p∗ in (8),

pj (qΣ) =λjc− 1a−1

j∑i∈I λic

− 1a−1

i

· qΣ =K−

1a−1λjw

− 1a

j∑i∈IK

− 1a−1λiw

− 1a

i

· qΣ =λjw

1/aj∑

i∈I λiw1/ai

· qΣ,

and the objective function value under this optimal decision is

N +∑j∈I

λ1−aj cjE

[(QN,pj

)a]= N +

∑j∈I

λ1−aj Kw

−a−1a

j

λajwj(∑i∈I λiw

1/ai

)a ·E [(QNΣ

)a]= N +

n∑j=1

Kλjw

1/aj(∑

i∈I λiw1/ai

)a ·E [QaΣ]

= N +K

(∑i∈I

λiw1/ai

)1−a

· Γ(a+ 1)(β/√λ)a{

1 + βΦ(β)

φ(β)

} .The solution N = λ+β

√λ for the problem

λ+β√λ+K

(∑i∈I

λiw1/ai

)1−a

· Γ(a+ 1)(β/√λ)a{

1 + βΦ(β)

φ(β)

}must satisfy the first-order condition if the second-order condition is met:

√λ+K

(∑i∈I

λiw1/ai

)1−a

·Γ(a+ 1) ·

1(β/√λ)a{

1 + βΦ(β)

φ(β)

}′ = 0.

21

Plugging in K from (12), we see that β∗ (from N∗ = λ+β∗√λ) satisfies this first-order condition.

The second-order condition is easy to see as in Proposition 4. Also define g (β) as in Proposition 4

except for the change of the exponent b to a. Since g′ (β)< 0, the second-order condition is met.

Now let us return to the constraint formulation. By our assumption, that (N , p) is feasible for

(9) with N ≤N∗. Then (10) is now

N +∑j∈I

λ1−aj Kw

−a−1a

j E[(QN,pj

)a] ≤ N∗+∑j∈I

λ1−aj Kw

−a−1a

j λajwj

= N∗+K∑i∈I

λiw1/ai

= N∗+KΓ(a+ 1)(

β/√λ)a{

1 + βΦ(β)

φ(β)

}But the right-hand side is the optimal cost with the staffing level N∗ and the ratio function

p∗, which means that there are two different solutions to the cost-minimization problem (10),

contradicting Proposition (4). �

Proposition 6. Suppose a≤ 1. The optimal staffing for (1) is the same as that for the single-

class problem:

minN

N

s.t. E[(QN

Σ

)a]≤∑i∈I

λaiwi. (13)

N ≥ 0.

Proof: By the pathwise Little’s law (1) is equivalent to

minN,p

N

s.t. E[(QN,pi

)a]≤ λaiwi, i∈ I. (14)

π ∈Π, N ≥ 0.

Next, by the triangle inequality for p-norm(∑

i∈I xpi

)(1/p) ≤∑

i∈I xi, so that for all q≥ 0,(∑i∈I

qi

)a≤∑i∈I

qai .

Therefore, if N is feasible for (14), it must be the case that

E[(QN

Σ

)a]=E

[(∑i∈I

QN,pi

)a]

≤E

[∑i∈I

(QN,pi

)a]=∑i∈I

E[(QN,pi

)a]≤∑i∈I

λaiwi.

22

It is then evident that the staffing solution for (14) is greater than or equal to the one for

(13). Let us denote it by N(β∗) = λ+√λβ∗. Thus, if we can find a ratio function that, with the

staffing solution N(β∗), satisfies the constraints in (14), it must be that N(β∗) is optimal for this

constrained problem. We claim that the following function p∗ does the job:

p∗i (qΣ) =

0, 0≤ qΣ <xi−1,

qΣ, xi−1 ≤ qΣ ≤ xi,0, xi < qΣ,

(15)

where x0 = 0 and xi’s are defined recursively by∫ xi

xi−1

1

1 + β∗Φ(β∗)φ(β∗)

·(β∗/√λ)

exp(−xβ/

√λ)xadx= λaiwi.

The existence of xi’s follows from the fact that β∗, by its definition as the optimal solution to (13),

satisfies ∫ ∞0

1

1 + β∗Φ(β∗)φ(β∗)

·(β∗/√λ)

exp(−xβ∗/

√λ)xadx = E

[(QN,p

Σ

)a]=∑i∈I

λaiwi.

Applying the definition of p∗ in (15), we have that

E[(QN∗,p∗

i

)a]=

∫ ∞0

1

1 + β∗Φ(β∗)φ(β∗)

·(β∗/√λ)

exp(−xβ/

√λ)

(pi (x))adx

=

∫ xi

xi−1

1

1 + β∗Φ(β∗)φ(β∗)

·(β∗/√λ)

exp(−xβ/

√λ)xadx

= λaiwi,

and we conclude that, with a≤ 1, N∗ = λ+ β∗√λ and p∗ are feasible for (14) and, in particular,

optimal. Since (14) is equivalent to (1), (N(β∗), p∗) is optimal for the latter with a≤ 1. �

Proposition 7. Suppose b≤ 1. An optimal ratio rule for

minN,p

N +∑i∈I

λiciE[(WN,pi

)b]s.t. π ∈Π,

is given by the following ratio functions:

pi∗ (qΣ) = qΣ, and pi (qΣ) = 0, i 6= i∗,

where i∗ is a class that satisfies λ1−bi∗ ci∗ = minj∈I λ

1−bj cj. The optimal staffing is given by N(β∗) =

λ+β∗√λ, where β∗ is the optimal solution to

minβ≥0

N(β) +λ1−bi∗ ci∗E

[(QN(β)Σ

)b]= N(β) +λ1−b

i∗ ci∗Γ(b+ 1)(

β/√λ)b{

1 + βΦ(β)

φ(β)

} .

23

Proof: By the pathwise Little’s law,

N +∑i∈I

λiciE[(WN,pi

)b]= N +

∑i∈I

λiciE

[(QN,pi

λi

)b]= N +

∑i∈I

λ1−bi ciE

[(QN,pi

)b].

For fixed N , the optimal allocation of a total queue length qΣ is derived by solving for each qΣ:

min∑i∈I

λ1−bi cip

bi

s.t.∑i∈I

pi = qΣ,

pi ≥ 0, i∈ I.

It is easy to solve the problem using KKT conditions from the Lagrangian:

L=∑i∈I

λ1−bi cip

bi + s

(qΣ−

∑i∈I

pi

)−∑i∈I

tipi.

The use of the KKT method is justified by the regularity of linear constraints. s and ti’s denote

the Lagrangian multipliers (these may depend on the parameter qΣ). The conditions are

1. bλ1−bi cip

b−1i − s− ti = 0, i∈ I,

2. qΣ−∑

i∈I pi = 0,

3. tipi = 0,

4. ti ≥ 0.

Let I∗ be the set of i’s with pi > 0. Then for i∈ I∗, ti = 0 and bλ1−bi cip

b−1i = s. By condition 2,

∑i∈I

pi =∑i∈I∗

pi =∑i∈I∗

(s

bλ1−bi ci

) 1b−1

= qΣ,

and s is calculated to be

s=

∑i∈I∗(bλ1−b

i ci) 1

1−b

qΣ

1−b

,

so that

pi =

(bλ1−b

i cis

) 11−b

=(bλ1−b

i ci) 1

1−b qΣ∑i∈I

(bλ1−b

i ci) 1

1−b

for all i∈ I∗. Thus, the objective function is then

∑i∈I∗

λ1−bi cip

bi =

∑i∈I∗

λ1−bi ci

(bλ1−b

i ci) 1

1−b∑j∈I∗

(bλ1−b

j cj) 1

1−b

b

qbΣ =∑i∈I∗

λ1−bi ci

(bλ1−b

i ci) b

1−b(∑j∈I∗

(bλ1−b

j cj) 1

1−b)b qbΣ

24

=

∑i∈I∗

(bbλ1−b

i ci) 1

1−b(∑j∈I∗

(bλ1−b

j cj) 1

1−b)b qbΣ =

∑i∈I∗

(λ1−bi ci

) 11−b(∑

j∈I∗(λ1−bj cj

) 11−b)b qbΣ

=

(∑i∈I∗

λic1

1−bi

)1−b

qbΣ.

Notice that

(∑i∈I∗ λic

11−bi

)1−b

only increases if the number of elements in I∗ increases. Hence,

the cost is minimized when only one queue is positive (at least one must be positive when qΣ > 0

to meet the condition that∑

i pi = qΣ). By choosing i∗ that satisfies λ1−bi∗ ci∗ ≤ λ1−b

i ci for all i ∈ I,

the cost is minimized, which proves that the specified ratio function is optimal. �

6. Proofs of dualization theorems6.1. Proof of Theorem 1

Proof of 1. Consider the problems (18) and (17). We will show that given a, b > 1 and c, we can

find w such that SQ(a,w) = SC(b, c).Let the optimal ratio functions pcost and pconst be again as in Propositions (4) and (5); see (19).

Suppose a, b, and ci’s are given. Define wi’s to be

wi =Kconst · c− ab−1

i . (16)

Then

pconstj (qΣ) =λjw

1/aj∑

i∈I λiw1/ai

·QΣ =λj

(Kconst · c

− ab−1

j

)1/a

∑i∈I λi

(Kconst · c

− ab−1

i

)1/a·QΣ

=λj · c

− 1b−1

j∑i∈I λic

− 1b−1

i

·QΣ = pcostj (QΣ) .

It remains to show that with this choice of w, the staffing levels are the same. The optimal staffing

for (17) is given by N(βcost), where βcost is the unique solution to

1 +

(∑i∈I

λic− 1b−1

i

)1−b

·Γ(b+ 1) ·

1((β/√λ))b{

1 + βΦ(β)

φ(β)

}′

= 0.

See Proposition 4. Define

Kconst =1(∑n

i λi · c− 1b−1

i

)a · Γ(a+ 1)(βcost/

√λ)a{

1 +βcostΦ(βcost)φ(βcost)

} .By Proposition (5), the optimal staffing for (18) is the unique solution β to

Γ(a+ 1)((β/√λ))a{

1 + βΦ(β)

φ(β)

} =

(n∑i

λiw1/ai

)a.

25

We claim that βcost is this solution when wi is set as in (16). Indeed,(n∑i

λiw1/ai

)a=

(n∑i

λi (Kconst)

1a · c

− 1b−1

i

)a

= Kconst ·

(n∑i

λi · c− 1b−1

i

)a=

Γ(a+ 1)(βcost/

√λ)a{

1 +βcostΦ(βcost)φ(βcost)

} .Thus, we find that with this choice of wi, both the ratio and staffing components are identical as

required.

Proof of 2. Any a-constraint formulation trivially lets static priority solutions by letting wi = 0

for all i except for one class.

�

6.2. Proof of Theorem 2

Proof of 1. Using the pathwise Little’s law, Wi is replaced by Qi/λi, so that the cost formulation

(2) is equivalent to

min N +∑i∈I

λ1−bi ciE

[(QN,pi

)b]s.t. p∈P, N ≥ 0. (17)

Similarly, (1) is equivalent to

minN,p

N

s.t. E[(QN,pi

)a]≤ λaiwi, i∈ I, (18)

p∈P,N ≥ 0,

and we focus on these two problems.

Consider the case a, b > 1. Let the optimal ratio functions be pcost and pconst for (17) and (18),

respectively. Then, by Propositions 4 and 5, we have the optimal ratio functions

pcostj (QΣ) =λjc− 1b−1

j∑i∈I λic

− 1b−1

i

·QΣ and pconstj (QΣ) =λjw

1/aj∑

i∈I λiw1/ai

·QΣ. (19)

We will first show that for an arbitrary choice of a > 1, b > 1, and wi’s (wi > 0), there exist ci’s

such that SC(a,w) = SQ(b, c). Define ci’s as

ci =Kcost ·(

1

wi

) b−1a

, (20)

26

where Kcost will be defined in (21) below. Then

pcostj (QΣ) =λjc− 1b−1

j∑i∈I λic

− 1b−1

i

·QΣ =

λj

(Kcost ·

(1wi

) b−1a

)− 1b−1

∑i∈I λi

(Kcost ·

(1wi

) b−1a

)− 1b−1

=λj ·(

1wi

)−1/a

∑i∈I λi ·

(1wi

)−1/a=

λj ·w1/aj∑

i∈I λi ·w1/ai

= pconstj (QΣ) .

Both formulations share the same ratio function and we have found the desired c (recall that

this ratio function is unique by Propositions 4 and 5). It remains to show that the staffing levels

are the same.

By Proposition 5, the staffing for (1) (and, in particular, for the equivalent (18)) is given by

N const(βconst) = λ+βconst√λ, where βconst is the unique solution to

Γ(a+ 1)((β/√λ))a{

1 + βΦ(β)

φ(β)

} =

(n∑i

λiw1/ai

)a.

Define

Kcost =

√λ

Γ(b+ 1) ·(∑n

j=1 λjw1/aj

)1−b ·

(((

β/√λ))b{

1 + βΦ(β)

φ(β)

})2

(((β/√λ))b{

1 + βΦ(β)

φ(β)

})′β=βconst

. (21)

The objective function value of (17) with ci as in (20) and with the optimal ratio function

pcost = pconst is given for a given staffing N by

N +n∑j=1

λ1−bj Kcost ·

(1

wj

) b−1a

·E[(QN,pj

)b]

= N +n∑j=1

λ1−bj Kcost ·

(1

wj

) b−1a

·

(λjw

1/aj∑

i∈I λiw1/ai

)b·E[(QN

Σ

)b]

= N +n∑j=1

λ1−bj Kcost ·

(1

wj

) b−1a

·

(λjw

1/aj∑

i∈I λiw1/ai

)b· Γ(b+ 1)((

β/√λ))b{

1 + βΦ(β)

φ(β)

}= N +

n∑j=1

Kcost ·λjw

1/aj(∑

i∈I λiw1/ai

)b · Γ(b+ 1)((β/√λ))b{

1 + βΦ(β)

φ(β)

}= N +Kcost ·

(n∑j=1

λjw1/aj

)1−b

· Γ(b+ 1)((β/√λ))b{

1 + βΦ(β)

φ(β)

} .

27

The second-order condition is verified as in the proof of Proposition 4:

√λ−Kcost ·

(n∑j=1

λjw1/aj

)1−b

·Γ(b+ 1) ·

(((β/√λ))b{

1 + βΦ(β)

φ(β)

})′(((

β/√λ))b{

1 + βΦ(β)

φ(β)

})2 = 0.

It can be now verified that βconst is a solution (and hence the unique solution) to this first-order

condition and thus also optimal for the cost formulation as required.

Proof of 2. Suppose that more than one of the wi’s are strictly positive. From Propositions 6

and 7, only static priority policies are optimal and hence only one customer class should have a

positive queue in b-cost formulations with b≤ 1. Then no linear constraint formulation can be an

implementation of b-cost formulation with b≤ 1 as the optimal solutions for the linear constraint

formulation have the expected queue of each class positive. By Proposition 6, the optimal staffing

for a-constraint formulation with a≤ 1 is given by the solution for (13). For a= 1,

E[(QN

Σ

)a]=E

[(∑i∈I

QN,πi

)a]=∑i∈I

E[(QN,πi

)a]=∑i∈I

λaiwi.

By the constraints, ∑j∈I,j 6=i

E[(QN,πj

)a]≤ ∑j∈I,j 6=i

λaiwi

and

E[(QN,πi

)a]≥ λaiwi.This applies to all i ∈ I, and hence no customer class queue can have a zero expected queue;

thus, the linear constraint formulation is not an implementation of b-cost formulation with b≤ 1.

In b-cost formulation with b > 1, the unique solution is FQR as is shown in Proposition 4. The

linear constraint formulation also has FQR as a solution (see Gurvich et al. (2008)) and hence is

an implementation of b-cost formulation with b > 1.

Proof of 3. For a < 1 we show that all the ratio policies are bang-bang rules: at each value of

qΣ (except maybe for a set of measure zero), the queue of one class is positive while others are 0.

Suppose this is not the case, i.e., one the optimal prioritization policy for

minN,π

N

s.t. E[(WN,πi

)a]≤wi, i∈ I, (22)

π ∈Π, N ∈Z+,

has a tracking function pi such that pi (qΣ)> 0 and pj (qΣ)> 0 for some qΣ with positive measure.

Define A, M1 and M2 as

A := {qΣ : pi (qΣ)> 0, pj (qΣ)> 0} ,

28

Mi :=

∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(pi (qΣ))

adqΣ,

Mj :=

∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(pj (qΣ))

adqΣ.

Then define B ⊂A to satisfy

Mi :=

∫qΣ∈B

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(pi (qΣ) + pj (qΣ))

adqΣ.

Now define a new ratio function p′ (qΣ), which is a modification of p (qΣ) as follows:

p′i (qΣ) = pi (qΣ) + pj (qΣ) for qΣ ∈B,

p′j (qΣ) = 0 for qΣ ∈B,

p′i (qΣ) = 0 for qΣ ∈A∩Bc,

p′j (qΣ) = pi (qΣ) + pj (qΣ) for qΣ ∈A∩Bc.

For the original ratio function p (qΣ),∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)((pi (qΣ))

a+ (pi (qΣ))

a)dqΣ =M1 +M2.

But for p′ (qΣ),∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)((p′i (qΣ))

a+ (p′i (qΣ))

a)dqΣ

=

∫qΣ∈B

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)((p′i (qΣ))

a+ (p′i (qΣ))

a)dqΣ

+

∫qΣ∈A∩Bc

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)((p′i (qΣ))

a+ (p′i (qΣ))

a)dqΣ

=

∫qΣ∈B

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ


adqΣ

+

∫qΣ∈A∩Bc

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ


adqΣ

=

∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ


adqΣ

<

∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)((pi (qΣ))

a+ (pi (qΣ))

a)dqΣ

= Mi +Mj. (23)

The inequality above follows from the simple inequality xa1 + xa2 > (x1 +x2)a

where a < 1 and

x1, x2 > 0.

29

Since ∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(p′i (qΣ))

adqΣ

=

∫qΣ∈B

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(p′i (qΣ))

adqΣ

+

∫qΣ∈A∩Bc

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(p′i (qΣ))

adqΣ

=

∫qΣ∈B

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(p′i (qΣ))

adqΣ + 0

=

∫qΣ∈B

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ


adqΣ

= Mi,

the following holds by (23):∫qΣ∈A

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(p′j (qΣ)

)adqΣ <Mj

Recalling the definition of Mj and that p (qΣ) = p′ (qΣ) for qΣ /∈A, we have∫ ∞0

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(p′j (qΣ)

)adqΣ

<

∫ ∞0

1

1 + βΦ(β)

φ(β)

· β√λ· exp

(− β√

λqΣ

)(pi (qΣ))

adqΣ,

and, therefore, E[(QN,p′

j

)a]< E

[(QN,pj

)a]. Note that E

[(QN,p′

k

)a]= E

[(QN,pk

)a]for all k ∈ I ∩

{j}c. Let w′j :=E[(QN,p′

j

)a]. Then p′ satisfies the constraints of the problem:

minN,π

N

s.t. E[(WN,πi

)a]≤wi, i∈ I ∩{j}c,E[(WN,πj

)a]≤w′j,π ∈Π, N ∈Z+.

By Proposition 6, the optimal staffing for the above problem is lower than the one for (22). This

contradicts to the assumption that p (qΣ) is an optimal solution for (22) since p′ (qΣ) can satisfy

the constraints at a lower staffing costs. We conclude that any optimal policy should be bang-bang

rules.

As no solution for the b-constraint formulation has this extreme bang-bang rule, the a-constraint

formulation with a< 1 cannot be an implementation of any b-cost formulation. �

30

7. A numerical study of robustness

Recall (Corollary 3) that, with strictly convex constraints or strictly convex waiting costs, multiple

formulations are equivalent in terms of their solutions. If one wants to implement given costs

through constraints, there is a plethora of options to choose from. Similarly, a given constraint can

be an implementation of multiple “originating” costs, in which case results of reverse engineering

may differ depending on the model the econometrician chooses to estimate. We next study the

sensitivity of formulations to the cost and target parameters c and w as a way to guide to the choice

of constraint (specifically, that of a) for implementation and that of cost structure (specifically, the

value of b) for reverse engineering. In our examples, we use a system with two customer classes,

i.e., with I = {1,2}.

Implementation through constraints

By Corollary 3, we can choose w and w so that the following two problems (with a= 2 and a= 3),

min N

s.t. E[W 2i ]≤ wi, i∈ I,

π ∈Π,N ∈Z+,

min N

s.t. E[W 3i ]≤ wi, i∈ I,

π ∈Π,N ≥ 0,

have the same unique solution. To obtain a meaningful comparison, we turn to a question of

robustness. We ask which formulation is more sensitive to misspecification of the targets w and w.

Our base case has w= (0.01,0.02) and w= (0.027,0.008)

The staffing for these two equivalent formulations is given by β = 0.4, so that, with λ = 500,

N = λ+ 0.4√λ = 508. For each of the values of δ ∈ {2,3}, we perturb the targets w1 and w1 by

multiplying both by values δ ∈ [0.93,1.07]. We recompute the optimal value of β corresponding

to the perturbed parameters—that is, we solve the staffing problem for the δ-perturbed values to

arrive at the staffing constant βδ (β = β1).

Figure 7 displays βδ against δ for each of the cases a = 2 and a = 3. Evidently, the cubic for-

mulation is less sensitive to the specification of the target, and the formulation with a= 2 is more

sensitive to “mistakes” in specifying the targets.

We have repeated this experiment for various initial values of w (and hence β), obtaining the

same qualitative insight. When choosing between two equivalent constraint formulations, picking

a higher (rather than smaller) power is “safer” in terms of robustness to misspecification of the

constraints.

Reverse engineering

Next, we consider the effect of the exponent b in the cost formulation. Our departure point is

a constraint formulation that a service provider is using and has a > 1. Its optimal solution is

31

0.9975

0.9985

0.9995

1.0005

1.0015

0.993 0.995 0.997 0.999 1.001 1.003 1.005 1.007

Sensitivity of staffing (β) to perturbations of the target w

a=2

a=3

Figure 6 Sensitivity of staffing (β) to perturbations of the target w

given by a staffing level and a fixed ratio rule (see item (i) of Theorem 2). The “econometrician”

observes these actions—the staffing level and the prioritization rule—and chooses a cost structure,

specifically, a b value. He then seeks to identify the coefficients ci, i∈ I such that the b-cost problem

has the provider’s action as its solution—this is the action of reverse engineering the costs.

Since there are two customer classes, the service provider’s decision is fully specified by the

staffing level N and the ratio of class 1, p1 (the ratio of class 2 is then p2 = 1− p1). Given the pair

(N,p1) and fixing b, we can find cost coefficients c1 and c2 such that (N,p1) is the optimal solution

to the cost-minimization problem:

minN,π

N +∑i∈I

λic∗iE[(WN,πi

)b]: π ∈Π, N ≥ 0.

The coefficients ci, i= 1,2 depend, of course, on b: given two different actions, (N,p1) and (N , p),

and fixing b, we can compute the differences in the cost coefficients. By subsequently varying b, we

can see how changes in providers’ actions map into changes in the “dualized” cost coefficients c1

and c2.

For this experiment, we take the initial firm’s action to be the ratio p1 = 0.4 (and p2 = 1− 0.4 =

0.6), and the staffing level β = 0.4 (with λ= 500, we have N = 500+0.4 ·√

500≈ 508). The dualized

coefficients are given by c = (1.024,0.682) for b = 2 and c = (4.291,1.907) for b = 3. That is, the

cost formulations

min N +∑

i∈I λiciE [W 2i ]

π ∈Π,N ∈Z+

andmin N +

∑i∈I λiciE [W 3

i ]

π ∈Π,N ∈Z+

have the same solution given by the fixed ratios p1,1− p1 and by N = 508.

32

We now (multiplicatively) perturb the actions of the provider—specifically, the values of β and

p1—to see how this affects the reversed coefficients. We perturb β by values in the range [0.98,1.02]

so that βα = αβ (again, β = β1) and we do the same for p1. We re-compute the corresponding

parameters for each combination of β and p1. That is, for each combination (β,p1), we compute

the reverse coefficients (c1, c2) for b= 2 and b= 3. We then capture the change relative to the base

by the metric

max{c1/c1, c2/c2, c1/c1, c2/c2}.

The results, for the cross-sections p1 ∈ {0.98p1, p1,1.02p1}, are displayed in Figure 7. When there

is no perturbation (i.e., p1 = p1 and β = β) the value of the metric is 1.

0.98

1

1.02

1.04

1.06

1.08

1.1

1.12

1.14

1.16

0.98 0.99 1 1.01 1.02Staffing (β) perturbation

b=2

1

0.981.02

Ratio (p1) perturbation

0.98

1

1.02

1.04

1.06

1.08

1.1

1.12

1.14

1.16

0.98 0.99 1 1.01 1.02Staffing (β) perturbation

b=3

0.98

1.02

1

Ratio (p1) perturbation

Figure 7 Sensitivity of reversed coefficients to actions with varying β

One can observe that the sensitivity is consistently greater for b = 3. That is, changes in the

provider’s actions (staffing and prioritization solutions) lead to greater changes in the dualized

cost coefficients when b is larger. We repeated these experiments for various base values β and

p1 and observed the same pattern. When building a structural model and seeking to impute the

waiting-cost coefficients, the reversed coefficients are more sensitive to provider actions when the

assumed structure has a higher value for the cost exponent b. Put differently, if one compares two

(otherwise identical) providers whose actions N,p differ only slightly, differences in costs will be

difficult to identify with small values of b.

8. Concluding remarks

The literature on staffing and prioritization of service systems has traditionally been focused on

solving given problems. One specifies costs or constraints and seeks to find optimal capacity and

priority-and-routing prescriptions.

33

Arguably, before asking how we might solve a given problem, we should be asking which problem

we should be solving in the first place. In this paper, we try to contribute to the study of this

largely open question.

Practitioners typically solve constraint formulations, and these, it seems, should be grounded in

some beliefs about the cost of delaying customers. In this paper we take a family of formulations—

both cost and constraint—and try to understand (a) how one should implement given costs as

constraints and (b) if there is a unique way to reverse engineer: given constraints that a provider

is using, can we figure out the implicit costs that the provider assigns to customer delay?

We find that, in the presence of multiple customer classes, the answers to both of these questions

are subtle. Fundamentally, the challenge lies in the complex structure of optimal prioritization

solutions to the staffing problem. The questions of duality require going beyond looking at the

surface of the formulation and examining these solutions in detail. Only when one fully maps the

spectrum of optimal solutions for the different problems, can one understand why the reasonable

ASA formulation (as well as strictly concave constraints) can generate solutions that cannot be

associated with (and hence cannot be the implementation of) any reasonable costs—convex or

concave.

While we believe that the essence of formulation duality is captured by the simple power functions

we studied here, one may wish to consider more general families. In fact, expanding the scope of

cost functions that are well understood is not a purely mathematical pursuit. To give a fuller view

of the relationship between costs and constraints one must first understand what the reasonable

cost structures are for customer delay—rooted in the psychology of wait and in empirical evidence.

With these cost functions in hand, one can then ask (i) how does one implement these through

constraints and (ii) how do constraints that are typical in practice reflect such delay costs?

References

Aksin, Z., B. Ata, S.M. Emadi, C-L Su. 2013. Structural estimation of callers’ delay sensitivity in call centers.

Management Science 59(12) 2727–2746.

Allon, G. 2012. Regulating quality of service by law in israel. URL http://operationsroom.wordpress.

com/2012/12/17/regulating-quality-of-service-by-law-in-israel/.

Allon, G., A. Federgruen, M. Pierson. 2011. How much is a reduction of your customers’ wait worth? an

empirical study of the fast-food drive-thru industry based on structural estimation methods. Manu-

facturing & Service Operations Management 13(4) 489–507.

Atar, R. 2005. Scheduling control for queueing systems with many servers: Asymptotic optimality in heavy

traffic. The Annals of Applied Probability 15(4) 2606–2650.

Atar, R., A. Mandelbaum, M.I. Reiman. 2004. Scheduling a multi class queue with many exponential servers:

asymptotic optimality in heavy traffic. The Annals of Applied Probability 14(3) 1084–1134.

34

Bassamboo, A., J.M. Harrison, A. Zeevi. 2006. Design and control of a large call center: Asymptotic analysis

of an LP-based method. Operations Research 54 419–435.

Borst, S., A. Mandelbaum, M. Reiman. 2004. Dimensioning large call centers. Operations Research 52(1)

17–34.

Chan, W., G. Koole, L’Ecuyer. 2014. Call center routing policy using call waiting and agent idle times.

Manufacturing & Service Operations Management Forthcoming.

Cohen, M.A, T.H. Ho, Z.J. Ren, C. Terwiesch. 2003. Measuring imputed cost in the semiconductor equipment

supply chain. Management Science 49(12) 1653–1670.

Gurvich, I., M. Armony, A. Mandelbaum. 2008. Service level differentiation in call centers with fully flexible

servers. Management Science 54(2) 279–294.

Gurvich, I., W. Whitt. 2007. Service-level differentiation in many-server service systems: A solution based

on fixed-queue-ratio routing. Operations Research 29 567–588.

Halfin, S., W. Whitt. 1981. Heavy-traffic limits for queues with many exponential servers. Operations

Research 29(3) 567–588.

Jouini, O., A. Pot, G. Koole, Y. Dallery. 2010. Online scheduling policies for multiclass call centers with

impatient customers. European Journal of Operational Research 207(1) 258 – 268.

Mehrotra, V., K. Ross, G. Ryder, Y.P. Zhou. 2012. Routing to manage resolution and waiting time in call

centers with heterogeneous servers. Manufacturing & Service Operations Management 14(1) 66–81.

Milner, J.M., T.L. Olsen. 2008. Service level agreements in call centers: Perils and prescriptions. Management

Science 54(2) 238–252.

Olivares, M., C. Terwiesch, L. Cassorla. 2008. Structural estimation of the newsvendor model: an application

to reserving operating room time. Management Science 54(1) 41–55.

Pang, G., O. Perry. 2014. A logarithmic safety staffing rule for contact centers with call blending. Management

Science Forthcoming.

Perry, O., W. Whitt. 2009. Responding to unexpected overloads in large-scale service systems. Management

Science 55(8) 1353–1367.

Pot, A., S. Bhulai, G. Koole. 2008. A simple staffing method for multiskill call centers. Manufacturing &

Service Operations Management 10(3) 421–428.

Soh, S.B., I. Gurvich. 2014. Call-center staffing: Service-level-differentiation and Gcµ rules. Working Paper

.

Tezcan, T., J.G. Dai. 2010. Dynamic control of N-systems with many servers: Asymptotic optimality of a

static priority policy in heavy traffic. Operations Research 58(1) 94–110.

Van Mieghem, J. A. 1995. Dynamic scheduling with convex delay costs: The generalized cµ rule. The Annals

of Applied Probability 5 809–833.

35

Wallace, R.B., W. Whitt. 2005. A staffing algorithm for call centers with skill-based routing. Manufacturing

& Service Operations Management 7(4) 276–294.

Date post:	20-Dec-2020
Category:	Documents
Upload:	others
View:	1 times
Download:	0 times

Duality in sta ng problems: Between holding costs and waiting ...2 The traditional cost formulation...

Documents