Efﬁcient Multi-Granularity Service Compositionimprove the computational cost. This paper is...

Efficient Multi-Granularity Service Composition

Lina Barakat, Simon Miles, Iman Poernomo, Michael LuckDepartment of Informatics, King’s College London

London, UK{lina.barakat, simon.miles, iman.poernomo, michael.luck}@kcl.ac.uk

Abstract—Dynamic composition of services provides theability to build complex distributed applications at run time bycombining existing services, thus coping with a large varietyof complex requirements that cannot be met by individual ser-vices alone. However, with the increasing amount of availableservices that differ in granularity (amount of functionality pro-vided) and qualities, selecting the best combination of servicesbecomes very complex. In response, this paper addresses thechallenges of service selection, and makes a twofold contri-bution. First, a rich representation of compositional planningknowledge is provided, allowing the expression of multipledecompositions of tasks at arbitrary levels of granularity.Second, two distinct search space reduction techniques areintroduced, the application of which, prior to performingservice selection, results in significant improvement in selectionperformance in terms of execution time, which is demonstratedvia experimental results.

Keywords-service selection; pruning; composite plans;

I. INTRODUCTION

Service Oriented Architecture (SOA) is becoming thedominant paradigm for enterprise development, permittinga flexible, compositional approach to integrating and imple-menting business processes via the combination of compu-tational services. Web services offer a particularly attractiveframework for implementing SOAs: exposing and integrat-ing the functionality of (possibly disparate) implementationbases via a common protocol (such as WSDL); and suggest-ing horizontal business process implementations in whichservices are loosely coupled and independent from eachother (within, for example, a BPEL based workflow engine).

These two aspects of the framework raise the open ques-tion of next generation service brokerage, in which webservices may be selected automatically to suit the func-tional requirements of a business process, but to meet someform of optimisation requirement with respect to cost andservice level agreements. For example, a financial tradingapplication might require a currency conversion service: ourSOA might select the cheapest currency conversion serviceprovider (out of a directory of hundreds) that meets desirableefficiency demands, with this selection varying over timeaccording to changes in the service directory.

Since the emergence of web service technology, thiskind of adaptive approach to brokerage was considered apossibility (it was one of the motivations behind offeringdirectory standards such as the UDDI). In the case of a

single-service architecture, picking the best service out of100 is a trivial task. However, as we move deeper into thejungles of enterprise development, SOAs are increasing inscale and the problem of optimal service selection suffersfrom combinatorial explosion.

Despite active research, current service composition ap-proaches still have several limitations. One is that theprevailing representation of compositional knowledge asgraph-based workflows [1] limits candidate services forcomposition to those that satisfy the functional granularityrequirements of the workflow’s atomic tasks, neglectingrelevant services with different granularity. For instance, twoatomic tasks might be better achieved with a coarse-grainedservice that combines their functionalities than with twoindividual finer-grained ones (e.g., due to the better qualityof the service or the reduced communication overhead). Toillustrate, consider a scenario in which Lina wants to goon holiday, while minimising the overall price regardless ofother qualities. To achieve this, she contacts a travel softwareagent with the request plan holiday, for which the agentgenerates a plan with the following sub-tasks: book flights,book hotel, book sightseeing, and apply for visa. The agentsearches for the cheapest service to execute each subtaskof the generated plan, which are (for instance) services sa,sb, sc, and sd, respectively. Now, suppose a service se

combines the two functionalities of booking a hotel andbooking sightseeing with a special offer on price. Here,the generated composite service (sa, sb, sc, sd) might not bethe cheapest, and replacing sb and sc with se might resultin a composite service with better quality (cheaper price).In response, this paper proposes a richer representation ofthe compositional knowledge that allows the expression ofnot only alternative composition plans, but also the differenthierarchical levels of the tasks involved.

A second challenge is how to allocate services to thecomposition plan’s tasks so that the resulting combina-tion satisfies the user’s quality constraints while optimisingoverall utility. This global optimal web service selectionproblem increases in complexity with the number of avail-able services that offer similar functionalities, but differ intheir qualities. Although many selection algorithms havebeen proposed to address this problem (e.g. [2]–[6]), theseusually do not scale well when the number of candidateservices becomes very large. To address this, two types of

A: Plan holiday

OR

num ≠ 0(pr, ex) = (40,15)

num ≠ 0(pr, ex) = (25,15)

num ≠ 0(pr, ex) = (30,40)

num ≠ 0(pr, ex) = (15,20)

num ≠ 0(pr, ex) = (25,20)

num = 0

E: book hotel & sightseeing

D: Apply for visa

B: Book flights

C: Book trains F: Book hotel

G: Book sightseeing

num ≠ 0(pr, ex) = (20,30)

Figure 1. Planning knowledge hierarchy for plan holiday task

pruning, namely task-based pruning and plan-based pruning,are presented in this paper, aiming to reduce the number ofpossible combinations of services that need to be consideredfor composition; thus when applied prior to selection, theyimprove the computational cost.

This paper is organised as follows. A formal model ofthe web service selection problem is provided in SectionII. Sections III and IV present our search space reductiontechniques and algorithms, while Section V evaluates theireffectiveness through experimental results. Related work andconclusion are discussed in Sections VI and VII.

II. FORMAL REPRESENTATION

In what follows: upper-case identifiers denote sets; lower-case identifiers denote functions and variables; identifierswith an upper-case first letter denote relations; and (V,E)represents a graph, where V is the set of nodes (vertices),and E is the set of edges.

A. Planning Knowledge Model

The planning knowledge for a particular objective canbe represented as a hierarchy of tasks, with the root beingthe goal task to be achieved. Each task is annotated withsemantic descriptions specifying the functional requirementsof the suitable services, e.g. in terms of OWL-S inputs,outputs, preconditions and effects. The sub-tasks (of eachcomposite task) are partially ordered and can be modelledas a directed acyclic graph (dag), and there may be morethan one way to decompose a task, resulting in alternativedecomposition graphs. A planning knowledge hierarchy forthe goal task plan holiday is shown in Figure 1.

Formally, the planning knowledge hierarchy can be repre-sented as a quintuple, (T, Ta, tr, tf , tg), where: T is a finiteset of the tasks involved; Ta ⊂ T × T is an aggregationrelation between tasks, hierarchically decomposing coarse-grained tasks into finer-grained ones, with (ti, tj) ∈ Taindicating that ti is the aggregating (parent) task whiletj is the aggregated (child) task; tr is the root of thehierarchy, representing the goal task to be achieved, suchthat ∀t ∈ T, (t, tr) /∈ Ta; tf : T → FD is a functionalitydescription function, which assigns to each task t ∈ T asemantic specification of its functional requirements, where

A

C FD

B ED(a) (b)

(e)

C ED(c)

GB FD(d) G

Figure 2. Alternative abstract plans for achieving A of Figure 1

FD is the set of all functionality descriptions; and finallytg : T → 2G maps each non-leaf task t ∈ T to a set of dags,each representing an alternative decomposition of task t, andG is the set of all dags that can be formed from the tasksin T , such that ∀t ∈ T, ∀g = (V,E) ∈ tg(t), g satisfies thefollowing: V ⊂ T , s.t. ∀v ∈ V, (t, v) ∈ Ta; and E ⊂ V ×Vis a strict partial order on the tasks in V , where (vi, vj) ∈ Eindicates that task vi should be executed before task vj . Forexample, the planning knowledge hierarchy in Figure 1 canbe represented as a tuple (T, Ta, tr, tf , tg):

T = {A, B, C, D, E, F, G}Ta = {(A, B), (A, C), (A, D), (A, E), (E, F), (E, G)}tr = A

tg(A) ={

({B, D, E}, {(B, D), (D, E)}),({C, D, E}, {(C, D), (D, E)})

}tg(E) = {({F, G}, {(F, G)})}tg(B) = tg(C) = tg(D) = tg(F) = tg(G) = ∅

Based on this model, alternative abstract plans may beavailable for a particular task, specifying required sub-tasksand their ordering constraints. For ease of definition, weintroduce two functions. expnode : G → 2G maps a dagto a set of graphs, each resulting from replacing a nodein the original dag with one of its decomposition graphs,so that ∀g = (V,E) ∈ G, expnode(g) = {gexp ∈ G |∃vexp ∈ V,∃gch ∈ tg(vexp), gexp = rplnd(g, vexp, gch)},where function rplnd(g, vexp, gch) replaces node vexp in gwith the graph gch (e.g. Figure 2(d) shows Figure 2(b) withnode E expanded). expgrph : G → 2G gives all possibleexpansions of a graph, such that ∀g ∈ G, expgrph(g) ={gexp ∈ G | ∃gnode ∈ expnode(g), gexp = gnode ∨ gexp ∈expgrph(gnode)}. Now, the alternative abstract plans forachieving a given task t ∈ T , according to the planningknowledge hierarchy, can be defined using the functionabspln : T → 2G, assigning to each task its correspondingabstract plans such that ∀t ∈ T, abspln(t) = {g ∈ G | g =({t}, ∅) ∨ g ∈ expgrph(({t}, ∅))}. For example, task A inFigure 1 has five possible abstract plans, shown in Figure 2.

When multiple services offer similar functionality, theycan be distinguished according to their quality of service(QoS) attributes (the non-functional properties of services).Formally, knowledge of these attributes comprises a tuple,(AN, AV , adm, adr), where: AN is the set of all qualityattribute names; AV is the set of all possible quality attributevalues (the union of the domains of all quality attributes);adm : AN → 2AV is a domain function, mapping each

quality attribute to its corresponding domain (possible valuesof this attribute); and adr : AN → {inc, dcr} is a directionfunction, associating each quality attribute with either inc,an increasing direction (quality increases as attribute valueincreases) or dcr, a decreasing direction (quality decreasesas attribute value increases). For simplicity, henceforth weassume that adr(a) = dcr for all quality attributes.

The space of available services is a tuple, (S, sn, sv, sf),where: S is the set of available services; sn : S → 2AN

assigns to each service its quality attribute names; sv : S ×AN → AV ∪{undefined} assigns to each service its qualityattribute values, such that ∀s ∈ S,∀a ∈ sn(s), sv(s, a) ∈adm(a) and ∀a /∈ sn(s), sv(s, a) = undefined; and sf :S → FD maps a web service to a semantic description ofits functionality e.g., OWL-S or WSDL-S.

B. Service Discovery ModelThe candidate services for each task in the hierarchy is a

function cnd : T → 2S , such that ∀t ∈ T, cnd(t) = {s ∈S | mtch(sf(s), tf(t))}, where mtch returns true if thefunctional description of the service semantically matchesthe functional requirements of the task (e.g., mtch can bealigned to a simple/complex semantic search in a UDDIregistry). Based on this, a graph of tasks can be instantiatedto a graph of services by replacing each task t in the graphwith a service s ∈ cnd(t) that provides the functionalityrequired. Formally, the possible instances of a task graphcan be defined as a function ins : G→ 2Gs , mapping a dagto a set of graphs resulting from replacing the task nodesin the original graph with a particular combination of theircandidate services, where Gs is the set of all dags that canbe formed from the services in S. For example, if in Figure1, cnd(F) = {s1, s2} and cnd(G) = {s3, s4}, then the taskgraph g = ({F,G}, {(F,G)}) has four possible instances:

ins(g) = {({s1, s3}, {(s1, s3)}), ({s1, s4}, {(s1, s4)}),({s2, s3}, {(s2, s3)}), ({s2, s4}, {(s2, s4)})}

Now, the instantiation of each task’s abstract plans canbe defined as a function actpln : T → 2Gs , which maps atask to a set of actual plans (instances of the task’s abstractplans), each representing an alternative composite servicefor achieving the task, such that ∀t ∈ T, actpln(t) = {ps =(Vs, Es) ∈ Gs | ∃p = (V,E) ∈ abspln(t), ps ∈ ins(p)}

The value of a particular quality attribute for a compositeservice is an aggregation of the corresponding quality valuesfor the component services, where the type of the aggrega-tion function (e.g., summation, min/max) depends on theattribute. Formally, the values of the quality attributes for acomposite service can be defined as a function:

cv : Gs ×AN → AV ∪ {undefined}such that ∀gs = (Vs, Es) ∈ Gs,∀a ∈ AN,

[cv(gs, a) = undefined]∨[cv(gs, a) = aggrs∈Vs

(sv(s, a)) ∈ adm(a)]

cv(gs, ex) =∑

s∈nodes(mxpth(gs))

(sv(s, ex))

cv(gs, rel) =∏

s∈Vs

(sv(s, rel))

cv(gs, thr) = mins∈Vs

(sv(s, thr))

Figure 3. Examples of aggregation functions

where aggr is some aggregation function that depends on theattribute considered. For example, the aggregation functionsfor the quality attributes execution time (ex), reliability(rel), and throughput (thr) are defined in Figure 3. Here,function nodes maps a graph to its corresponding nodes;and function mxpth(gs) returns the path with the longestexecution time in the graph gs (i.e. a path pth from astart node to a destination node in gs with the maximum∑s∈nodes(pth)

(sv(s, ex))).

In addition to specifying the goal task to be achieved, auser might impose constraints on the values of the quality at-tributes and weight different quality attributes to reflect theirrelative importance to the user. Formally, a user’s request canbe defined as a triple, (rt, rc, rw), where: rt ∈ T is the goaltask to be accomplished; rc : AN → AV ∪ {undefined}represents the QoS constraints specified by the user forthe task rt, and maps a quality attribute to an upper user-defined bound for its value, such that ∀a ∈ AN, rc(a) ∈adm(a) ∪ {undefined}, with rc(a) = undefined indicatingthere are no restrictions on the value of attribute a, andrc(a) 6= undefined indicating that rc(a) is the maximumallowed value for attribute a; and finally, rw : AN → [0, 1]assigns to each quality attribute a user-defined weightingfactor to represent the user’s preferences regarding differentquality attributes, s.t.

∑a∈AN rw(a) = 1.

Example: Suppose there is a request to achieve taskA such that the user is equally interested in maximisingreliability and minimising execution time, and the highestprice (pr) the user is willing to pay is 50. This request canbe represented as a triple (rt, rc, rw): rt = Arc(pr) = 50; rc(a) = undefined,∀a 6= prrw(a) = 0.5,∀a ∈ {rel, ex}; rw(a) = 0, otherwise

Based on the user-defined attribute weights, the qualityvalues of a service (atomic or composite) can be aggre-gated into a single value, representing the overall utilityof the service for this user. Specifically, the utility of anatomic service s ∈ cnd(t) is a function su(t, s) ∈ [0, 1]s.t. su(t, s) =

∑a∈AN (rw(a) ∗ tmx(t,a)−sv(s,a)

tmx(t,a)−tmn(t,a) ), wheretmx(t, a)/tmn(t, a) returns the max/min value offered forthe quality attribute a by task t’s candidate services (i.e.max

s∈cnd(t)(sv(s, a))/ min

s∈cnd(t)(sv(s, a))). Similarly, the utility of

a composite service ps ∈ actpln(rt) is a function cu(ps) ∈

[0, 1] s.t. cu(ps) =∑

a∈AN (rw(a)∗ rmx(a)−cv(ps,a)rmx(a)−rmn(a) ), where

rmx(a)/rmn(a) returns the max/min value offered for thequality attribute a by the requested task’s actual plans:

rmx(a) = maxp∈abspln(rt)

(aggrt∈nodes(p)(tmx(t, a)))

rmn(a) = minp∈abspln(rt)

(aggrt∈nodes(p)(tmn(t, a)))

Here, aggr is as in the composite service quality model (thisfunction is increasing in the interval [0,∞)).

Service selection involves finding the best combination ofservices to achieve the requested task, that both satisfies theuser’s constraints and maximises the overall utility. Formally,the satisfactory composite services that meet the user’sconstraints can be defined as SCS = {ps ∈ actpln(rt) |∀a ∈ AN, [(rc(a) 6= undefined)⇒ (cv(ps, a) ≤ rc(a))]}

Now, the solution composite service psol for the user’srequest is that satisfying the following: psol ∈ SCS; andcu(psol) = max

ps∈SCS(cu(ps)).

III. SEARCH SPACE REDUCTION

Since the number of possible actual plans (service com-binations) for achieving the requested task can be large,the time to solve the service selection problem can beexponential. If web service based architectures becomethe technology underlying large enterprise systems, serviceselection risks combinatorial explosion and the efficiencyof selection solution must be addressed. The complexity isaffected by the number of tasks per abstract plan n, thenumber of candidate services per task m, and the numberof alternative abstract plans k. For instance, given n = 5,m = 1000, and k = 4, the number of possible actual plansis k×mn = 4× 10005. Consequently, reducing this searchspace (the possible combinations of services) is essentialto reduce the computational cost of selection. To achievethis, we present task-based pruning and plan-based pruning,which focus on reducing the number of candidate services mand the number of alternative abstract plans k, respectively.

A. Task-based Pruning

Task-based pruning aims to improve the efficiency of ser-vice selection by reducing the number of candidate servicesthat need to be considered for each task, and as a resultreducing the number of possible combinations of services.For instance, in the case where there are 10 sub-tasks, eachwith 100 candidate services, removing one candidate servicefrom any sub-task will eliminate 1009 possible combinationsfrom the problem search space. Two types of task-basedpruning, namely constraint-based pruning and domination-based pruning are described below.

1) Constraint-based Pruning: Constraint-based pruningaims to eliminate all candidate services that do not meetthe specified user constraints, since all possible combina-tions of services that include them will also violate theconstraints, and thus are not worth considering. Formally,

∀t ∈ T , service s ∈ cnd(t) should not be considered forcomposition according to constraint-based pruning iff ∃a ∈AR, sv(s, a) > rc(a), where AR = {a ∈ AN | rc(a) 6=undefined} is the set of constrained quality attributes.

2) Domination-based Pruning: Domination-based prun-ing reduces the search space by filtering out uninterestingservices from the set of candidates services for each task.Such uninteresting services are dominated by another can-didate service for the same task, with a service s1 beingdominated by another service s2 iff s1 is worse than s2 forall quality attributes. For example, a flight booking servicewith price:20 and execution time:10 dominates a service withprice:30 and execution time:20. Formally, ∀t ∈ T, ∀s1, s2 ∈cnd(t), s1 is dominated by s2 ⇔

[∀a ∈ AN, sv(s1, a) ≥ sv(s2, a)]∧[∃a ∈ AN, sv(s1, a) > sv(s2, a)]

Dominated services are not potential candidates for theoptimal solution of the service selection problem because,given two composite services:

gs = (Vs = {s1, s2, ..., si, ..., sn}, Es)g′s = (V ′s = {s1, s2, ..., s

′i, ..., sn}, E′s)

where service si is dominated by s′i and, since the aggre-gation functions in the composite service quality model areincreasing in the interval [0,∞), the following is satisfied:∀a ∈ AN, cv(g′s, a) ≤ cv(gs, a). In other words, the qualityvalues of the composite service g′s are either equal to orbetter than the quality values of the composite service gs, forall quality attributes. Thus, given that gs satisfies the qualityconstraints, g′s will also meet the same constraints, but withbetter utility. Hence, keeping only non-dominated servicesas candidates for each task will still guarantee finding theoptimal solution for the composition problem (when such asolution exists) while reducing the computational cost.

Since the best (non-dominated) services for each task areindependent of any particular user request, they need notbe determined at request time. Instead, these services canbe pre-computed for each task in the planning knowledgehierarchy, and continuously modified in response to changessuch as the addition of a new service, the deletion of anexisting service, or changes in the quality values of a service.

Although this request-independent pruning (ri-pruning)keeps, for each task, only the best candidate services re-garding all quality attributes, some might still be dominatedby others when considering a particular request, and thusare not candidates for this request’s optimal solution. Forinstance, suppose the candidate services for a particular taskafter such domination-based pruning are as follows, whereeach service is annotated with attributes: price, executiontime, availability, and reliability.

price execution time availability reliabilitys1 10 20 40 10s2 30 40 10 40s3 5 25 30 20s4 6 27 20 15

Now, suppose the user is interested in a composite servicethat meets constraint cex on execution time and has the min-imum price, regardless of availability and reliability. Here,service s4 is dominated by service s3 since s3 has lowerprice and shorter execution time. Thus, for each compositeservice satisfying cex and containing s4, another satisfactorycomposite service with better utility (lower price) can befound by replacing s4 with s3. Similarly, s1 dominates s2.Hence only s1 and s3 should be considered for composition.

Based on this example, further reduction in the searchspace can be achieved by removing, from the set of request-independent non-dominated candidate services for each task,those candidates that are dominated by another candidateaccording to the current request. A service s1 is dominatedby another service s2 with respect to a certain request iff s1

is worse than s2 for all the constrained quality attributes andthe utility value. This additional request-dependent domina-tion pruning (rd-pruning) results in considerable reduction inthe number of candidate services without affecting the abilityto find the optimal solution. Formally, ∀t ∈ T, ∀s1, s2 ∈cnd(t), s1 is request-based dominated by s2 ⇔

(∀a ∈ AR, sv(s1, a) ≥ sv(s2, a))∧(su(t, s1) ≤ su(t, s2))∧([∃a ∈ AR, sv(s1, a) > sv(s2, a)] ∨ [su(t, s1) < su(t, s2)])

B. Plan-based Pruning

As the composite service provider might have a numberof possible alternative plans in its planning knowledgehierarchy to achieve the requested functionality, the goal ofplan-based pruning is to reduce this planning search spaceby eliminating those plans that will not lead to a satisfactorysolution for the current request. More specifically, it tries toidentify the abstract plans, all of whose available instances(all possible combinations of services) are guaranteed toviolate the quality constraints. In order to determine suchunsatisfactory plans without first performing the costly pro-cess of service selection, we annotate each task t ∈ Tin the planning knowledge hierarchy with two items: thenumber of its candidate services; and its ideal quality values,which can be represented as a function tiv(t, a) that assignsto each quality attribute a ∈ AN the best value offeredfor this attribute among all task t’s candidate services (i.e.tiv(t, a) = tmn(t, a)).

To illustrate the benefit of such information, consider theplanning knowledge hierarchy of Figure 1, where the idealquality values of each task correspond to price and executiontime. Here, there are five alternative plans for performingtask A: plan1: A; plan2: B-D-E; plan3: C-D-E; plan4: B-D-F-G; and plan5: C-D-F-G. Now, suppose the user wantsto achieve task A with the constraint that the cost should beless than 80. Since plan1 has no available candidate services,it can be eliminated immediately from the planning searchspace. Based on the ideal quality values of tasks B, C, D, E,

F and G, the best possible prices offered by plan2, plan3,plan4, and plan5 are 90, 75, 100, and 85, respectively. As theonly plan to satisfy the price constraint, only plan3 shouldbe considered for composition with the other plans filteredout from the current request’s planning search space.

Formally, the set of plans to be considered to performa requested task according to plan-based pruning can bedefined as PLN = {p = (V,E) ∈ abspln(rt) | [∀t ∈V, cnd(t) 6= ∅] ∧ [∀a ∈ AR, aggrt∈V (tiv(t, a)) ≤ rc(a)]},where aggr is as in the composite service quality model.

IV. SERVICE SELECTION

Several methods have been proposed to tackle the globaloptimal service selection problem. Here, we model thisas a multi-constrained optimal path selection problem ina directed graph (sometimes called multi-constrained QoSrouting), which involves finding a path, from the start nodeto the destination node in a network, satisfying a set of path-level QoS constraints while being optimal with respect tosome cost function. Modelling service selection in this wayallows us to cope with all alternative abstract plans at once,without applying selection to each plan separately.

Several algorithms have been proposed as an attempt tosolve the QoS routing problem. Here we adopt the multi-constrained Bellman-Ford algorithm [7], which we firstintroduce, and then show how it can be updated to solveour service selection problem.

A. Multi-constrained Bellman-Ford Algorithm

Given a set of path-level QoS constraints, the multi-constrained Bellman-Ford algorithm stores in each node vof the search graph the set of optimal paths discovered sofar from the start node to v, with a path being consideredoptimal regarding the constrained quality attributes if noother path in the graph (from the same source to the samedestination) is better for all of these attributes. The optimalpaths from the start node to all other nodes are generatedby traversing the graph edges, and for each edge (u, v)comparing the optimal paths already recorded in v withthose obtained by concatenating the optimal paths in u withthe edge (u, v). As a result of processing all graph edges|V |−1 times (|V | is the number of nodes in the graph), thedestination node contains all optimal paths from the startnode to the destination, of which, those that satisfy the QoSrequirements are considered solution paths.

B. Selection Algorithm

The first step in our service selection algorithm is togenerate the plan paths graph, where each path correspondsto an alternative abstract plan for achieving the requestedtask (all abstract plans are assumed to have sequentialstructure). For example, the plan paths graph for the planningknowledge of Figure 1 is provided in Figure 4 (in which thestart and end nodes are additional nodes, which are ignored

C

B ED

A

F G

start dest

Figure 4. Plan paths graph

when instantiating the graph paths). Based on the idea of theBellman-Ford algorithm, each node v in the plan paths graphstores the optimal instances (optimal composite services)among all possible instances of the paths discovered so farfrom the start node to v. For example, node D in Figure 4records the optimal instances of the paths BD and CD. Inorder to maximise utility, the concept of optimal paths inthe original Bellman-Ford algorithm is updated so that aninstance at node v is considered optimal if no other possibleinstance of a path between the source node and v has bothbetter values for all the constrained attributes and betterutility. Moreover, to reduce the number of optimal instances,only those satisfying the quality constraints are maintainedin each node. After traversing all graph nodes in topologicalorder, the solution is the optimal composite service that hasthe best utility at the destination node.

In order to meet plan-based pruning needs, we maintainin each node v of the plan paths graph, the set of its validpredecessors, which is the set of paths from the start nodeto v that, when combined with at least one path from v tothe end node, will lead to a satisfactory abstract plan withrespect to the plan-based pruning. In other words, a pathpstart→v is a valid predecessor of v iff there exists at leastone path pv→dest such that pstart→v+ pv→dest ∈ PLN .Based on these valid predecessors, processing the edgescan be updated so that, when traversing a particular edge(u, v), only the optimal composite services stored in u thatare instances of v’s valid predecessors are considered. Forexample, given that PLN = {BDE,CDE, CDFG}, thevalid predecessors of the nodes in Figure 4 are provided inTable I. Hence, when processing edge (D,F ), all the optimalcomposite services stored at node D that are instances of thepath BD should be ignored (BD is not a valid predecessorof node F ), and only those that are instances of the path CDshould be considered for joining with node F ’s candidateservices. This service selection algorithm is provided inAlgorithm 1. Function pcnd(u) at line 3 of process-edgeprocedure returns the candidate services of node u thatsurvive task-based pruning (pcnd(u)=cnd(u) when no taskpruning is performed).

V. EXPERIMENTS AND RESULTS

This section evaluates the effectiveness of the pruningtechniques presented, focusing mainly on assessing the gainin performance, in terms of computation time. For thispurpose, the candidate services of each task are generated

Algorithm 1 WS-Selection-Algorithm1: generate plan paths graph gPK = (VPK , EPK)2: assign to each node v ∈ VPK/{vstart} its valid predecessors

validpred(v)3: sort the nodes in VPK topologically4: store the empty instance at vstart and empty service at vdest

5: for each v ∈ VPK/{vdest} taken in topological order do6: if v = vstart or validpred(v) 6= ∅ then7: for each (v, u) ∈ EPK do8: process-edge(v,u)9: inssolution is the instance at vdest with the highest cu

Procedure 2 process-edge(v,u)1: for each pu ∈ validpred(u) do2: if v is the end node of pu then3: for each s ∈ pcnd(u) do4: for each optimal instance insv ∈ ins(pu) at node v

do5: if ∀a ∈ AR, cv(insv + s, a) ≤ rc(a) then6: check-instance-optimality(insv + s,u)

Procedure 3 check-instance-optimality(ins,u)1: flag ← 12: for each optimal instance insu at node u do3: if ∀a ∈ AR, cv(insu, a) is-better-than cv(ins, a) and

cu(insu) is-better-than cu(ins) then4: flag ← 05: break6: else if ∀a ∈ AR, cv(ins, a) is-better-than cv(insu, a) and

cu(ins) is-better-than cu(insu) then7: remove insu from the instances at u8: if flag=1 then9: add ins to the instances at u

randomly (each candidate service is assumed to have 5quality attributes). Request quality constraints and qualityweights are also set to random values. For simplicity, eachparent task in the planning knowledge hierarchy is assumedto have one decomposition graph, and all quality attributesare assumed to be additional (aggr is the sum function).

To evaluate the performance of domination-based pruningbefore applying service selection, the computation time ofthe algorithm (in relation to both the number of candidateservices per task, and the number of the constrained qualityattributes) was compared in three cases: no task pruning(the basic algorithm), only ri-pruning, and both ri-pruningand rd-pruning. Here, the number of candidate services pertask was varied between 100 and 1000, while the number oflevels in the hierarchy and the number of child tasks for each

Table IVALID PREDECESSORS FOR NODES IN FIGURE 4 (S = START NODE)

Node A B C D E F G destValid ∅ S S SB SBD SCD SCDF SBDEPred. SC SCD SCDE

SCDFG

0

10

20

30

40

50

60

70

80

90

100

0 100 200 300 400 500 600 700 800 900 1000

com

pu

tati

on

tim

e (

ms)

number of candidate services per task

(a) Domination Pruning Results

no task pruning

request-independent domination

request-dependent domination

0

10

20

30

40

50

60

0 100 200 300 400 500 600 700 800 900 1000

com

pu

tati

on

tim

e (

ms)


(b) Constraint Pruning Results

without constraint pruning

with constraint pruning

0

5

10

15

20

25

30

35

0 100 200 300 400 500 600 700 800 900 1000

com

pu

tati

on

tim

e (

ms)


(c) Plan Pruning Results

without plan pruning

with plan pruning

0

20

40

60

80

100

120

140

160

180

1 2 3 4 5

com

pu

tati

on

tim

e (

ms)

number of quality constraints

(d) Domination Pruning Results

request-independent domination

request-dependent domination

0

20

40

60

80

100

120

1 2 3 4 5

com

pu

tati

on

tim

e (

ms)

number of quality constraints

(e) Constraint Pruning Results

without constraint pruning

with constraint pruning

0

20

40

60

80

100

120

1 2 3 4 5 6

com

pu

tati

on

tim

e (

ms)

number of child tasks per parent task

(f) Plan Pruning Results

without plan pruning

with plan pruning

Figure 5. Evaluating the search space reduction techniques

parent were fixed at 3 and 4, respectively (i.e. the number oftasks in each abstract plan ≤ 16). The execution time of eachalgorithm was averaged over 20 different graph instances,and 20 different randomly-generated requests. The results,provided in Figure 5(a), indicate that both domination typessignificantly outperform the basic algorithm whose execu-tion time increases dramatically with the increasing numberof candidate services. Moreover, we observe that rd-pruningachieves better performance than ri-pruning, especially asthe number of candidate services grows.

To further compare the performance of the two domi-nation approaches with respect to the number of qualityconstraints, the number of candidate services per task wasfixed at 1000, while the number of the constrained attributeswas varied between 1 and 5 (5 is the total number of qualityattributes). As before, the reported computation times areaveraged (over 20 graph instances and 10 requests). Asshown in Figure 5(d), both algorithms require more timewhen the problem becomes more constrained because, asthe number of constraints increases, so does the number ofparameters with respect to which the optimal instances arecomputed, resulting in more optimal instances being storedat each graph node.

Although rd-pruning is generally more efficient than ri-pruning, the latter is more efficient than the former whenthe number of quality constraints is maximal (5 in ourcase). This is expected because, when the number of qualityconstraints is equal to the total number of quality attributes,no service will be rd-dominated by another service afterapplying ri-pruning to each task. Hence, rd-pruning in thiscase only results in adding unnecessary overhead to overallperformance (rd-pruning is performed at request time), with-

out achieving further reduction in the service search space.To study the improvement in execution time as a result of

constraint-based pruning, the time of the selection algorithmis compared in two cases: domination-based pruning alone;and both domination-based pruning and constraint-basedpruning. Again, the effects of the numbers of candidates andconstraints are considered, with the experimental settingsbeing the same as previously.

The results in Figure 5(b) show that constraint-based prun-ing in combination with domination-based pruning performsbetter than domination-based pruning alone. Clearly, the gainin performance increases with the number of quality con-straints, as shown in Figure 5(e) because, in over-constrainedproblems, more candidate services are likely to violate thequality constraints, and thus are pruned prior to applying theselection algorithm, resulting in computation time reduction.

Finally, to assess the effectiveness of plan-based prun-ing, task-based pruning (domination-based pruning andconstraint-based pruning) alone was compared with thecombination of task-based pruning and plan-based pruning.Figures 5(c) and 5(f) show the running time of the algo-rithms, varying the number of candidate services per taskand the number of child tasks per parent task, respectively.As expected, the inclusion of plan-based pruning resultsin an improvement in performance: the computation timereduction increases as the problem becomes more complex(due to an increased number of candidate services, or anincreased number of alternative abstract plans).

VI. RELATED WORK

In recent years, several methods have been proposed totackle the global optimal service selection problem, which

involves assigning actual services to a workflow’s abstracttasks such that the resulting composite service meets theimposed global-level QoS constraints while maximising agiven utility function. One possible way to solve this prob-lem is by adopting the Integer Programming (IP) techniquesto find the best assignments of services to the abstract tasks,as presented by Zeng et al. [2], and Ardagna et al. [6].Another approach to the same problem, based on geneticalgorithms, is proposed by Canfora et al. [3]. Yu et al. [5]introduce two alternative models for the service selectionproblem, as a multi-dimension multi-choice knapsack prob-lem and as a multi-constrained optimal path problem, andheuristic algorithms are proposed for both models to achievebetter performance. Another QoS-routing based solution ispresented by Li et al. [4], where the H MCOP heuristicalgorithm is adopted to find the optimal aggregation ofservices while addressing the correlation issues betweenthem. Our search space reduction techniques can be seenas a preprocessing step to these algorithms to improvetheir performance. This is done by ensuring that servicecombinations that are guaranteed not to be optimal or toviolate the imposed constraints are not considered in theselection process.

Attempting to reduce the number of candidate servicesfor a task is not new, and has been investigated by otherresearches. For instance, Qi et al. [8] suggest a heuristic localoptimisation method that reduces the number of candidatesper task by assigning to each task different sets of task-level quality constraints, and then keeping for each constraintset the satisfactory service with the best utility. Unlike thisheuristic pruning, our task-based pruning does not affectthe ability to find an optimal solution. Similarly to ourapproach, Alrifai et al. [9] use the notion of dominance toselect representative web services for the workflow’s tasks.However, as opposed to that work, we present a request-based domination pruning, where the dominance betweenservices is determined with respect to user’s request, thusresulting in more uninteresting services being pruned foreach task. Domination-based ranking of services is alsoadopted by Skoutas et al. [10] to select the best services fora particular request. That work, however, only ranks servicesbased on their functional suitability for the request withoutaddressing the QoS issues nor the composition problemconsidered in this paper.

Finally, the concept of task decomposition presented inour planning knowledge model is very similar to taskdecomposition in HTN planning [11], and to compositeprocess decomposition in OWL-S process ontology [12].The novelty of our formalism, however, is that it mapstasks at arbitrary levels of granularity to actual services, thusconsidering all possibilities to achieve the requested goal,unlike the existing approaches where only atomic tasks aremapped to actual services.

VII. CONCLUSION

In this paper, we have presented an approach to dynamicservice composition, where planning knowledge consists ofhierarchies of tasks, each with multiple alternative decom-positions. In addition, task-based pruning and plan-basedpruning techniques have been introduced to reduce the costof instantiating plan tasks with actual services in a way thatsatisfies the QoS requirements. To evaluate the effectivenessof these techniques, a service selection algorithm was pre-sented, and the results show that a significant gain in perfor-mance is achieved through applying our pruning techniquesas preprocessing before the selection algorithm. Future workwill involve testing the pruning techniques using real datasets, resolving trust issues in service selection, and inves-tigating other search space reduction techniques, including:context-based pruning and coordination-based pruning.

REFERENCES

[1] E. Sirin, B. Parsia, and J. Hendler, “Template-based composi-tion of semantic web services,” in AAAI Fall Symp. on Agentsand the Semantic Web, 2005, pp. 85–92.

[2] L. Zeng, B. Benatallah, A. H. H. Ngu, M. Dumas,J. Kalagnanam, and H. Chang, “QoS-aware middleware forweb services composition,” IEEE Trans. Softw. Eng., vol. 30,no. 5, pp. 311–327, May 2004.

[3] G. Canfora, M. D. Penta, R. Esposito, and M. L. Villani,“An approach for QoS-aware service composition based ongenetic algorithms,” in Proc. 2005 Genetic and EvolutionaryComputation Conference, 2005, pp. 1069–1075.

[4] L. Li, J. Wei, and T. Huang, “High performance approach formulti-QoS constrained web services selection,” in Proc. 5thInt. Conf. on Service-Oriented Computing, 2007, pp. 283–294.

[5] T. Yu, Y. Zhang, and K. Lin, “Efficient algorithms for webservices selection with end-to-end QoS constraints,” ACMTrans. Web, vol. 1, no. 1, May 2007.

[6] D. Ardagna and B. Pernici, “Adaptive service compositionin flexible processes,” IEEE Trans. Softw. Eng., vol. 33, pp.369–384, Jun. 2007.

[7] X. Yuan and X. Liu, “Heuristic algorithms for multi-constrained quality of service routing,” IEEE/ACM Trans.Netw., vol. 10, pp. 244–256, Apr. 2002.

[8] L. Qi, Y. Tang, W. Dou, and J. Chen, “Combining localoptimization and enumeration for QoS-aware web servicecomposition,” in Proc. 2010 IEEE International Conferenceon Web Services, 2010, pp. 34–41.

[9] M. Alrifai, D. Skoutas, and T. Risse, “Selecting skylineservices for QoS-based web service composition,” in Proc.19th Int. Conf. on world wide web, 2010, pp. 11–20.

[10] D. Skoutas, D. Sacharidis, A. Simitsis, and T. Sellis, “Rankingand clustering web services using multicriteria dominancerelationships,” IEEE Trans. Services Comput., vol. 3, pp. 163–177, Jul. 2010.

[11] E. Sirin, B. Parsia, D. Wu, J. Hendler, and D. Nau, “HTNplanning for web service composition using SHOP2,” WebSemant., vol. 1, pp. 377–396, Oct. 2004.

[12] D. Martin, M. Burstein, J. Hobbs, O. Lassila, D. McDermott,S. McIlraith, S. Narayanan, M. Paolucci, B. Parsia, T. Payne,E. Sirin, N. Srinivasan, and K. Sycara, “OWL-S: Semanticmarkup for web services,” W3C, W3C Member Submission,2004.

Date post:	21-Apr-2020
Category:	Documents
Upload:	others
View:	1 times
Download:	0 times

Efﬁcient Multi-Granularity Service Compositionimprove the computational cost. This paper is...

Documents