Algorithm Selection for Preferred Extensions Enumeration

Algorithm Selection for PreferredExtensions Enumeration

Federico Cerutti, Massimiliano Giacomin, Mauro Vallati

COMMA-2014 — Wednesday 10th September, 2014

Background on Dung’s AF

Current Approaches for Preferred Extensions Enum.Features

Empirical ResultsConclusions

Background

Definition

Given an AF Γ= ⟨A,R⟩, with R⊆A×A:

a set S⊆A is conflict–free if @ a,b ∈ S s.t. a→ b;

an argument a ∈A is acceptable with respect to a set S⊆A if ∀b ∈A s.t.b→ a, ∃ c ∈ S s.t. c→ b;

a set S⊆A is admissible if S is conflict–free and every element of S isacceptable with respect to S;

a set S⊆A is a complete extension, i.e. S ∈ ECO(Γ), iff S is admissible and∀a ∈A s.t. a is acceptable w.r.t. S, a ∈ S;

a set S⊆A is the grounded extensions, i.e. S ∈ EGR(Γ), iff S is the minimal(w.r.t. set inclusion) complete set;

a set S⊆A is a preferred extension, i.e. S ∈ EPR(Γ), iff S is a maximal (w.r.t. setinclusion) complete set.

Background on Dung’s AF

Current Approaches forPreferred Extensions Enum.

FeaturesEmpirical Results

Conclusions

AspartixM: [Dvorák et al., 2011]

Expresses argumentation semantics in Answer Set Programming(ASP);Tests for subset-maximality exploiting the metasp optimisationfrontend for the ASP-package gringo/claspD;Database of the form:

{arg(a) | a ∈A} ∪ {defeat(a,b) | ⟨a,b⟩ ∈R}

Example of program for checking the conflict–freeness:

πcf = { in(X)← not out(X), arg(X);out(X)← not in(X), arg(X);← in(X), in(Y ), defeat(X,Y )}.

NAD-Alg: [Nofal et al., 2014]

Several improvements in [Nofal et al., 2014]

PrefSAT: [TAFA2013]

ΠΓ is a boolean formula such that eachsatisfying assignment of the formulacorresponds to a complete extension.

SCC-P: [KR2014]

SCC-P: [KR2014]

SCC-P: [KR2014]

AspartixM // NAD-Alg // PrefSAT [DARe2014]

% Solved Average CPU-TimeDensity 25% 50% 75% 25% 50% 75%

AspartixM 98.3 100.0 100.0 56.5 14.7 10.0NAD-Alg 100.0 100.0 100.0 18.9 0.2 0.2PrefSAT 100.0 100.0 100.0 5.1 1.6 2.2

% Solved Average CPU-TimeDensity RAND RAND

AspartixM 98.9 34.0NAD-Alg 93.9 70.6PrefSAT 100.0 4.2

PrefSAT // SCC-P [KR2014]720 AFs varying |SCCΓ| in 5:5:45. Size of SCCs N(µ= 20 : 5 : 40, σ = 5);

attacks among SCCs N(µ= 20 : 5 : 40, σ = 5)

40

50

60

70

80

90

100

0 10 20 30 40 50

|SCCΓ|

IPC value (normalised) for SCC-P and SAT-P when 5 ≤ |SCCΓ| ≤ 45

SCC-P SAT-P

For |SCCΓ|= 35, Md(SCC-P) = 8.81,Md(SAT-P) = 8.53, z=−0.35, p= 0.73;

Background on Dung’s AFCurrent Approaches for Preferred Extensions Enum.


Conclusions

Features from an Argumentation Graph Γ= ⟨A,R⟩— improvement from [ECAI2014]

Directed Graph (26 features)

Structure:

# vertices ( |A| )# edges ( |R| )# vertices / #edges ( |A|/|R| )# edges / #vertices ( |R|/|A| )densityaverage

Degree: stdevattackers max

min#averagestdevmax

SCCs:

min

Structure:

# self-def# unattackedflow hierarchyEulerianaperiodic

CPU-time: . . .

Undirected Graph (24 features)

Structure:

# edges# vertices / #edges# edges / #verticesdensity

Degree:

averagestdevmaxmin

SCCs:

#averagestdevmaxmin

Structure: Transitivity

3-cycles:

#averagestdevmaxmin

CPU-time: . . .

How Hard is to Get the Features?

Direct Graph Features (DG) Undirect Graph Features (UG)Class CPU-Time # feat Class CPU-Time # feat

Mean stdDev Mean stDevGraph Size 0.001 0.009 5 Graph Size 0.001 0.003 4Degree 0.003 0.009 4 Degree 0.002 0.004 4SCC 0.046 0.036 5 Components 0.011 0.009 5Structure 2.304 2.868 5 Structure 0.799 0.684 1

Triangles 0.787 0.671 5

Average CPU-time, stdev, needed for extracting the features of a given class.


Features

Empirical Results

Conclusions

Protocol: Some Numbers

|SCCΓ| in 1 : 100;|A| in 10 : 5,000;|R| in 25 : 270,000 (Erdös-Rényi, p uniformly distributed) ;Overall 10,000 AFs.

Cutoff time of 900 seconds (value also for crashed, timed-out or ranout of memory).

EPMs both for Regression (Random forests) andClassification (M5-Rules) using WEKA;Evaluation using a 10-fold cross-validation approach on a uniformrandom permutation of instances.

Result 1: Best Features for Prediction

Solver B1 B2 B3AspartixM number of arguments density of directed graph size of max. SCCPrefSAT density of directed graph number of SCCs aperiodicityNAD-Alg density of directed graph CPU-time for density CPU-time for EulerianSCC-P density of directed graph number of SCCs size of the max SCC

Determined by a greedy forward search based on the Correlation-based Feature Selection (CFS)attribute evaluator.

AF structure SCCs CPU-time for feature extraction

Result 2: Predicting (log)Runtime

RSME of Regression (Lower is better)B1 B2 B3 DG UG SCC All

AspartixM 0.66 0.49 0.49 0.48 0.49 0.52 0.48PrefSAT 1.39 0.93 0.93 0.89 0.92 0.94 0.89NAD-Alg 1.48 1.47 1.47 0.77 0.57 1.61 0.55SCC-P 1.36 0.80 0.78 0.75 0.75 0.79 0.74

s

∑ni=1

�

log10( bti )− log10( yi )�2

n

AF structure SCCs CPU-time for feature extraction Undirect Graph

Result 3: Best Features for Classification

C-B1 C-B2 C-B3number of arguments density of directed graph min attackers

Determined by a greedy forward search based on the Correlation-based Feature Selection(CFS) attribute evaluator.

AF structure Attackers

Result 4: Classification, i.e. Selecting the BestSolver for a Given Γ= ⟨A,R⟩

Classification (Higher is better)|A| density min attackers DG UG SCC All

Accuracy 48.5% 70.1% 69.9% 78.9% 79.0% 55.3% 79.5%Prec. AspartixM 35.0% 64.6% 63.7% 74.5% 74.9% 42.2% 76.1%Prec. PrefSAT 53.7% 67.8% 68.1% 79.6% 80.5% 60.4% 80.1%Prec. NAD-Alg 26.5% 69.2% 69.0% 81.7% 85.1% 35.3% 86.0%Prec. SCC-P 54.3% 73.0% 72.7% 76.6% 76.8% 57.8% 77.2%

AF structure Attackers Undirect Graph SCCs

Result 5: Algorithm Selection

Metric: Fastest(max. 1007)

AspartixM 106NAD-Alg 170PrefSAT 278SCC-P 453EPMs Regression 755EPMs Classification 788

Metric: IPC(max. 1007)

NAD-Alg 210.1AspartixM 288.3PrefSAT 546.7SCC-P 662.4EPMs Regression 887.7EPMs Classification 928.1

IPC: scale of (log)relative performance



Conclusions

Conclusions

50 featuresBest algorithm selection accuracy: 80%Algorithm selection performance: 2 times better than the bestsingle-solver; 3 times better than the second-best single-solver;Consistency with previous explorations: density is among the mostinformative features.

Undirect Graph Features:AspartixM, often around [800 . . . 899] seconds — predictable?;NAD-Alg, often time-out — predictable?;PrefSAT, SAT-solver performance?;SCC-P, little difference, significant?.

Future Work

Exploiting additional useful features;Relevant semantics-dependent features (comparison with othersemantics);Combining algorithms in portfolios;

Scheduling;Ordering;Solver combination;

Addressing problems other than enumeration (e.g. non-emptiness. . . ).

Acknowledgements

The authors would like to acknowledge the use of the Universityof Huddersfield Queensgate Grid in carrying out this work.

References I

[Dvorák et al., 2011] Dvorák, W., Gaggl, S. A., Wallner, J., and Woltran, S. (2011).Making Use of Advances in Answer-Set Programming for Abstract Argumentation Systems.In Proceedings of the 19th International Conference on Applications of Declarative Programming and Knowledge Management (INAP2011).

[Nofal et al., 2014] Nofal, S., Atkinson, K., and Dunne, P. E. (2014).Algorithms for decision problems in argument systems under preferred semantics.Artificial Intelligence, 207:23–51.

Date post:	18-Dec-2014
Category:	Education
Upload:	federico-cerutti
View:	50 times
Download:	2 times

Algorithm Selection for Preferred Extensions Enumeration

Education