Introduction to PCP

11

22

IntroductionIntroduction

In this lecture we’ll cover:Definition of PCP Prove some classical hardness of approximation results

Review some recent ones

33

Review: Decision, Review: Decision, Optimization ProblemsOptimization Problems

A A decisiondecision problem problem is is

a a Boolean Boolean function function ƒ(X)ƒ(X), or alternatively, or alternatively

a a languagelanguage L L {0, 1} {0, 1}** comprising all strings for comprising all strings for which which ƒƒ is is TRUETRUE:: L = { X L = { X {0, 1} {0, 1}* * | ƒ(X) }| ƒ(X) }

An An optimizationoptimization problem problem is is

a function a function ƒ(X, Y)ƒ(X, Y) which, given which, given XX, is to be maximized , is to be maximized (or minimized) over all possible (or minimized) over all possible YY’s: ’s: maxmaxyy[ ƒ(X, Y) ][ ƒ(X, Y) ]

A A thresholdthreshold version of version of max-ƒ(X, Y)max-ƒ(X, Y) is is

the the languagelanguage LLtt of all strings of all strings XX for which there exists for which there exists YY such that such that ƒ(X, Y) ƒ(X, Y) t t

[transforming an optimization problem into [transforming an optimization problem into decision]decision]

44

Review: The Class NPReview: The Class NP

The classical definition of the class The classical definition of the class NPNP is: is:

A language A language L L {0, 1} {0, 1}* * belongsbelongs to the class to the class NPNP, if there exists a Turing machine , if there exists a Turing machine VVLL [referred to as a [referred to as a verifierverifier] ] such thatsuch that

X X L L there exists a there exists a witnesswitness YY such that such that VVLL(X, Y)(X, Y) acceptsaccepts, in time , in time |X||X|O(1)O(1)

That is, That is, VVLL can verify a membership-proof of can verify a membership-proof of XX in in L L in time polynomial in the length ofin time polynomial in the length of X X

55

Review: NP-HardnessReview: NP-Hardness

A language A language LL is said to be is said to be NP-hardNP-hard if an if an efficient (polynomial-time) procedure for efficient (polynomial-time) procedure for LL can be utilized to obtain an efficient can be utilized to obtain an efficient procedure for any procedure for any NP-languageNP-language

This definition allows efficient reduction This definition allows efficient reduction that use the more general, that use the more general, CookCook reduction. An efficient algorithm, reduction. An efficient algorithm, translating any translating any NPNP problem to a single problem to a single instance of instance of LL - thereby showing that - thereby showing that LL NP-hardNP-hard - is referred to as - is referred to as KarpKarp reduction reduction

66

Review: Characterizing NPReview: Characterizing NP

ThmThm [Cook,Levin][Cook,Levin]:: For For LL NP NP there’s an algorithm there’s an algorithm that, on input that, on input XX, constructs, in time , constructs, in time |X||X|O(1)O(1), a set , a set of of local-constraintslocal-constraints (Boolean functions) (Boolean functions)

L,X L,X = { = { 11ll } }

over variables over variables yy11,...,y,...,ymm s.t.: s.t.:

1.1. each of each of 11ll depends on depends on o(1)o(1) variables variables

2.2. X X L L there exists an assignment there exists an assignment A: { yA: { y11, ..., y, ..., ym m } } { 0, 1 } { 0, 1 } satisfying all satisfying all L,XL,X

[ note that [ note that mm and and ll must be at most polynomial in must be at most polynomial in |X||X| ] ]

77

NP characterizationNP characterization

y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

TT

TT

TT

TT

TT

FF

TT

FF

TT

TT

FF

TT

FF

TT

If X X L, L, all of

the local tests aresatisfied

88

Approximation - Some Approximation - Some DefinitionsDefinitions

DefDef: : gg-approximation-approximation

A A g-approximationg-approximation of a maximization (similar of a maximization (similar for minimization) function for minimization) function ff, is an algorithm , is an algorithm that on input that on input XX, outputs , outputs f’(X)f’(X) such that: such that:

f’(X) f’(X) f(X)/g(|X|). f(X)/g(|X|).

DefDef: : PTASPTAS (poly-time approximation scheme) (poly-time approximation scheme)

We say that a maximization function We say that a maximization function ff, has a , has a PTASPTAS, if for every , if for every gg, there is a polynomial , there is a polynomial ppgg and a and a g-approximationg-approximation for for ff, whose running , whose running time is time is ppgg(|X|)(|X|)

99

Approximation - NP-hardApproximation - NP-hard?? We know that by using Cook/Karp reductions, we We know that by using Cook/Karp reductions, we

can show many decision problems to be can show many decision problems to be NP-hardNP-hard..

Can an approximation problem be Can an approximation problem be NP-HardNP-Hard??

One can easily show, that if there is One can easily show, that if there is gg,for which ,for which there is a there is a g-approximating g-approximating for for TSPTSP, , P=NPP=NP..

1010

Characterization of NPCharacterization of NP

ThmThm [Cook,Levin][Cook,Levin]:: For For LL NP NP there’s an there’s an algorithm that, on input algorithm that, on input XX, constructs, in time , constructs, in time |X||X|O(1)O(1), a set of , a set of local-constraintslocal-constraints (Boolean (Boolean functions)functions)

L,X L,X = { = { 11ll } }

over variables over variables yy11,...,y,...,ymm s.t.: s.t.:

1.1. each of each of 11ll depends on depends on o(1)o(1) variables variables

2.2. X X L L there exists an assignment there exists an assignment A: { yA: { y11, ..., y, ..., ym m } } { 0, 1 } { 0, 1 } satisfying all satisfying all L,XL,X

AS,ALMSS

PCP

X X L L assignment assignment A: { y1, ..., ym } A: { y1, ..., ym } { 0, 1 } { 0, 1 } satisfies satisfies < ½ < ½ fraction of fraction of L,XL,X

1111

PCP NP characterizationPCP NP characterization

y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

TT

FF

TT

FF

TT

If X X L, L, at least

half of the local tests aren’t

satisfied !

1212

Probabilistically Checkable Probabilistically Checkable ProofsProofs

Hence, Hence, Cook-LevinCook-Levin theorem states that a verifier theorem states that a verifier can efficiently verify membership-proofs for any can efficiently verify membership-proofs for any NPNP language language

PCPPCP characterization of characterization of NPNP, in contrast, states , in contrast, states that a membership-proof can be verified that a membership-proof can be verified probabilisticallyprobabilistically by choosing randomly one by choosing randomly one local-constraintlocal-constraint,, accessing the accessing the small setsmall set of variables it depends of variables it depends

on,on, accept or reject accordinglyaccept or reject accordingly

erroneouslyerroneously accepting a non-member only accepting a non-member only with with small probabilitysmall probability

1313

Gap ProblemsGap Problems

A A gap-problemgap-problem is a maximization is a maximization (or (or minimization) minimization) problem problem ƒ(X, Y)ƒ(X, Y), and two , and two thresholds thresholds tt11 > t > t22

XX must be accepted if must be accepted if maxmaxYY[ ƒ(X, Y) ] [ ƒ(X, Y) ] t t11

XX must be rejected if must be rejected if maxmaxYY[ ƒ(X, Y) ] [ ƒ(X, Y) ] t t22

other other XX’s may be accepted or rejected ’s may be accepted or rejected (don’t care)(don’t care)

(almost a decision problem, relates to (almost a decision problem, relates to approximation)approximation)

1414

Reducing gap-Problems to Reducing gap-Problems to Approximation ProblemsApproximation Problems

Using an efficient approximation algorithm for Using an efficient approximation algorithm for ƒ(X, Y)ƒ(X, Y) to within a factor to within a factor gg,,one can efficiently solve the corresponding one can efficiently solve the corresponding gap problem gap problem gap-ƒ(X, Y)gap-ƒ(X, Y), as long as , as long as tt1 1 // tt2 2 > g> g22

Simply run the approximation algorithm.Simply run the approximation algorithm.The outcome clearly determines which side of The outcome clearly determines which side of the gap the given input falls in.the gap the given input falls in.

(Hence, proving a gap problem (Hence, proving a gap problem NP-hardNP-hard translates to its approximation version, for translates to its approximation version, for appropriate factors )appropriate factors )

1515

gap-SAT

DefDef: gap-SAT[D, v, : gap-SAT[D, v, ]] is as follows: is as follows: InstanceInstance: a set : a set = { = { ll } } of Boolean-of Boolean-

functions functions (local-constraints)(local-constraints) over variables over variables yy11,...,y,...,ymm of of range range 22VV

LocalityLocality: each of : each of 11ll depends on at most depends on at most DD variablesvariables

Maximum-Satisfied-FractionMaximum-Satisfied-Fraction is the fraction of is the fraction of satisfied by an assignment satisfied by an assignment A: { yA: { y11, ..., y, ..., ym m } } 2 2vv

if this fraction if this fraction

= 1= 1 accept accept

<< reject reject DD, , vv and and may be a function of may be a function of l l

1616

The PCP HierarchyThe PCP Hierarchy

DefDef:: L L PCP PCP[ [ DD, , VV, , ] ] if if LL is efficiently reducible to is efficiently reducible to gap-SATgap-SAT[ [ DD, , VV, , ] ]

ThmThm [AS,ALMSS] [AS,ALMSS] NP NP PCP[ O(1), 1, ½] PCP[ O(1), 1, ½] [ The PCP characterization theorem above ][ The PCP characterization theorem above ]

ThmThm [ RaSa ] [ RaSa ] NP NP PCP[ O(1), m, 2 PCP[ O(1), m, 2-m-m ] ] for for m m loglogcc n n for some for some c > 0c > 0

ThmThm [ DFKRS ][ DFKRS ] NP NP PCP[ O(1), m, 2 PCP[ O(1), m, 2-m-m ] ] for for m m loglogcc n n for any for any c < 1c < 1

ConjectureConjecture [BGLR][BGLR] NP NP PCP[ O(1), m, 2 PCP[ O(1), m, 2-m -m ] ] for for m m log n log n

1717

Optimal CharacterizationOptimal Characterization

One cannot expect the error-probability to be less One cannot expect the error-probability to be less than exponentially small in the number of bits each than exponentially small in the number of bits each local-test looks atlocal-test looks at since a random assignment would make such a since a random assignment would make such a

fraction of the local-tests satisfiedfraction of the local-tests satisfied One cannot hope for smaller than polynomially One cannot hope for smaller than polynomially

small error-probabilitysmall error-probability since it would imply less than one local-test since it would imply less than one local-test

satisfied, hence each local-test, being rather easy satisfied, hence each local-test, being rather easy to compute, determines completely the outcometo compute, determines completely the outcome

[ the BGLR conjecture is hence optimal in that [ the BGLR conjecture is hence optimal in that respect]respect]

1818

Approximating MAX-IS is NP-hardApproximating MAX-IS is NP-hard

We will reduce We will reduce gap-SATgap-SAT to to gap –Independent-Set. gap –Independent-Set.

Given an expression Given an expression = { = { ll } } of Boolean-of Boolean-functions over variables functions over variables yy11,...,y,...,ymm of range of range 22VV, Each , Each of of 11ll depends on at most depends on at most DD variables, we variables, we must determine whether all the functions can be must determine whether all the functions can be satisfied or only a fraction less than satisfied or only a fraction less than ..

We will construct a graph, We will construct a graph, GG , that has an , that has an independent set of size independent set of size rr there exists an there exists an assignment, satisfying assignment, satisfying r r of the local-constraints of the local-constraints yy11,...,y,...,ymm..

1919

((q,rq,r-)-)co-partite Graph co-partite Graph G=(QG=(QR, R, E)E)

Comprise Comprise q=|Q|q=|Q| cliques of size cliques of size r=|R|r=|R|:: E E {(<i,j{(<i,j11>, <i,j>, <i,j22>) | i>) | iQ, jQ, j11,j,j2 2 R}R}

qq

2020

qq

Gap Independent-SetGap Independent-Set

InstanceInstance:: an an (q,r)(q,r)-co-partite -co-partite graph graph G=(qG=(qR, E)R, E)

ProblemProblem:: distinguish distinguish betweenbetween GoodGood: : IS(G) = qIS(G) = q BadBad: every set : every set I I V V s.t. s.t. |I||I|

> > qq contains an edge contains an edge

ThmThm:: IS( r, IS( r, ) ) is NP-hard as long as is NP-hard as long as r r ( 1 / ( 1 / ))cc for some constantfor some constant c c

2121

gap-SAT gap-SAT gap-IS gap-IS

y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

TT

TT

TT

TT

TT

FF

TT

FF

TT

ll

Construct a graphConstruct a graph G G that has that has 11 clique clique i i ,,in which in which 11 vertex vertex satisfying assignment for satisfying assignment for ii

2222


y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

TT

TT

TT

TT

TT

FF

TT

FF

TT

ll

Two vertices are connected if the assignments Two vertices are connected if the assignments they represent are inconsistentthey represent are inconsistent

2323

LemmaLemma::

(G(G) = k) = k (independent set of size k) (independent set of size k) X X L L

(There is an assignment that satisfies k clauses) (There is an assignment that satisfies k clauses) Consider an assignment Consider an assignment AA satisfying k clauses. For each satisfying k clauses. For each

clause clause ii consider consider AA's restriction to 's restriction to ii‘s variables ‘s variables

The corresponding The corresponding kk vertexes form an independent set vertexes form an independent set

in in GG

Any independent set of size Any independent set of size kk in in GG implies an implies an

assignment satisfying assignment satisfying kk of of 11ll


Hence: Gap-IS is NP hard, and IS is NP-Hence: Gap-IS is NP hard, and IS is NP-hard to approximate!hard to approximate!

2424

Each of the following theorems gives a hardness of Each of the following theorems gives a hardness of approximation result of Max-IS:approximation result of Max-IS:

ThmThm [AS,ALMSS] [AS,ALMSS] NP NP PCP[ O(1), 1, ½] PCP[ O(1), 1, ½] ThmThm [ RaSa ] [ RaSa ] NP NP PCP[ O(1), m, 2 PCP[ O(1), m, 2-m-m ] ] for for m m

loglogcc n n for some for some c > 0c > 0 ThmThm [ DFKRS ][ DFKRS ] NP NP PCP[ O(1), m, 2 PCP[ O(1), m, 2-m-m ] ] for for m m

loglogcc n n for any for any c > 0c > 0 ConjectureConjecture [BGLR][BGLR] NP NP PCP[ O(1), m, 2 PCP[ O(1), m, 2-m -m ] ] for for

m m log n log n

Hardness of approximation of Hardness of approximation of Max-ISMax-IS

2525

Assuming the PCP theorem, we will show that if Assuming the PCP theorem, we will show that if PPNP,NP, Max-3SatMax-3Sat does not have a does not have a PTAS: PTAS:

Theorem: There is a constant Theorem: There is a constant C>0 C>0 so that computing so that computing (1+c) approximations to (1+c) approximations to Max-3SatMax-3Sat is is NP-hardNP-hard

Hardness of approximation forHardness of approximation forMax-3SATMax-3SAT

2626


SAT formulaSAT formulaEquivalent Equivalent

3SAT 3SAT formulaformula

y1y1

y2y2

yiyi

ym-1ym-1

ymym

variablesvariables

1111

jjjj

llll

CC11CC11 11CCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 jjCCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 llCCkk

CCkkCC22CC22 CC33

CC33

Given an instance of gap-SAT, Given an instance of gap-SAT, = { = { ll } }, we will , we will

transform each of the transform each of the ii‘s into a ‘s into a 3-SAT3-SAT expression expression ii..

2727


Given an instance of gap-SAT, Given an instance of gap-SAT, = { = { ll } }, there are , there are O(n)O(n) functions functions i i . Each of the . Each of the ii‘s depends on up to ‘s depends on up to D=O(1)D=O(1) variables variables..

y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

CC11CC11 11CCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 jjCCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 llCCkk

CCkkCC22CC22 CC33

CC33

Hence each function can be represented as Hence each function can be represented as a CNF formula a CNF formula ii::

ijj

i D ,21C

(a conjunction of (a conjunction of 2^D 2^D clauses, each of size at most clauses, each of size at most DD))Note that the number of clauses is still constant.Note that the number of clauses is still constant.Overall, we build a CNF formula: a conjunction of Overall, we build a CNF formula: a conjunction of i i (one for or each local test)(one for or each local test)..

2828


Now rewrite every D-clause as a group of 3-clauses to Now rewrite every D-clause as a group of 3-clauses to obtain a 3-CNF:obtain a 3-CNF:

y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

CC11CC11 11CCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 jjCCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 llCCkk

CCkkCC22CC22 CC33

CC33

)(^)^()()...( 12412112121

jzjkjkkkk zlzzllzlllll

Note that this is still a constant blow up in the number Note that this is still a constant blow up in the number of clauses.of clauses.

2929


y1y1

y2y2

yiyi

ym-1ym-1

ymym

1111

jjjj

llll

CC11CC11 11CCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 jjCCkk

CCkkCC22CC22 CC33

CC33

CC11CC11 llCCkk

CCkkCC22CC22 CC33

CC33

In case In case is NOT satisfyable, some constant fraction of the is NOT satisfyable, some constant fraction of the are not satisfied, and for each, at least one clause are not satisfied, and for each, at least one clause in in ii isn’t satisfied. isn’t satisfied.

3030

Conclusion:Conclusion:In case the original In case the original SATSAT formula formula isn’t satisfied, a constant isn’t satisfied, a constant

number of number of 3SAT3SAT formula formula ii are not satisfied, and for each at are not satisfied, and for each at least one clause isn’t satisfied.least one clause isn’t satisfied.

Because each Because each ii contains a constant number of clauses, contains a constant number of clauses, altogether a constant number of clauses in the resulting altogether a constant number of clauses in the resulting 3SAT3SAT aren’t satisfied.aren’t satisfied.

This provides a gap, and hence This provides a gap, and hence 3SAT3SAT cannot be approximated to cannot be approximated to within some constant unless within some constant unless P=NPP=NP!!

Hardness of approximation Hardness of approximation forfor

Max-3SATMax-3SAT

3131

The PCP theorem has ushered in a new era of hardness The PCP theorem has ushered in a new era of hardness of approximation results. Here we list a few:of approximation results. Here we list a few:

We showed that We showed that Max-CliqueMax-Clique ( and equivalently ( and equivalently Max-Independent-Set Max-Independent-Set ) do not has a ) do not has a PTASPTAS. It is . It is known in addition, that to approximate it with a known in addition, that to approximate it with a factor of factor of nn1-1- is hard unless is hard unless co-RP = NPco-RP = NP..

Chromatic Number Chromatic Number - - It is It is NP-HardNP-Hard to approximate it to approximate it within a factor of within a factor of nn1-1- unless unless co-RP = NPco-RP = NP. There is a . There is a simple reduction from simple reduction from Max-CliqueMax-Clique which shows that which shows that it is it is NP-HardNP-Hard to approximate with factor to approximate with factor nn. .

Chromatic Number for 3-colorable graphChromatic Number for 3-colorable graph - - NP-HardNP-Hard to approximate with factor to approximate with factor 5/3-5/3- (i.e. to (i.e. to differentiate between 4 and 3). Can be differentiate between 4 and 3). Can be approximated within approximated within O(nO(nloglogO(1) O(1) n).n).

More Results Related to PCPMore Results Related to PCP

3232

Vertex Cover Vertex Cover – Very easy to approximate within a – Very easy to approximate within a factor of 2. factor of 2. NP-HardNP-Hard to approximate it within a to approximate it within a factor of 4/3.factor of 4/3.

Max-3-Sat Max-3-Sat – Known to be approximable within a – Known to be approximable within a factor of factor of 8/78/7. . NP-HardNP-Hard to approximate within a to approximate within a factor of factor of 8/7-8/7- for every for every >0>0

Set Cover Set Cover - - NP-HardNP-Hard to approximate it within a to approximate it within a factor of factor of ln nln n. Cannot be approximated within . Cannot be approximated within factor factor (1-(1-))ln nln n unless unless NP NP Dtime(n Dtime(nloglognloglogn))..


3333

Maximum Satisfying Linear Sub-SystemMaximum Satisfying Linear Sub-System - - The problem: Given a linear system The problem: Given a linear system Ax=bAx=b (A is (A is n x mn x m matrix ) in field matrix ) in field FF, find the , find the largest number of equations that can be largest number of equations that can be satisfied by some satisfied by some xx..

If all equations can be satisfied the problem is in If all equations can be satisfied the problem is in PP..

IfIf F=Q NP-Hard F=Q NP-Hard to approximate by factor to approximate by factor mm. Can . Can be approximated be approximated in O(m/logm)in O(m/logm)..

If If F=GF(q)F=GF(q) can be approximated by factor can be approximated by factor q q (even a random assignment gives such a factor). (even a random assignment gives such a factor). NP-HardNP-Hard to approximate within to approximate within q-q-. Also . Also NP-HardNP-Hard for equations with only 3 variables.for equations with only 3 variables.

For equations with only 2 variables. For equations with only 2 variables. NP-HardNP-Hard to to approximated within 1.0909 but can be approximated within 1.0909 but can be approximated within 1.383 approximated within 1.383


Date post:	23-Jan-2016
Category:	Documents
Upload:	glenda
View:	32 times
Download:	0 times

Introduction to PCP

Documents