new algorithms and lower bounds for monotonicity testing of boolean functions

46
New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions Rocco Servedio Joint work with Xi Chen and Li-Yang Tan Columbia University Institute for Advanced Study March 2014

Upload: amity

Post on 24-Feb-2016

62 views

Category:

Documents


0 download

DESCRIPTION

Rocco Servedio Joint work with Xi Chen and Li-Yang Tan Columbia University. New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions. Institute for Advanced Study March 2014. Simplest question about a Boolean function: Does it have some property P ?. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Rocco Servedio

Joint work with Xi Chen and Li-Yang TanColumbia University

Institute for Advanced StudyMarch 2014

Page 2: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Property Testing

All Boolean functions

Property P

Query access to unknown f on any input x

With as few queries as possible, decide if

Far from P

Simplest question about a Boolean function: Does it have some property P?

f has Property P vs. f does not have Property P

Unreasonable: Easy lower bound of

f is far from having Property P

Page 3: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Rules of the game

1. If f has Property P, accept w.p. > 2/3.

2. If f is ε-far from having Property P, reject w.p. > 2/3.

3. Otherwise: doesn’t matter what we do.

All Boolean functions

Property P Far from P

Query access to unknown f on any input x

Page 4: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

This work: P = monotonicity

Page 5: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Monotone Far from monotone

Well-studied problem: [GGR98, GGL+98, DGL+99, FLN+02, HK08, BCGM12, RRS+12, BBM12, BRY13, CS13, …]

but still significant gaps in our understanding.

This work: P = monotonicity

Page 6: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Background and prior results

Page 7: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

For all monotone functions g: A monotone function is one that satisfies:

Monotone Far from monotone

Quick reminder about monotonicity

“Flipping an input bit from 0 to 1 cannot make f go from 1 to 0”

Page 8: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Monotone

1

01

1

01

Far from monotone

0n 0n

1n 1n

Page 9: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

First test that comes to mind

Pick random edge

y = 1 1 1 1 0 0 1 0 1 0 1 0 0 1

x = 1 1 1 1 0 0 1 1 1 0 1 0 0 1

Reject!

f(x) = 0

f(y) = 1

f(x) = 1

f(y) = 1

f(x) = 0

f(y) = 0

f(x) = 1

f(y) = 0

Repeat

? x

y

Call such an edge a “violating edge”

Page 10: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Theorem [Goldreich et al. 1998] If f is ε-far from monotone, Ω(ε/n) fraction of edges are violations.

Therefore tester will reject within O(n/ε) queries.

An observation and a theorem

Reject!

f(x) = 0

f(y) = 1

Call such an edge a “violating edge”

Simple observation: If f is monotone, tester never rejects.

Question: If f is ε-far from monotone, how likely to catch violating edge?

? x

y

Tester = Sample random edges, check for violations.

Page 11: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

An exponential gap between upper and lower bounds

Goldreich et al. [FOCS 1998]

Introduced problem, gave tester with O(n) query complexity.

Fischer et al. [STOC 2002]

Any non-adaptive tester must make Ω(log n) queries. Therefore, any adaptive tester must make Ω(log log n) queries.

Chakrabarty-Seshadhri [STOC 2013]

O(n7/8)-query non-adaptive tester!

[GGR98, DGL+99, HK08, BCGM12, RRS+12, BRY13, CS13, …]

Page 12: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Lower bound:Any non-adaptive tester must make queries.

Therefore, any adaptive tester must make queries.

Exponential improvements over Ω(log n) and Ω(log log n) lower bounds of Fischer et al. (2002)

Upper bound:There is a non-adaptive tester that make

queries.

Polynomial improvement over O(n7/8) upper bound of Chakrabarty and Seshadhri (2013)

Our results

Page 13: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Rest of talk

The lower bound• Main idea of the argument in a simpler setting

• Key technical ingredient: Multidimensional Central LimitTheorems

• Generalizing the lower bound: Testing monotonicity on hypergrids

Page 14: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Yao’s minimax principle

Lower bound against randomized algorithms

Tricky distribution over inputs to deterministic algorithms

implies

Page 15: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Yao’s principle in our setting

Monotone Far from Monotone

Distribution Dno supported on far from monotone functions

Indistinguishability. For all T = deterministic tester that makes o(n1/5) queries,

Distribution Dyes supported on monotone functions

Page 16: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Dno : = −1 with prob 0.1, 7/3 with prob 0.9

Main Structural Result: IndistinguishabilityAny deterministic tester that makes few queries cannot tell Dyes from Dno

Key property: , .

Dyes : = uniform from {1,3}

Our Dyes and Dno distributions

Both supported on Linear Threshold Functions (LTFs) over {−1,1}n:

Verify: Dyes LTFs are monotone, Dno LTFs far from monotone w.h.p.

Page 17: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Claim. For all T = deterministic tester that makes q = o(n1/5) queries,

Indistinguishability: starting small

q = 1 query

Page 18: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Non-trivial proof of a triviality

Claim. Let T = deterministic tester that makes 1 query, on string z. Then:

(*)

vs.

Tester sees:

Page 19: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Central Limit Theorems. Sum of many independent “reasonable” random variables converges to Gaussian of same mean and variance.

Main analytic tool (Baby version):

Page 20: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Goal: Upper bound

Recall key property:

Page 21: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Claim. Let T = deterministic tester that makes 1 query. Then:

We just proved:

Plenty of room to spare!Would be happy with < 0.1

Page 22: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

q queries instead of 1

Multidimensional CLTs. Sum of many independent “reasonable” q-dimensional random variables converge to q-dimensional

Gaussian of same mean and variance.

Main analytic tool (Grown-up version):

Page 23: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Multidimensional CLTs

[Mossel 08, Gopalan-O’Donnell-Wu-Zuckerman 10]

Lower bound

[Valiant-Valiant 11]

Main technical work: Adapting multidimensional CLT for Earth Mover Distance to get

Page 24: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Claim. For all T = deterministic tester that makes q = o(n1/5) queries,

Let’s prove the real thing:

Page 25: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Setting things up

Arrange the q queries of tester T in a q x n matrix Q

i-th query string

Recall: Tester’s goal is to distinguish

versus

Page 26: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

What does the tester see?

Goal: Upper bound

Tester sees:

(coordinate-wise sign)

Page 27: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Goal: Upper bound

Random variables supported on 2q orthants of Rq

union of orthants“roughly equal weight on any union of orthants”

Page 28: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Fixed

from product distribution over Rn sum of n independent vectors in Rq

Ditto:

Page 29: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Goal: Two sums of n independent vectors in Rq are “close”

where closeness = roughly equal weight on any union of orthants.

The final setup

Furthermore, since

suffices to show each are close to q-dimensional Gaussian with matching mean and covariance matrix.

Page 30: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Valiant-Valiant Multidimensional CLT

Sum of many independent “reasonable” q-dimensional random variables is close to q-dimensional Gaussian of same mean and variance.

with respect to Earth Mover Distance:

Minimum amount of work necessary to “change one PDF into the other one”,

where work := mass x distance

Technical lemma:Closeness in EMD roughly equal

weight on any union of orthants

Page 31: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Valiant-Valiant Multidimensional CLT

Page 32: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Key technical lemma

Closeness in EMD Roughly equal weight on any union of orthants

Page 33: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Let’s consider the contrapositive:for all unions of orthants

≥ 0.1 unit of mass must be moved out of

Recall: work = mass x distance

Slight technical obstacle: What if this 0.1 unit of mass only moves a tiny distance?

Solution: Then G has ≥ 0.1 weight on red boundary.

Not possible thanks to anti-concentration of G

If we want to make S look like G,

Page 34: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

In slightly more detail

For all r > 0, define Br := radius r boundary around

Therefore:

, has to be moved out of

Br

Page 35: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

We just proved

Gaussian anti-concentration

Valiant-ValiantCLT

Page 36: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

The details

t=O(1) in our setting

= in our setting

Optimal choice of makes RHS

Pick , and RHS is o(1) as desired.

Page 37: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Indistinguishability. For all T = deterministic tester that makes q = o(n1/5) queries,

Recap

Lower bound:Any non-adaptive tester must make queries.

Therefore, any adaptive tester must make queries.

Page 38: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Testing monotonicity of Boolean functions over general hypergrid domains

Page 39: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Boolean functions over hypergrids

Monotone Far from monotone

All Boolean functionsAll Boolean-valued functions over hypergrid

Testing monotonicity of Boolean functions over hypergrids

Page 40: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Lower bound: *Any non-adaptive tester for testing monotonicity of

.must make queries.

* only for even m

Proof by reduction to m=2 case (Boolean hypercube)

Page 41: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

A useful characterization

Theorem. [Fischer et al. 2002]

There exists ε .

mn vertex-disjoint pairs such that and .

is ε-far from monotone

“violating pair”

(Upward direction is easy)

0

1

0

1

0

1

0

1

0

1

0

10

1

Page 42: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Given any , define as follows:

Reducing to m = 2

numbers in [m] bits in {0,1}

Easy: If f is monotone then so is F.

Remains to argue:If f is ε-far from monotone then so is F.

Exists ε2n vertex-disjoint pairs in {0,1}n that are violations w.r.t. f

Exists ε .

mn vertex-disjoint pairs in [m]n that are violations w.r.t. F

Useful characterization

Useful characterization

Each violating pair (m/2)n vertex-disjoint violating pairs

suffices to show:

Page 43: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

f(x) = 0

f(y) = 1

1

0 S(x)

S(y){0,1}n

[m]n

|S(x)| = |S(y)| = (m/2)n

where

Easy end game: exhibit order-preserving bijection between S(x) and S(y)

Page 44: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Conclusion

A polynomial lower bound for testing monotonicity of Boolean functions

Main technical ingredient: multidimensional central limit theorems Proof extends to testing monotonicity over general hypergrid domains

(when m is even)

Open Problem: Polynomial lower bounds against adaptive testers?

Lower bound:Any non-adaptive tester must make queries.

Therefore, any adaptive tester must make queries.

Page 45: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

What I didn’t talk about

An improved one-sided, non-adaptive tester

Builds on ingredients from [Chakrabarty and Seshadhri 2013]: trade off edge tester versus “path tester”

We give a different “path tester” with an improved analysis

Open Problems: can we exploit adaptivity or two-sided error to obtain more efficient testers?

Upper bound:There is a non-adaptive tester that make

queries.

Page 46: New Algorithms and Lower Bounds for Monotonicity Testing of Boolean Functions

Thank you!