kevin matulef mit ryan o’donnell cmu ronitt rubinfeld mit rocco servedio columbia

28
Kevin Matulef MIT Ryan O’Donnell CMU Ronitt Rubinfeld MIT Rocco Servedio Columbia

Upload: cornelius-ridout

Post on 15-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Kevin MatulefMIT

Ryan O’DonnellCMU

Ronitt RubinfeldMIT

Rocco ServedioColumbia

= Property Testing

Linear Threshold Functions =

Main Theorem:

There is a poly(1/) query,

nonadaptive,

two-sided error

property testing algorithm for being a halfspace.

Given black-box access to ,

f a halfspace ) alg. says YES with prob. ¸ 2/3;

f -far from all halfspaces ) alg. says YES with prob. · 1/3.

Halfspaces are testable.

Motivation

• “Usual” property testing motivation…?

• ‘precursor to learning’ motivation makes some sense

• Not many poly(1/)-testable classes known.

• Core test is 2-query:

f a halfspace ) Pr[ f passes] ¼ c

f -far from all halfspaces ) Pr[ f passes] · c − poly()

• Local tests really characterize the class:

“Halfspaces maximize this quadratic form,

and anything close to maximizing is close to a halfspace.”

2-query test

Promise: f is balanced

1. Pick to be -correlated inputs.

are such that

2. Test if

Thm: f a halfspace ) Pr[ f passes] ¸

f -far from all halfspaces ) Pr[ f passes] ·

uniform

1

1

−1

−1

1

−1

−1

1

i.e.

2-query test

Promise: f is balanced

1. Pick to be -correlated inputs.

2. Test if

Thm: f a halfspace ) Pr[ f passes] ¸

f -far from all halfspaces ) Pr[ f passes] ·

uniformGaussian

The truth about the Boolean test

2-queryGaussian

test

non-balancedcase

Boolean,

“low-influences”

version

testingfor low

influences

“cross-testing” two

low-influence halfspaces

stitching together halfspaces,

LP bounds

junta-testing[FKRSS’02]

Gaussian testing setting

• Domain:

• Class to be tested is

Halfspaces:

• thought of as having

Gaussian distribution:

Each coord 1,…, n distributed

as a standard N(0,1) Gaussian

• Unknown

Gaussian

Facts about Gaussian space

• Rotationally invariant

• The r.v. has distribution N(0, ).

• With overwhelming probability,

• Hence essentially same as uniform distribution on the sphere.

• “ are -correlated n-dim. Gaussians:”

are i.i.d. “-correlated 1-dim. Gaussians:”

– draw , set

(proof: = , which has same distribution as

by rotational symmetry)

Why Gaussian space?

You: “Ryan, why are you hassling us with all this Gaussian stuff?

I only care about testing on {−1, 1}n.”

Me: “Sorry, you have to be able to solve this problem first.”

But also: Much nicer setting because of rotational invariance.

might really be a function

in disguise.

[class of halfspaces $ class of halfspaces]

Intuition for the test

Q: Which subset of half of the [sphere/Gaussian space]

maximizes probability of vectors landing in same side?

the test

Intuition for the test

A: Halfspace, for each value of 2 [0,1]. (And each value of ½.)

(Gaussian: [Borell’85]; Sphere: [Feige-Schechtman’99], others?)

the test

But does this characterize halfspaces?

Q: If a set passes the test with probability close to that of a

halfspace, is it itself close to a halfspace?

A: Not known, in general.

But: We will show that this is true when is close to 0.

The “YES” case

Suppose f is a balanced halfspace.

1. By spherical symmetry, we can assume

2. Thus iff .

3. This probability is

[Sheppard’99]

the test

Pr[ f passes] ?

The “NO” case

Suppose is any balanced function.

Def: Given , define their “correlation” to be

“Usual Fourier analysis thing”:

where f = 0 is the “constant part” of f , f = 1 is the “linear part” of f , etc.

the test

Pr[ f passes] ?

Def:

any expressible as:

The “NO” case the test

Analyzing the pass probability

Fact:

Cor:

the test

for all i.

The tail part,

Analyzing the pass probability

What is the “constant part” of f ?

Prototypical constant function is

Fact:

= 0 in our case, since I promised f

balanced.

the test

Analyzing the pass probability

What is the “linear part” of f ?

A linear function looks like

Fact:

Cor:

the test

Let’s write in place of

Analyzing the pass probability

But:

(since f is §1-valued)

the test

Gaussian facts

The “NO” case completed

with equality iff

I.e., for any f :

if is close to , then f is close to being a halfspace.

In particular, with a little more analytic care, one concludes:

the test

(in fact, the sgn of its linear part)

)

The truth about the Boolean test

2-queryGaussian

test

non-balancedcase

Boolean,

“low-influences”

version

testingfor low

influences

“cross-testing” two

low-influence halfspaces

stitching together halfspaces,

LP bounds

junta-testing[FKRSS’02]

Boolean version

The “NO” case

Let’s PgUp and see what needs to change!

Analyzing the pass probability

But:

(since f is §1-valued)

the test

???False: is possible: f (x) = x1.

Idea

But:

(since f is §1-valued)

False: is possible: f (x) = x1.

???

Idea

???

Central Limit Theorem: If each is “small”, say , (with error bounds) then

is “close” in distribution to .

“ i th influence ”

Germ of remainder of proof:

1. Possible to test if all i’s small

2. for at most i’s

Open directions

1. this result

+ “Every lin. thresh. fcn. has a low-weight approximator” [Servedio

’06]

= we understand Boolean halfspaces somewhat thoroughly.

Can we use this to solve some more open problems?

2. Which classes of functions testable?

Consider the class “isomorphic to Majority;”

i.e.,

Another chunk of the paper shows an lower bound!

(# queries depends only on