improved approximation algorithms for bipartite correlation … · correlation bi-clustering...

38
Improved Approximation Algorithms for Bipartite Correlation Clustering Nir Ailon Noa Avigdor-Elgrabli Edo Liberty Anke van Zuylen Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Upload: others

Post on 24-Sep-2020

9 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Improved Approximation Algorithms for BipartiteCorrelation Clustering

Nir Ailon Noa Avigdor-Elgrabli Edo Liberty Anke van Zuylen

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 2: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Correlation clustering

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 3: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Input for correlation clustering

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 4: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Output of correlation clustering

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 5: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Cost of a correlation clustering solution

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 6: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Cost of a correlation clustering solution

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 7: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Correlation clustering results

approx const running time

Bansal, Blum, Chawla ≈ 20, 000 Ω(n2)

Demaine, Emanuel,Fiat, Immorlica

4 log(n) LP

Charikar, Guruswami,Wirth

4 LP

Ailon, Charikar,Newman, Alantha

2.5 LP

Ailon, Charikar,Newman, Alantha

3 O(m)

Ailon, Liberty < 3 O(n) + cost(OPT )

n and m are the number of nodes and edges in the graph.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 8: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Correlation bi-clustering

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 9: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Input for correlation bi-clustering

The input is an undirected unweighted bipartite graph.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 10: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Output of correlation bi-clustering

The output is a set of bi-clusters.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 11: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Cost of a correlation bi-clustering solution

The cost is the number of erroneous edges.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 12: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Correlation bi-clustering results

approx const running time

Demaine, Emanuel,Fiat, Immorlica

O(log(n))∗ LP

Charikar, Guruswami,Wirth

O(log(n))∗ LP

Noga Amit 12 LP

This work 4 LP (deterministic)

This work 4 O(m) (randomized)

* The first two results hold for general weighted graph.

n and m are the number of nodes and edges in the graph.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 13: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Consider the following graph

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 14: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Choose `1 uniformly at random from the left side.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 15: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Add the neighborhood of `1 to the cluster

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 16: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

For each other node on the left (`2) do the following:

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 17: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

w.p. min(|R1,2|/|R2|, 1) add `2 to the cluster if |R1,2| ≥ |R1|.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 18: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Here `2 joins the cluster because R1,2 ≥ R1.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 19: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Let’s consider another example

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 20: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Let’s consider another example

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 21: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Since |R1,2|/|R2| = 1/2 with probability 1/2 we decide what to do with `2

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 22: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Since |R1,2| < |R1| that decision should be to make `2 a singleton

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 23: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Otherwise (w.p. 1/2) we decide nothing about `2 and continue.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 24: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

We remove the clustered nodes from the graph and repeat.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 25: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

PivotBiCluster

Lemma

Let OPT denote the best possible bi-clustring of G.Let B be a random output of PivotBiCluster. Then:

EB∼PivotBiCluster [cost(B)] ≤ 4cost(OPT )

Let’s see how to prove this...

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 26: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Tuples, bad events, and violated pairs

A “bad event” (XT ) happens to tuple T = (`1, `2,R1,R1,2,R2).

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 27: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Tuples, bad events, and violated pairs

We “blame” bad event XT for the violated (red) pairs, E[cost(T )|XT ] = 3.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 28: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Tuples, bad events, and violated pairs

Since every violated pair can be blamed on (or colored by) one bad eventhappening we have:

EB∼PivotBiCluster [cost(B)] ≤∑T

qT · E[cost(T )|XT ]

where qT denotes the probability that a bad event happened to tuple T .

Note: the number of tuples is exponential in the size of the graph.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 29: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Proof sketch

1 We have (previous slide)

ALG ≤∑T

qT · E[cost(T )|XT ]

2 Write the dual linear program

OPT ≥∑T

β(T ) s.t. constrains on β(T )

3 Set a feasible solution β(T )← qT f (T ).

4 Show that:

E[cost(T )|XT ] + E [cost(T )|XT ] ≤ 4(f (T ) + f (T ))

5 Which gives

ALG ≤∑T

qT · E[cost(T )|XT ] ≤ 4∑T

qT f (T ) ≤ 4 · OPT

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 30: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

The linear program

In a bad square, any clustering must err at least once.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 31: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

The linear program

Let x`,r be equal 1 if the clustering errs on pair (`, r) and 0 otherwise.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 32: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

The linear program

For r2 ∈ R2 and r1,2 ∈ R1,2 we have x`1,r2 + x`1,r1,2 + x`2,r2 + x`2,r1,2 ≥ 1

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 33: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

The linear program

Since each tuple corresponds to |RT2 | · |RT

1,2| bad squares, we get thefollowing constraint:

∀ T :∑

r2∈RT2 ,r1,2∈RT

1,2

(x`T1 ,r2

+ x`T1 ,r1,2+ x`T2 ,r2

+ x`T2 ,r1,2

)=

∑r2∈RT

2

|RT1,2| · (x`T1 ,r2

+ x`T2 ,r2) +

∑r1,2∈RT

1,2

|RT2 | · (x`T1 ,r1,2

+ x`T2 ,r1,2)

≥ |RT2 | · |RT

1,2|

Minimizing the cost corresponds to a minimization over∑

x`,rand subject to x`,r ≥ 0.

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 34: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

The linear program

The dual of which is as follows:

max∑T

β(T )

s.t. ∀ (`, r) ∈ E :∑T : `T2 =`,r∈RT

2

β(T )

|RT2 |

+∑

T : `T1 =`, r∈RT1,2

β(T )

|RT1,2|

+∑

T : `T2 =`, r∈RT1,2

β(T )

|RT1,2|≤ 1

and ∀ (`, r) 6∈ E :∑

T : `T1 =`, r∈RT2

1

|RT2 |β(T ) ≤ 1

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 35: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Feasibilty

Lemma

The solutionβ(T ) = qT · f (T )

is a feasible solution to the dual of the linear program when:

f (T ) = min|RT1,2|, |RT

2 |min

1,

|RT1,2|

min|RT1,2|, |RT

1 |+ min|RT1,2|, |RT

2 |

Lemma

For any tuple T ,

E[cost(T )|XT ] + E[cost(T )|XT ] ≤ 4 ·(f (T ) + f (T )

).

We will not show these here, they are very technical...

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 36: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Conclusion and discussion

PivotBiCluster LP based

Running time O(m)Solves LP onn3 constraints

Takes advantage of bipartiteness Yes no

Approximation factor 4 4

Symmetric No Yes

Deterministic No Yes

Can a combinatorial algorithm bit the 4 approximation factor?

Maybe it should take advantage of the symmetry?

Can this algorithm be derandomized?

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 37: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Things I’ll be happy discuss

(Some of which I don’t know the answers for...)

The rest of the proof

Why we believe the analysis must use tuples and bad squares are notenough

The running time of the algorithm (easily seen to be O(m))

An LP based 4-approximation deterministic algorithm

Is there a tight bad example for which the algorithm achieves theapproximation bound

Is there a combinatorial and deterministic algorithm with this bound(or better)

Is there an approximation hardness result? (for what factor)

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen:

Page 38: Improved Approximation Algorithms for Bipartite Correlation … · Correlation bi-clustering results approx const running time Demaine, Emanuel, Fiat, Immorlica O(log(n)) LP Charikar,

Thank you

Nir Ailon, Noa Avigdor-Elgrabli, Edo Liberty, Anke van Zuylen: