solving graph compression via optimal transportvgarg/otcslides.pdf · solving graph compression via...

14
Solving graph compression via optimal transport Vikas Garg, Tommi Jaakkola CSAIL, MIT

Upload: others

Post on 25-Apr-2020

37 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Solving graph compression via optimal transport                                                                        Vikas  Garg,  Tommi  Jaakkola                                                                                                          CSAIL,  MIT    

Page 2: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Graph compression

 

•  Improved  performance    •  reduced  storage  requirements  •  faster  algorithms  •  removal  of  spurious  features  for  downstream  tasks  

•  summarizaCon/beFer  visualizaCon    

Image  source:      Houw    Liong  The,  Lecture  12  on  graph  mining.                

Page 3: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Standard approach 1: Compression via coarsening

Image  source:  Aydin  Buluc,  SIAM  Journal  on  ScienCfic  CompuCng,  2011.    

Find  a  matching,  merge  the  matched  verCces,  and  repeat.          

Page 4: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Standard approach 2: Compression via sparsification

Image  source:  Daniel  Spielman,    Workshop  on  Algorithms  for  Modern  Massive  Datasets.      

Keep  the  verCces  intact,  and  delete  edges  instead.    

Page 5: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

What’s missing?

•  ExisCng  approaches  try  to  preserve  the  graph  spectrum,  cuts  etc.    •  oblivious  to  aFribute  informaCon,  e.g.,  graph  labels  and  node  features  

•  Compression  criteria  not  given  as  an  opCmizaCon  problem  •  less  suitable  for  robustly  linking  with  downstream  tasks  

Page 6: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Optimal transport cost on a graph

1/8                                      

1/8                                      

1/4                                      

 1/4                                      

1/8                                       1/8                                      

1/8                                      

1/2                                        

3/8                                      

Suppose  we  fix  an  iniCal  distribuCon  and  a  target  distribuCon  on  the  nodes.          

c1  c2  

c3  

c4  

c5  

c6  

Cost  =  1/8  *  (c1  +  c2  +  c6)    

What’s  the  minimum  cost  to  transfer  mass  if  we  only  allow  flow  along  the  edges?  

 Example:    Cost  associated  with  the  indicated  flow  (not  necessarily  opCmal)  

c3  

c4  

Note  that  here  the  target  distribuCon  is  over  a  smaller  support  (only  3  nodes)  

Page 7: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

How do we compute the optimal transport (OT) cost?

                                                                                                                           :        

Previously  known  for  directed  graphs  only.  We  extend  to  the  undirected  seeng.    

Page 8: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Outline of our approach

• Define  OT  on  a  (directed/annotated)  graph  •  cost  depends  on  specified  prior  informaCon  

•  e.g.  importance  of  nodes  and  their  labels  or  aFributes  •  thus  can  be  informed  by  the  downstream  task  

• OpCmize  the  target  distribuCon  (its  support)  •   using  a  regularized  OT  cost  as  the  criterion  •  key  step:  we  show  how  subgraph  selecCon  is  found    (yet  to  be  illustrated)  

Page 9: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Challenge: Target distribution is not known

• Combinatorial  problem!  Need  to  compute  opCmal  cost  relaCve  to  every  target  distribuCon  over  the  specified  size  of  support  

1/8                                      

1/8                                      

1/4                                      

 1/4                                      

1/8                                       1/8                                      

1/8                                      

1/2                                        

3/8                                      

c1  c2  

c3  

c4  

c5  

c6  

1/8  

3/8   1/2  

3/8   1/8  

1/2  c3  

c4  

c2  

c5  

c1  

c2  

Page 10: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Optimization formulation

   OpCmal  transport  cost                  in  dual  form  

RegularizaCon  

Control  the  size  of  target  support  

Page 11: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Solving the optimization •  Non-­‐convex  due  to  sparsity  constraint      •  Introduce  Boolean  variables          and  deduce                                                                  

                                   •  Relax  each  coordinate  of        to  [0,  1]  and  solve.  Perform  rounding  to  have  at  most  k  verCces.  The  spanned  subgraph  is  our  compressed  graph.  

 

       

Page 12: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

     Performance  on  standard  graph  datasets  

Page 13: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

     QualitaCve  results  on  syntheCc  and  real  data  

Page 14: Solving graph compression via optimal transportvgarg/OTCSlides.pdf · Solving graph compression via optimal transport !!!!!Vikas!Garg,!Tommi!Jaakkola!!!!!CSAIL,!MIT!!

Conclusion • A  new  principled  approach  to  compressing  graphs    • Prior  informaCon  can  be  seeded  easily  •  Suitable  for  downstream  tasks  such  as  classificaCon    •  InteresCng  direcCons  

•  complement  encoding  (compression)  with  decoding  (decompression)  •  expand  the  framework  to  allow  addiConal  constraints  (e.g.  requiring  the  compressed  graph  to  be  connected)  

•  higher  order,  structured  graph  compression