google fusion tables: web-centered data management and collaboration hector gonzalez, alon y....

22
Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan, Rebecca Shapley, Warren Shen, Jonathan Goldberg-Kidon Google Inc. Proceedings of the 2010 international conference on Management of data (SIGMOD '10)

Upload: lindsay-fowler

Post on 13-Jan-2016

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Google Fusion Tables: Web-Centered Data Management and

Collaboration

Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan, Rebecca Shapley, Warren Shen, Jonathan Goldberg-KidonGoogle Inc.

Proceedings of the 2010 international conference on Management of data(SIGMOD '10)

Page 2: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Introduction

• Cloud, Web, Powerful PC devices• How would we design data management

functionality for today's connected world?

Page 3: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Introduction

• The design goals of Fusion Tables • Functionality support of this design.• Other Paper provides architecture and

implementation.• Google Fusion Tables: Data Management,

Integration and Collaboration in the Cloud. Proceedings of the Symposium on Cloud Computing, 2010

Page 4: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Design Foundations

• Replace traditional database management? • Applications into the cloud?• Underlying Principles?– Small set of guiding principles – pay-as-you-go

Page 5: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Design Foundations

• New Application– Ecologists in the rain forests of Costa Rica– Circle of blue.– Current status of health clinics– The International Coffee Organization– Epidemiologist – Visualize data for senator– MTBGuru– Dairy farm in Brazil, manage in Thailand and California

Page 6: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Design Foundations

Underlying Principles• Provide Seamless Integration with the Web– Public Datasets for search engine– Visualization on Web– Powerful Collaboration

• Emphasize Ease of Use

Page 7: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Design Foundations

Underlying Principles• Provide Incentives for Sharing Data– loss of attribution– misuse and corruption of their data– others not being able to find the data easily.

• Facilitate Collaboration– discuss and comment

Page 8: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Management with Fusion Tables

• Data Acquisition– Upload file– Ease of use, fewer steps– No schema, type– System specify data of column to the type.– If they so desire, user can specify data types.

Page 9: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Management with Fusion Tables

• Data Acquisition– Upload file– Ease of use, fewer steps– No schema, type– System specify data of column to the type.– If they so desire, user can specify data types.

Page 10: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Management with Fusion Tables

• Data Sharing and Collaboration– Attribution and export– Search– Sharing and integration– Discussions

Page 11: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Management with Fusion Tables

• Data Sharing and Collaboration– Sharing and integration

Page 12: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Management with Fusion Tables

• Data Sharing and Collaboration– Discussions

Page 13: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• Table, Map, Intensity map, Line, Bar, Pie, Scatte, Timeline

Page 14: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• Table, Map, Intensity map, Line, Bar, Pie, Scatte, Timeline

Page 15: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• Table, Map, Intensity map, Line, Bar, Pie, Scatte, Timeline

Page 16: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• Table, Map, Intensity map, Line, Bar, Pie, Scatte, Timeline

Page 17: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• Table, Map, Intensity map, Line, Bar, Pie, Scatte, Timeline

Page 18: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• Table, Map, Intensity map, Line, Bar, Pie, Scatte, Timeline

Page 19: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Data Manipulation and Visualization

• HTML snippet

Page 20: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Fusion Tables API

• Platform for data management and collaboration

• Provide developers to extend the others• API for creating, inserting, deleting, and

updating rows in a table.• Authenticated through pre-existing methods

for all Google properties.

Page 21: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Related Work

• Several online database management tools exist– ManyEyes (many-eyes.com)– DabbleDB (dabbledb.com)– Socrata (socrata.com)– Factual (factual.com)

• Fusion Table– collaboration aspects of

data management and handles larger datasets.

Page 22: Google Fusion Tables: Web-Centered Data Management and Collaboration Hector Gonzalez, Alon Y. Halevy, Christian S. Jensen, Anno Langen, Jayant Madhavan,

Conclusions

• Much larger class of users– manage their data– integrated with their other online activities

• data owners to publish data on the Web• easier for users to discover data• Provide– more expressive data modeling– query capabilities– adequate performance on larger datasets.