massachusetts institute of technology cambridge,...
TRANSCRIPT
Massachusetts Institute of Technology!Cambridge, MA!
http://bigdata.csail.mit.edu/!
data…is practically everywhere!
health/!medical!
social!finance!
retail!government!
entertainment!
learning!transportation/logistics! science/!
environment!
…AND THE AMOUNT OF DATA IS GROWING AT 50% PER YEAR! - according to IDC!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
insurance!
Generating common questions:
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
• What platforms? infrastructure? algorithms?
• How do we leverage different types of data – inside our organization and external data?
e.g. structured data, video streams, images, social media (twitter), natural language (voice, text), measurements, click streams…
• How do we make “big data” accessible? - not just a handful of data analysts but across my organization - data provenance – how do I trust this data to make decisions
• How manage security and privacy concerns?
etc….
across different sectors including medical, finance, government, science, retail, insurance, industrial and manufacturing…!
bigdata@csail: focus
Computation Algorithms Applications Privacy
+ + +
photo credits: Google, Andrew Lo/MIT, hGraph/Involution Studios!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
CAPTURE!DISTRIBUTED DATA STORAGE & STREAMING![Stonebraker, Madden..]
CLOUD PLATFORMS ![Agarwal, Devadas, Amarsinghe..]
MASSIVE SCALE!
DATA ANALYTICS!BIG ALGORITHMS![Indyk, Roni>, Edelman..]
BIG MACHINE LEARNING![Jaakkola, O’Reilly, Fisher..]
Images Text LocaHons Measurements Videos Web clickstreams Tweets Voice…
INSIGHT!!
VISUALIZATION!& HCI![Karger, Miller, Oliva, Keel…]
BIG UNDERSTANDING![Torralba, Freeman [Images], Barzilay, Katz [Language], Glass [Speech]…]
Finance Medical Science Energy Intelligence EducaHon Retail Sports Entertainment TransportaHon Business Insurance…
SECURITY & PRIVACY!
[Zeldovich, Kaashoek, Clark, Sollins..]
APPLICATIONS!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
!Bring together leading researchers in the field and major players in industry to identify and tackle the unique and fundamental challenges of big data!
bigdata@csail: mission
RESEARCH! We will develop tools, algorithms, designs and prototypes of reusable systems that will solve the big data problem.!
CONVENE!Big Data is not just algorithms or databases – it requires a collaborative effort across MIT working with partners in academia, industry and government. We will bring together thought leaders to map out the future of Big Data.!
EDUCATE/!IMPACT!
We will continuously seek opportunities to engage and educate students, researchers, industry, government, and the public on Big Data.!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
State of the Initiative
research!
convene!
• Funded 7 big data seed projects (2012/2013)!• Foster new research projects with our partners!• Developing “Big Data Partners” to enable access to big data sets for research!• Facilitate interactions with students and researchers!
• Workshops on Big Challenges (topics identified by members)!• Big Data Integration (April 2013)!• Big Data Privacy (June 2013) – co-sponsored by Sloan Foundation!
• Annual meeting (Nov 13-14 2013)!• “Big Data Lecture” series at MIT (Fall 2012/Spring 2013)!• Member meetings and visits!• Big Data space at CSAIL!
impact/!educate!
• MIT “Big Data Challenge” (pilot: City of Boston transportation)!• “Big Data Living Lab” at MIT!• bigdata@csail member website!• Big Data Industry Visitor program!• Big Data Tools and Demos!• Meeting/Workshop reports/articles/white papers!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
bigdata@csail Founding members!
Members!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
Research – seed funding for big data projects
• CFP: ʻCall for Proposalʼ 2x per year (Fall/Spring)!– research project grants!– small grants for seminar series; competitions; course projects; etc.!
• bigdata.csail.mit.edu/proposals!
Projects (2012/2013):!• Boris Katz, Interfacing with Big Data Repositories!• Bill Freeman, Learning online from 25 Million Images!• Tim Berners-Lee, Growing Big Linked Data from Seed!• Una-May OʼReilly, Genetic Programming Machine Learning in a DBMS!• Alex (Sandy) Pentland, A Testbed for Trusted Use of Big Data!• David Karger, Masses Data: Empowering Big Dataʼs Users!• Aude Olivia, Memorability as a Quantification of Utility of Information!
bigdata@csail: data partnerships!
!We partner with different organizations to make data sets available for research here at MIT -- for exploring new ideas; testing out theories and new algorithms, systems and tools; student projects and challenges; and ultimately, for demonstrating the impact of Big Data with real world data.!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
MIT Big Data Challenge!• Pilot: Big Data + Transportation - City of Boston (summer 2013)!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
Big Data: a Living Lab at MIT!• Big Data will change the world: How?!• Proposal: We want to create a living lab at MIT to allow the community
to access, sharing, and use data about itself. !• Why? To explore technical issues around integration, privacy,
visualization, and performance, as well as social implications of big data.!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
MIT DATA HUB!ORGANIZATIONAL!
DATA!PERSONAL!DATA!
PUBLIC !DATA!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
people
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
Big Data requires a new generation of technologies to store, manage, analyze, share, and understand the huge quantities of data we are now capable of collecting…!
…join bigdata@CSAIL to learn more!
MIT COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE LABORATORY
example research projects…
http://bigdata.csail.mit.edu/research!