fairhair.ai: delivering outside insight from billions of ... · fairhair.ai: delivering outside...

1
Special Seminar Thursday, September 28, 2017 10:00 - 11:30 a.m. GHC 4405 Fairhair.AI: Delivering Outside Insight from billions of online sources with the Meltwater Data Science platform” ABSTRACT Meltwater Outside Insight product suite is currently being used by over 25,000 business customers worldwide, including 50% of Fortune 500 companies. Fairhair.AI is the platform behind the scenes responsible for generating all the insights. This presentation covers a high-level overview of Fairhair.AI and its major components: an AI engine for unsupervised full-site web extraction; pipelines for natural language processing analyzing 100’s of millions of new documents everyday across 20 languages; and, last but not the least, a knowledge graph construction that systematically mines facts from all the ingested data. We are democratizing Insight building by opening up our data, our platform, and our interoperable NLP pipelines to developer communities and potential academic collaboration. Aditya Jami , Senior Director of Engineering, Meltwater Aditya Jami is the Senior Director of Engineering at Meltwater and oversees all AI and Big Data initiatives by teams comprised of experienced researchers from top universities (Oxford, Stanford, etc.), big data engineers with prior industry experience at Netflix, Walmart, Yahoo, MongoDB, and committers of popular open source projects. He was the chief software architect of a cross- university joint project called RoboBrain, a large-scale computational system that learns from publicly available Internet resources, computer simulations, real-life robot trials, and was listed in the Top 10 Breakthrough technologies by MIT TechReview 2016. He built the first version of Chaos Monkey (later extended to Simian Army) that ran on Netflix production cloud and its open source version is widely adapted by many companies. Prior to that at Yahoo, he worked on Datahighway, a real time data platform that collected and analyzed 300 Billion events with a hardware footprint of 500K nodes to power its news feed recommendation and behavioral advertising models.

Upload: others

Post on 24-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fairhair.AI: Delivering Outside Insight from billions of ... · Fairhair.AI: Delivering Outside Insight from billions of online sources with the Meltwater Data Science platform”

Special Seminar Thursday, September 28, 2017

10:00 - 11:30 a.m. GHC 4405

“ Fairhair.AI: Delivering Outside Insight from billions of online sources with the Meltwater Data Science platform”

ABSTRACT Meltwater Outside Insight product suite is currently being used by over 25,000 business customers worldwide, including 50% of Fortune 500 companies. Fairhair.AI is the platform behind the scenes responsible for generating all the insights. This presentation covers a high-level overview of Fairhair.AI and its major components: an AI engine for unsupervised full-site web extraction; pipelines for natural language processing analyzing 100’s of millions of new documents everyday across 20 languages; and, last but not the least, a knowledge graph construction that systematically mines facts from all the ingested data. We are democratizing Insight building by opening up our data, our platform, and our interoperable NLP pipelines to developer communities and potential academic collaboration.

Aditya Jami, Senior Director of Engineering, Meltwater

Aditya Jami is the Senior Director of Engineering at Meltwater and oversees all AI and Big Data initiatives by teams comprised of experienced researchers from top universities (Oxford, Stanford, etc.), big data engineers with prior industry experience at Netflix, Walmart, Yahoo, MongoDB, and committers of popular open source projects. He was the chief software architect of a cross-university joint project called RoboBrain, a large-scale computational system that learns from publicly available Internet resources, computer simulations, real-life robot trials, and was listed in the Top 10 Breakthrough technologies by MIT TechReview 2016. He built the first version of Chaos Monkey (later extended to Simian Army) that ran on Netflix production cloud and its open source version is widely adapted by many companies. Prior to that at Yahoo, he worked on Datahighway, a real time data platform that collected and analyzed 300 Billion events with a hardware footprint of 500K nodes to power its news feed recommendation and behavioral advertising models.