tivit foresight seminar 2010: from data to intelligence
TRANSCRIPT
FROM DATA TO INTELLIGENCESRA - PROPOSAL
Pauli Kuosmanen
CTO
Tivit
4.10.2010
DATA AVALANCHE IS REALITY
FINNISH ICT “COMPETENCE BUILDING”
Tele
com
Co
mp
uta
tio
nal
Basic competences
Business
Cloud Software
Next Media
Services in RTE
Future Internet
From Data to Intelligence
Device and Interoperability
DATA RESERVES
2009 2010
ICT
4.10.2010
4
NOT ONLY DATA VOLUMES GROW
• Information complexity is also increasing greatly
• Most data (and data constructs) cannot be comprehended by humans directly
• We need development in data mining, knowledge discovery in databases, data understanding technologies, hyperdimensional visualization, AI/Machine-assisted discovery, data fusion, …
4.10.2010
DATA IS HETEROGENEOUS
4.10.2010
BIG DATA
• The term Big data from software engineering and computer science describes datasets that grow so large that they become awkward to work with using on-hand database management tools (Wikipedia)
• Data size is a moving target, current limits are on the order of terabytes, exabytes and zettabytes of data
• It seems like big data is also big business
– Companies are using their data assets to aim their products and services with increasing precision
• A paper by the late Jim Gray of Microsoft says that, compared to the cost of moving bytes around, everything else is free. That applies also in data processing.
• So lets work in two parallel goals:
1. To have a large number of worlds big data reserves stored in Finland
2. To become the most skilled in digging hands deep and dirty in the data in order to generate new value
4.10.2010
Data GatheringData Farming:
Storage, Archiving, Indexing, Metadata,Ontologies, Data Fusion, Interoperability, …
Data Mining :
Pattern or correlation search, clustering analysis, automated classification, outlier / anomaly searches, hyperdimensional visualization, clustering, association rule learning…
Data Understanding
New Knowledge
New Intelligence
MethodologicalChallenges
INTELLIGENCE EXTRACTION PROCESS
TechnicalChallenges
TechnicalChallenges
TechnicalChallenges
4.10.2010
6.10.2010 4.10.2010
4.10.2010
CONCLUSION OF THE EVALUATION
4.10.2010
4.10.2010
BUT, BUT ...
• Show me the money
• Show me the start-ups
• We have both, I know, but we can do a lot better!
4.10.2010
NEW SRA
• If wide enough interest is shown, especially from companies
• Logica has promised to be the Steering Company
• Contact:
– Jukka Ahtikari
– Development Director
– Karvaamokuja 2, 00381 Helsinki
– +358 40 844 9322
6.10.2010
Data GatheringData Farming:
Storage, Archiving, Indexing, Metadata,Ontologies, Data Fusion, Interoperability, …
Data Mining :
Pattern or correlation search, Clustering analysis, Automated classification, Outlier / anomaly detection, Hyperdimensional visualization, Clustering, Association rule learning…
Data Understanding
New Knowledge
New Intelligence
CONTENT OF THE SRA (1/2)
4.10.2010
CONTENT OF THE SRA (2/2)
• Several application fields
– Govermental data
– Traffic data
– Genomic data
– Video data
– Teleoperator data
– Personal lifelong data
– Health data
– Environmental data
– Master data
– ...
– ...
4.10.2010