stratio platform overview v4.1

Post on 07-Jul-2015

239 Views

Category:

Engineering

5 Downloads

Preview:

Click to see full reader

DESCRIPTION

Stratio is a Big Data platform based on Spark. It is 100% open source and enterprise - ready. In Stratio we are Pure Spark, since it is the only technology in the market able to combine stored data analyses with real time streaming data, all in the same query. We are unique in integrating Spark processing with the main NoSql databases: Cassandra, MongoDB, ElasticSearch, ...

TRANSCRIPT

SELECT * FROM tweets WHERE lucene=

'{

filter :

{

type : "range",

field : "time",

lower : "2014/04/25",

upper : "2014/04/1"

},

query :

{

type : "phrase",

field : "body",

values : ["big", "data"]

},

sort :

{

fields: [ {field:"retweets”, reverse:true} ]

}

}';

CASSANDRA

Kafka

STRATIO DEEP

STRATIO DEEP

readClobreadCSVreadLinereadMultiLinereadAvroreadJson

addCurrentTimeaddLocalHostgeoIPfindReplaceSplit

generateUUIDdecompressIfextractJsonPathsdetectMimeType

xqueryextractURIComponentsxsltGrok (regular expressions)

exec

spooling SNMP

Kite SoftwareDevelopment Kit

top related