stratio platform overview v4.1
DESCRIPTION
Stratio is a Big Data platform based on Spark. It is 100% open source and enterprise - ready. In Stratio we are Pure Spark, since it is the only technology in the market able to combine stored data analyses with real time streaming data, all in the same query. We are unique in integrating Spark processing with the main NoSql databases: Cassandra, MongoDB, ElasticSearch, ...TRANSCRIPT
•
•
•
•
•
•
•
•
•
•
•
SELECT * FROM tweets WHERE lucene=
'{
filter :
{
type : "range",
field : "time",
lower : "2014/04/25",
upper : "2014/04/1"
},
query :
{
type : "phrase",
field : "body",
values : ["big", "data"]
},
sort :
{
fields: [ {field:"retweets”, reverse:true} ]
}
}';
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
CASSANDRA
Kafka
STRATIO DEEP
STRATIO DEEP
•
•
•
•
•
•
•
readClobreadCSVreadLinereadMultiLinereadAvroreadJson
addCurrentTimeaddLocalHostgeoIPfindReplaceSplit
generateUUIDdecompressIfextractJsonPathsdetectMimeType
xqueryextractURIComponentsxsltGrok (regular expressions)
exec
spooling SNMP
Kite SoftwareDevelopment Kit