discussion on time series data group name: wg2 source: qi yu, mitch tseng- huawei technologies, co....

13
Discussion on Time Series Data Name: WG2 e: Qi Yu , Mitch Tseng- Huawei Technologies, Co. LTD. ng Date: 2015-07-01 Item :WI-0033

Upload: hillary-hopkins

Post on 12-Jan-2016

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Discussion on Time Series Data

Group Name: WG2Source: Qi Yu , Mitch Tseng- Huawei Technologies, Co. LTD.Meeting Date: 2015-07-01Work Item :WI-0033

Page 2: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Introduction• The definition of Time Series Data in TS-0011:

Time series data is a sequence of data points, typically consisting of successive measurements made over a time interval.

• The following two requirements have been agreed and the related WI-0033 Supporting Time Series Data has been agreed at TP#17 as well.– The oneM2M System shall be able to collect, store time series data.– The oneM2M System shall be able to detect and report the missing

data in time series.

© 2015 oneM2M Partners<Document number>

2

Page 3: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Introduction• Time Series data is widely existing in many verticals .

– Periodical monitoring data from product line in industrial domain– Report Heartbeat and blood pressure periodically in ehealth– Report location information of the car in ITS

• The characters of time series data :(v1,t1),(v2,t2),(v3,t3)…….– Chronologically– Dynamically– Infinitely– In most cases, small amount of data in each occurrence

Page 4: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Use of Time Series DataExample: Collecting Location data of taxis in Beijing. The taxi should report its location information to the platform

once every minute. This information includes the GPS data and time when the data was generated. The taxi company can use these data to do the following analysis:

Average miles which the taxi is driven daily or monthly.Track historical location information of the taxi in a specific

month……The platform should store the data; however, the use of data

depends on the requirements from the application.

Page 5: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Issues

© 2015 oneM2M Partners<Document number>

5

<contentInstance>0..n

<subscription>0..n

<container>0..n

<container>

0..1creator

0..1maxNrOfInstances

0..1maxByteSize

0..1maxInstanceAge

1currentNrOfInstances

1currentByteSize

0..1locationID

0..1ontologyRef

<latest>1

<oldest>1

<semanticDescriptor>0..n

<contentInstance>

1contentSize

0..1ontologyRef

1content

0..1contentInfo

0..1creator

semanticDescriptor0..n

1. Time series data need to be updated whenever there are new data present. However, <contenInstance> cannot be modified once created.

2. The current <content> cannot reflect the time information when the data are collected by the device. Whenever a new data is arriving, a new contenInstance ressource needs to be created. It will add to the data redundanby in the attributes e.g. reourceType,ParentID….

Attributes of <contentInstance>

Multiplicity

resourceType 1resourceID 1resourceName 1parentID 1labels 0..1 (L)expirationTime 1creationTime 1lastModifiedTime 1stateTag 1announceTo 0..1 (L)announcedAttribute 0..1 (L)creator 0..1contentInfo 0..1contentSize 1ontologyRef 0..1content 1

Page 6: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Issues• Location information (GPS) of Beijing’ taxis

There are nearly 67000 taxis by 2014. Every minute, the taxis generates one GPS data in general. So the total amount of data transactions have reached near 100 Million times level in a day. (67000*24*60 ~ 96.5 Millions)K There are at 100M level of <contentInstance> resources to be

created. That’s to say we should organize and manage these new resources and a huge amount of history data resource.

K There are also at least 4 attributes (resourceType, resourceName, parentID, stateTag) which will be created for one location values in a <contentInstance> resource. So it will make large data redundancy.

K The other issue we need to consider is the constrained devices which has limited resource.

Page 7: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Option 1

© 2015 oneM2M Partners<Document number>

7

Option1 is to define a new resource for time series data. The List attribute is used to store data and time when the device generated the data.

List is complex data type and it is used to store chronological series of arrays which includes the data and the time when the data was produced, for example, product line monitoring data (value1, time1), (value2, time2).,,

(value1, time1),(value2,time2),(value3, time3),(value4, time4),.....

Page 8: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Option1 (cont.)

Originator Receiver

1. Update Request (list=(vauleN, timeN))

2.The Receiver shall insert this array into the list attributebased on time.

3.Update Response

list:(value1, time1)(value2, time2)………(valueN, timeN)

A new coming Time Series Data should be appended or inserted into the list attribute based on time information.

The existing UPDATE operation allows to modify or create previously non-existing attributes of the resource type. This is whole replacement of the attribute.The option1 needs to change the existing UPDATE operation to support inserting array into the List.

Page 9: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Option 2The option2 is to use the existing contentIstance resource and add a new atrribute.The contentGenerationTime attribute is the time when the corresponding content was generated by the M2M Node.

<contentInstance>

1contentSize

0..1ontologyRef

1content

0..1contentInfo

0..1creator

semanticDescriptor0..n

0..1contentGenerationTime

A new coming Time Series Data will lead to create a new contentInstance. This will lead to redundancy.

Page 10: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Option3

Attributes of <timeSeriesData>

Multiplicity

RW/RO/WO

Description

< timeSeriesDat

aAnnc> Attributes

resourceType 1 RO See clause 9.6.1.3 where this common attribute is described.

NA

resourceID 1 RO See clause 9.6.1.3 where this common attribute is described.

NA

resourceName 1 WO See clause 9.6.1.3 where this common attribute is described.

MA

parentID 1 RO See clause 9.6.1.3 where this common attribute is described.

NA

creationTime 1 RO See clause 9.6.1.3 where this common attribute is described.

NA

Time 1 WO This attribute contains the time when the data was collected from M2M Node

OA

Data 1 WO This attribute contains the data collected from M2M Node

OA

The option 3 is to define a simple resource type <timeSeriesData> which includes the data ,time and necessary attributes. The other necessary attributes refer to the parent resource such as ACP, label, etc.

A new coming Time Series Data will lead to create a new timeSeriesData.

Page 11: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Comparison of Solutions

© 2015 oneM2M Partners<Document number>

11

Option Analysis

Option1 A new resource is defined, but need to change the existing UPDAT operation.

Option2 A new attribute is defined. A new time series data will lead to create a new contentInstance resource .Option2 will lead to the data redundancy ,especially for constrained devices with limited resource.

Option3 A new resource with limited attributes is defined .

Page 12: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Way Forward

• Define a new resource type with limited attributes to support the time series data based on the comparison of the solutions.

Page 13: Discussion on Time Series Data Group Name: WG2 Source: Qi Yu, Mitch Tseng- Huawei Technologies, Co. LTD. Meeting Date: 2015-07-01 Work Item :WI-0033

Thanks

© 2015 oneM2M Partners<Document number>

13