1 the use of scanner data on non-food products [email protected] statistics norway

12
1 The use of scanner data on The use of scanner data on non-food products non-food products [email protected] Statistics Norway

Upload: lawrence-robbins

Post on 20-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

1

The use of scanner data on The use of scanner data on non-food productsnon-food products

[email protected]

Statistics Norway

Page 2: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

2

SD use in the Division of price statistics

1997

2001

2005

HICP/CPI: First contact with grocery chains

SD from all grocery chains in HICP/CPI

“Full” scale use of SD in HICP/CPI for pharmaceutical products

First use of SD from grocery chains in PPP work

PPP food survey fully based on SD

2003

2009

SD from the first pharmacy chain in HICP/CPI

2010

SD from three pharmacy chains in HICP/CPI

2012

“Full” scale use of SD in HICP/CPI for Food

2013

New calculation method at elementary level for Food

SD from four petrol chains in HICP/CPI

2005 2013

SD from the first petrol chain in HICP/CPI

2003

Page 3: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

SD on non-food products• Non-food products are included in these COICOP groups:

- 0213: Beer

- 0220: Tobacco

- 0454: Solid fuels

- 0531: Major household appliances whether electric or not

- 0540: Glassware, tableware and household utensils

- 0552: Small tools and miscellaneous accessories

- 0561: Non-durable household goods

- 0721: Spare parts and accessories for personal transport equipment

- 0722: Fuels and lubricants for personal transport equipment

- 0931: Games, toys and hobbies

- 0933: Gardens, plants and flowers

- 0934: Pets and related products

- 0952: Newspapers and periodicals

- 0954: Stationery and drawing materials

- 1111: Restaurants, cafes

- 1112: Canteens

- 1213: Other appliances, articles and products for personal care

• SD on non-food products is received from:

- grocery and kiosk chains

- petrol chains

Page 4: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

Current production system of SD on non-food products• Statistics Norway have build an application to deal with non-food products:

– One GTIN matched to one representative item– If a GTIN is missing, then it’s replaced.– An automatically suggest of a replacement according to product group and

turnover.– If the replaced GTIN is of different quality, indirect quality adjustment is

made.

• Weakness:– If turnover is twisted towards other GTINs, re-coding/re-matching should

be done.– Resource demanding to follow changing turnover figures.– If we don’t control the matching over time we may follow

“unrepresentative” GTINs.

• Statistics Norway want to use more of the SD on non-food products.

Page 5: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

ENVA-classification(EAN Norges Varegruppestandard )

• GS1 Norway has developed their own classification system called ENVA (not brick):

- Gives information about which product group an GTIN belongs to.

- Takes into account whether the product is fresh, canned or frozen.

- Is applied only by the grocery chains.

• Advantages with ENVA-classification:

- Secure a mutual understanding about product groups.

- Makes it possible to compare groups from different chains.

Page 6: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

ENVA-classification

• Statistics Norway doesn’t receive ENVA-classification from all the chains:

- The biggest chain since 2001.

- One chain started in May 2015.

- The last chain will be able during 2016.

• Statistics Norway wants to utilize the ENVA-classification to define non-food products at our unofficial COICOP-6 level.

- Mostly a direct link between ENVA-classification and COICOP.

- Try to reduce the use of text searches.

Page 7: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

ENVA-classification

02.2.0 Tobacco

COICOP-5 COICOP-6 COICOP-6 ENVA-group

02.2.0.1 Cigarettes

02.2.0.1 00556 Marlboro' cigarettes 00556 Cigarettes 2835 Cigarettes02.2.0.1 00739 Prince Rounded taste'

02.2.0.2 Cigars

02.2.0.2 00737 Bellmann' Cigarill 00737 Cigars 2834 Cigars

02.2.0.3 Other tobacco products

02.2.0.3 00740 Tiedemanns Gul 3 MIXTURE' 00740 Rolling tobacco 2837 Other tobacco products02.2.0.3 00741 GullSnitt' rolling tobacco02.2.0.3 00743 Oliver Twist' chewing tobacco 00743 Chewing tobacco 2837 Other tobacco products02.2.0.3 00744 General' snuff (loose leaf) 00744 Snuff 2837 Other tobacco products02.2.0.3 00745 Rizzla' cigarette paper, pack 00745 Cigarettes paper 2837 Other tobacco products02.2.0.3 01102 moccaMINT', snuff, portions

Old classification New classification

Page 8: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

ENVA-classificationGTIN Product description ENVA-group ENVA-name

42068037 PARAMOUNT RED 20PK 2835 Cigarettes42068099 PARAMOUNT GOLD 20PK 2835 Cigarettes42221920 LUCKY STRIKE ADDITIVE FREE RED 20PK 2835 Cigarettes42226192 LUCKY STRIKE ADDITIVE FREE BLUE 20PK 2835 Cigarettes57072630 CAMEL SILVER 20PK 2835 Cigarettes

57076461 CAMEL DOUBLE MINT & GREEN 20 20PK 2835 Cigarettes57076461 CAMEL DOUBLE MINT & GREEN 20PK 2835 Cigarettes59040200 PRINCE CLICK & REFRESH BLUE 20PK 2835 Cigarettes59045175 PRINCE BLACK CLICK & REFRESH 20PK 2835 Cigarettes59047681 PRINCE MENTHOL CLICK & BOOST 20STK 2835 Cigarettes59048152 LUCKY STRIKE CLICK & ROLL 20STK 2835 Cigarettes70114706 PRINCE RICH TASTE 20PK 2835 Cigarettes70114720 PRINCE RICH TASTE 100'S 20PK 2835 Cigarettes70114737 PRINCE ROUNDED TASTE 20PK 2835 Cigarettes70114751 PRINCE ROUNDED TASTE 100'S 20PK 2835 Cigarettes70114768 PRINCE GOLDEN TASTE 20PK 2835 Cigarettes70114782 PRINCE GOLDEN TASTE 100'S 20PK 2835 Cigarettes70114805 PRINCE MENTHOL TASTE 20PK 2835 Cigarettes73101635 BENSON & HEDGES GOLD 20PK 2835 Cigarettes73103479 MARLBORO SILVER 20PK 2835 Cigarettes

73113966 CAMEL NATURAL FLAVOR 20PK 2835 Cigarettes76124457 KENT ORIGINAL 20PK 2835 Cigarettes76124464 KENT ORIGINAL 100'S 20PK 2835 Cigarettes76124471 KENT MENTHOL GREEN 20PK 2835 Cigarettes76124495 KENT WHITE INFINA 20PK 2835 Cigarettes76149108 CAMEL FILTER 20PK 2835 Cigarettes76149122 CAMEL BLUE 20PK 2835 Cigarettes

Page 9: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

ENVA-classificationGTIN Product description ENVA-group ENVA-name

000030105034 VEC.COL SHOCK E.NAVY MAYBELLINE 3453 MASCARA000030105409 BIG EYES VERY BLACK MAYBELLINE 3453 MASCARA000030105416 BIG EYES BROWNISH MAYBELLINE 3453 MASCARA000030108387 COLOSSAL VERY BLACK MAYBELLINE 3453 MASCARA000030108493 COLOSSAL VERY BLACK WP MAYBELLINE 3453 MASCARA000030114319 COLOSSAL L.BLACK MAYBELLINE 3453 MASCARA

3600522050438 VOLUMINOUS LEBLACK LOREAL 3453 MASCARA3600522218791 EVC.MASC MILLION LASH EXCESS LOREAL3453 MASCARA3600522332107 FALSE L. WINGS BLACK LOREAL 3453 MASCARA3600522367635 FALSE L. WINGS WP LOREAL 3453 MASCARA3600522387459 VO.CO. MASCARA M.MANGA MAYBELLINE3453 MASCARA3600522616252 MILL.LASH SO COUTURE LOREAL 3453 MASCARA3600522745815 F.LASH.WINGS I.BLACK LOREAL 3453 MASCARA3600530164158 V.EXPRESS TURBO BOOST LOREAL 3453 MASCARA3600530775279 ILLEGAL LENG. BLACK MAYBELLINE 3453 MASCARA3600530775286 ILLEGAL LENG. BROWN MAYBELLINE 3453 MASCARA3600530775774 ILLEGAL LENG. WTP MAYBELLINE 3453 MASCARA

3600530880713 ILLG. LENGHT GL. BLACK MAYBELLINE 3453 MASCARA3600530910946 BROW MASC. ME.BROWN MAYBELLINE 3453 MASCARA3600530910960 BROW MASC. DA.BROWN MAYBELLINE 3453 MASCARA3600531044848 BIG EYES REBEL BLACK MAYBELLINE 3453 MASCARA4084200857103 GR.LASH VERY BLACK MAYBELLINE 3453 MASCARA5021044013332 V.EXPRESS BLACK LOREAL 3453 MASCARA5021044013455 V.EXPRESS BLACK WATERPROOF LOREAL 3453 MASCARA

000030074576 V.EXPR COLO BLACK MAYBELLINE 3453 MASCARA000030079236 V.EXPR COLO WTP MAYBELLINE 3453 MASCARA000030079847 V.EXPR COLO 100% BLACK MAYBELLINE 3453 MASCARA000030086272 V.EXP..FALS BLACK LOREAL 3453 MASCARA

Page 10: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

ENVA-classification

05.6.1.2 Other non-durable household articles05.6.1.2_00148 Taper candle, 10 pack

05.6.1.2 Other non-durable household articles

05.6.1.2_00148 CandleCOICOP-7 ENVA-group ENVA-name05.6.1.2_00148_1 Candle rings 3340 Candle rings05.6.1.2_00148_2 Antique candle 3344 Antique candle05.6.1.2_00148_3 Outdoor candle 3345 Outdoor candle05.6.1.2_00148_4 Cake candle 3346 Cake candle05.6.1.2_00148_5 Block candle 3347 Block candle05.6.1.2_00148_6 Tealight 3348 Tealight05.6.1.2_00148_7 Taper candle 3349 Taper candle05.6.1.2_00148_8 Chandelier candle 4664 Chandelier candle05.6.1.2_00148_9 Single candle 4665 Single candle05.6.1.2_00148_10 Globe candle 4666 Globe candle05.6.1.2_00148_11 Scented candle 4667 Scented candle

Page 11: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

The new production system for SD on non-food products • An automatically routine that connect the GTIN with COICOP-6 groups for

non-food products each month.

- Mostly directly linking between new GTIN and COICOP through the ENVA-classification. (grocery chains)

- Directly inking between new GTIN and COICOP through chains’ own classification. (petrol chains)

- Text searches in some COICOP groups.

- Manual checks of the automatically linking.

• Make a basket of non-food products in December each year.

• Calculates unweighted Jevons indices and the percentage changes from basis month, for COICOP-6 groups.

• If the replaced GTIN is of different quality, indirect quality adjustment must be made.

• If no replacement is done, then impute missing prices.

Page 12: 1 The use of scanner data on non-food products Pia.Ronnevik@ssb.no Statistics Norway

The challenge with non-food products

• In some COICOP groups we have different data sources:

- Scanner data

- Questionnaires filled out by the stores

(- Web scraping)

• How to combine different sources?

- One index for each data source?

• How to weight these indices together?

- Turnover?

- Other sources?