svenska yle metadata and data first

28
Data first and linked data at the Swedish speaking Yle Mikael Hindsberg, concept developer svenska.yle.fi @mickhinds | [email protected] 27.5.2015 Background image: CC BY-SA http://commons.wikimedia.org/wiki/User:Mschel

Upload: micke-hindsberg

Post on 28-Jul-2015

85 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Svenska Yle metadata and data first

Data first and linked data at the Swedish speaking Yle

Mikael Hindsberg, concept developer svenska.yle.fi @mickhinds | [email protected]

27.5.2015Background image: CC BY-SA http://commons.wikimedia.org/wiki/User:Mschel

Page 2: Svenska Yle metadata and data first

Linked data

We now link content at Svenska.yle.fi over:

• Organizational borders

• Content managment systems

• Different languages

• Different media types (text/video/audio)

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Page 3: Svenska Yle metadata and data first

Linked data

WHY?

and

HOW?

… have we done this

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

"I Wonder". CC BY - http://commons.wikimedia.org/wiki/File:I_Wonder.jpg#mediaviewer/File:I_Wonder.jpg

Page 4: Svenska Yle metadata and data first

Organization

Svenska Yle is a miniature of Yle

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

• We are a small agile unit who can pilot many things within the company

Page 5: Svenska Yle metadata and data first

Organization

One tv-channel and two radio channels- And the web

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Page 6: Svenska Yle metadata and data first

Big disruption in digitalizing media

Media convergence

The audience is changing the way they consume media – FAST!

Page 7: Svenska Yle metadata and data first

How to handel this?

Unified platform, Drupal7, 2012

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

News, current affairs, sports, entertainment, lifestyle, recipes, health, science, debate – all in one hierarchically flat platform.

MUST ALSO BE UNIFORM AS DATA!

Page 8: Svenska Yle metadata and data first

Referer trafic

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

- Machine readability- Semantic metadata - Linked data- Open data- SEO

The content must be able to live independently from the platform!

Page 9: Svenska Yle metadata and data first

Content – i.e. articles – rule!

60% of the trafic starts from an article, 29% from the main page. 11% covers everything else!

Page 10: Svenska Yle metadata and data first

Mobile over 50%

Week Desktop Mobile Tablet21/2014 57 % 27 % 16 %21/2015 46 % 37 % 17%

Page 11: Svenska Yle metadata and data first

Metadata is the key

• Content, platform and distribution are exploding in diversity.

• We need to be able to serve the web with our content as data.

• Semantically rich ontologies with public URI’s• We use Finto (Finnish ontology and thesauri service) and

Freebase >> Wikidata ca. Aug. 2015• Linked data

• Map relation’s between content to link them together • Data graph

• This graph gives structure, recommendations, search engine optimization, new knowledge, global intercompability

= Data first

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Page 12: Svenska Yle metadata and data first

The components• Finnish ontology and thesauri library

(FINTO) www.finto.fi • Frebase www.freebase.com • Drupal module for annotation:

https://www.drupal.org/project/yild

• + journalists do the base annotation• complemented with automatic annotation

• New module can utilize wikipedia, geonames > almost all open metadata repositories: YILD – Yle Integrator for Linked Data https://www.drupal.org/project/yild

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Page 13: Svenska Yle metadata and data first

YILDYle Integrator for Linked Datahttps://www.drupal.org/project/yild + also check out PoolParty-extension https://www.drupal.org/project/yild_poolparty

Page 14: Svenska Yle metadata and data first

Yle-API• On top of our metadata sits an

Yle API-layer (API = application programming interface)

• Meta-API ties together all Yle-metadata to a graph

• API-calls in JSON(-LD)• Compatible with Schema.org, EBUCore och Dublin

Coren

• http://developer.yle.fi/tutorials.html

CC BY-SA http://commons.wikimedia.org/wiki/User:Pbroks13

Page 15: Svenska Yle metadata and data first

Roadmap for Yle API

SYND/FYND DRUPAL 7

Programs API

Articles API

Meta API

Image API

IMS

Arena API

Login API

Weather API

NewsGuard

ProgramGuide

Finto

FreebaseScores

APIArpa

Metrics API

Page 16: Svenska Yle metadata and data first

The opening of Yle-APIWe have taken the first steps to start opening up our API’s by publishing the Programmes API.

When the Articles- and meta-API’s are opened 3rd party developers can build own versions of most of our services

• http://developer.yle.fi/tutorials.html

Page 17: Svenska Yle metadata and data first

Annotation – journalists vs. algorithms

Journalists:

+ abstraction+ logic

- inconsistancy in both quality and quantity- poor attention to detail

Algortihms:

+ attention to detail+ consistancy

- lack of human logic- lack of languag knowledge - idioms- great sense of detail

Page 18: Svenska Yle metadata and data first

Uutisvahti – mobile applikationNews application for pushing news stories through metadata >> http://yle.fi/uutisvahti/

Page 19: Svenska Yle metadata and data first

Linked data

We can now link content (data) over borders of:

• Organization

• Publishing systems (CMS)

• Language (language neutral, works with Finnish, Swedish and English)

• Media type (text, video, audio, images)

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Page 20: Svenska Yle metadata and data first

Example

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Linking over language

Termsidor

Swedis

h

Finnish

Page 21: Svenska Yle metadata and data first

Example Recomendations

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Page 22: Svenska Yle metadata and data first

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

• Based on semantic tags• Easy to add more

attributes like metrics• Can be improved by

algorithms, like cos-similarity

• Must be careful to not make too exact recommendations > boring

• Serendipitet, show the audience what they didn’t know they wanted to know

RecomendationsExample

Page 23: Svenska Yle metadata and data first

Linking different media types new information from the graph

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

Example

Page 24: Svenska Yle metadata and data first

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel

How?It’s really quite simple Below note to developer who built it.

Page 25: Svenska Yle metadata and data first

ExampleMedia recommendation, when the article was new:

Page 26: Svenska Yle metadata and data first

ExampleThe same content after 3 months. The graph lives!

Page 27: Svenska Yle metadata and data first

Hur?Still demands quite complicted api-calls must optimize

Page 28: Svenska Yle metadata and data first

Thank you! Questions?

Mikael ’Micke’ Hindsberg

twitter.com/mickhinds

svenska.yle.fi utveckling.ylebloggen.fi

www.slideshare.net/mickhinds

Background image: CC BY SA http://commons.wikimedia.org/wiki/User:Mschel