rnews embedded data for the news industry by evan sandhaus
DESCRIPTION
From SMX East 2013 - Structured Data Superstars - rNews Embedded Data for the News Industry by Evan Sandhaus of the NY TimesTRANSCRIPT
rNews:
EMBEDDED DATA
FOR THE
NEWS
INDUSTRY
#
Evan Sandhaus
NYTimes
@kansandhaus
#SMX #22C
October 2, 2013
#
2
Agenda
Why we need rNews
Intro to rNews
Benefits of rNews
Road to rNews
Schema.org integration
Adoption
NYT Case Study
Discussion
4
...And 50
Others
5
Why we need
Semantic
Markup
5
The Burning Question
STORY
PHOTO
Story components which are obvious to a person…
STORY
PHOTO
...are not so obvious to a machine.
9
The Problem Of Structured Data: Continued
Label Type Value
id number 1248069162607
Headline text New Web Code Draws Concern...
Byline text By TANZINA VEGA
Date date 20101010
Body text In the next few years, a powerful...
Length number 1123
Tag text Privacy
Tag text Computers and the Internet
Tag text Web Browsers
<html> <head> <title> New Web Code
Draws Concern... </title> </head>
<body> <div>
New Web Code Draws Concern...
</div> <div>
By TANZINA VEGA
</div> <div>
October 10, 2010
</div> <div>
In the next few years, a powerful...
</div> </body></html>
Data Tier Display TierLogic Tier
Content very well structured on Data
Tier, but all of this structure is lost in
translation to presentation tier.
10
The Problem Of Structured Data: Continued
<html> <head> <title> New Web Code
Draws Concern... </title> </head>
<body> <div>
New Web Code Draws Concern...
</div> <div>
By TANZINA VEGA
</div> <div>
October 10, 2010
</div> <div>
In the next few years, a powerful...
</div> </body></html>
Display Tier
=
?
Search engines, social
networks, aggregators and
other sites only see the
Display Tier, and cannot
leverage the underlying
structure of the data.
11
The Problem Of Structured Data: Continued
Without structured data search engines, social
networks and other sites cannot attractively format
links back to our site, potentially decreasing
referral traffic.
With Structured
Data
No Structured
Data
12
The Case of the Missing Structured Data
13
Semantic Markup Standards
Microformats RDFa Microdata JSON
First
Simple
Rigid
Official
Complex
OpenGraph
Unofficial
Flexible
Schema.org
Official
Developers
External
14
15
1616
rNews
17
rNews Defined
rNews is a data model for embedding
machine-readable publishing metadata in
web documents and a set of suggested
implementations.
slightly shorter
18
rNews is a data model
19
for embedding machine-readable publishing
metadata in web documents
Headline
Byline
Tags
Creator
...
20
and a set of suggested implementations
RDFa Microdata JSON
Today Today Maybe?
21
rNews - Class Diagram
22
rNews - Working Example
23
24
HTML 5 Microdata<!DOCTYPE HTML>
<html itemscope itemtype="http://schema.org/NewsArticle" >
<head>
<style type="text/css">@import url(css/iptc_times2.css);</style>
<meta itemprop="dateCreated" content="2011-03-23"/>
<meta itemprop="description" content="The questions about the command..."/>
<meta itemprop="inLanguage" content="en-US"/>
<meta itemprop="thumbnailUrl" content="http://graphics8.nytimes.com/images/common/icons/t_wb_75.gif"/>
<meta itemprop="genre" content="Current"/>
<meta itemprop="id" content="1248069687395"/>
<meta itemprop="version" content="2"/>
<meta itemprop="publishingPrinciples" content="http://www.nytco.com/press/ethics.html"/>
<meta itemprop="wordCount" content="879"/>
</head>
<body>
<div style="height:900px" class="article">
<div class="a_column">
<div itemprop="headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div>
<div itemprop="alternativeHeadline" class="rider">NATO Takes Command</div>
<div itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject">
<img itemprop="URL" class="image" src="img/libya_sample_reuters.jpg"/>
<div class="image_credit">Credit:
<span itemprop="creator" itemscope itemtype="http://schema.org/Person">
<span itemprop="name">Goran Tomasevic</span>
</span>
/
<span itemprop="sourceOrganization" itemscope itemtype="http://schema.org/Organization">
<span itemprop="name">Reuters</span>
<meta itemprop="tickerSymbol" content="NYSE TRI"/>
</span>
</div>
rNews
BenefitsOr Why You Should Care
About rNews
26
Benefit #1: Superior Algorithmically-
Generated Links
Using structured data search engines, social
networks and other sites can attractively format
links back to our site, potentially increasing referral
traffic.
With Structured
Data
No Structured
Data
27
Benefit #2: Superior Tool Support
Vertical search
Commenting Platforms
Rights Management
28
Benefit #3: Better Analytics
29
The Way to rNews
30
rNews - Timeline
September 2010 - rNews proposed to IPTC at fall
meeting
March 2011 - rNews draft version 0.1 approved by
IPTC at summer meeting.
March - May 2011 - IPTC solicits feedback on draft
standard.
June 2011 - IPTC to vote on revised standard at
summer meeting
October 2011 - rNews 1.0 Approved by IPTC
November 2011 - Start of rNews Implementation
on nytimes.com
31
Engaging Our Community
32
Engaging Our Community
33
Engaging Our Community
3434
Engaging Our Community
And Then
Schema.org
36
37
38
This class contains derivatives of
IPTC rNews properties. rNews is a
data model of publishing metadata
with serializations currently
available for RDFa as well as
HTML5 Microdata. More
information about the IPTC and
rNews can be found at rnews.org.
3939
rNewsThank You!
http://www.slideshare.net/SearchMarketingExpo
#
see more presentations at:
#
4
0