xml what is it and why do we care?

37
XML: What is it and why do we care? California Agriculture University of California Agriculture and Natural Resources Communication Services and Internet Technology

Upload: janetlouisewhite

Post on 21-Jun-2015

322 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Xml  what is it and why do we care?

XML: What is it and why do we care?

California Agriculture University of California Agriculture and Natural Resources Communication Services and Internet Technology

Page 2: Xml  what is it and why do we care?

Or how a mild-mannered journal . . .

(the University of California’s peer-reviewed journal of agriculture, and natural and human resources research) . . .

University of California | Peer-reviewed Research and News in Agricultural, Natural and Human Resources

Growing bigger, better:Artisan olive oil comes of age

California AgricultureJANUARY–MARCH 2011 • VOLUME 65 NUMBER 1

University of California | Peer-reviewed Research and News in Agricultural, Natural and Human Resources

Down on the farm: Agritourism on the rise

Also: Free trade, constructed wetlands, rice nitrogen, smart sprayers, almond irrigation

California AgricultureA P R I L – J U N E 2 0 1 1 • V O L U M E 6 5 N U M B E R 2

University of California | Peer-reviewed Research and News in Agricultural, Natural and Human Resources

Food as medicine

Can what we eat help cure what ails us?

California AgricultureJULY–SEPTEMBER 2011 • VOLUME 65 NUMBER 3

Page 3: Xml  what is it and why do we care?

. . . can have superpowers on the Web*

*especially the semantic Web

Page 4: Xml  what is it and why do we care?

Readers today are problem solvers

Tunisia, January 2011

Page 5: Xml  what is it and why do we care?

They seek relevant information,

Page 6: Xml  what is it and why do we care?

repurpose and recombine it . . .

Page 7: Xml  what is it and why do we care?

and find a solution. .

Page 8: Xml  what is it and why do we care?

Readers will go to great lengths to get the information they need.

As government tried to block protests, the people went from

YouTube, to Facebook, to Twitter

Page 9: Xml  what is it and why do we care?

Information wants to be free* . . .

Information wants to be expensive*

*Stewart Brand, Hackers Convention 1984

Page 10: Xml  what is it and why do we care?

The tension will not go away Readers expect more – discoverable,

accessible content that can be viewed on the "container of choice" – iPhone, Android, Kindle,

iPad, laptop . . .

Page 11: Xml  what is it and why do we care?

At the same time, faculty authors need “impact”

as shown by "metrics" in both the the popular

Page 12: Xml  what is it and why do we care?

and scholarly worlds. Luckily the Web helps us do both.

Page 13: Xml  what is it and why do we care?

As land-grant universities, we have

• quality content, • some of the best science in the world, and • a long tradition of open-access information, both print and Web

Page 14: Xml  what is it and why do we care?

Our content is free in another way now . . . it is no longer

restricted to a container

Page 15: Xml  what is it and why do we care?

Once we have the content in digital form, we can render it in various chunks and containers

–books, ebooks, PDFs, blogs, websites, journals, ezines, even good old print

page turners, presence on databases

Page 16: Xml  what is it and why do we care?

But with so many formats and devices -- and readers with

different devices -- how can we do this?

Page 17: Xml  what is it and why do we care?

?

Discoverable Accessible

Actively linked Flexible -- rendered across platforms and devices

Visible in the scholarly world

We need a multi-tasking tool to create content that is

Page 18: Xml  what is it and why do we care?

XML = “extensible markup language”

•  a language for documents containing structured information;

•  Including both content and markers, tags identifying what role that content plays.

•  For example:

<title>Huckleberry Finn</title> <author>Mark Twain</author>

Page 19: Xml  what is it and why do we care?

Transforms to HTML on the Web

Page 20: Xml  what is it and why do we care?

XML metadata behind it

Page 21: Xml  what is it and why do we care?

Active links on the Web

•  References •  Adam KL. Community Supported Agriculture 2006. Fayetteville, AR:ATTRA-National Sustainable

Agriculture Information Service. www.attra.org/attra-pub/PDF/csa.pdf •  Altieri MA. Agroecology: The Science of Sustainable Agriculture 1995. Boulder, CO:Westview Pr. 448p. •  Anderson-Wilk M. Does community-supported agriculture support conservation?. J Soil Water Conserv.

2007. 62(6):126-7A. •  DeLind LB. Ferguson AE. Is this a women's movement? The relationship of gender to community-

supported agriculture in Michigan. Hum Organ. 1999. 58(2):190-200. •  Feenstra G. CSAs: The consumer-farmer connection. Cal Ag. 1994. 48(5):8- DOI: 10.3733/ca.v048n05p8

[CrossRef] •  Flora CB. Bregendahl C. The role of collaborative community supported agriculture: Lessons from Iowa.

Leopold Ctr Prog Rep. 2007. 16:44-7. •  Galt RE. Counting and mapping community-supported agriculture in the United States and California:

Contributions from critical cartography/GIS. ACME: Int E-J Crit Geogr. 2011. 10(2):131-62. •  Hinrichs CC. Embeddedness and local food systems: Notes on two types of direct agricultural market. J

Rural Stud. 2000. 16(3):295-303. DOI: 10.1016/S0743-0167(99)00063-7 [CrossRef] •  Klonsky K. Siebert J. Organic agricultural production in California. California Agriculture: Dimensions and

Issues 2004. Berkeley, CA:Giannini Foundation. p. 241-56.

Page 22: Xml  what is it and why do we care?

XML tagging for one reference •  <ref id="R8"><nlm-citation citation-type="journal"> •  <person-group person-group-type="author"> •  <name><surname>Hinrichs</surname> •  <given-names>CC</given-names></name></person-group> •  <article-title>Embeddedness and local food systems: Notes on two

types of direct agricultural market</article-title> •  <source>J Rural Stud</source> •  <year>2000</year> •  <volume>16</volume> •  <issue>3</issue> •  <fpage>295</fpage><lpage>303</lpage> •  <pub-id pub-id-type="doi">10.1016/S0743-0167(99)00063-7</pub-

id> •  </nlm-citation></ref>

Page 23: Xml  what is it and why do we care?

Why is XML so useful?

•  Machine readable •  Flat file – no style •  Talks to relational databases – backbone

of semantic Web •  A standard for that can be recognized by

eReaders and mobile devices, databases and platforms – and poured into different containers

Page 24: Xml  what is it and why do we care?

What “flavor” of XML ?

•  There are different kinds of XML. •  Each is a DTD or document type

definition, a set of rules defining content and structure.

Page 25: Xml  what is it and why do we care?

For maximum functionality: NLM DTD *

•  Established in 2003 by the National Library of Medicine, the world’s largest biomedical library

•  Designed for content posted to Pub Med Central – a digital archive of biomedical and life sciences journal literature.

•  Provides free access to full text of articles

* Known also as Journal Article Tag Suite (JATS)

Page 26: Xml  what is it and why do we care?

Now the de facto international standard of scholarly publication

It is popular among aggregators, and house style for Atypon and Allen Press, recommended by HighWire and Public Library of Science.

Page 27: Xml  what is it and why do we care?

Flavor?

•  Each “flavor” OF XML includes elements, attributes, and entities, and defines relationships among them.

•  There are code checkers on line that enable you to test your code for – proper structure (Is there an end tag for every

start tag?) and – conformation to the standard.

Page 28: Xml  what is it and why do we care?

Functionality -- most important

•  XML enables content to be channeled into myriad formats and devices: web sites, digital edition on laptop, application for smartphone, reading devices, eReaders, YouTube or Podcast if video or audio files.

•  It enables content to be indexed by scholarly databases including Thomson ISI Web of Science

Page 29: Xml  what is it and why do we care?

Another option: XML metadata + PDF

•  PDFs are true visual representations of how text is meant to look

•  PDFs are easy to read because they retain formatting

•  They include vector graphics that scale cleanly in any device

Page 30: Xml  what is it and why do we care?

California Agriculture has posted full text in XML and PDF format back to our first issue

in 1946.

Beginning in July 2011: California Agriculture went electronic version of record.

Page 31: Xml  what is it and why do we care?

We now deliver XML directly to databases. They display it as soon as we signal new content.

Page 32: Xml  what is it and why do we care?

One favorable result: Greater visibility for faculty authors June ‘11: Impact factor for 2010 doubled to .92

Page 33: Xml  what is it and why do we care?

. . . and continued upward

Page 34: Xml  what is it and why do we care?

. . . with a favorable long-term trend.

Page 35: Xml  what is it and why do we care?

We are charting and analyzing popular metrics

Page 36: Xml  what is it and why do we care?

Steps to XML

1. Hire a web editor (someone familiar with XML flavors, transforms, and CSS).

2. Get a friendly, helpful IT programmer to build a database, create transforms and calls

3. Hire vendor to digitize your backfile (our vendor is Aptara) 4. Learn to maintain and update your site with editorial staff and

designers.

Check out helpful websites: JATS-Con 2011 PPTs and some videos go to:

http://www.ncbi.nlm.nih.gov/books/NBK57252/ NLM DTD and PubMed Central code checker http://www.ncbi.nlm.nih.gov/pmc/pub/validation/

Page 37: Xml  what is it and why do we care?

Making your content discoverable to readers and internet databases, aggregators, browsers . . .

whether in Ghana . . . Or Iowa