archiving the mobile web frank mccown, monica yarbrough, & keith enlow computer science dept...

23
Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Upload: ashlyn-bridges

Post on 18-Dec-2015

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Archiving the Mobile Web

Frank McCown, Monica Yarbrough, & Keith EnlowComputer Science Dept

Harding University

WADL 2013Indianapolis, IN

July 25, 2013

Page 2: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Mobile vs. Stationary Web

Page 3: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Mobile Web-Related Markup Languages

http://en.wikipedia.org/wiki/File:Mobile_Web_Standards_Evolution_Vector.svg

Smartphone era

Page 4: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Two Types of Mobile Web

Feature Phone Web Smartphone Web

cHTML (iMode), WML, WAP, etc.

XHTML, HTML5, etc.

Page 5: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013
Page 6: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Serving Up Mobile Sites

1. Responsive web design• Same HTML content to desktop and mobile• CSS media queries alter appearance

<!-- CSS media query on a link element --><link rel="stylesheet" media="(max-width: 800px)" href="example.css" />

<!-- CSS media query within a style sheet --><style>@media (max-width: 600px) { .sidebar { display: none; }}</style>

Page 7: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Example of Responsive Web Design

Page 8: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Serving Up Mobile Sites

1. Responsive web design• Same HTML content to desktop and mobile• CSS media queries alter appearance

2. Redirect mobile user agent to mobile site• Client-side redirection• Server-side redirection

Page 9: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Client-Side Redirection• JavaScript detects mobile user agent

// From www.harding.eduvar ua = navigator.userAgent.toLowerCase(); if (queryString.match('version=mobile') || ua.match(/IEMobile|Windows CE|NetFront|PlayStation|like Mac OS Z|MIDP|UP\.Browser|Symbian|

Nintendo|BlackBerry|mobile/i)) { if (!ua.match('ipad')) { if (window.location.pathname.match('.html')) window.location = window.location.pathname.replace('.html', '.m.html'); else window.location = window.location.pathname + 'index.m.html'; }}

Page 10: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Client-Side Redirection

Page 11: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Server-Side Redirection• Server routes mobile user agent to different page

Apache Example:RewriteEngine OnRewriteBase /

RewriteCond %{HTTP_USER_AGENT} (android|bb\d+|meego).+mobile|avantgo|badda\/|blackberry|blazer|etc…|zte\-) [NC]RewriteRule ^$ http://detectmobilebrowser.com/mobile [R,L]

https://developers.google.com/webmasters/smartphone-sites/details

Page 12: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Server-Side Redirection

Page 13: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Serving Up Mobile Sites

1. Responsive web design• Same HTML content to desktop and mobile• CSS media queries alter appearance

2. Redirect mobile user agent to mobile site• Client-side redirection• Server-side redirection

3. User-agent content negotiation• Dynamically serving different HTML for the same URL

Page 14: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

User-Agent Content Negotiation

• Server serves up different content for same URL• Use Vary: User-Agent

header in response• Best method for serving content

quickly

Page 15: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Archiving Mobile Sites

1. Responsive web design• Easy: Crawl like normal• Use client tools to view page formatted for mobile

2. Redirect mobile user agent to mobile site• Need to crawl with mobile user agent• Need JavaScript-enabled crawler to handle client-side

redirection

3. User-agent content negotiation• Need to crawl with mobile user agent• Need to distinguish mobile vs. desktop for same URL

Page 16: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

How are we doing archiving mobile sites

so far?

Page 17: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013
Page 18: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Earliest archived

page

Page 19: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Earliest 2007 archived page: WML

Page 20: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Finally some news!

Page 21: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Really???

Page 22: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Great…

Page 23: Archiving the Mobile Web Frank McCown, Monica Yarbrough, & Keith Enlow Computer Science Dept Harding University WADL 2013 Indianapolis, IN July 25, 2013

Only desktop version is archived!