gathering statistics at fcla
DESCRIPTION
ICOLC – Philadelphia March 28, 2006 Michele Newberry. Gathering Statistics at FCLA. Or ……. Statistics Hell!!!. ABC-CLIO. Logon to the Stats interface at: http://serials.abc-clio.com/reports/ ID: @ufl.edu PW: - PowerPoint PPT PresentationTRANSCRIPT
Gathering Statistics at FCLA
ICOLC – PhiladelphiaMarch 28, 2006
Michele Newberry
Or ……
March 28, 2006 ICOLC, Philadelphia, Newberry 2
Statistics Hell!!!
March 28, 2006 ICOLC, Philadelphia, Newberry 3
ABC-CLIO Logon to the Stats interface at: http://serials.abc-clio.com/reports/ ID: <xxx>@ufl.edu PW: <nnnn> http://serials.abc-clio.com/reports/
start&_formname=loginfo&appname=reports&loginname=<xxx>@ufl.edu&password=<nnnn>&forgotten_password=0
Choose “Select All” for Institutions Choose a reporting period (one or multiple months) Choose an Output Type (I choose “Excel Friendly HTML”) Click “Run Report” http://serials.abc-clio.com/reports/go/ABC-Clio-Serials-Reports&_appname=
reports&_operation=DoReport&addlcid=I00271&addlcid=I00267&addlcid=I00268&addlcid=I00269&addlcid=I00270&addlcid=I00637&addlcid=I00725&addlcid=I00652&addlcid=I00697&addlcid=I00764&startmonth=20051001&stopmonth=20051101&outputtype=Coutputtype=C for CSV, E for Excel friendly html, H for HTMLstart,stop month in YYYYMMDD
Save Page As HTML file – “FCLAReportMMYY.html”, i.e. – FCLAReport0905.html.
Upload to FCLA website and transcribe total annual searches into the master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 4
BePress BePress stats are VERY complicated to gather. Go to the admin
URL: http://www.bepress.com/cgi/myaccount.cgi Login with the user ID and Password for each school. Copy the URL for each school’s report and paste into the browser to
get an “Open” or “Save As…” popup window. Open the file in Excel, and make the following modifications:
Add a line above line one that says: BePress Usage Report (Arial, 14 point) Bold & Italicize & color purple the next line (Full-Text Downloads….) Delete the usage data for each title up to January of the current year (these
reports keep data from the creation of any given university’s account), and then move the data for the current year to the left so that the January data is in Column B.
Resize Column A so that the full title of each Journal is viewable Merge & Center these two header lines over the width of the report
Once finished, “Save As Web Page” naming by school & year, i.e. “famu_2005.html”
Repeat for each school. Upload to FCLA website and transcribe total annual searches into
the master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 5
CSA Login to CSA Illumina Usage at:
http://mars3.csa.com/usage/ou_login.aspx Login with ID: <xxx> and PW: <nnnn> Choose either Live Reporting or Emailed Reports
(depending on how current the data needs to be (explained on the site) – normally choose Live for current data.
Under “Consortium Reports”, select a date range (one or multiple months), and run the report.
“Save As…” HTML, naming the file csa_fcla_MMYY.html, i.e., csa_fcla_0905.html
Upload to FCLA website and transcribe total annual searches into the master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 6
EBSCO Login to EBSCO Admin at: http://eadmin.epnet.com/eadmin/login.aspx ID: <xxxx> PW: <nnnn> Click on the “Reports & Statistics” tab, then configure the report. Normally choose:
By Database, Consortium: ALL Level: Site Date range: one or multiple months Include: All Sites Fields to show: Sessions, Searches, Total Full Text Requests (any other fields are fine as well, but those are the required fields)
Then either Show, E-mail, or Schedule this report to be run. “Save As…” HTML, naming the file ebscoYYYY_MM.html,
i.e., ebsco2005_08.html. Upload to FCLA website and transcribe total annual searches into the
master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 7
Gale / IAC InfoTrac Login to Gale InfoTrac Config at: http://infotrac.galegroup.com/itconfig/fcla_000 ID: <xxxx> PW: <nnnn> Click on Reports in the navigation bar. Under “Consortium”, select E-mail Gale or COUNTER reports, or setup a monthly
report. Normally we choose the Gale report, as the COUNTER report does not display stats in the searches by university & month style that FCLA prefers.
So choose Gale report, and select a date range. Under Gale Standard Use Reports, check “Usage Summary”, “Usage by
Database”, “Library Location”. Choose Format=“Comma Separated Values”; Compression=“None” and
Attachment= “Yes”. Recipient: enter E-mail address for the report and click “Get Report” to get it sent
via E-mail. Once received, format it to resemble the existing reports at: http://
www.fcla.edu/FCLAinfo/stats/iac/iac.html . Saved as HTML with a filename of gale_MM_YYYY.html, i.e., gale_09_2005.html. Upload to FCLA website and transcribe total annual searches into the
master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 8
LexisNexis – Academic, Congressional & Statistical
LexisNexis stats can be gathered by accessing their website at: http://www3.lexisnexis.com/aur/signon.html
ID: <XXXX> PW: <NNNN> Once there, you can view HTML or download CSV reports. HTML
reports cannot be downloaded. FCLA worked with LexisNexis to get an FTP account through which we download the HTML reports.
FTP Setup Info: FTP Host - ftp.lexisnexis.com Login – <XXXX> Password – <NNNN>
Download all of the new reports for Academic, Congressional, Statistical, and the Rollup reports
Rename the files as institutionYYMM.html, i.e., famu0508.html . Upload to the FCLA website for each product and transcribe total
annual searches into master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 9
ProQuest Login to ProQuest Local Admin at: http://lad.proquest.com/ladweb ID: <XXXX> PW: <NNNN> Click on the tab (or link) for Select Report Type:
Database Activity – Detail Delivery Method: Download or Email now Show items with zero usage: Yes Include sub-accounts in this report: Yes
Select a date range for the usage period (one or multiple months) Click Create Report. Save as HTML as pqYYMM.html, i.e., pq0905.html. Upload to FCLA website and transcribe total annual searches into the
master spreadsheet.NOTE: There is also a cumulative report for ProQuest Digital
Dissertations usage that is emailed once a month in HTML format and uploaded upon arrival as pqdd_stats.html.
March 28, 2006 ICOLC, Philadelphia, Newberry 10
RLG Login to RLG Stats at: http://reports.rlg.org Invoice Account Code (IAC): <XXXX> Access via two reports:
6. Union Cat and Citation Files – Searches for Month 7. Other Info Resources – Search Activity for Month
Aggregate stats by institution manually due to the shared IAC.
Do not post on the FCLA website. Transcribe total annual searches into master
spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 11
Standard & Poor’s Login to S&P NetAdvantage at:
http://www.netadvantage.standardandpoors.com/NASApp/NetAdvantage/usage/Usage.do
ID: <XXXX> PW: <NNNN> S&P reports are only available for single month/single institution. 10 reports must be downloaded per month. If multiple institutions are selected, the report gives an aggregated
total, rather than a breakdown by site, which FCLA needs. Select Month, Year and Institution, then click “Show Report”. Click “Printer Friendly” to generate a new window with the full report,
then “Save Page As” HTML, as s&p_institution_MM_YYYY.html, i.e., s&p_famu_09_2005.html
Repeat for all 10 reports. Once all reports have been saved, all hyperlinks must all be removed
as they point to resources are not be available from the posted page. Upload to FCLA website and transcribe total annual searches into
master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 12
ValueLine
ValueLine statistics are emailed to FCLA directly from the vendor rep.
Excel format. Save as a valueline_MM_YYYY.html,
i.e., valueline_09_2005.html Upload to the FCLA website and transcribe total
annual searches into master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 13
Wilson Login to WilsonWeb at: http://www.hwwstats.com/ng/ Account Number: <NNNN> (then click Login) Password: <XXXX> (then click Continue) Click :Database Usage”
(COUNTER reports are available, but the reports under Database Usage are more appropriate to the kind of data that FCLA gathers monthly)
Select “Bill To Account” (Ship To Account will generate ZERO usage) Account: ALL (or you can run individual school reports) Product: ALL Detail Level: Complete Report Choose a date range (one or multiple months) Sort By: Number of Searches, then click “Submit”
Once the report is generated, save as a file (HTML) or email the file. Save the HTML file as wilson_MM_YYYY.html, i.e., wilson_09_2005.html.
Upload to the FCLA website and transcribe total annual searches into master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 14
ACM ACM provides usage stats only twice per year, after
June 30 and after December 31. Data is provided personally by ACM rep.
<inject humorous story here> Files are delivered in HTML format, by school. Add the following header:
ACM Digital Library Usage Report for (school name) (Date Range of report)
Save the file as HTML (acm_school_year.html – i.e. acm_famu_2005.html).
Upload to FCLA website and transcribe total annual searches into master spreadsheet.
March 28, 2006 ICOLC, Philadelphia, Newberry 15
ABC-CLIO IAC InfoTrac
ACM LexisNexis: Academic | Congressional
Statistical
BePress ProQuest
CSA Usage Statistics RLG
Current Contents Connect Standard & Poors
EBSCOHost ValueLine
FirstSearch PerSearch Blocks WebLUIS Databases
FirstSearch WilsonWeb
Galenet
http://www.fcla.edu/system/intro.html
March 28, 2006 ICOLC, Philadelphia, Newberry 16
EBSCOHost Usage Reports
2006
January February
2005
December 2005 June 2005
November 2005 May 2005
October 2005 April 2005
September 2005 March 2005
August 2005 February 2005
July 2005 January 2005
March 28, 2006 ICOLC, Philadelphia, Newberry 17
EBSCOadmin Database Usage ReportJanuary 2006
Site Database Name SessionsTurn-aways Searches
Total Full Text
Image/ Video
Smart
LinkCustom
Link
FLORIDA GULF COAST UNIV CINAHL 763 0 2089 62 0 374 334
FLORIDA GULF COAST UNIV Econlit 435 0 473 12 0 0 1
FLORIDA GULF COAST UNIV Pre-CINAHL 399 0 516 0 0 3 2
FLORIDA GULF COAST UNIVRILM Abstracts of Music Literature
428 0 458 0 0 0 0
FLORIDA INTL UNIV CINAHL 44 0 204 0 0 9 18
FLORIDA INTL UNIV Econlit 91 0 340 2 0 6 117
FLORIDA INTL UNIV Pre-CINAHL 48 0 313 0 0 1 13
FLORIDA INTL UNIVRILM Abstracts of Music Literature
22 0 112 0 0 0 20
FLORIDA STATE UNIV CINAHL 267 0 908 46 0 0 343
FLORIDA STATE UNIV Econlit 100 0 378 5 0 0 32
FLORIDA STATE UNIV Pre-CINAHL 222 0 787 0 0 0 79
FLORIDA STATE UNIVRILM Abstracts of Music Literature
205 0 620 0 0 0 74
March 28, 2006 ICOLC, Philadelphia, Newberry 18
Searches/Database/Institution FAMU FAU FGCU FIU FSU UCF UF UNF USF UWF
TOTAL
ABCAmerHistory&Life 5383 204 1634 705 1976 2599 2729 1611 3683 967 21491
ABC HistAbstracts 1966 122 747 83 1438 1116 1524 1169 1472 790 10427
ACM ACM Dig Coll 539 2822 181 2305 7391 8625 4967 2333 3753 743 33659
APA PsycInfo 2945 28188 2628 141342 149735 151612 154734 47226 186262 19757 854429
BEP BEPress 45 204 17 346 317 532 401 123 260 31 2276
Bowkr BIP 684 17840 5444 6509 21100 10336 13094 8368 17171 4443 104989
Bowkr Ulrichs 639 2907 2647 2641 7413 9123 5900 3297 20762 4375 59704
BowkrBowker
TOTAL 1323 20747 8091 9150 28513 19459 18994 11665 37933 8818 164693
CSA CSA Package 22057 264347 91046 815871 1371814 262353 583005 225981 423753 212520 4272747
CSA ATLA Religion 817 6403 1190 14468 36013 4330 11681 3695 10087 1424 90108
The “SPREADSHEET”
March 28, 2006 ICOLC, Philadelphia, Newberry 19
Automated COUNTER Harvester Assumptions:
Data can be retrieved either via ftp or via html. If via ftp, only the address is needed. If via html, one or more pages may need to be traversed.
Design: Each COUNTER source has a harvesting script
associated with it. Each script is maintained in a separate file.
There is a COUNTER database that holds the retrieved data. It also holds COUNTER source information.
March 28, 2006 ICOLC, Philadelphia, Newberry 20
ID SOURCE SCRIPT LAST_ACCESS
STATUS PERIOD
Table key The name of the COUNTER source. This is the prime key of the table
Fully qualified path to the script to be used to retrieve data from the source
Date and time stamp of completion of the last harvesting of this source.
Active – currently being harvestedError – last harvest failedWait – last harvest succeeded, next
harvest not yet initiated.
Frequency of harvest, in days.
SOURCE Table:
RESOURCES Table:
ID NAME TYPE PRINT_ISSN
ONLINE_ISSN
PUBLISHER PLATFORM
Table key The name of the resource, such as journal title, database name, service name
Journal, Database, Service
ISSN of print version
ISSN of online version
publisher ?
STATISTICS Table:
SOURCE ID START_DATE END_DATE TYPE COUNT
ID of source from SOURCE table
Start date of reporting period, as yyyy-mm-dd
End date of reporting period, as yyyy-mm-dd
ft_pdf – pdfft_html - html
Number of requests.
COUNTER Harvester Preliminary Design
March 28, 2006 ICOLC, Philadelphia, Newberry 21
COUNTER Harvester Preliminary Design Scripts
Are XML documents that specify a set of actions to be performed and the information necessary to perform the action.
Comments are formatted as XML, i.e., <!-- ...-->.
Special characters must be escaped.
March 28, 2006 ICOLC, Philadelphia, Newberry 22
Example: Gale<CounterHarvesterScript>
<debug/> <navigate>http://infotrac.galegroup.com/itconfig/fcla_000</navigate>
<pause> 2 </pause><navigate>http://infotrac.galegroup.com/itconfig/fcla_000?id=fclareports&pass=fclareports</navigate><pause> 2 </pause><navigate>http://web6.infotrac.galegroup.com/infotrac_config/session/191/591/76110636w6/pg=ir!166&ui3=CONSORT&ui4=fcla&ul=3&un=0&uo=9&uiy=Y1&uin=1&uif=CSV&uic=None&uil=YES&[email protected]&uz=++Get+Report++</navigate><alert> Statisitcs will be sent by email to [email protected]. </alert>
</CounterHarvesterScript>
March 28, 2006 ICOLC, Philadelphia, Newberry 23
Problems encountered COUNTER data is treated as secure data, so
creation of a web-walking robot is problematic.
Session data must be extracted at some point in the session and then inserted into URLs to be sent later in the session.
Reporting periods are defined in columns, one column per period. The column headers must be walked, looking for Total YTD after the monthly data column(s).
March 28, 2006 ICOLC, Philadelphia, Newberry 24
COUNTER compliance is based on an Excel worksheet geared for human consumption, not computer processing. The various reports don’t start reporting periods in the same column: In Journal Report 1, periods start in column F In Journal Report 2, periods start in column G
Institution name may be in a single cell row or a column
Some csv files have information that must be skipped: Headings at the top of the sheet Subtotal and total lines interspersed and/or at the bottom of
the sheet
March 28, 2006 ICOLC, Philadelphia, Newberry 25
Counter Harvester Conclusion COUNTER compliance is claimed by many but
delivered by few.
In fact, as of this writing, we haven’t found a single csv that fully conforms.
Many are very different from the standard format.
Others, like EBSCO, are very close but since they are not exact, are still not ameniable to machine processing.
March 28, 2006 ICOLC, Philadelphia, Newberry 26
Hope for the future - SUSHI The SUSHI schema needs more specificity,e.g.:
the format of dates is not specified in the schema but different date formats may cause different servers to reject the request or, worse still, to fail.
Integrating SUSHI and csv-based data: In SUSHI, the reporting period is generalized with a
start/end date. In the Excel-based standard, the column headings are of the form mm-yyyy. To place both into a common database requires normalization.
March 28, 2006 ICOLC, Philadelphia, Newberry 27
Tim J Stats- SUSHI.ppt