ER&L 2010: Usage Statistics for E-resources – is all that data meaningful?

Speaker: Sally R. Krash, vendor

Three options: do it yourself, gather and format to upload to a vendor’s collection database, or have the vendor gather the data and send a report (Harrassowitz e-Stats). Surprisingly, the second solution was actually more time-consuming than the first because the library’s data didn’t always match the vendor’s data. The third is the easiest because it’s coming from their subscription agent.

Evaluation: review cost data; set cut-off point ($50, $75, $100, ILL/DocDel costs, whatever); generate list of all resources that fall beyond that point; use that list to determine cancellations. For citation databases, they want to see upward trends in use, not necessarily cyclical spikes that average out year-to-year.

Future: Need more turnaway reports from publishers, specifically journal publishers. COUNTER JR5 will give more detail about article requests by year of publication. COUNTER JR1 & BR1 combined report – don’t care about format, just want download data. Need to have download information for full-text subscriptions, not just searches/sessions.

Speaker: Benjamin Heet, librarian

He is speaking about University of Notre Dame’s statistics philosophy. They collect JR1 full text downloads – they’re not into database statistics, mostly because fed search messes them up. Impact factor and Eigen factors are hard to evaluate. He asks, “can you make questionable numbers meaningful by adding even more questionable numbers?”

At first, he was downloading the spreadsheets monthly and making them available on the library website. He started looking for a better way, whether that was to pay someone else to build a tool or do it himself. He went with the DIY route because he wanted to make the numbers more meaningful.

Avoid junk in junk out: HTML vs. PDF downloads depends on the platform setup. Pay attention to outliers to watch for spikes that might indicate unusual use by an individual. The reports often have bad data or duplicate data on the same report.

CORAL Usage Statistics – local program gives them a central location to store user names & passwords. He downloads reports quarterly now, and the public interface allows other librarians to view the stats in readable reports.

Speaker: Justin Clarke, vendor

Harvesting reports takes a lot of time and requires some administrative costs. SUSHI is a vehicle for automating the transfer of statistics from one source to another. However, you still need to look at the data. Your subscription agent has a lot more data about the resources than just use, and can combine the two together to create a broader picture of the resource use.

Harrassowitz starts with acquisitions data and matches the use statistics to that. They also capture things like publisher changes and title changes. Cost per use is not as easy as simple division – packages confuse the matter.

High use could be the result of class assignments or hackers/hoarders. Low use might be for political purchases or new department support. You need a reference point of cost. Pricing from publishers seems to have no rhyme or reason, and your price is not necessarily the list price. Multi-year analysis and subject-based analysis look at local trends.

Rather than usage statistics, we need useful statistics.

ebook usage statistics

In a recent phone/web town hall discussion with Peter Shepherd, Project Director for COUNTER, mused about why publishers (and libraries) have not embraced the COUNTER Code of Practice for Books and Reference Works as quickly as they have the Code of Practice for Journals and Databases. His approach is that we are paying customers and should have that information. My perspective: meh.

I would like to see ebook usage for items that we purchase as a subscription, but for items we own (i.e. one-time purchase with perpetual access), it’s less of a concern for collection development. Licensed ebooks with annual subscriptions (like regularly updating encyclopedias or book packages) are more like online databases or ejournals than traditional paper books, so in that regard, it shouldn’t be difficult for publishers to implement the COUNTER Code of Practice for Books and Reference Works and provide use information to customers.

For books that are static and don’t have any annual cost attached to them, there isn’t much of a regular need to know what is being used. We keep track of re-shelving stats for the purposes of managing a physical collection with space limitations, and those problems are not replicated in an online environment. Where the usage of owned ebooks comes into play is when we are justifying:
a. The purchase of those specific ebooks.
b. The purchase of future ebooks from that publisher.
c. The amount of money in the ebook budget.

Hopefully Mr. Shepherd, Project COUNTER, and vocal librarians will be able to convince the publishers of the value of providing usage information. When budgets are as tight as they are these days, having detailed information about the use of your subscription-based collection is essential for making decisions about what must be kept and what can be let go (or should be promoted more to the users). Of course, in less desperate times, knowing this information is also important for making adjustments to the library’s collection emphasis in order to meet the needs of the users.

gathering statistics

For the past couple of weeks, the majority of my work day has been spent on tracking down and massaging usage statistics reports from the publishers of the online products we purchase. I am nearly half-way through the list, and I have a few observations based on this experience:

1. There are more publishers not following the COUNTER code of practice than those who are. Publishers in traditionally library-dominated (and in particular, academic library-dominated) markets are more likely to provide COUNTER-compliant statistics, but that is not a guarantee.

2. Some publishers provide usage statistics, and even COUNTER-compliant usage statistics, but only for the past twelve months or some other short period of time. This would be acceptable only if a library had been saving the reports locally. Otherwise, a twelve month period is not long enough to use the data to make informed decisions.

3. We are not trying to use these statistics to find out which resources to cancel. On the contrary, if I can find data that shows an increase in use over time, then my boss can use it to justify our annual budget request and maybe even ask for more money.

Update: It seems that the conversation regarding my observations is happening over on FriendFeed. Please feel free to join in there or leave your thoughts here.

css.php