Anna Creech is a university librarian with two cats, glasses, comfortable shoes, and a fear of turning into a stereotype.

Topics

Archives

Shovers and Makers 2009: I’m a winner! (So are you.) shoversandmakers.net

#11

I’ll bet you thought I forgot about this whole 50 books thing. No, it’s just that once again, my intentions are much more noble than reality. I have also been rather poor at reporting on the books I’ve read this year, but most of the time, I assume I’m the only one who really cares about all this, anyway.

#11 is I’m Off Then: Losing and Finding Myself on the Camino de Santiago by Hape Kerkeling, translated by Shelley Frisch. This one landed on my doorstep the other week as the latest in a slow trickle of review books coming in from Library Journal. (You can search for my recent reviews, if you’re so inclined.) A little uncertain about it at first, I quickly found myself lost in the story and read it cover to cover in one sitting.

Kerkeling is a German comedy performer of some renown. Not being up on my European comedians (aside from nearly memorizing all of Eddie Izzard’s routines on YouTube), I hadn’t heard of the fellow before this book. After failing to track down any recording of a performance in English or with subtitles, I gave up. Considering that my German linguistic skills are virtually nil, I’m not surprised I hadn’t heard of him before. (If you are interested, Amazon has a short interview with him in English.)

The book is essentially the diary he wrote while hiking the Camino de Santiago in 2001. It’s not strictly a recording of events and people from the pilgrimage, but the stories he tells about his background and prior experiences add import to the things that happen to him on the trail. By the end of the story, I felt as though Kerkeling was a long-lost friend with whom I had recently reunited over a cup of coffee. In many ways, this book reminded me of Kelly Winters’ Walking Home, and that is a good thing.

Technorati Tags: , ,

Pandora Town Hall (Richmond, VA)

June 29, 2009
approx 100 attending
free t-shirts! free burritos from Chipotle!

Tim Westergren, founder of Pandora

His original plan was to get in a car & drive across country to find local music to add to Pandora, but it wasn’t quite as romantic as he thought it would be. On the way home, he planned a meetup on the fly using the Pandora blog, and since then, whenever he visits a new city, he organizes get together like this one.

Tim is a Stanford graduate and a musician, although he didn’t study it specifically. He spent most of his 20s playing in bands, touring around the country, but not necessarily as a huge commercial success. It’s hard to get on the radio, and radio is the key to professional longevity. Eventually, he shifted to film score composition, which required him to analyze music and break it down into components that represent what is happening on the screen. This generated the idea of a musical genome.

The Music Genome Project was launched in 2000 with some seed money that lasted about a year. Eventually, they ran out of money and couldn’t pay their 45 employees. They tried several different ways to raise money, but nothing worked until some venture investors put money into it in 2004. At that point, they took the genome and repurposed it into a radio (Pandora) in 2005.

They have never advertised — it has all been word of mouth. They now add about 65,000 new listeners per day! They can see profitability on the horizon. Pandora is mainly advertising supported. The Amazon commissions provide a little income, but not as much as you might think they would.

There are about 75,000 artists on the site, and about 70% are not on a major label. The song selection is not based on popularity, like most radio, but rather on the elements of the songs and how they relate to what the user has selected.

Playlists are initially created by the song or artists musical proximity to begin with, and then is refined as the user thumbs up or down songs. Your thumbs up and down effect only the station you are listening to, and it effects whatever the rest of the playlist was going to be. They use the over-all audience feedback to adjust across the site, but it’s not as immediate or personalized.

They have had some trouble with royalties. They pay both publishing and performer royalties per song. They operate under the DMCA, including the royalty structure. Every five years, a committee determines what the rate will be for the next year. In July 2007, the committee decided to triple the ratings and made it retroactive. It essentially bankrupted the company.

Pandora called upon the listeners to help them by contacting their congressional representative to voice opposition to the decision. Congress received 400,000 faxes in three days, breaking the structure on the Hill for a week! Their phones were ringing all day long! Eventually, they contacted Pandora to make it stop. They are now finishing up what needs to be done to bring the royalty back to something more reasonable. (Virtually all the staffers on Capitol Hill are Pandora users — made it easy to get appointments with congress members.)

Music comes to Pandora from a variety of sources. They get a pile of physical and virtual submissions from artists. They also pay attention to searches that don’t result in anything in their catalog, as well as explicit suggestions from listeners.

They have a plan to offer musicians incentives to participate. For example, if someone thumbs up something, there would be a pop-up that suggest checking out a similar (or the same) band that is playing locally. Most of the room would opt into emails that let them know when bands they like are coming to town. Musicians could see what songs are being thumbed up or down and where the listeners are located.

Listener suggestion: on the similar artists pages, provide more immediate sampling of recommendations.

What is the cataloging backlog? It takes about 8-10 weeks, and only about 30% of what is submitted makes it in. They select based on quality: for what a song is trying to do, does it do it well? They know when they’ve made a wrong decision if they don’t include something and a bunch of people search for it.

Pandora is not legal outside of the US, but many international users fake US zip codes. However, in order to avoid lawsuits, they started blocking by IP. As soon as they implemented IP blocking, they received a flood of messages, including one from a town that would have “Pandora night” at a local club. (The Department of Defense called up and asked them to block military IP ranges because Pandora was hogging the bandwidth!)

Why are some songs quieter than others? Tell them. They should be correcting for that.

The music genome is used by a lot of scorers and concert promoters to find artists and songs that are similar to the ones they want.

Could the users be allowed more granular ratings rather than thumbing up or down whole songs? About a third of the room would be interested in that.

Mobile device users are seeing fewer advertisements, and one listener is concerned that this will impact revenue. Between the iPhone, the Blackberry, and the Palm Pre, they have about 45,000 listeners on mobile devices. This is important to them, because these devices will be how Pandora will get into listener’s cars. And, in actuality, mobile listeners interact with advertisements four times as much as web listeners.

Tim thinks that eventually Pandora will host local radio. I’m not so sure how that would work.

Subscription Pandora is 192kbps, which sounds pretty good (and it comes with a desktop application). It’s not likely to get to audiophile level until the pipes are big enough to handle the bandwidth.

Variety and repetition is their biggest areas where they get feedback from listeners. The best way to get variety is to add different artists. If you thumb down an artist three times, they should be removed from the station.

They stream about 1/3 of the data that YouTube streams daily, with around 100 servers. Tim is not intimately familiar with the tech that goes into make Pandora work.

[The questions kept coming, but I couldn't stay any longer, unfortunately. If you have a chance to attend a Pandora Town Hall, do it!]

Technorati Tags: , ,

Kindle 2 is kind of cool, actually

My library (as in, the library where I work) has the good fortune of being blessed with both funds and leadership that allow us to experiment with some emerging technologies. When Amazon released the first version of the Kindle, we purchased one to experiment with. It was simply the latest in a long history of ebook readers that we had hoped to be able to incorporate into the library’s function on campus.

I took a turn at using the Kindle, and I was mightily unimpressed. The interface seemed very clunky, to the point of preventing me from getting into the book I tried to read. When the Kindle 2 was released and we received permission to purchase one, I was skeptical that it would be any better, but I still signed up for my turn at using it.

Last week, I was given the Kindle 2, and since it already had a book on it that I was half-way through reading, I figured I would start there. However, I was not highly motivated to make the time for it. Yesterday afternoon, I took the train up to DC, returning this morning. Four hours round trip, plus the extra time spent waiting at each station, gave me plenty of time to finish my book, so I brought the Kindle 2 with me.

I’m not going to gush about how I fell in love with the device, because I didn’t. However, I finished the book with ease before I arrived in DC, and out of shear boredom I pulled down a copy of another book that was already purchased on our library account. I was pleasantly surprised by how easy it was to go from one book to another without having to lug along several selections from my library “just in case” I ran out of something to read.

Right now, I’m at least a third of the way in on the second book, and I plan to finish reading it on the Kindle 2.

I don’t think I’ll end up buying one anytime soon, particularly since I’ve put a stop to buying new books until I’ve read more of the ones I own. However, I have a better understanding of those Kindle enthusiasts who rave about having their entire library (and more) at their fingertips. It’s pretty handy if you’re someone who often has time to kill away from your library.

Technorati Tags:

where I spend my time online

While I was at the reference desk this quiet afternoon, I attempted to catch up on scanning through Lifehacker. Their article about the Geek Chart app caught my eye. Microblogging, or at the very least, in the moment stream of consciousness sharing, has taken over a good portion of my online presence, leaving this venue for slightly more substantial (and infrequent) commentary. So, I decided to fill out the details needed to build my Geek Chart.


Anna’s Geek Chart

Looks like those of you who want a more regular dose of Anna will need to be following my Twitter and Flickr feeds (with some Delicious thrown in). For the rest of you, enjoy the lighter load on your feed reader.

Technorati Tags: , , , ,

NASIG 2009: What Color Is Your Paratext?

Presenter: Geoffrey Bilder, CrossRef

The title is in reference to a book that is geared towards preparing for looking for a new job or changing careers, which is relevant to what the serials world is facing, both personnel and content. Paratext is added content that prepares the audience/reader for the meat of the document. We are very good at controlling and evaluating credibility, which is important with conveying information via paratext.

The internet is fraught with false information, which undermines credibility. The publisher’s value is being questioned because so much of their work can be done online at little or no cost, and what can’t be done cheaply is being questioned. Branding is increasingly being hidden by layers like Google which provide content without indicating the source. The librarian’s problem is similar to the publisher’s. Our value is being questioned when the digital world is capable of managing some of our work through distributed organizational structures.

“Internet Trust Anti-Pattern” — a system starts out as being a self-selected core of users with an understanding of trust, but as it grows, that can break down unless there is a structure or pervasive culture that maintains the trust and authority.

Local trust is that which is achieved through personal acquaintance and is sometimes transitive. Global trust extends through proxy, which transitively extends trust to “strangers.” Local is limited and hard to expand, and global increases systemic risk.

Horizontal trust occurs among equals with little possibility of coercion. Vertical trust occurs within a hierarchy, and coercion can be used to enforce behavior, which could lead to abuse.

Internet trust is in the local and horizontal quadrant. Scholarly trust falls in the vertical and global quadrant. It’s no wonder we’re having trouble figuring out how to do scholarship online!

Researchers have more to read and less time to read it, and it’s increasing rapidly. We need to remember that authors and readers are the same people. The amazing ways that technology has opened up communication is also causing the overload. We need something to help identify credible information.

Dorothea Salo wrote that for people who put a lot of credibility in authoritative information, we don’t do a very good job of identifying it. She blames librarians, but publishers have a responsibility, too. Heuristics are important in knowing who the intended audience is meant to be.

If you find a book at a bargain store, the implication is that it is going to be substantially less authoritative than a book from a grand, old library. (There are commercial entities selling leather bound books by the yard for buyers to use to add gravitas to their offices and personal libraries.) Scholarly journals are dull and magazines are flashy & bright. Books are traditionally organized with all sorts of content that tells academics whether or not they need to read them (table of contents, index, blurbs, preface, bibliography, etc.).

If you were to black out the text of a scholarly document, you would still be able to identify the parts displayed. You can’t do that very well with a webpage.

When we evaluate online content, we look at things like the structure of the URL and where it is linked from. In the print world, citations and footnotes were essential clues to following conversations between scholars. Linking can do that now, but the convention is still more formal. Logos can also tell us whether or not to put trust in content.

Back in the day, authors were linked to printers, but that lead to credibility problems, so publishers stepped in. Authors and readers could trust that the content was accurate and properly presented. Now it’s not just publishers — titles have become brands. A journal reputation is almost more important than who is publishing it.

How do we help people learn and understand the heuristics in identifying scholarly information? The processes for putting out credible information is partially hidden — the reader or librarian doesn’t know or see the steps involved. We used to not want to know, but now we do, particularly since it allows us to differentiate between the good players and the bad players.

The idea of the final version of a document needs to be buried. Even in the print world (with errata and addendums) we were deluding ourselves in thinking that any document was truly finished.

Why don’t we have a peer reviewed logo? Why don’t we have something that assures the reader that the document is credible? Peer review isn’t necessarily perfect or the only way.

How about a Version of Record record? Show us what was done to a document to get it to where it is now. For example, look at Creative Commons. They have a logo that indicates something about the process of creating the document which leads to machine-readable coding. How about a CrossMark that indicates what a publisher has done with a document, much like what a CC logo will lead to?

Knowmore.org created a Firefox plugin to monitor content and provides icons that flags companies and websites for different reasons. Oncode is a way of identifying organizations that have signed a code of conduct. We could do this for scholarly content.

Tim Berners Lee is actively advocating for ways to overlay trust measures on the internet. It was originally designed by academics who didn’t need it, but like the internet anti-trust pattern, the “unwashed masses” have corrupted that trust.

What can librarians and publishers do to recreate the heuristics that have been effective in print? We are still making facsimiles of print in electronic format. How are we going to create the tools that will help people evaluate digital information?