230 likes | 368 Views
Collaborative Filtering: Possibilities for Digital Libraries. Janet Webster , OSU Libraries Jon Herlocker , OSU Computer Science Heather Pennington-Lehman , UW Information School Seikyung Jung , OSU Computer Science. Our driving questions.
E N D
Collaborative Filtering:Possibilities for Digital Libraries Janet Webster, OSU Libraries Jon Herlocker, OSU Computer Science Heather Pennington-Lehman, UW Information School Seikyung Jung, OSU Computer Science IAMSLIC 2003
Our driving questions • How to translate traditional library approaches to reference and resource discovery into the web environment? • Can collaborative filtering be successfully applied to digital libraries in a manner that improves the effectiveness of the library? IAMSLIC 2003
What is collaborative filtering? • Users evaluate and rate search results. • Indicates individual’s information need. • Used for others with similar information needs. • Responds to changing needs. • Examples • MovieLens http://www.movielens.org • Amazon.com IAMSLIC 2003
Why is CF relevant to libraries? • Electronic information is increasing. • How to provide satisfactory access to it? • Evolving definition of digital library • How to add value to our collections & services? • Harness our expertise • How to involve us in the web search process? IAMSLIC 2003
CF and OSU Libraries • Commitment to building natural resources digital library • Need to build relationship • Opportunity provided by Network for Earthquake Engineering Simulation (NEES) • Availability of expertise IAMSLIC 2003
Two Test Beds • OSU Libraries Recommender System • Integrate with existing library systems and traditions • Dealing with noisy and untrustworthy data • Compute, display & explain recommendations • Infer recommendations from user behavior • Evolve CF in a broader setting • Tsunami Digital Library • Explore collecting digital documents • Develop partnerships • Evolve CF in a circumscribed community IAMSLIC 2003
Tsunami Digital Library:Description • Portal for ‘high-quality’ tsunami information and data • Different interfaces for different audiences • Test bed for CF technology • Test bed for international collaboration • Model for distributed maintenance IAMSLIC 2003
Tsunami Digital Library:Information Characteristics • Authority • Access • Usability • Permanence IAMSLIC 2003
Tsunami Digital Library:Development • Encouraged to collaborate • Identify potential “high quality” sites • Develop preliminary interface • Conduct controlled experiment • Refine system • Conduct need assessment • Use parallel test bed IAMSLIC 2003
Tsunami Digital Library:Needs Assessment • Identify users. • Conduct pilot survey. • Broaden survey. • Integrate findings into system. • Get more funding. IAMSLIC 2003
Preliminary Survey Results • What tsunami resources do you use and how do you use them? • Specific sites • Usage patterns • Problems encountered IAMSLIC 2003
Preliminary Survey Results • What’s a digital library? • Description • Usage IAMSLIC 2003
Preliminary Survey Results • Would you be involved in building and maintaining a tsunami digital library? • If so, what should it have in it? IAMSLIC 2003
Emerging Issues • What’s in the TDL? • Collection policy • How can we help manage sites we “collect”? • Administrative component • Capturing orphaned sites • Building partnerships IAMSLIC 2003
Our driving questions • How to translate traditional library approaches to reference and resource discovery into the web environment? • Can collaborative filtering be successfully applied to digital libraries in a manner that improves the effectiveness of the library? IAMSLIC 2003
So, why is CF relevant? • Handle more electronic information • Harness our expertise • Shape direction of digital libraries IAMSLIC 2003
CIRCLE Research Team • Brandon Correy, undergraduate • Kevin Harris, undergraduate • Jon Herlocker, asst professor • Seikyung Jung, PhD candidate • Kristine Pence, undergraduate • Heather Pennington-Lehman, masters student • Michael Tichenor, undergraduate • Kami Vaniea, undergraduate • Janet Webster, assc professor IAMSLIC 2003
Funding • OSU Libraries Gray Chair for Innovative Technologies • Northwest Alliance for Computational Scientific Engineering • Georgia Pacific HMSC internship IAMSLIC 2003
More information • Silence of the Sleeper • http://www.gladwell.com/1999/1999_10_04_a_sleeper.htm • Recker & Walker, Collaborative Information Filtering • J of Interactive Learning Research 14 (1): 79-98 • Technical paper on our controlled experiment • http://eecs.oregonstate.edu/library/files/2003-17/jcdl2003.pdf • TDL http://dl.nacse.org/ • Library Recommender http://dl.nacse.org/osu IAMSLIC 2003
Contacts • Janet Webster • Oregon State University Libraries, Hatfield Marine Science Center • janet.webster@oregonstate.edu • Jon Herlocker • Oregon State University, School of Electrical Engineering & Computer Science • herlock@eecs.oregonstate.edu IAMSLIC 2003