+ All Categories
Home > Documents > Searching Featured Lists. About Mercer University.

Searching Featured Lists. About Mercer University.

Date post: 30-Mar-2015
Category:
Upload: chandler-stephenson
View: 215 times
Download: 2 times
Share this document with a friend
Popular Tags:
43
Searching Featured Lists Jeremy Brown Linda Chen Mercer University Libraries
Transcript
Page 1: Searching Featured Lists. About Mercer University.

Searching Featured Lists

Jeremy BrownLinda Chen

MercerUniversity Libraries

Page 2: Searching Featured Lists. About Mercer University.

About Mercer University

• Academic University• 11 Schools• 10,000+ students

Page 3: Searching Featured Lists. About Mercer University.

Featured Lists

• User Manual Page #107098• Supports review files of bib or item records• Do Not support other record types• Need to keep the list(s) from the Creating List

Page 4: Searching Featured Lists. About Mercer University.

2 Steps

• Create a list• Set up a WWWOption

Page 5: Searching Featured Lists. About Mercer University.

Step 1 – Create a List

Page 6: Searching Featured Lists. About Mercer University.

Step 1 - Creating List Search Screen

Page 7: Searching Featured Lists. About Mercer University.

Step 2 - WWWOption

Page 8: Searching Featured Lists. About Mercer University.

Step 2 - WWWOption

• User Manual Page # 106908•WWWOPTION: FEATURED_LIST

Page 9: Searching Featured Lists. About Mercer University.

Example 1

•WWWOPTION: FEATURED_LIST=[list]|[table_header]| [revfile_header]|[#recs_header]• FEATURED_LIST= 01,03,12,17

Page 10: Searching Featured Lists. About Mercer University.

Example 2

•WWWOPTION: FEATURED_LIST=[list]|[table_header]| [revfile_header]|[#recs_header]• FEATURED_LIST=01,03,12,17|SELECTED TITLES|

TOPIC|# TITLES

Page 11: Searching Featured Lists. About Mercer University.

Our WWWOption

FEATURED_LIST=[list]|[table_header]| [revfile_header]|[#recs_header]FEATURED_LIST=03,02,05,06,36|February 2012| Collections|# Titles

Page 12: Searching Featured Lists. About Mercer University.

Feature List

•What do we use it for? Monthly New Book Lists Special Projects Display Different Collections Anyone has other suggestion?

Page 13: Searching Featured Lists. About Mercer University.

Feature List

• This is ok, but..• Something is missing…

Cannot search the list

Page 14: Searching Featured Lists. About Mercer University.

Searching Feature Lists

Page 15: Searching Featured Lists. About Mercer University.

What We Wanted

•Menu of Feature Lists to Search• Keyword Search function

Page 16: Searching Featured Lists. About Mercer University.

What We Noticed

• Feature List URLs always contain “ftlist” Review file number

• Result Record URLs always contain Review file number Bibliographic record number

Page 17: Searching Featured Lists. About Mercer University.

Anatomy of a List URL

Page 18: Searching Featured Lists. About Mercer University.

A Result URL

Page 19: Searching Featured Lists. About Mercer University.

Anatomy of a Result URL

Page 20: Searching Featured Lists. About Mercer University.

This Sounded Indexable!

• Each URL contains the list review file• Each URL contains the bib number• Bibliographic records have lots of key words.

Page 21: Searching Featured Lists. About Mercer University.

Indexing WebPAC Feature Lists

Page 22: Searching Featured Lists. About Mercer University.

Technologies Used

• Nutch: http://nutch.apache.org/• SOLR: http://lucene.apache.org/solr/• jQuery/AJAX: http://jquery.com

Page 23: Searching Featured Lists. About Mercer University.

Flow Chart

Page 24: Searching Featured Lists. About Mercer University.

Crawl TheWebPAC

Page 25: Searching Featured Lists. About Mercer University.

What Does Nutch Do?

1. Read list of URLs2. Download one of the URLs3. Extract all the links on the page, add to list

of URLs4. Extract the full text of the page5. Repeat until done with URLs6. Send Pages to SOLR for indexing

Page 26: Searching Featured Lists. About Mercer University.

Set up WebPAC for Crawling

•We need to let our crawler look at the WebPAC• Name the crawler in

[nutch_home]/conf/nutch-site.xml

• Edit /iiidb/http/robots.txt• Add:

User-agent: MercuSpiderDisallow:

Page 27: Searching Featured Lists. About Mercer University.

Goals of Nutch URL Configuration

• Only want bibs from the featured lists•We do not want to crawl:

MARC view Save record screens Request item screens Probably others…

•We do not want to index the whole WebPAC

Page 28: Searching Featured Lists. About Mercer University.

Set up Nutch for Crawling the WebPAC

• Edit nutch/conf/regex-urlfilter.txt• Add:

-^http://library.mercer.edu/.*request.*-^http://library.mercer.edu/.*save.*-^http://library.mercer.edu/.*marc.*+^http://library.mercer.edu/search~S1.*ftlist

Page 29: Searching Featured Lists. About Mercer University.

Indexing withSOLR

Page 30: Searching Featured Lists. About Mercer University.

What Is SOLR?• A search engine in a box• Put documents in• Creates a full text index• Answers queries

Page 31: Searching Featured Lists. About Mercer University.

Set up Solr for Crawling the WebPAC• SOLR comes ready to run• It works well with Nutch• Easy to use as search engine back end

Page 32: Searching Featured Lists. About Mercer University.

The Single, Solitary SOLR Modification

• Add to solr-example/solr/conf/solrconfig.xml

<requestHandler name="/mercuFtlist" class="solr.StandardRequestHandler"> <lst name="defaults"> <int name="rows">10</int> <str name="echoParams">explicit</str> <str name="fl">url,id</str> </lst> <lst name="appends"> <str name="fq">(url:ftlist)+(url:\+\+ftlist/\+\+ftlist)</str> </lst> <lst name="invariants"> <bool name="facet">false</bool> </lst></requestHandler>

Page 33: Searching Featured Lists. About Mercer University.

SOLR Request Handler, Explained

•We only want the URL and ID in our results

•We only want to search items that appear in the feature list

<str name="fq">(url:ftlist)+(url:\+\+ftlist/\+\+ftlist)</str>

<str name="fl">url,id</str>

Page 34: Searching Featured Lists. About Mercer University.

Performance Hit (Millennium)

Page 35: Searching Featured Lists. About Mercer University.

Performance Hit (Web Server)

Page 36: Searching Featured Lists. About Mercer University.

Searching

Page 37: Searching Featured Lists. About Mercer University.

Architecture• Standard web form• AJAX calls to SOLR for results• AJAX results enhancement

Page 38: Searching Featured Lists. About Mercer University.

Search Form

Page 39: Searching Featured Lists. About Mercer University.

Search Form in the WebPAC

Page 40: Searching Featured Lists. About Mercer University.

Basic Results Display

Page 41: Searching Featured Lists. About Mercer University.

Enhanced Results

Page 42: Searching Featured Lists. About Mercer University.

More Information• Our Featured Lists:

http://library.mercer.edu/search~S1?/ftlist• Visit our Systems page:

http://libraries.mercer.edu/university-libraries/systems-department/iug-2012or: http://goo.gl/3q3QV

•MAJAX is located here:http://libx.org/majax/

Page 43: Searching Featured Lists. About Mercer University.

Thank You!


Recommended