Date post: | 05-Jan-2016 |
Category: |
Documents |
Upload: | jasmine-oconnor |
View: | 213 times |
Download: | 0 times |
MOODy :) Investigations into Massive Open Online Discovery at IU
Juliet Hardesty (@jlhardes)Courtney Greene McDonald (@xocg)
Bryan J Brown (@bryjbrown)
Digital Library Brown Bag | December 11, 2013Tweet it! #dlbb
http://www.flickr.com/photos/rogersmith/313323541/
You’ve heard of MOOCs…
We’d like to introduce you to MOODs …
MassiveOpen Online
Discovery
http://www.flickr.com/photos/danielito311/5847295876/
IUCAT: Blacklight discovery interface http://iucat.iu.edu
Digital Collections Search (beta) : Blacklight discovery interface
http://webapp1.dlib.indiana.edu/dcs/
IUB Library Web Site Search: Drupal + SolrCurrently in development (wireframe)
One Big Index in the Sky
(Solr)
http://www.flickr.com/photos/thukral/1983931186/
Phase One• Enable indexing & combined search of Indiana
University catalog, repository, digital collections, website data– Combine feeds of catalog (IUCAT) & digital
collections (DCS) data via MODS
– Single index: author, date, format, language, location, & subject
– Facets: author, date, format, & subject
Websi te Content
IU Collections Data (MODS)
IUCAT: MARC to MODS DCS: M
ODSMAGIC!
Phase Two
• Expose IU dataset for potential combination with other institutional datasets
IU Dataset
Pat Q ScholarI.M. Hacker
• Goal: Identify best practices for UX around discovery in the context of metadata• Surveyed institutions to identify approaches to Solr
indexing and end-user options
Solr Schemas Used
Solr/SolrMarc - modified; 23; 72%
Solr/Solr-Marc - un-
modified; 4; 13%
Other; 5; 16%Other
SolrMarc - modified
SolrMarc - unmodified
Apache Solr - modified
Apache Solr - unmodified
0 2 4 6 8 10 12 14 16
5
8
1
15
3
Solr Schemas
Solr Schema Fields Modified
<types> <fields> <copyField> <dynamicField> Other0
5
10
15
20
25
18
22 2119
3
Types of Fields Modified
config
index
callnumber_
map
compositi
on_era_
map
country
_map
format_
map
instrumen
t_map
langu
age_
mapOther
0
1
2
3
4
5
6
7
87 7
3
0 0
4
0
1
3
SolrMarc .properties modified (10 responses)
Reasons for Modifying Fields
Connections to original metadata
http://www.flickr.com/photos/cobalt/4191469239/
http://www.flickr.com/photos/arielle_kristina/4095456119/
What is being indexed?
Books
Article
s
Journals
Web
pages/
sites
3D objects
Photographs
Sound re
cord
ings
Moving i
mages
Even
ts
Games
(video
or board
)
Finding a
ids
Manuscr
ipts/co
rresp
ondence
0
2
4
6
8
10
12
14
16
18
20
Non full-text / Derivative size / Information onlyFull-text / Streaming / Original size
Metadata mapped to Solr/SolrMARC
Other
DC
PBCore
FRBR
METS
TEI
MODS
EAD
MARC
0 2 4 6 8 10 12 14 16
10
4
1
0
1
2
10
4
15
Discovery Layer
Blacklight; 10; 42%
VuFind; 6; 25%
Other; 8; 33%
Shared File Sets
Just what we asked for!
http://www.flickr.com/photos/annettepedrosian/2108145618/
Proof of Concept
Internship Goal:Explore possibilities of combining multiple
metadata feeds into one central index
Feed 1
Feed 2
Index?
QuestionsWhat are our data sources?
?
?
Index?
QuestionsWhat is Apache Solr, and how does it work?
IUCAT
Fedora
Solr?? ??
?
?
?
Librarian-friendly documentation is on the way!
QuestionsWhat’s the best way to get the data?
IUCAT
Fedora
Solr??
?
QuestionsWhat’s the “native” format?
IUCAT
Fedora
Solr?Z39.50
OAI-PMH
?
?
QuestionsWhat data should we index?
IUCAT
Fedora
Solr?Z39.50
OAI-PMH
MARCXML
MODS
?
Our custom Solr schema
QuestionsHow can we transform it?
IUCAT
Fedora
Solr
Z39.50
OAI-PMH
MARCXML
MODS
schema.xml
?
?
QuestionsHow can we automate the process?
IUCAT
Fedora
Solr
Z39.50
OAI-PMH
MARCXML
MODS
schema.xml
XSLT
XSLT
BatchIngest
Future Goals
?
FedoraSolr
?
OAI-PMH
?
MODSschema.xml
XSLT
XSLT
BatchIngest
IUCAT Z39.50
MARCXML
XSLT
? ?
?
XSLT
Future Goals
?
FedoraSolr
?
OAI-PMH
?
MODSschema.xml
XSLT
XSLT
BatchIngest
IUCAT Z39.50
MARCXML
XSLT
? ?
?
XSLT
Future Goals
?
FedoraSolr
?
OAI-PMH
?
MODSschema.xml
XSLT
XSLT
BatchIngest
IUCAT Z39.50
MARCXML
XSLT
? ?
?
XSLT
Plans Moving Forward
http://www.flickr.com/photos/usfws_alaska/7376551524/
Questions?Comments!
Kittens >^..^<
http://www.flickr.com/photos/notemily/5394289051/
Thank you!
Julie ([email protected])Courtney ([email protected])Bryan ([email protected])
More info (posters, more data, &c) at http://bit.ly/meta-lita-2013These slides will shortly be available via IU Scholarworks
THAN
K YOU
!!1!11!!