+ All Categories
Home > Documents > MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes)...

MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes)...

Date post: 05-Jan-2016
Category:
Upload: jasmine-oconnor
View: 213 times
Download: 0 times
Share this document with a friend
Popular Tags:
31
MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown) Digital Library Brown Bag | December 11, 2013 Tweet it! #dlbb http://www.flickr.com/photos/rogersmith/313323541/
Transcript
Page 1: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

MOODy :) Investigations into Massive Open Online Discovery at IU

Juliet Hardesty (@jlhardes)Courtney Greene McDonald (@xocg)

Bryan J Brown (@bryjbrown)

Digital Library Brown Bag | December 11, 2013Tweet it! #dlbb

http://www.flickr.com/photos/rogersmith/313323541/

Page 2: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

You’ve heard of MOOCs…

We’d like to introduce you to MOODs …

MassiveOpen Online

Discovery

http://www.flickr.com/photos/danielito311/5847295876/

Page 3: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

IUCAT: Blacklight discovery interface http://iucat.iu.edu

Digital Collections Search (beta) : Blacklight discovery interface

http://webapp1.dlib.indiana.edu/dcs/

IUB Library Web Site Search: Drupal + SolrCurrently in development (wireframe)

Page 4: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

One Big Index in the Sky

(Solr)

http://www.flickr.com/photos/thukral/1983931186/

Page 5: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Phase One• Enable indexing & combined search of Indiana

University catalog, repository, digital collections, website data– Combine feeds of catalog (IUCAT) & digital

collections (DCS) data via MODS

– Single index: author, date, format, language, location, & subject

– Facets: author, date, format, & subject

Page 6: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Websi te Content

IU Collections Data (MODS)

IUCAT: MARC to MODS DCS: M

ODSMAGIC!

Page 7: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Phase Two

• Expose IU dataset for potential combination with other institutional datasets

IU Dataset

Pat Q ScholarI.M. Hacker

Page 8: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

• Goal: Identify best practices for UX around discovery in the context of metadata• Surveyed institutions to identify approaches to Solr

indexing and end-user options

Page 9: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

View the survey at http://bit.ly/13qhCD7

Page 10: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Solr Schemas Used

Solr/SolrMarc - modified; 23; 72%

Solr/Solr-Marc - un-

modified; 4; 13%

Other; 5; 16%Other

SolrMarc - modified

SolrMarc - unmodified

Apache Solr - modified

Apache Solr - unmodified

0 2 4 6 8 10 12 14 16

5

8

1

15

3

Solr Schemas

Page 11: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Solr Schema Fields Modified

<types> <fields> <copyField> <dynamicField> Other0

5

10

15

20

25

18

22 2119

3

Types of Fields Modified

config

index

callnumber_

map

compositi

on_era_

map

country

_map

format_

map

instrumen

t_map

langu

age_

mapOther

0

1

2

3

4

5

6

7

87 7

3

0 0

4

0

1

3

SolrMarc .properties modified (10 responses)

Page 12: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Reasons for Modifying Fields

Connections to original metadata

http://www.flickr.com/photos/cobalt/4191469239/

http://www.flickr.com/photos/arielle_kristina/4095456119/

Page 13: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

What is being indexed?

Books

Article

s

Journals

Web

pages/

sites

3D objects

Photographs

Sound re

cord

ings

Moving i

mages

Even

ts

Games

(video

or board

)

Finding a

ids

Manuscr

ipts/co

rresp

ondence

0

2

4

6

8

10

12

14

16

18

20

Non full-text / Derivative size / Information onlyFull-text / Streaming / Original size

Page 14: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Metadata mapped to Solr/SolrMARC

Other

DC

PBCore

FRBR

METS

TEI

MODS

EAD

MARC

0 2 4 6 8 10 12 14 16

10

4

1

0

1

2

10

4

15

Page 15: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Discovery Layer

Blacklight; 10; 42%

VuFind; 6; 25%

Other; 8; 33%

Page 16: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Shared File Sets

Just what we asked for!

http://www.flickr.com/photos/annettepedrosian/2108145618/

Page 17: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Proof of Concept

Internship Goal:Explore possibilities of combining multiple

metadata feeds into one central index

Feed 1

Feed 2

Index?

Page 18: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsWhat are our data sources?

?

?

Index?

Page 19: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsWhat is Apache Solr, and how does it work?

IUCAT

Fedora

Solr?? ??

?

?

?

Librarian-friendly documentation is on the way!

Page 20: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsWhat’s the best way to get the data?

IUCAT

Fedora

Solr??

?

Page 21: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsWhat’s the “native” format?

IUCAT

Fedora

Solr?Z39.50

OAI-PMH

?

?

Page 22: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsWhat data should we index?

IUCAT

Fedora

Solr?Z39.50

OAI-PMH

MARCXML

MODS

?

Page 23: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Our custom Solr schema

Page 24: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsHow can we transform it?

IUCAT

Fedora

Solr

Z39.50

OAI-PMH

MARCXML

MODS

schema.xml

?

?

Page 25: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

QuestionsHow can we automate the process?

IUCAT

Fedora

Solr

Z39.50

OAI-PMH

MARCXML

MODS

schema.xml

XSLT

XSLT

BatchIngest

Page 26: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Future Goals

?

FedoraSolr

?

OAI-PMH

?

MODSschema.xml

XSLT

XSLT

BatchIngest

IUCAT Z39.50

MARCXML

XSLT

? ?

?

XSLT

Page 27: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Future Goals

?

FedoraSolr

?

OAI-PMH

?

MODSschema.xml

XSLT

XSLT

BatchIngest

IUCAT Z39.50

MARCXML

XSLT

? ?

?

XSLT

Page 28: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Future Goals

?

FedoraSolr

?

OAI-PMH

?

MODSschema.xml

XSLT

XSLT

BatchIngest

IUCAT Z39.50

MARCXML

XSLT

? ?

?

XSLT

Page 29: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Plans Moving Forward

http://www.flickr.com/photos/usfws_alaska/7376551524/

Page 30: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Questions?Comments!

Kittens >^..^<

http://www.flickr.com/photos/notemily/5394289051/

Page 31: MOODy :) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene McDonald (@xocg) Bryan J Brown (@bryjbrown)

Thank you!

Julie ([email protected])Courtney ([email protected])Bryan ([email protected])

More info (posters, more data, &c) at http://bit.ly/meta-lita-2013These slides will shortly be available via IU Scholarworks

THAN

K YOU

!!1!11!!


Recommended