EAD XML Search at the Library of Congress
Mary A. Lacy,Manuscript Division, Library of Congress
Michael Ferrando,Information Technology Services, Library of Congress
August 11, 2010
Why a new finding aids search system?
Older production system failing (size and number of finding aids)
Updated presentation needed to meet researcher and staff needs and expectations
Better platform to support links to digitized content
Decision to integrate finding aids and LC Online Catalog into LC XML data store
Application Decisions
Native XML data store platform for search, display, storage (eXist)
XQuery, JQuery, Java, and XSLT for search and display
EADs wrapped in METS objects, pulling data from finding aids and bibliographic/holdings records
Closer integration of EAD and LC Online Catalog EAD controlaccess terms updated weekly from Catalog exports of associated collection‐level records
Addition of LCCN Permalinks to EADs
Hyperlinks from EAD index terms and LC Online Catalog
Design Decisions
Consult special collections reference staff, archival catalogers, Web design staff
Implement keyword searching across all finding aids as well as within a single finding aid (KWIC for both)
Define requirements for full displays, browse lists, results list, URLs, including “you are here” anchors
Incorporate HTML meta tags (DC and MODS) to support web citation
Where We Started
Where We Are Now
Where We Are Now
Where We Are Now
Search Finding Aids Page
Browse List
Keyword Search Options
Keyword Search Results
EAD Full Display: Overview Tab
EAD Full Display: Search Results Tab
EAD Full Display: Contents List Tab
EAD Full Display: Contents List Tab
EAD Full Display: Digitized Content Links
EAD Full Display: Index Terms Tab
Index Terms to “All Finding Aids” Browse
Browse List Search Results
Index Terms to LC Online Catalog Browse
EAD Full Display: Print/Download Tab
Workflow Steps Needed
Convert from EAD DTD to EAD schema
Determine “marc”dmdSec structure and metadata (from EAD, LC Online Catalog, boilerplate)
Refresh controlaccess from weekly LC Online Catalog exports
METS creation (including building structMap identifiers and various dmdSec’s)
LC EAD METS Object
LC EAD METS Object: structMap
LC EAD METS Object: structMap
METS dmdSec id=“marc” Mapping
Challenges
Limited support for eXist (security issues, indexing, memory, and database configuration)
Limited training and support for XQuery
Complicated finding aid display issues
Revision of LC web page templates to accommodate EAD’s XQuery/JQuery displays
Revision of bibliographic and holdings records to improve linking and browse lists
Integration into New LC XML Data Store
EAD finding aids and LC Online Catalog records will be merged into a single search and display application in LC XML data store using Mark Logic (beta release, fall 2010)
EAD finding aids will follow design principles and displays from stand‐alone search application
Stand‐alone application will continue
Future Enhancements
Better integration with digitized archival content (sample items, full gallery views, page‐turners)
Search enhancements based on user feedback
Improved PDF versions
Distribution of EAD METS objects
LC EAD RSS feeds
Finding Aids: XML Source Files
Questions?
General question?
Contact Mary Lacy ([email protected])
Technical question?
Contact Mike Ferrando ([email protected])
Watch for announcements by
September 2010!