Post on 06-Dec-2014
description
transcript
Advanced Search & IndexingIn Liferay 6.1
External sources integration
Filippo Frignocca | Marcello TorrianiAriadne
November 16th, 2012
Intro & Agenda
Search in intranet portals Business requirements: clustering,
advanced indexing configuration, external sources integration
Ariadne experience Liferay 6.1 indexing and searching
solutions Our solution: Liferay 6.1, Solr 4.0,
ManifoldCF
Search in intranet portal
Large amount of structered and unstructured content
Content can be anywhere: DB, filesystem, repository…
Users have to find information quickly and efficiently
Business requirements
Indexing & Searching in an enterprise environment means:
Seamless integration in SERP External sources indexing Clustered environment Scalability Advanced indexing features needs
Our past work in Ariadne
On Liferay 5.x Deep customization: a new Web CMS Integration of a Content Repository ElasticSearch Integration of external contents: Google Search Appliance
Liferay 6.1 and search
Opensearch Faceted search Document & Media search:
Liferay 6.1 supports searching both local and remote repositories.
The search capabilities, however, are limited to the facilities that is provided by the protocol
Solr 4
Schema aware Search functions Plug handlers and function Cache Replication Statistics
SolrCloud
High scale, fault tolerant, distributed indexing and search
Collections (indexes) may span through different SolrCores
Zookeeper manage cluster configuration and coordination
ManifoldCF
Crawler framework for connecting source content repositories (Sharepoint, Documentum, web sites, shares,...) to indexers (Solr, Elasticsearch, …)
Resilient, incremental Integrated with document security Connectors:
● Authority connectors: map user name to access to a repository
● Repository connectors: fetch documents from a repository
● Output connectors: push documents to (an indexer)
ManifoldCF input connectors
Portal Assets and external CMIS-compliant repositories
UNIFIED INDEX(Solr)
External sources crawling(ManifoldCF)
Liferay 6.1 & Solr 4
Liferay integrates Solr 1.4 Upgrade to 4.0:
● Index Schema Definition
● Solr plugin fix to handle void snippet results
● Solr plugin configuration to connect to external Index Engine/Cluster
Schema Definition for SolR 4.0 – Liferay 6.1 integration
primary key: uid
Fix to handle results with matches in all metadata, not only Content or Title
External Index Configuration
ManifoldCF output connector
ManifoldCF repository connector
ManifoldCF job scheduler
ManifoldCF job monitor
ManifoldCF output configuration
Custom plugin
To show external source search results in the Liferay Faceted Search Portlet we need to:
Define one (or more) Asset Model Class Configure a custom Indexer Class Implement proper Hook to override the result page jsps Properly configure the Faceted Search Portlet
Indexed and Model classes
Faceted Search jsp Hook
Faceted Search Configuration
Faceted Search Results
Work in progress
Indexing and searching protected resources Optimize External Asset Management Tags and Categories consistence check Optimize Index schema mapping Entry detail link management for external results
Thank you!
www.ariadne.it
info@ariadne.it
@ariadnesrl
MarcelloTorrianitorriani@ariadne.it
FilippoFrignoccafrignocca@ariadne.it