Date post: | 12-Apr-2017 |
Category: |
Technology |
Upload: | agnes-molnar |
View: | 788 times |
Download: | 0 times |
About Agnes Molnar• SharePoint Server MVP• Senior Solutions Consultant, BA Insight• Recognized blogger, speaker, writer
• Web: http://www.bainsight.com• Blog: http://aghy.hu• Email: [email protected]• Twitter: @molnaragnes
Search
Search connects people
to the information
they need to get their jobs done.
Search• „I know what I’m searching for and know how to
do that”
• „I know what I’m searching for but I don’t know how to do that”
• „I don’t know what I’m searching for”
• „Am I Searching?...”
Enterprise Search
• Enterprise – is no longer within the firewall• Relevance is critical• Search within the organization• „Transparent” Search• Search Driven Applications
Search Components
Source: http://searchpatterns.org
Search Based Application (SBA)
• Software Application • Built on a Search Engine backbone rather
than a database infrastructure • Purpose is not classic information
retrieval, but rather mission-oriented information access, analysis or discovery
SBA Examples
12
ChallengesUser ExperienceChallenges
Multiple search interfaces, systems, and logons; no unified search results
Data and ExpertiseChallenges
Files and email on desktops; structured and unstructured data silos; untapped expertise
Enterpriseand IT Challenges
Relevance and ranking; security, privacy & compliance; scalability, manageability & extensibility
Customizations for Search Driven ApplicationsBuilding on an extensible platform
Configure Extend CreateUser Context
LOB ConnectivityContent ProcessingBusiness languageFederation Sources
UI Look & Feel…..
Relevance ProfilesUI & Web PartsResult Rollup
Visual ElementsWorkflowsAnalytics
…..
Custom ElementsWork Environments
New Innovations….
Content Sources
Content Sources in SBAs
• Combine (join) data• Connect data
– Existing relationships in the source system– Newly discovered, cross-system relationships
• Aggregate data• Visualize data
Data Collection / Crawling
• Crawler: – Connects to the Content Source– Enumerates the content– Reads the content items– Extracts the metadata– Sends the collected info back to the Indexer
Data Collection / Crawling• Connector: Enables to access different types of content
• OOTB:– SharePoint– File Share– Web site– Exchange Public Folders– Custom Connectors– (Lotus Notes)– (Documentum)
Natural Language Processing• Crawl/Index Time
– Language Detection– Tokenization– Stemming and Lemmatization
• Query Time– Approximate Spelling– Phonetic Spelling– Word Truncation– Regular Expressions– Semantic Expansion– Rules-based Matching
Processing: Crawled and Managed Properties
• Crawled property: metadata extracted from the documents/items during the crawl.
• Managed property: can appear in refined searches and helps users perform more successful queries
Processing: Crawled and Managed Properties
Processing: Ranking• Ranking: produce results that are ordered according
to some computed relevancy score
• Dynamic: Based on weighted managed properties (title, body, social tags, etc.)
• Static: – File Type– Click through relevancy– Depth
Processing: Ranking
Processing: Relevance Tuning
User Interface• OOTB Web Parts
– Refinement Panel– Core Results Web Part
• Federation• People Search• Scopes• Custom Web Parts
– Visual Navigation– Mashups– Etc.
• Workflows – Act on Items Immediately
Search Federation
• Using remote index for queries• Location type:
– SharePoint Search index– FAST index– OpenSearch 1.0/1.1
Search Federation
Search Federation• Benefits:
– No resources needed for indexing– Custom Credentials– Usage restrictions– Prefix / Pattern match– Query Template
• {searchTerms} scope:Documents• {searchTerms} type:.doc type:.docx type:.docm
• BUT:– Live Internet connection is required– Bandwith– No control over results (order, relevance, etc.)– Separated Web Parts
Search Federation
Summary
• Search Based Applications?– Need to Aggregate Heterogeneous Content– Neet to Process Large Volume of Data– Need for Real Time Information– Need for Ad Hoc Reporting