+ All Categories
Home > Documents > A search engine is a web site that collects and organizes content from all over the internet Search...

A search engine is a web site that collects and organizes content from all over the internet Search...

Date post: 08-Jan-2018
Category:
Upload: noel-warner
View: 213 times
Download: 0 times
Share this document with a friend
40
Transcript
Page 1: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 2: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

GROUP MEMBERS

NIDA IRFAN AHSAN AKRAM USMAN BASHIR AMBER SHAHEEN GHULAM HASSAN

Page 3: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

SEARCH ENGINE

Page 4: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 5: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

INTRODUCTION A search engine is a web site that collects and

organizes content from all over the internet

Search engines look through their own databases of information in order to find what it is that you are looking for.

“Search engine” is the popular term for an Information Retrieval (IR) system.

Page 6: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

INTRODUCTION

Search engines match queries against an index that they create. The index consists of the words in each document, plus pointers to their locations within the documents. This is called an inverted file.

Page 7: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

INTRODUCTIONA search engine or IR system

comprises four essential modules:

A document processorA query processorA search and matching functionA ranking capability

Page 8: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

HIST

ORY

Page 9: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Archie -> First search tool for the Internet

Gopher -> Indexed plain text documents

Jughead -> Searched the files stored in Gopher index

Wandex -> First Web search engine systems

HISTORY

Page 10: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Alan Emtage created the first ever search engine called Archie in 1990.

Archie collects information by crawling the internet.

It matches the files it found with search queries and returned results from its database.

ArchieHISTORY

Page 11: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

HISTORY It was introduced in 1990 It is the protocol system It allows text files to be hierarchically

organized It is used to enable the simple sharing of

documents with which could be located in any part of the world

Gopher

Page 12: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

HISTORY Jughead

It was developed by Rhett Jones in 1993 Jug head is the search engine system for

the gopher protocol It searches the single server at a time It was officially released in 2006

Page 13: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

HISTORY Wandex

It was the first search engine introduced in 1993 by Matthew Gray

It was the first search engine that is usable for world wide web

Wandex itself was based on the world wide web wanderer which merely listed websites

Page 14: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Mod

ules

Page 15: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Search Engine Modules A document processor

A query processor

A search and matching function

A ranking capability

Page 16: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

The document processor prepares, processes, and inputs the documents, pages, or sites that users search against. The document processor performs some or all of the following steps:

Search Engine Modules Document Processor

Page 17: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Search Engine Modules Normalizes the document stream to a

predefined format. Breaks the document stream into desired

retrievable units. Isolates and metatags subdocument

pieces. Identifies potential index able elements in

documents.

Page 18: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Search Engine Modules Deletes stop words. Stems terms. Extracts index entries. Computes weights. Creates and updates the main

inverted file against which the search engine searches in order to match queries to documents.

Page 19: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Query Processor Query processing has seven possible

steps. System can cut these steps short and

proceed to match the query to the inverted file.

Document processing shares many steps with query processing.

Search Engine Modules

Page 20: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

More steps and more documents make the process more expensive for processing

Search system designers must choose what is most important to their users, time or quality.

Publicly available search engines usually choose time over very high quality, having too many documents to search against.

Search Engine Modules

Page 21: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Search Engine Modules

Step 1: TokenizingStep 2: ParsingStep 3: STOPLISTStep 4: StemmingStep 5: Creating the queryStep 6: Query expansionStep 7: Query term weighting

Page 22: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Search Engine Modules Search and matching function

Searching the inverted file for documents meeting the query requirements

It is the simple binary search It follows the simple document

representation, query representation and algorithm

After computing the similarity of each document in the subset of documents, the system presents an ordered list to the user

Page 23: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Type

s

Page 24: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Types of Search Engine Crawler-Based Search

Engines

Human-Powered Directories

Hybrid Search Engines" Or Mixed Results

Page 25: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Wor

king

Page 26: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Working of Search Engine

SpidersRobots

Page 27: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 28: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 29: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 30: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 31: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Adva

nta

ges

Page 32: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Advantages of Search Engine

It is easy and quick

It is a quick access to information

Save of time

It help to organize internet and individual websites

Page 33: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Good

Ch

oice

Page 34: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Choice of Search Engine The search system development or selection should be based primarily on the local needs Consider using freeware search engines, if your requirements are met by these. Consider if the web server you are using supports indexing and search, and if this is adequate for you

Page 35: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Choice of Search Engine

The IT Professionals should make an effort to keep themselves abreast of the current web technologies The features available within a tool

should be made use of properly to get maximum benefits

Page 36: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Conc

lusi

on

Page 37: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Search engine plays important role in accessing the content over the internet, it fetches the pages requested by the user.

It made the internet and accessing the information just a click away.

Conclusion

Page 38: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

ConclusionThe need for better search engines only

increases

The search engine sites are among the most popular websites.

Page 39: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
Page 40: A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.

Recommended