+ All Categories
Home > Technology > How Google Works

How Google Works

Date post: 04-Dec-2014
Category:
Upload: rishabh-dev-singh
View: 181 times
Download: 1 times
Share this document with a friend
Description:
How Google Works,Page Rank Technology and examples explaining Page Rank Technology.
Popular Tags:
34
Presented By.. Rishabh Dev Singh
Transcript
Page 1: How Google Works

Presented By.. Rishabh Dev Singh

Page 2: How Google Works

Contents..• Uncover the secrets of Google.• How it all works.• Understanding the technology ‘PAGE RANK’

behind it.• Why ‘PAGE RANK ‘ is a pioneering technology.• Reference.

Page 3: How Google Works

Why is Google Different ??

Page 4: How Google Works
Page 5: How Google Works

Step1…• Exploring the web…

Page 6: How Google Works

Crawling…• Special Software known as “Googlebot” is used.

• Runs on large number of computers to crawl the web.

• Googlebot starts from its last crawl status and busily looks for new sites,change to current and invalid links.

Page 7: How Google Works

Step 2…• Organizing the data..

• Report on the pages visited & thus index is updated.

• Index like something at back of the book.

Page 8: How Google Works

Step3…• Presenting the Data..

• Google search doesn’t just drive into index & fish around for what it needs.

• Use of Knowledge Graph.

• Several factors are used to present the most relevant search results.

Page 9: How Google Works

Factors…Some of the know factors..• Type of content.• Quality of content.• Freshness of content.• The user’s region.• Legitimacy of the site.• Name and address of the website.• Search word synonyms.• Social media promotions.• How many links point to a particular web page.• The value of those links.

Page 10: How Google Works

Page Rank…• Developed by Larry Page and Sergey Brin in 1998

• Trademark of Google

• Patented by Stanford Unvirsity

• Back bone of Google Search Technology

Page 11: How Google Works

UNDERSTANDING PAGE

RANK

Page 12: How Google Works

Page Rank Technology..

• Rank pages based on the number of other pages that link to it.

• Gives an indication of the relative importance of a page.

• Hence,an appropriate ‘SERP’(Search Engine Result Page) listing.

• Calculated by nature and number of ‘ back links ‘

Page 13: How Google Works

Definition of Page Rank..

• “We assume page A has pages T1…Tn which point to it.The parameter ‘d’ is a damping factor which can be set between 0 & 1. We usually set ‘d’ to 0.85.Also C(A) is defined as the number of links going out of page A.

The PageRank of a page A is given as follows: PR(A) = (1-d) + d*(PR(T1)/C(T1)+…+PR(Tn)/C(Tn))

Note that the Page Ranks form a probability distribution over web pages , so the sum of all web pages , Page Ranks will be one”

Page 14: How Google Works

Calculating Page Rank...

Page 15: How Google Works

Definition Of Terms• PR: Shorthand for PageRank: the actual ,

real ,page rank for each page as calculated by Google.

• Back link : If page A links out to page B , the page B is said to have a ‘ back link’ from page A.

Page 16: How Google Works

• The PR of each page depends on the PR of the pages pointing to it.

• We won’t know what PR those pages have until the pages pointing to them have their PR calculated.

• ………….and so on..

PR(A) =(1-d)+d(PR(T1)/C(T1)+..+PR(Tn)/C(Tn))

Page 17: How Google Works

Seems impossible

to do this

calculation….

Page 18: How Google Works

BUT THERE IS A SOLUTION

Page 19: How Google Works

• Page Rank can be calculated using simple iterative algorithms .

• What we need to do ..

• *Remember the each value we calculate.

• *Repeat the calculations lots of times.

Page 20: How Google Works

How Many

Times ???

Page 21: How Google Works

Until the number stop changing much…

Page 22: How Google Works

• Let ,us assume that PR =1.0 & d=0.85PR(A) = (1-d) + d(PR(B)/1)PR(B) =(1-d)+d(PR(A)/1)On calculation..PR(A) = 0.15+0.85*1 =1PR(B) = 0.15 + 0.85*1=1

Page A Page B

Page 23: How Google Works

OK

BUT WHY SHOULD I ASSUME

PR =1 ?

WHAT IF NOT…

Page 24: How Google Works

So , Lets start with PR=0

• PR(A) =0.15 +0.85 *0=0.15• PR(B)=0.15 + 0.85*0.15=0.2775

• Again• PR(A) =0.15 +0.85 *0.2775=0.385875• PR(B)=0.15 + 0.85*0.385875=0.47799375

And Again• PR(A) =0.15 +0.85 *0.47799375=0.5562946875• PR(B)=0.15 + 0.85*0.5562946875=0.622850484375

Inference : PR approaches 1..•

Page 25: How Google Works

Example…• Let us assume :PR(A)=40,PR(B)=40

• First calculation:• PR(A)=0.15+0.85*40=34.15• PR(B)=0.15+0.85*34.15=29.1775

• And again :• PR(A)=0.15 + 0.85*29.1775=24.950875• PR(B)=0.15 + 0.85*24.950875=21.35824375• ………PR will approach and settle down

@1

Page 26: How Google Works

Another Example…

Page 27: How Google Works
Page 28: How Google Works

• The home page has got the highest PR…after all it is the one getting most numbers of incoming..

• But what's happened to the average ? It’s only 0.378 !!!

Page 29: How Google Works

Lets, take a look at the “external site “

pages-

What’s happening to their Page Rank ?

Page 30: How Google Works

• That’s better- It does work after all !!• And look at the PR of our home page !!• All those incoming links sure make a difference

Page 31: How Google Works

Regardless the number of pages, average PR will always be 1.0 at best.

And that’s how you searching happens on GOOGLE.

Page 32: How Google Works
Page 33: How Google Works

References…• Sergey Brin & Larry Page , “Anatomy of Large-

Scale Hyper textual Web Search Engine”• http://www.cs.princeton.edu/~

chazelle/courses/BIB/pagerank.htm• http://en.wikipedia.org/wiki/PageRank• http://

www.whitelines.nl/html/google-page-rank.html• http://www.google.co.in/insidesearch/

howsearchworks/thestory/

Page 34: How Google Works

Recommended