+ All Categories
Home > Data & Analytics > Recommendation system

Recommendation system

Date post: 10-Jan-2017
Category:
Upload: akshat-thakar
View: 371 times
Download: 1 times
Share this document with a friend
11
Recommendation Engine Akshat Thakar
Transcript
Page 1: Recommendation system

Recommendation Engine

Akshat Thakar

Page 2: Recommendation system

Precursor

Awareness about Analytics • Jargon Buster• Recommendation System for Web/Digital

Analytics• Technology

Page 3: Recommendation system

• Sentiment Analysis

Clustering

• Collaborative-based filtering• Item based• User Based

Recommendation

Similarity Measurement–Pearson, Tanimoto

Algorithm - K-means

Similarity Measurement - Euclidean

Classification NLP

• Content-based filtering

• Regression• Decision Tree• SVM• NN

• Voice Recognition

• Video Analytics

Page 4: Recommendation system

Content Based, Collaborative Filtering[CF] and Hybrid Recommendation System

• Content Based systems focus on properties of items. Similarity of items is determined by measuring the similarity in their properties.

Needs History Data.

• Collaborative-Filtering systems focus on the relationship between users and items. Similarity of items is determined by the similarity of the ratings of those items by the users who have rated both items.

Source-http://infolab.stanford.edu/~ullman/mmds/ch9.pdf

Page 5: Recommendation system

How users are similar?

CF - User Similarity

Similarity Notion

User Neighborhood

User BasedRecommender

#1 #2 #3User Id Item Id Rating

Data Model

Page 6: Recommendation system

CF - Item Similarity

How items are similar?

Similarity Notion

Item BasedRecommender#1 #2 #3

User Id Item Id Rating

Data ModelItem-neighborhood

Source-http://www.theregister.co.uk/2006/08/15/beer_diapers/

Page 7: Recommendation system

Similarity Notion

• Pearson Correlation - measures the tendency of the numbers[User Preferences] to move together proportionally. When this tendency is high, the correlation is close to 1

• Spearman Correlation – Rank based on user preference

• Euclidean Distance - based on the distance between users. Smaller the distance, more similarity in users.

• Tanimoto Coefficient – based on number of items in common

• LogLikelihood Similarity

How to code?

Page 8: Recommendation system

How Similarity Definition affects Neighborhood formation?

Source: http://www.slideshare.net/Cataldo/apache-mahout-tutorial-recommendation-20132014Mahout In Action

Threshold based neighborhood

Page 9: Recommendation system

Evaluation• Evaluate Top n Recommendations• Precision and Recall

Relevant Non Relevant

Search Result ShownTrue Positive False Positive

Search result Not Shown False Negative True Negative

Source-https://en.wikipedia.org/wiki/Precision_and_recall

Page 10: Recommendation system

System Solutioning - More than Algorithm Accuracy

• Business Goal Injection• Novelty – avoiding repeated recommendations• Diversity – How diverse are recommended items?

Does it include all sub topics?• Positive Feedback• Negative Feedback

source: http://www.slideshare.net/Zhenv5/diversity-and-novelty-for-recommendation-system

Page 11: Recommendation system

Technology

• Mahout – Hadoop(optional), Java.Lot of stable algorithms.

• RRhadoopLot of Statistics packages.

• SparkEmerging TechnologyAlgorithms are getting added


Recommended