+ All Categories
Home > Data & Analytics > DE Presentation v2

DE Presentation v2

Date post: 12-Apr-2017
Category:
Upload: scstark
View: 93 times
Download: 0 times
Share this document with a friend
17
SceneFindr Stephanie Stark
Transcript
Page 1: DE Presentation v2

SceneFindrStephanie Stark

Page 2: DE Presentation v2

Motivation● Interested in hearing live music, but don’t

know where to go?

Page 4: DE Presentation v2

Pipeline

Page 5: DE Presentation v2

Data Sources

Page 6: DE Presentation v2

Data Sources

Page 7: DE Presentation v2

Data Sources

Page 8: DE Presentation v2

Data Sources

Page 9: DE Presentation v2

Data Sources

Page 10: DE Presentation v2

Pipeline

Page 11: DE Presentation v2

ETL

Artists

Events

Feature Extraction

K-Means Clusterin

g

Recommendations

Page 12: DE Presentation v2

Database

Page 13: DE Presentation v2

Pipeline

Page 14: DE Presentation v2

Scaling

500gb Artist Data

9 Hours

500gb Event Data

Page 15: DE Presentation v2

Lessons Learned (the hard way!)● Scala● Parallelized ML algorithms

Page 16: DE Presentation v2

About Me

B.A., Mount Holyoke CollegeMajor: MathematicsMinor: Computer Science

Education

Interests ReadingArt HistoryHiking

Stephanie Stark

Page 17: DE Presentation v2

Future WorkImplement TF/IDF compatibility for projectUse PCAImplement cosine similarity for feature clusteringCluster within metro areaUse Redis as a cache for feature vectors


Recommended