Download - DE Presentation v2
Transcript
![Page 1: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/1.jpg)
SceneFindrStephanie Stark
![Page 2: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/2.jpg)
Motivation● Interested in hearing live music, but don’t
know where to go?
![Page 4: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/4.jpg)
Pipeline
![Page 5: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/5.jpg)
Data Sources
![Page 6: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/6.jpg)
Data Sources
![Page 7: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/7.jpg)
Data Sources
![Page 8: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/8.jpg)
Data Sources
![Page 9: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/9.jpg)
Data Sources
![Page 10: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/10.jpg)
Pipeline
![Page 11: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/11.jpg)
ETL
Artists
Events
Feature Extraction
K-Means Clusterin
g
Recommendations
![Page 12: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/12.jpg)
Database
![Page 13: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/13.jpg)
Pipeline
![Page 14: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/14.jpg)
Scaling
500gb Artist Data
9 Hours
500gb Event Data
![Page 15: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/15.jpg)
Lessons Learned (the hard way!)● Scala● Parallelized ML algorithms
![Page 16: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/16.jpg)
About Me
B.A., Mount Holyoke CollegeMajor: MathematicsMinor: Computer Science
Education
Interests ReadingArt HistoryHiking
Stephanie Stark
![Page 17: DE Presentation v2](https://reader036.vdocuments.mx/reader036/viewer/2022062902/58edd8661a28abcc498b464d/html5/thumbnails/17.jpg)
Future WorkImplement TF/IDF compatibility for projectUse PCAImplement cosine similarity for feature clusteringCluster within metro areaUse Redis as a cache for feature vectors