pears systems

25
PEARS

Upload: amit-kumar-gupta

Post on 18-Jul-2015

71 views

Category:

Documents


1 download

TRANSCRIPT

PEARS

About PEARS

We are a team of Engineers, researchers and product managers.

PEARS “Personalization and Recommendation System”

40%Snapdeal orders

5%Sales boost after

launching Personalized Products

25 MillionUsers are recommended products

50K+Sellers

3M+Daily users

We generate

100TB+Data per day

PEARS 360 degree analysis

360°

USER PRODUCT

Purchase Data Search Data Click/View Data Wish-list Data Cart Data

Social SignalsCustomer Care

dataUser DemographicReviewsGeolocation

Recommended Products

Category Affinity Brand Affinity Filter Affinity

Real-time Purchase Probability

User categorizationReal-time promos/

discounts

USER

Product Explicit data

Price history Seller details Sale history Product visits data Offer/Promo

FeedbackAdwords dataSocial dataOutside dataTags

User AffinityGeo-location based

TargetingSale prediction

Product QualitySeller QualityProduct

categorization

PRODUCT

PEARS Architecture

PEARS Architecture

PEARS Data Ingestion Layer

Consumer layerQueuing layer

WEB

LOGS

DATABASE

NOSQL

EXTERNAL

Data Ingestion Layer

gulpR

Data Sources Persistence layer

CAMUS

PEARS Data Science Engine

Data Science Engine

• Machine Learning• Data Mining• Text Mining• Statistical Analysis

Technologies

PEARS Data Products

Recommended Product Feed

Trending Now

People who viewed this item also viewed

Similar products

Frequently bought together

Data Analytics & Business Intelligence

Data

Sources

Analytics

Cloud

Data

Aggregation

Data

Insights

Transaction Data

Orders

Supply Chain

Financial

Customer Care

Etc

Data Cluster Scalable to

Peta Bytes

Non Txn Data

Web Logs

MongoDB

User Profile

Catalog Data

Browsing Data

External Data

Omniture

Social

Columnar Database

Data Warehouse

Snapdeal API

Business Intelligence

Reports

Adhoc & Periodic

API Client

Internal & External

Data Analytics & Business Intelligence

Online Data Processing

Online Data Processing

• High Speed Product Updates 2 million per hour

• High Speed Search Indexing 1 million per hour

• Seller Ranking 1 million per hour

• Optimized Courier Allocation 26 million rule combinations

• Flash Sale 100K orders per minute

• Rich Product Listing 200K per second

Challenges

Challenges

• Real-time Analytics– Personalization– Business Intelligence

• Large Scale Data Processing within minimal time with least resources

• On-line Data Processing– Growing Catalog – Growing Seller Base– Extremely High read-write Systems

Thank You!