minneapolis solr meetup - may 28, 2014: ecommerce search with apache solr
DESCRIPTION
"eCommerce Search with Apache Solr", Grant IngersollTRANSCRIPT
![Page 1: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/1.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
eCommerce Search with Apache Solr
Grant IngersollCTO, LucidWorks
Twitter: @gsingers
![Page 2: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/2.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Tales from the trenches
• The case of the missing data
• The power of suggestion
![Page 3: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/3.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Topics
• Solr powered commerce– Companies– Features
• Relevance, relevance, relevance
• Demo
![Page 4: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/4.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Solr Powers Leading eCommerce and Consumer Sites
![Page 5: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/5.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Basic Features for eCommerce
• High quality OOTB relevance• Facets
– Range, Term/Category, Hierarchical, Pivot• Highlighting• Did you mean?• Boosting/Blocking/Landing Pages• Easy scale
![Page 6: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/6.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Advanced Features
• Spatial– Local– Route finding– Open Hours, etc.
• Function Queries– Inventory, Margin
• Stats Component– Missing data– Bounds, etc.
![Page 7: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/7.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Tips and Tricks
![Page 8: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/8.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014
Look Before You Leap
• Before undertaking any relevance tuning, you need to define what “better search” means to you
• Once determined, many ways to test/measure
• Once tested, many ways to fix
http://www.betternetworker.com/files/useruploads/16675/leap.jpg
![Page 9: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/9.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014
Understand your…
• Domain– Types of documents– Languages present– Document structures,
metadata and other features– Lexical resources: jargon,
synonyms, abbreviations...– Relationships between
documents
• Users– Sophistication/Expertise– Search and Discovery needs– Known Item vs. Keyword
• Tolerance for Pain– Managers– Business Interests– Release cycles– Obsession in finding the one
true relevance model (hint, it doesn’t exist)
– “explain() blindness”
![Page 10: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/10.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014
Known Item vs. Keyword
eCommerce search often has a split between known item and keyword search
You probably have more “wiggle” room for relevancy on keyword search
E.g. What should be the top result for a search on “women’s shoes”?
Known Item should have best matches at the top More in a moment
![Page 11: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/11.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014
Debugging
• Check the analysis (more in the next slide)• Check for data quality issues• Check your query constructs (slop, boosts, etc.)
• Try alternate query representations• (exact match)^100 OR (sloppy phrase match)^50 OR (OR query)
• Use Lucene’s explain() or Solr’s &debugQuery
![Page 12: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/12.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Signal Processing for Search and Discovery
• Signals power modern relevance– Clicks, conversions, sharing, history, signatures
• LucidWorks 5 makes it easy to capture and leverage signals– Recommendations, analytics, discovery
• Simplifies your data workflow• Simplify your operational footprint
![Page 13: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/13.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Solr Powered Signal Processing
• Use Case: eCommerce
• Data: – Product catalog (~1.2m items)– Click data (~3.9M clicks)
![Page 14: Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr](https://reader036.vdocuments.mx/reader036/viewer/2022062511/54c673164a7959e37d8b4613/html5/thumbnails/14.jpg)
10010
10010
10010 10010
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Metadata
• http://www.lucidworks.com
• [email protected]• @gsingers
• Lucene/Solr Revolution – Washington DC, Nov 11-14– http://www.lucenerevolution.org