open relevance project

13
Lucid Imagination, Inc. Open Relevance Project In the beginning… Lucene Connector Framework

Upload: landon

Post on 23-Feb-2016

34 views

Category:

Documents


0 download

DESCRIPTION

In the beginning…. Open Relevance Project. Lucene Connector Framework. SF Lucene Meetup. Agenda. Announcements and Introductions Solr 1.4 and Data Discovery – Hoss Tips and Tricks with Analyzers – Tom Hill Optimizing Solr for performance and User Experience – Brian Doll - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Open Relevance Project

Lucid Imagination, Inc.

Open Relevance Project

In the beginning…

Lucene Connector Framework

Page 2: Open Relevance Project

SF Lucene Meetup

Page 3: Open Relevance Project

Lucid Imagination, Inc.

Agenda

Announcements and Introductions

Solr 1.4 and Data Discovery – Hoss

Tips and Tricks with Analyzers – Tom Hill

Optimizing Solr for performance and User Experience – Brian Doll

Spatial Search with Lucene and Solr - Grant

Page 4: Open Relevance Project

Lucid Imagination, Inc.

Introductions and Announcements

Intro

Haiti Fundraiser/Rafflehttp://forms.lucidimagination.com/go/lucidimagination/donate2haiti

Lucene Connector Framework in Incubatorhttp://incubator.apache.org/connectors

Page 5: Open Relevance Project

Spatial Search

Page 6: Open Relevance Project

Lucid Imagination, Inc.

Introduction

What is it?

Use Cases

Background Knowledge

Implementing

Status

Resources

Page 7: Open Relevance Project

Lucid Imagination, Inc.

What is It?

Spatial search is the merging of unstructured content with spatial information to produce search results specific to one or more locations

In other words“Find me all the Starbucks selling double espressos within 5 miles of my current location”

Lucene/Solr solve specific spatial problems, but doesn’t provide a general purpose GIS solution

Page 8: Open Relevance Project

Lucid Imagination, Inc.

Use Cases

Asset Management“Dude, where’s my stuff?”

Social NetworkingFind all friends near me

Targeted, local search results and ads“restaurants in Austin Texas”

“Starbucks, 55313”

Business IntelligenceRestrict doc set for analysis by location

Page 9: Open Relevance Project

Lucid Imagination, Inc.

Spatial Search Concepts

Spatial Data TypesPoints (latitude/longitude)

Lines

Shapes

Maps and overlaysStreets, POI

Integration with unstructured textMetadata, descriptions, user reviews, etc.

http://www.openstreetmap.org/?lat=44.9744&lon=-93.2484&zoom=14&layers=B000FTFT

Page 10: Open Relevance Project

Lucid Imagination, Inc.

Implementation Needs

Indexing support for data types

Query Parsing

Efficient distance calculationsEuclidean, Great Circle (Haversine), Vincenty’s

FilteringBounding Box

Sort by Distance

Relevance Enhancement

Faceting

Advanced: shape intersections, routes

Page 11: Open Relevance Project

Lucid Imagination, Inc.

Things you can do now in Lucene/Solr

Distance FunctionsSolr has: haversine (great circle), Euclidean, Manahattan (others)

FilterFrange, Cartesian Tier/Tile coming soon, Lat/Long Range

Boost Scorehttp://localhost:8983/solr/select/?q=_val_:"recip(dist(2, store, point(34.0232,-81.0664)),1,1,0)"&fl=*,score

SortSOLR-1297 added support for sorting by Function (any function)

Contrib/Spatial has similar support

Page 12: Open Relevance Project

Lucid Imagination, Inc.

What’s next/needed

Faceting

“Generic” Filtering support

Function Query results as fields in output

Query Parsing SupportGeocoding, etc.

Shapes, lines, intersections?

Page 13: Open Relevance Project

Lucid Imagination, Inc.

Resources

https://issues.apache.org/jira/browse/SOLR-773Several issues in Lucene for improvements

http://wiki.apache.org/solr/SpatialSearch

http://www.nsshutdown.com/projects/lucene/whitepaper/locallucene_v2.html

http://www.ibm.com/developerworks/opensource/library/j-spatial/index.html

http://www.lucidimagination.com/search/?q=spatial