introduction to apache solr

Download Introduction to Apache Solr

If you can't read please download the document

Upload: shalin-shekhar-mangar

Post on 16-Apr-2017

1.770 views

Category:

Software


2 download

TRANSCRIPT

Introduction to Apache Lucene/Solr

Shalin Shekhar Mangar

Who am I?

Apache Lucene/Solr Committer and PMC member

Contributor since January 2008

Currently: Engineer at LucidWorks

Formerly with AOL

Email: [email protected]

Twitter: shalinmangar

Blog: http://shal.in

Apache Lucene

http://lucene.apache.org/java

Java based API for adding search and indexing to your applications

High performance indexing over 150GB/hour on modern hardware

Fast and efficient scoring and indexing algorithms

Support for multiple query types, hit highlighting, faceting, joins, grouping, typo-tolerant suggestions and multiple languages

Most widely deployed search library on the planet

Apache Lucene Work Pipeline

Inverted Index

Lucene Query Syntax

+red +shoes = red AND shoes

+shoes -red = shoes NOT red

android phone

android phone -samsung = android phone NOT samsung

android samsung~4

merced*

createDate:[201301 TO 201401]

author:shalin

author:shalin mangar

author:shalin mangar AND project:(lucene OR solr)

title:samsung^5 category:phone

Apache Solr

http://lucene.apache.org/solr

Lucene based search server + other features

Access Lucene over HTTP:Java, Ruby, Python, .NET, PHP over XML/JSON and other formats

Most programming tasks in Lucene are configuration tasks in Solr

Faceting (guided navigation, filters etc)

Replication and distributed search

Lucene best practices

Other features

Data Import HandlerIndex Databases, Mails, RSS, XMLs etc.

Rich document supportPDF, MS Office, Images etc

Replication for high query volume

Distributed search for large indexesProduction systems with 1B+ documents

Very extensible and customizableEmbedded in commercial search products from LucidWorks, DataStax, Cloudera, Hortonworks, Amazon CloudSearch and Riak

Apache Solr

Where does Solr fit?

Solr block diagram

/select?q=video&sort=price desc&fl=name,id,price&wt=json&indent=on

/select?q=video+card&fl=name,id&hl=true&hl.fl=name,features

/select?wt=json&indent=on&q=*:*&fl=name&facet=true&facet.field=cat

Indexing data using SolrJ The official Solr Java client

Searching using SolrJ The official Solr Java client

Bangalore Baby Apache Solr Meetup Group

http://www.meetup.com/Bangalore-Baby-Apache-Solr-Group/

Already had one successful meetup

Great tutorial + hands-on workshop

Must join for all new comers

Planning to have another meetup next month

Thank you

Shalin Shekhar MangarLucidWorks

Confidential and Proprietary 2012 LucidWorks

Click to edit the title text formatTitle

4th Bangalore Lucene/Solr Meetup19th April 2014

Click to edit the title text formatClick to edit Master title style

Confidential and Proprietary 2012 LucidWorks

Click to edit the title text formatClick to edit Master title style

Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline Level

Seventh Outline LevelClick to edit Master text styles

Second level

Third level

Fourth level

Fifth level