introduction to apache solr
Post on 16-Apr-2017
1.770 Views
Preview:
TRANSCRIPT
Introduction to Apache Lucene/Solr
Shalin Shekhar Mangar
Who am I?
Apache Lucene/Solr Committer and PMC member
Contributor since January 2008
Currently: Engineer at LucidWorks
Formerly with AOL
Email: shalin@apache.org
Twitter: shalinmangar
Blog: http://shal.in
Apache Lucene
http://lucene.apache.org/java
Java based API for adding search and indexing to your applications
High performance indexing over 150GB/hour on modern hardware
Fast and efficient scoring and indexing algorithms
Support for multiple query types, hit highlighting, faceting, joins, grouping, typo-tolerant suggestions and multiple languages
Most widely deployed search library on the planet
Apache Lucene Work Pipeline
Inverted Index
Lucene Query Syntax
+red +shoes = red AND shoes
+shoes -red = shoes NOT red
android phone
android phone -samsung = android phone NOT samsung
android samsung~4
merced*
createDate:[201301 TO 201401]
author:shalin
author:shalin mangar
author:shalin mangar AND project:(lucene OR solr)
title:samsung^5 category:phone
Apache Solr
http://lucene.apache.org/solr
Lucene based search server + other features
Access Lucene over HTTP:Java, Ruby, Python, .NET, PHP over XML/JSON and other formats
Most programming tasks in Lucene are configuration tasks in Solr
Faceting (guided navigation, filters etc)
Replication and distributed search
Lucene best practices
Other features
Data Import HandlerIndex Databases, Mails, RSS, XMLs etc.
Rich document supportPDF, MS Office, Images etc
Replication for high query volume
Distributed search for large indexesProduction systems with 1B+ documents
Very extensible and customizableEmbedded in commercial search products from LucidWorks, DataStax, Cloudera, Hortonworks, Amazon CloudSearch and Riak
Apache Solr
Where does Solr fit?
Solr block diagram
/select?q=video&sort=price desc&fl=name,id,price&wt=json&indent=on
/select?q=video+card&fl=name,id&hl=true&hl.fl=name,features
/select?wt=json&indent=on&q=*:*&fl=name&facet=true&facet.field=cat
Indexing data using SolrJ The official Solr Java client
Searching using SolrJ The official Solr Java client
Bangalore Baby Apache Solr Meetup Group
http://www.meetup.com/Bangalore-Baby-Apache-Solr-Group/
Already had one successful meetup
Great tutorial + hands-on workshop
Must join for all new comers
Planning to have another meetup next month
Thank you
Shalin Shekhar MangarLucidWorks
Confidential and Proprietary 2012 LucidWorks
Click to edit the title text formatTitle
4th Bangalore Lucene/Solr Meetup19th April 2014
Click to edit the title text formatClick to edit Master title style
Confidential and Proprietary 2012 LucidWorks
Click to edit the title text formatClick to edit Master title style
Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline Level
Seventh Outline LevelClick to edit Master text styles
Second level
Third level
Fourth level
Fifth level
top related