apache solr for typo3 at typo3 usergroup day netherlands

Post on 25-May-2015

1.927 Views

Category:

Technology

3 Downloads

Preview:

Click to see full reader

DESCRIPTION

Presentation of an extension to integrate Apache Solr for TYPO3. Apache Solr is an enterprise search server, TYPO3 is a mid-to large size enterprise Content Management System; combining both results in great user search experience.

TRANSCRIPT

Apache Solr & TYPO3TYPO3 Core Developer, Release Manager TYPO3 4.2

Ingo Renner

ingo@

typo3.

org

@ingor

enner

mail

twitter

ingo@

typo3.

org

@ingor

enner

mail

twitter

Indexed Search

• Indexing Frontend / Crawler• Respects access rights• Respects languages• Index in Database• Totally OK for smaller websites

Indexed Search

Slooooooooooooowww

Apache Solr

• Enterprise Search Server• Based on Lucene Index• Apache Software Foundation Project• Many powerful features

• CNet, Netflix, ilocal.nl, Zappos.com

So what is Apache Solr?

• Index = Collection of Documents• Document = Data stored in Fields• Field Type defines processing through

Analizers, Tokenizers, Filters• Dynamic Fields• Copy Fields

Solr Concepts

Flexibility

• Speed: Many times faster than IS• Better search results• Faceted search• Spellchecker: Did you mean ... ?• Similarity search: More like this ...• Editorial Content / paid search results• Synonyms, Stopwords• Boosting of specific index fields• Replication, distributed search

Why Apache Solr?

Speed &

Power

• REST like interface• Indexing of XML Documents through

HTTP POST• Querying through HTTP GET• Results as XML, JSON, PHP

How it works

Easy API

• Needs Java

• We donʻt want to deal with Java• Solr shields us from Java once set-up

Disadvantages

Developers

stay with PHP

• Multiple times faster than IS• NO database queries • Easy installation / Configuration• Respects access rights• Respects languages• Cutomizability

Advantages

FastEasy to use

Powerful

EXT:solr+

• „Acts like Indexed Search“• Indexing through Frontend / Crawler• Search• Search Word Highlighting• Sorting• Spellchecker: Did you mean ... ?• Similarity Search: More like this ...• Faceted Search• Suggest / Autocompletion

Current Status

• Backend Modul• API, indexing through BE• Related Searches• Last Searches• Smart Reranking through user usage• Editorial Search Results• Editing of Stopwords, Synonyms

Outlook

• Private financing of new features• Financing partners get

Early Access and Support• Minimum stake of 5 man days• v2.0 end of Q2 next year• Development as Community

Project in parrallel

Development Model

• Released v1.0 on TER• Project on TYPO3 Forge• Open Development• Only few differences

compared to „our“ version

Community Edition

Showcases

Showcases

Showcases

Showcases

Showcases

Showcases

Making the sun shine on your search

• Requires any J2EE container:Tomcat, Jetty, Resin, ...

• Run setup scripts provided with EXT:solr• Copy provided configuration files to Solr• config.index_enable = 1

Requirements, Setup

• Indexing of additional Data through hooks, interfaces, TS configuration

• Individual index schema• En/Disable features through TS• Individual, flexible rendering of results

Customization

More than Solr

• Lucene - Search Index Library

• Tika - Content Extraction from Files

• Nutch - Crawl External Sites

Projects around Solr

Thanks for listening.

ingo@

typo3.

org

@ingor

enner

mail

twitter

ingo@

typo3.

org

@ingor

enner

mail

twitter

top related