building smart indexes for drupal sites

Post on 11-Feb-2017

52 Views

Category:

Technology

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Anant Corporation

Research & DevelopmentSearch – Building Smarter Search Indexes

What do we do?

Streamline, Organize and Unify Business InformationPortals | Integration | Search

Agenda

•Overview – What is Search•Define – Dumb vs. Smart Indexes•Patterns: Ingestion & Retrieval•Technologies: What’s Available?•Questions & Answers

Search – Information RetrievalDocument Retrieval

• Google Search• Amazon Search• LinkedIN Search• *CMS Search• *Portal Search• *CRM Search• * Search

Document Routing

• Google Alerts• Amazon’s

Recommendations• Netflix

Recommendations• LinkedIN

Recommendations

Usual “Search” / Consumer Apps• Interface - Frontend Layer (UI) is

Deployed as Static Files from CDN• Software - Business Logic (API) is

Deployed as Stateless Services • Database - Persistent information (Data)

is any of SQL/NoSQL/Graph/Index/*• Systems - Different Applications

(Systems) are hosted in private/public/* clouds

Define – Smart vs. Dumb Index?“Dumb” Index

• No “Index” (SQL/NoSQL)

• Keyword matching• Term / phrase

matching• Basic highlighting• No Annotation

“Smart” Index

• Meta Data • Named Entity

Extraction • Concepts /

Keywords • “Likeness” /

Clustering

Post

• Title• Content• Author• Date• Tags• Categories• Link

SmartPost

• Title• Content_Raw• Content_HTML• Content_Readable• Author_Name• Author_Email• Date• Tags_User• Tags_Alchemy• Tags_OpenCalais• Categories• Link• Link_Thumbnail_Image

Example – Smart vs. Dumb Index Item ?

Patterns : Ingestion and Retrieval

Stage

Index

API

Technologies : What’s Available Now?

Elastic Search Connector Apache Solr Connector

Anant - D.C. Office

ContactRahul Singh

• Web: http://anant.us• Email: rahul@anant.us• Phone: 1.855.ANANTCO• 1010 Wisconsin Ave. NW,

Suite 250Washington, D.C. 20007

top related