Professional Documents
Culture Documents
Search, APIs, Capability Management and The Sensis Journey
Search, APIs, Capability Management and The Sensis Journey
Search, APIs, Capability Management and The Sensis Journey
Craig Rees
Project background Platform selection Search capability Relevance Architecture Quality management Hurdles Whats next
Todays menu
Sensis helps Australians find, buy and sell From print directories to a cross-platform lead generator Sensis publishes over 1.8 Million business listings Two of the top 10 visited online sites in Australia (WhitePages.com.au and YellowPages.com.au)
Sensis
Business objectives
Drive presence in the local search market place Open up the largest database of business listings in Australia Reduce the effort required from local search developers Free to use, we are after the reporting Technology objectives
Develop a total search platform Relevancy testing as part of the development lifecycle A framework to identify problem spaces Manageable platform Continuous deployments
Project background
Developer portal
Support for the search capability team Structured vs non structured data Deterministic vs black box Non propriety code base Community backing
Platform selection
Lvl 5
A/B testing Machine learning External collaboration Multiple contexts Online dashboards Test environments Dynamic search refinements Targets and metrics Defined team Regular monitoring Static autosuggest Basic linguistics Adhoc processes Part time team Static dictionaries Individual led innovation No resources No reporting Out of the box features
Lvl 4
Lvl 3
Lvl 2
Lvl 1
Location
Chronology
Social Graph
Device
Individual
Context is key
Business Data
Geo Service
Solr
Business Data MongoDB Index Type Query Handler Historical search Data Reporting Events Ontologies Name Query Handler API
Mashery
Search Service Publisher Reporting Service
Our architecture
Business Data
Geo Service
Solr
Business Data MongoDB Index Type Query Handler Historical search Data Reporting Events Ontologies Name Query Handler API
Mashery
Search Service Publisher Reporting Service
Data staging
Business Data
Geo Service
Solr
Business Data MongoDB Index Type Query Handler Historical search Data Reporting Events Ontologies Name Query Handler API
Mashery
Search Service Publisher Reporting Service
Search
Business Data
Geo Service
Solr
Business Data MongoDB Index Type Query Handler Historical search Data Reporting Events Ontologies Name Query Handler API
Mashery
Search Service Publisher Reporting Service
API
Business Data
Geo Service
Solr
Business Data MongoDB Index Type Query Handler Historical search Data Reporting Events Ontologies Name Query Handler API
Mashery
Search Service Publisher Reporting Service
API proxy
Moved from a black box solution to a manageable platform Deliver search improvements without major code changes Understand how results were calculated Identity problems scientifically Continuously tune and test relevance
Yesterday
Today
Tomorrow
Features signed off only when they make a positive impact to quality score
Results examiner
Score analysis
Tuning
Data redundancy and homogeneity Solr ranking of rare terms Intent differentiation Contextual synonyms
Query engine Facets / autosuggest Real time tuning Machine learning Multi term queries Scoring thresholds Content Value
Where next?
Questions?