Topic:
LinkedIn Search is built on a distributed realtime indexing system, with strong business requirements on SLAs for both search latency as well as how soon a newly updated document is to appear in the search results. Furthermore, with LinkedIn's rich structured information, guided navigation, or faceted search is an important functionality for enhanced user search experience. Combining faceted search and realtime search while maintaining SLA poses challenges in result and indexing cache. We will also discuss other novel solutions for problems such as: section search, e.g. zoning; balanced segment merge for incremental updating indexes; query segmentation and classificaiton etc.
SPEAKER: John Wang is the Seach Architect behind LinkedIn’s search infrastructure and is primarily focusing on realtime distributed faceted search. John is a frequent contributor to the open source community, e.g. Lucene, Solr etc. Previously, John has led development backing both internet and enterprise search systems at Yahoo!, SimplyHired, Verity/Autonomy etc. John’s LinkedIn profile can be reached at: http://www.linkedin.com/in/javasoze.
Jake Mannix has been building web applications for the past decade, much of this time focussed on search. He is a committer on the Apache Mahout project (a Lucene subproject for scalable machine-learning), the open source Zoie real-time search and Bobo-browse faceting libraries, built on top of Apache Lucene, and currently works at LinkedIn as a Principal Software Engineer responsible for general purpose recommender systems.
Location
Cubberley Community Center
4000 Middlefield Road, Room H-1
Palo Alto, CA Directions