Introduction
This guide covers search in the AI era: how modern retrieval and ranking systems combine classical infrastructure (indexing, serving, scaling) with new modelling techniques (dense retrieval, late interaction, LLM-augmented ranking, and evaluation under real traffic).
Chapters will expand on production patterns, trade-offs, and how teams ship reliable search when models and traffic both grow quickly.