
Unifying the AV ML Stack: From Raw Data to Trained Model with LanceDB
A complete walkthrough of building an autonomous vehicle perception model training pipeline on top of LanceDB and the Multimodal Lakehouse.
Applications
Autonomous Vehicles
unifying-the-av-ml-stack-lancedb

Volcano Engine LAS's Lance-Based PB-Scale Autonomous Driving Data Lake Solution
How Bytedance Volcano Engine LAS (Lake for AI Service) leverages Lance as the core storage format, rapidly constructing a next-gen AI data lake to efficiently store, manage, and process multimodal data (text, images, audio/video).
Case Study
Autonomous Vehicles
volcano-engine-autonomous-driving-data-lake-solution
All Posts

Columnar File Readers in Depth: Backpressure
Streaming data applications can be tricky. When you can read data faster than you can process the data then bad things tend to happen. The various solutions to this problem are largely classified as backpressure.
Engineering
columnar-file-readers-in-depth-backpressure

Multimodal Myntra Fashion Search Engine Using LanceDB
Build a multimodal fashion search engine with LanceDB and CLIP embeddings. Follow a step‑by‑step workflow to register embeddings, create the table, query by text or image, and ship a Streamlit UI.
Engineering
multimodal-myntra-fashion-search-engine-using-lancedb

Hybrid Search and Custom Reranking with LanceDB
Combine keyword and vector search for higher‑quality results with LanceDB. This post shows how to run hybrid search and compare rerankers (linear combination, Cohere, ColBERT) with code and benchmarks.
Engineering
hybrid-search-and-custom-reranking-with-lancedb-4c10a6a3447e

Inverted File Product Quantization (IVF_PQ): Accelerate Vector Search by Creating Indices
Compress vectors with PQ and accelerate retrieval with IVF_PQ in LanceDB. The tutorial explains the concepts, memory savings, and a minimal implementation with search tuning knobs.
Engineering
benchmarking-lancedb-92b01032874a-2








