Blog - LanceDB Blog (Page 2)

GTA5 Multimodal RAG application

Artificial Intelligence (AI) has been actively working with text for quite some time, but the world isn't solely centered around words. If you take a moment to look around, you'll find a mix of text, images, videos, audio, and their combinations.

Blog 11 min read

Implementing Corrective RAG in the Easiest Way

Even though text-generation models are good at generating content, they sometimes need to improve in returning facts. This happens because of the way they are

Blog 5 min read

Hybrid search and custom reranking with LanceDB

Search has historically been a complex problem in computer science. This is partly due to limitations in natural language or contextual understanding, and partly due

Blog 12 min read

Hybrid search: RAG for real-life production-grade applications

by Mahesh Deshwal What is Hybrid Search, and what’s the need for it? With the increasing usage of LLMs in RAG setting, there’s

Blog 6 min read

Benchmarking New OpenAI Embedding Models with LanceDB

A couple of weeks ago, OpenAI launched their new and most performant embedding models with higher multilingual performance and new parameters to control the overall

Blog 3 min read

Substrait Powered Filter Pushdown in Lance

by Weston Pace Filter pushdown is one of the more fundamental optimizations in any data engineering pipeline. The premise is simple: the earlier you filter

Blog 5 min read

LanceDB + Polars

A (near) perfect match A spiritual successor to pandas, Polars is a new blazing fast DataFrame library for Python written in Rust. At LanceDB, we

Blog 6 min read

Efficient RAG with Compression and Filtering

by Kaushal Choudhary Why Contextual Compressors and Filters? RAG (Retrieval Augmented Generation) is a technique that helps add additional data sources to our existing LLM

Blog 3 min read

Langroid: Multi-Agent Programming framework for LLMs

In this era of Large Language Models (LLMs), there is unprecedented demand to create intelligent applications powered by this transformative technology. What is the best

Blog 5 min read

Using Prediction Guard and LanceDB to Prevent Medical Hallucinations

This is a collective work of the following authors: Sharan Shirodkar (Prediction Guard) Daniel Whitenack (Prediction Guard) Bingyang (Icy) Wang (Emory University) Guangming (Dola) Qiu

Blog 7 min read

Using column statistics to make Lance scans 30x faster

by Will Jones In Lance v0.8.21, we introduced column statistics and statistics-based page pruning. This enhancement reduces the number of IO calls needed

Blog 6 min read

Benchmarking LanceDB

I came upon a blog post yesterday benchmarking LanceDB. The numbers looked very surprising to me, so I decided to do a quick investigation on

Blog 7 min read