Better RAG with Active Retrieval Augmented Generation FLARE by Akash A. Desai Welcome to our deep dive into Forward-Looking Active Retrieval Augmented Generation (FLARE), an innovative approach enhancing the accuracy and reliability of
Multitask Embedding with LanceDB By Kaushal Choudhary In this blog, we’ll cover InstructOR embedding model and how it can be used with LanceBD. LanceDB embedding API already supports
NER-powered Semantic Search using LanceDB NER stands for Named Entity Recognition, a form of natural language processing (NLP) that involves extracting and identifying essential information from text. The information that
GPU-accelerated Indexing in LanceDB Vector databases are extremely useful for RAG, RecSys, computer vision, and a whole host of other ML/AI applications. Because of the rise of LLMs,
A Primer on Text Chunking and its Types Text chunking is a technique in natural language processing that divides text into smaller segments, usually based on the parts of speech and grammatical meanings
Optimizing LLMs: A Step-by-Step Guide to Fine-Tuning with PEFT and QLoRA A Practical Guide to Fine-Tuning LLM using QLora Conducting inference with large language models (LLMs) demands significant GPU power and memory resources, which can be
Serverless Multi-Modal search engine application by Ayush Chaurasia In this writeup, you’ll learn the process of building a multi-modal search engine using roboflow’s CLIP inference API and LanceDB,
Context-aware chatbot using Llama 2 & lanceDB serverless vector database Building a real chatbot using RAG method – by Akash A. Desai Introduction: Many people know about OpenAI’s cool AI models like GPT-3.5 and
LLMs, RAG, & the missing storage layer for AI by Ayush Chaurasia In the rapidly evolving landscape of artificial intelligence, Generative AI, especially Language Model Machines (LLMs) have emerged as the veritable backbone of
Improving LLM-based web applications with easy-to-use and free serverless vector database LanceDB by Tevin Wang I believe that there are currently two major challenges using LLMs. (1) outdated information (ChatGPT has limited knowledge of the world and
S3 Backed Full-Text Search with Tantivy (Part 1) by Rob Meng Tantivy is a highly performant full text search library written in Rust. The python version of tantivy powers the full-text search (FTS)
Maximizing Developer Workflows using Lance and Claude 2 by Leon Yee Introduction Just a few days ago, I went to Anthropic’s Hackathon, where hackers utilized their Claude 2 APIs to build some