🔥LanceDB as the Local Vector Database of Choice of GraphRAG🔥
Microsoft Research open sourced their GraphRAG project last week, and we are excited that the GraphRAG team has decided to launch with LanceDB as the local vectordb of choice!
🔬Experimental Native Full Text Search Released in Lance🔬
- As implemented from scratch in Rust, it works with S3 / GCS directly.
- FTS will be available in Python, Typescript and Java SDKs in August.
- Experimental encoding for struct column for faster random access.
Community contributions
💡
FSST: Thank you @broccoliSpicy for making FSST available. With FSST, we see sizable compression over large string columns, which is especially useful for AI datasets that store text prompts.
💡
A heartfelt thank you to our community contributors of lance and lancedb this month: @jiachengdb @broccoliSpicy @BitPhinix @niyue @gagan-bhullar-tech @inn-0 @MagnusS0 @nuvic @JoanFM @thomasjpfan @sidharthrajaram @forrestmckee
Good reads
We appreciate the feedback from our community and have been diligently working on several initiatives over the past month! We have embarked on a journey to rewrite our SDKs with the goal of achieving better cross-language consistency. Additionally, we are excited to announce the release of Lance v0.15.0, which includes significant performance enhancements and new capabilities. Be sure to check out our two engineering blogs for more details!
Event recap
Latest releases
- More advanced encodings in Lance V2 format: FSST, Bitpacking, Dictionary and packed struct encodings
- New rerankers available for customization: Reciprocal Rank Fusion reranker and Jina reranker
- Huggingface transformers in Typescript SDK