GraphRAG, Full-Text Search, LanceDB+Character.AI

GraphRAG, Full-Text Search, LanceDB+Character.AI

1 min read

🔥LanceDB as the Local Vector Database of Choice of GraphRAG🔥

Microsoft Research open sourced their GraphRAG project last week, and we are excited that the GraphRAG team has decided to launch with LanceDB as the local vectordb of choice! 

🔬Experimental Native Full Text Search Released in Lance🔬

  • As implemented from scratch in Rust, it works with S3 / GCS directly.  
  • FTS will be available in Python, Typescript and Java SDKs in August.
  • Experimental encoding for struct column for faster random access.

Community contributions

💡
FSST: Thank you @broccoliSpicy for making FSST available. With FSST, we see sizable compression over large string columns, which is especially useful for AI datasets that store text prompts.
💡
A heartfelt thank you to our community contributors of lance and lancedb this month: @jiachengdb @broccoliSpicy @BitPhinix @niyue @gagan-bhullar-tech @inn-0 @MagnusS0 @nuvic @JoanFM @thomasjpfan @sidharthrajaram @forrestmckee

Good reads

We appreciate the feedback from our community and have been diligently working on several initiatives over the past month! We have embarked on a journey to rewrite our SDKs with the goal of achieving better cross-language consistency. Additionally, we are excited to announce the release of Lance v0.15.0, which includes significant performance enhancements and new capabilities. Be sure to check out our two engineering blogs for more details!

Event recap

LancedB + Character.AI - The Hierarchy of Needs for Training Dataset Development

The Data Exchange with Ben Lorica: Unlocking the Power of Unstructured Data

Latest releases