LanceDB Enterprise Architecture, Lance Community Office Hour, Petabyte Scale Multimodal AI

🔥LanceDB Enterprise Documentation

LanceDB Enterprise transforms your data lake into a high performance vector database that can operate at extreme scale. Serve millions of tables and tens of billions of rows in a single index, improve retrieval quality using hybrid search with blazing fast metadata filters, and up to 200x cheaper with object storage.

LanceDB Enterprise Architecture

👩‍💻Lance Community Office Hour

We kicked off our first monthly community office hour in Oct, in which the LanceDb team talked about what we're currently working on, discussed rough plans and future roadmap, and answered questions from anyone in the Lance and LanceDB community. The next one is coming up soon, come say hi!

November Office Hour

Community contributions

💡

A heartfelt thank you to our community contributors of lance and lancedb this month: @niyue @SaintBacchus @dsgibbons @erikml-db @alexwilcoxson-rel @o-alexandrov @do-me @rithikJha @jameswu1991 @gagan-bhullar-tech @akashsara @sayandipdutta @rjrobben @PrashantDixit0 @akashsara

Good reads

Another deep dive into the Lance file format and explain how Lance uses backpressure to balance parallelism, speedy I/O, streaming computation, and limited RAM usage.

Event recap

Lance Format Deep Dive Podcast with Weston Pace

Multimodal AI in Production with LanceDB and Ray, by Chang She & Lei Xu

Upcoming events

Latest releases

Lance releases:

LanceDB releases:

v0.11.0 (2024-10-09)
- Python v0.14.0
- From Lance:
  - SQL now supports list element access.
  - Filter pushdown enabled for scans of Lance V2 files.
  - Zstd block compression supported, if opted in for particular columns.
v0.12.0 (2024-10-29)
- Python v0.15.0
- LanceDB’s native tokenizer is now configurable in Python sync API, allowing you to choose whether to enable stemming and stop word removal, as well as other settings. The default tokenizer no longer performs stemming or stop word removal. This will come soon to Nodejs and async Python.