🧠Lance Format Deep Dive, Samsara❤️LanceDB, 🏖️Summer Events Circuit

🎤 Catch Us on Stage This June!

We’re thrilled to be speaking at several top-tier events this month — alongside our customers — sharing real-world insights from scaling enterprise AI systems in production.

If you’re attending the AI Engineering World Fair (June 3–5), Data + AI Summit (June 9–12), or the Toronto Machine Learning Summit (June 13–18), don’t miss our sessions across multiple tracks. Come say hi and learn what we’ve been building!

⚙️ Lance Format Deep Dives

In addition to our highly requested deep dives into the Lance format, we also shared our perspective on the future of open source table formats — inspired by feedback and questions from the Iceberg community.

Curious where things are headed? Dig in 👇

🎥 Event Recap: AI at Scale with Samsara

The Samsara team is harnessing LanceDB to simplify and streamline AI data infrastructure for massive, real-world datasets.

In May, our cofounder Chang She joined Samsara’s AI Speaker Series, where he shared cutting-edge insights on multimodal AI and the evolving landscape of AI agents.

Missed it? Catch the recording below 👇

Scaling AI Data Infrastructure: A Multimodal Approach

🔐 LanceDB Enterprise Product News

Smoother concurrent upserts: Upsert operations are now conflict-free in typical workloads, so you can write without worrying about collisions.
Significantly reduced storage costs: Reduce object store operations by up to 95% with small files loaded with a single I/O instead of multiple IOPS - ideal for small-table workloads.
Filter binary data with ease: Now query large binary columns directly in your filters – no workarounds needed.
Optimized GCP deployment tuning: Fine-tune weak consistency and concurrency limits to better balance performance, cost, and flexibility.
Intuitive embedding visualization: New UMAP visualizations help you explore and understand vector data at a glance.

Learn more

0:00

/0:15

Embedding Visualization shown in LanceDB Cloud (Beta)

👥 Community Contributions

💡

A heartfelt thank you to our community contributors of lance and lancedb this month: @yanghua @frankliee @leaves12138 @Jay-ju @KazuhitoT @majin1102 @upczsh @renato2099 @HaochengLIU @omahs @xaptronic @acoliver

🛠️ Open Source Releases Spotlight

Boolean logic for full-text search: Combine filters with AND/OR or &/| — full-text search now works the way you think.

fts_query = MatchQuery("puppy", "text") & MatchQuery("happy", "text")

Faster, smarter full-text indexing: Compression and optimized search algorithms speed up index builds and boost performance at scale.
No more stalled upserts: A new timeout setting ensures merge_insert operations won’t hang forever.

table.merge_insert(id)
        .when_matched_update_all()
        .when_not_matched_insert_all()
        .execute(new_data, timeout=timedelta(seconds=10))
    )

Flexible phrase matching: Control how loose or tight your matches are with the slop parameter.

 fts_query=PhraseQuery("frodo was happy", "text", slop=2)

Spark compatibility built in: Works with multiple Spark versions out of the box — just drop in the bundled JAR and go. Quick start →

Lance File 2.1: Smaller and Simpler

Designing a Table Format for ML Workloads

The Lancelot Chronicles: Your Quest in the Realm of Open Source

Lance v2: A columnar container format for modern data

🎤 Catch Us on Stage This June!

⚙️ Lance Format Deep Dives

🎥 Event Recap: AI at Scale with Samsara

🔐 LanceDB Enterprise Product News

👥 Community Contributions

🛠️ Open Source Releases Spotlight

Tags

Share with friends

Posted by Jasmine Wang