Benjamin Trent

About the author

Benjamin Trent is a Lucene committer and member of the project management committee at The Apache Software Foundation and a software engineer at Elastic, where he works on improving Elasticsearch.

Author’s articles

Scaling late interaction models in Elasticsearch - part 2

Search Relevance Vector Database+1

March 20, 2025

Scaling late interaction models in Elasticsearch - part 2

This article explores techniques for making late interaction vectors ready for large-scale production workloads, such as reducing disk space usage and improving computation efficiency.

PS BT

By: Peter Straßer and Benjamin Trent

Searching complex documents with ColPali - part 1

Search Relevance Vector Database+1

March 18, 2025

Searching complex documents with ColPali - part 1

The article introduces the ColPali model, a late-interaction model that simplifies the process of searching complex documents with images and tables, and discusses its implementation in Elasticsearch.

PS BT

By: Peter Straßer and Benjamin Trent

Lucene Vector Database

February 27, 2025

Filtered HNSW search, fast mode

Explore the improvements we have made for HNSW vector search in Apache Lucene through our ACORN-1 algorithm implementation.

By: Benjamin Trent

Lucene

February 7, 2025

Concurrency bugs in Lucene: How to fix optimistic concurrency failures

Thanks to Fray, a deterministic concurrency testing framework from CMU’s PASTA Lab, we tracked down a tricky Lucene bug and squashed it

BT AL

By: Benjamin Trent and Ao Li

Optimized Scalar Quantization: Even Better Binary Quantization

Lucene Vector Database

January 6, 2025

Optimized Scalar Quantization: Even Better Binary Quantization

Here we explain optimized scalar quantization in Elasticsearch and how we used it to improve Better Binary Quantization (BBQ).

By: Benjamin Trent

Lucene

December 27, 2024

Lucene bug adventures: Fixing a corrupted index exception

Sometimes, a single line of code takes days to write. Here, we get a glimpse of an engineer's pain and debugging over multiple days to fix a potential Apache Lucene index corruption.

By: Benjamin Trent

Better Binary Quantization (BBQ) vs. Product Quantization

Vector Database Lucene+1

November 18, 2024

Better Binary Quantization (BBQ) vs. Product Quantization

Why we chose to spend time working on Better Binary Quantization (BBQ) instead of product quantization in Lucene and Elasticsearch.

By: Benjamin Trent

Better Binary Quantization (BBQ) in Lucene and Elasticsearch

Lucene Vector Database

November 11, 2024

Better Binary Quantization (BBQ) in Lucene and Elasticsearch

How Better Binary Quantization (BBQ) works in Lucene and Elasticsearch.

By: Benjamin Trent

Looking back: Elastic's vector search improvements in Elasticsearch & Lucene

Vector Database Search Relevance

August 27, 2024

Looking back: Elastic's vector search improvements in Elasticsearch & Lucene

Looking back at Elastic's vector search innovations in Elasticsearch and Lucene.

KD BT

By: Kathleen DeRusso and Benjamin Trent

Vector Database

July 17, 2024

Bit vectors in Elasticsearch

Discover what are bit vectors, their practical implications and how to use them in Elasticsearch.

By: Benjamin Trent

Making Elasticsearch and Lucene the best vector database: up to 8x faster and 32x efficient

Vector Database Generative AI

April 26, 2024

Making Elasticsearch and Lucene the best vector database: up to 8x faster and 32x efficient

Discover the recent enhancements and optimizations that notably improve vector search performance in Elasticsearch & Lucene vector database.

MS BT JF

By: Mayya Sharipova, Benjamin Trent and Jim Ferenczi

Understanding Int4 scalar quantization in Lucene

Lucene ML Research

April 25, 2024

Understanding Int4 scalar quantization in Lucene

This blog explains how int4 quantization works in Lucene, how it lines up, and the benefits of using int4 quantization.

BT TV

By: Benjamin Trent and Thomas Veasey

Scalar quantization optimized for vector databases

ML Research

April 25, 2024

Scalar quantization optimized for vector databases

Optimizing scalar quantization for the vector database use case allows us to achieve significantly better performance for the same retrieval quality at high compression ratios.

TV BT

By: Thomas Veasey and Benjamin Trent

Introducing kNN Query: An expert way to do kNN search

Vector Database How To

December 7, 2023

Introducing kNN Query: An expert way to do kNN search

Explore how the kNN query in Elasticsearch can be used and how it differs from top-level kNN search, including examples.

MS BT

By: Mayya Sharipova and Benjamin Trent

Understanding scalar quantization in Lucene

Lucene ML Research

November 11, 2023

Understanding scalar quantization in Lucene

Explore how Elastic introduced scalar quantization into Lucene, including automatic byte quantization, quantization per segment & performance insights.

By: Benjamin Trent

Lucene ML Research

October 25, 2023

Scalar quantization 101

Understand what scalar quantization is, how it works and its benefits. This guide also covers the math behind quantization and examples.

By: Benjamin Trent

Lucene

September 1, 2023

Bringing maximum-inner-product into Lucene

Explore how we brought maximum-inner-product into Lucene and the investigations undertaken to ensure its support.

By: Benjamin Trent

Vector Database Lucene

August 24, 2023

Adding passage vector search to Lucene

Here's how to add passage vectors to Lucene, the benefits of doing so and how existing Lucene structures can be used to create an efficient retrieval experience.

By: Benjamin Trent

Generative AI

January 23, 2023

Save space with byte-sized vectors

Elasticsearch is introducing a new type of vector that has 8-bit integer dimensions. This is 4x smaller than the current vector with 32-bit float dimensions, which can result in substantial space savings.

JC BT

By: Jack Conradson and Benjamin Trent

Aggregate data faster with new the random_sampler aggregation

Generative AI

April 20, 2022

Aggregate data faster with new the random_sampler aggregation

Aggregate billions of documents in milliseconds instead of minutes with Elastic. Learn more about how the new random_sampler aggregation gives you statistically robust results at a lower cost.

BT TV

By: Benjamin Trent and Thomas Veasey

About the author

Author’s articles

Ready to build state of the art search experiences?