MB-66395: [v17] Add Request Batcher Module (for GPU indexes) by CascadingRadium · Pull Request #389 · blevesearch/zapx

CascadingRadium · 2026-03-31T10:08:30Z

Add latency-aware batching for vector search requests using a coalescing queue
to combine compatible requests and execute them together to reduce number of
Faiss calls.
Ensure adaptive batching by using Nagle's algorithm for maximum efficiency.

- Add latency-aware batching for vector search requests using a timer-driven coalescing queue. - Merge compatible requests (same k) and execute them together to reduce number of `Faiss` calls. - Ensure bounded latency via per-request deadlines using a package-level variable called `LatencyBudget`.

This reverts commit 02cddd2.

- Add latency-aware batching for vector search requests using a timer-driven coalescing queue. - Merge compatible requests (same k) and execute them together to reduce number of `Faiss` calls. - Ensure bounded latency via per-request deadlines using a package-level variable called `LatencyBudget`.

faiss_vector_request_batcher.go

Copilot

Pull request overview

Adds a new latency-aware batching module intended to coalesce compatible Faiss vector search requests (same k) into fewer Faiss calls, while also introducing supporting vectorSet helpers and a small reset fix in vector index opaque state.

Changes:

Introduces requestBatcher / coalesceQueue for timer-driven request coalescing and batched execution.
Adds vectorSet.clone() and vectorSet.mergeWith() helpers to support request merging.
Extends vector index interfaces with a faissIndexBatch abstraction and clears vectorIndexOpaque.config on reset.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
section_faiss_vector_index.go	Clears `vectorIndexOpaque.config` during `Reset()` to avoid retaining prior config state.
faiss_vector_wrapper.go	Adds `vectorSet` cloning/merging helpers used by batching.
faiss_vector_request_batcher.go	New batching/coalescing implementation for Faiss searches.
faiss_vector_request_batcher_test.go	Adds a benchmark harness and fake index for the batcher.
faiss_vector_index.go	Adds `faissIndexBatch` interface for batched search operations.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

faiss_vector_request_batcher.go

faiss_vector_request_batcher_test.go

faiss_vector_wrapper.go

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 7 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

faiss_vector_request_batcher.go

faiss_vector_request_batcher_test.go

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

faiss_vector_request_batcher.go

faiss_vector_wrapper.go

faiss_vector_request_batcher.go

CascadingRadium · 2026-04-09T13:12:52Z

exploring a self-tuning algorithm for higher QPS while batching. Converting to draft.

abhinavdangeti · 2026-04-10T20:00:52Z

faiss_vector_request_batcher.go

+type requestBatcher struct {
+	// the coalesce queue that manages the batching of incoming search requests.
+	cq *coalesceQueue
+}


Let's add some unit testing and/or benchmarks for how many requests we can execute in a given window of time (assuming a constant runtime for each batch).

Likith101 · 2026-04-13T08:49:57Z

faiss_vector_index.go

 }
+
+// Interface for batched search operations on Faiss vector indices.
+type faissIndexBatch interface {


nit: faissQueryBatch

CascadingRadium added 11 commits March 18, 2026 15:44

interface defn

ea5d90b

interfaces

3f8ac66

Add impls

34b1f82

add constructor

cc07c8d

small refactor

c6294fc

few cleanups

525cec1

Merge branch 'master' into pre_gpu

b262d75

Merge branch 'master' into merge_fastmerge

9efebea

Revert "[v17] Add Request Batcher Module (#384)"

43faa98

This reverts commit 02cddd2.

CascadingRadium changed the base branch from master to merge_fastmerge March 31, 2026 10:08

CascadingRadium requested review from Likith101, Thejas-bhat, abhinavdangeti, capemox and maneuvertomars March 31, 2026 12:17

Base automatically changed from merge_fastmerge to master April 1, 2026 09:28

CascadingRadium and others added 4 commits April 1, 2026 15:08

Merge branch 'master' into batcher

670cc20

fix merge conflict

dd6c8a7

fix data race

10c244f

stop() now synchronously waits for monitor to finish

1299fd2

CascadingRadium commented Apr 1, 2026

View reviewed changes

faiss_vector_request_batcher.go Outdated Show resolved Hide resolved

add missed reset

98a3bea

CascadingRadium requested a review from Copilot April 1, 2026 20:58

Copilot started reviewing on behalf of CascadingRadium April 1, 2026 20:59 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

code review

546daed

CascadingRadium requested a review from Copilot April 1, 2026 21:52

Copilot started reviewing on behalf of CascadingRadium April 1, 2026 21:52 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

CascadingRadium and others added 3 commits April 2, 2026 03:29

small fix

0fd6285

Update faiss_vector_request_batcher.go

91dbca9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update faiss_vector_request_batcher.go

562a4e7

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

CascadingRadium requested a review from Copilot April 3, 2026 16:43

Copilot started reviewing on behalf of CascadingRadium April 3, 2026 16:43 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

CascadingRadium added 3 commits April 3, 2026 22:32

Merge branch 'master' into batcher

97bf7cc

unexport and const

29cf538

reduce budgets for better QPS

ebb6b01

CascadingRadium marked this pull request as draft April 9, 2026 13:12

CascadingRadium added 3 commits April 10, 2026 17:51

nagle algorithm

27c1a69

remove bench file

aaaed9a

remove comment

09fe4a2

CascadingRadium marked this pull request as ready for review April 10, 2026 12:26

Merge branch 'master' into batcher

3fc4440

abhinavdangeti changed the title ~~[v17] Add Request Batcher Module~~ MB-66395: [v17] Add Request Batcher Module Apr 10, 2026

abhinavdangeti changed the title ~~MB-66395: [v17] Add Request Batcher Module~~ MB-66395: [v17] Add Request Batcher Module (for GPU indexes) Apr 10, 2026

abhinavdangeti added this to GPU-Accelerated Vector Search Apr 10, 2026

github-project-automation bot moved this to Todo in GPU-Accelerated Vector Search Apr 10, 2026

abhinavdangeti requested changes Apr 10, 2026

View reviewed changes

CascadingRadium self-assigned this Apr 10, 2026

Likith101 reviewed Apr 13, 2026

View reviewed changes

Likith101 requested changes Apr 13, 2026

View reviewed changes

CascadingRadium moved this from Todo to In Progress in GPU-Accelerated Vector Search Apr 14, 2026

Conversation

CascadingRadium commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CascadingRadium commented Apr 9, 2026

Uh oh!

abhinavdangeti Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Likith101 Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CascadingRadium commented Mar 31, 2026 •

edited

Loading