feat: relax sentence-transformers and transformers version pins for 5.x compat by j-sperling · Pull Request #210 · lightonai/pylate

j-sperling · 2026-03-24T02:18:31Z

Summary

Bump sentence-transformers from == 5.1.1 to == 5.3.0
Cap transformers at < 5.0.0 (was <= 4.56.2) to allow 4.57.x while deferring v5 until numerical equivalence is confirmed
Remove deprecated overwrite_output_dir from 3 test files (no-op on v4, required for eventual v5 support)

Numerical verification

ColBERT embeddings are bit-identical between transformers 4.56.2 and 5.4.0 (both with ST 5.3.0, torch 2.9.0):

Model	max abs diff	min cosine sim	Verdict
lightonai/GTE-ModernColBERT-v1	0.00	1.000000	Bit-identical
lightonai/colbertv2.0	0.00	1.000000	Bit-identical

The known transformers v5 divergences (huggingface/transformers#42889, huggingface/transformers#43697) affect T5/encoder-decoder and vision models due to weight tying refactoring. BERT-based architectures used by pylate's ColBERT models are unaffected.

Test plan

Tested with ST 5.3.0 + transformers 4.57.6 on Python 3.12 (Apple Silicon):

Model loading works: GTE-ModernColBERT-v1, ColBERT-Zero, colbertv2.0
Encoding produces correct shapes (per-token 128d embeddings)
13/16 tests pass (3 training tests fail on MPS -- pre-existing, unrelated)
1 model-loading test fails (jina-colbert-v2 HF config issue -- pre-existing)

Motivation

The current exact pin on sentence-transformers==5.1.1 prevents PyLate from being installed alongside other packages that depend on newer ST versions. This is a common integration pain point (see #144, #190).

The transformers<=4.56.2 upper bound blocks 4.57.x which includes bug fixes and performance improvements.

….x compat Relax sentence-transformers from ==5.1.1 to >=5.1.1 and remove the transformers upper bound (was <=4.56.2, now >=4.48.0). Tested with ST 5.3.0 + transformers 5.3.0: - Model loading works (GTE-ModernColBERT-v1, ColBERT-Zero) - Encoding produces correct shapes - 13/16 tests pass (3 training tests fail on MPS -- unrelated) - Only non-jina model loading tests pass (jina-colbert-v2 has a separate HF config issue) Also removes deprecated overwrite_output_dir from 3 test files (removed in transformers 5.x TrainingArguments).

NohTow · 2026-03-24T09:19:04Z

Thanks for the PR!
Let me run the test and try to have a look ASAP

Edit:
I might take a bit of time to review because:

I am quite drowning lately but most importantly
This is a major update for transformers, so I need to check with @tomaarsen if there isn't any pitfall

In any case, we'll have to pin the exact version of ST to the latest one: we are not using >= because it can break when Tom release a new version. I know it's a pain and we also have to make updates manually each time, but this is the only way to keep something stable. This is the main argument in favor of merging PyLate upstream in ST.

Samoed · 2026-03-24T09:25:37Z

pyproject.toml

 ]
 dependencies = [
-    "sentence-transformers == 5.1.1",
+    "sentence-transformers >= 5.1.1",


Suggested change

"sentence-transformers >= 5.1.1",

"sentence-transformers >= 5.1.1, <6.0.0",

matospiso · 2026-03-25T12:20:40Z

if you're using uv, you can temporarily avoid this inconvenience by adding

[tool.uv]
override-dependencies = [
    "transformers>=some_version",
    "sentence-transformers>=another_version",
]

to your pyproject.toml (until this PR is merged)

NohTow · 2026-03-25T13:23:50Z

Hey,
I chatted quickly with @tomaarsen and the new ST version indeed allows to use Transformers v5 but I have a few things to consider before making this update:

Transformers v5 seems to yield different values than v4.X for some models. Until we know why (Tom is currently working on multimodal update and did not have time to dig further), I would be a bit worried about lifting the version lock, as any new install would use v5 by default
Although all tests are passing, I need to take a bit of time to check the release note, as Tom did some internal modifications and I want to make sure everything is fine

I am sorry to be slow with some updates lately, I've been quite busy, but as raised above, you can change overcome those issues locally. Is there any very blocking setup that you are encountering? I'll try to fix everything ASAP

- sentence-transformers: >= 5.1.1 -> == 5.3.0 (exact pin per maintainer policy) - transformers: >= 4.48.0 -> >= 4.41.0 (restore original floor, no justification for bump) - Keep overwrite_output_dir removal (needed for transformers 5.x compat) - Keep transformers ceiling removed (ColBERT models produce bit-identical embeddings between transformers 4.56.2 and 5.4.0)

j-sperling · 2026-03-27T03:56:15Z

Thanks for the detailed feedback @NohTow -- totally understand both concerns. I dug into the transformers v5 numerical issue before responding.

ColBERT numerical comparison: transformers 4.56.2 vs 5.4.0

I ran pylate's encode() on identical inputs with transformers 4.56.2 and 5.4.0 (both with ST 5.3.0, torch 2.9.0):

Model	max abs diff	min cosine sim	Verdict
`lightonai/GTE-ModernColBERT-v1`	0.00	1.000000	Bit-identical
`lightonai/colbertv2.0`	0.00	1.000000	Bit-identical

The known transformers v5 divergences (huggingface/transformers#42889 for T5 weight tying, huggingface/transformers#43697 for RTDetrV2) affect encoder-decoder and vision model families. BERT-based architectures -- which all pylate ColBERT models use -- produce identical outputs.

Happy to test additional models if you'd like.

Updated PR

Based on your feedback, I've updated the branch:

ST pin: == 5.3.0 (exact pin per your policy)
Transformers: removed the <= 4.56.2 ceiling, kept the original >= 4.41.0 floor unchanged
overwrite_output_dir removal: kept (needed for transformers 5.x TrainingArguments)

If you'd prefer to keep the transformers ceiling until Tom confirms independently, I can revert that part and just ship the ST bump. Your call.

TheAdamEvans · 2026-04-05T11:00:25Z

pyproject.toml

    "datasets >= 2.20.0",
    "accelerate >= 0.31.0",
    "pandas >= 2.2.1",
-    "transformers >= 4.41.0, <= 4.56.2",


The transition to 5.*+ is a breaking change and feels too big right now.

In this PR, perhaps we can compromise and make this change to:
"transformers >= 4.41.0, <= 4.57.1",

Should be no breaking changes this way --> transformers <= 4.57.1

TheAdamEvans · 2026-04-05T11:02:06Z

.gitignore

+
+# Agent config (generated/local)
+.memsearch/
+AGENTS.md
+CLAUDE.md


Don't need this addition in this PR.

AGENTS.md is another whole discussion and task.

- Remove out-of-scope .gitignore additions (AGENTS.md, CLAUDE.md, .memsearch) - Add transformers < 5.0.0 ceiling to keep v4 boundary while allowing 4.57.x patches

j-sperling · 2026-04-06T04:30:03Z

Thanks @TheAdamEvans. Reverted the .gitignore additions -- clearly out of scope.

For the transformers ceiling: updated to <= 4.57.1 as you suggested. I'd initially gone with < 5.0.0 to avoid the pin going stale on each patch, but a tight ceiling is the safer call while the v5 numerical story is still being sorted out (see discussion above with @NohTow).

Kept the overwrite_output_dir removal -- it's a no-op on v4 (default was already False) and will be needed whenever v5 support lands.

NohTow · 2026-04-07T09:18:26Z

Hey,
Sorry for the delay, Raphael and I were at ECIR last week and are in vacation this week
Will focus on the various PRs when I get back
Apologies for the delay again

tomaarsen · 2026-04-07T12:00:51Z

Hello!

I've ran some more tests between transformers v4 and v5, and the discrepancies I spotted earlier were likely caused by differences in default options for dtype and attn_implementation. With those fixed, I'm only getting some discrepancies for ~3 rerankers and a handful of Sparse Encoder models, and no regressions for dense embedding models, across a total of 169 model tests.
In short, moving to transformers v5 seems pretty safe.

Tom Aarsen

TheAdamEvans · 2026-04-08T04:51:14Z

No worries team, I know you have a lot on and these are small things, am patient! 😄 Looking to get PyLate included in Canva's monorepo. It can be tricky because the entire company has to share the exact same version specs. Thanks @j-sperling for updating 🙏

Re: sentence-transformers, I'd selfishly prefer <=5.1.2 because that's what we have here today and will simplify things for me later on the next PyLate release.

@j-sperling could I convince you to also update ninja to be widened to >=1.11.1? That === pin conflicts with other packages that currently resolve to 1.11.1.1 (a more commonly available version from what I can see).

Happy to open separate issues for these if we'd prefer.

TheAdamEvans · 2026-04-09T03:07:44Z

pyproject.toml

    "ujson == 5.10.0",
    "ninja == 1.11.1.4",
    "fastkmeans == 0.5.0",
    "fast-plaid>=1.2.4.260,<=1.3.0.290",


Might I also suggest

Suggested change

"fast-plaid>=1.2.4.260,<=1.4.6.280",

Samoed reviewed Mar 24, 2026

View reviewed changes

chore: gitignore generated agent config files

3d2e6f7

TheAdamEvans suggested changes Apr 5, 2026

View reviewed changes

Address review: revert .gitignore, cap transformers < 5.0.0

0a752fb

- Remove out-of-scope .gitignore additions (AGENTS.md, CLAUDE.md, .memsearch) - Add transformers < 5.0.0 ceiling to keep v4 boundary while allowing 4.57.x patches

Address review: use transformers <= 4.57.1 per reviewer request

a71f688

TheAdamEvans reviewed Apr 9, 2026

View reviewed changes

	"sentence-transformers >= 5.1.1",
	"sentence-transformers >= 5.1.1, <6.0.0",

Conversation

j-sperling commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Numerical verification

Test plan

Motivation

Related

Uh oh!

NohTow commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Samoed Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

matospiso commented Mar 25, 2026

Uh oh!

NohTow commented Mar 25, 2026

Uh oh!

j-sperling commented Mar 27, 2026

ColBERT numerical comparison: transformers 4.56.2 vs 5.4.0

Updated PR

Uh oh!

TheAdamEvans Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

TheAdamEvans Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TheAdamEvans Apr 5, 2026

Choose a reason for hiding this comment

Uh oh!

j-sperling commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NohTow commented Apr 7, 2026

Uh oh!

tomaarsen commented Apr 7, 2026

Uh oh!

TheAdamEvans commented Apr 8, 2026

Uh oh!

TheAdamEvans Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

j-sperling commented Mar 24, 2026 •

edited

Loading

NohTow commented Mar 24, 2026 •

edited

Loading

TheAdamEvans Apr 5, 2026 •

edited

Loading

j-sperling commented Apr 6, 2026 •

edited

Loading