Make FlaxLMSDiscreteScheduler jittable (#2180) by srlynch1 · Pull Request #8 · srlynch1/diffusers

srlynch1 · 2026-06-21T11:49:20Z

Summary

Replace scipy integration in get_lms_coefficient with JAX-native jnp.trapezoid and vectorized coefficient product
Pre-allocate fixed-shape derivatives buffer via max_order in set_timesteps
Use jax.lax.fori_loop and step-index sigma lookup in step for jit compatibility
Add three @require_flax parity tests (step, full loop, coefficient)

Test plan

pytest tests/schedulers/test_scheduler_lms_flax.py -q (3 passed)
python utils/check_copies.py
ruff check on changed files

Made with Cursor

Three require_flax tests verify step, full-loop, and coefficient parity between eager and jax.jit execution after the jittable scheduler refactor. Co-authored-by: Cursor <cursoragent@cursor.com>

cursor

Cursor Bugbot has reviewed your changes using default effort and found 2 potential issues.

Bugbot Autofix prepared fixes for both issues found in the latest run.

✅ Fixed: Step sigma epsilon mismatch
- Removed the erroneous + 1e-5 offset from step() so sigma matches scale_model_input() and the PyTorch LMSDiscreteScheduler reference.
✅ Fixed: Order exceeds buffer size
- Capped the multistep fori_loop upper bound with jnp.minimum(order, state.derivatives.shape[0]) to prevent out-of-bounds derivative indexing when order exceeds the max_order buffer from set_timesteps.

Or push these changes by commenting:

@cursor push 5495fb409d

Preview (5495fb409d)

diff --git a/src/diffusers/schedulers/scheduling_lms_discrete_flax.py b/src/diffusers/schedulers/scheduling_lms_discrete_flax.py
--- a/src/diffusers/schedulers/scheduling_lms_discrete_flax.py
+++ b/src/diffusers/schedulers/scheduling_lms_discrete_flax.py
@@ -271,7 +271,7 @@
             )
 
         step_index = jnp.where(state.timesteps == timestep, jnp.arange(state.timesteps.shape[0]), 0).sum()
-        sigma = state.sigmas[step_index] + 1e-5
+        sigma = state.sigmas[step_index]
 
         # 1. compute predicted original sample (x_0) from sigma-scaled predicted noise
         if self.config.prediction_type == "epsilon":
@@ -291,9 +291,10 @@
 
         # 3. Compute linear multistep coefficients and the previous sample based on the derivatives path
         effective_order = jnp.minimum(step_index + 1, order)
+        loop_order = jnp.minimum(order, state.derivatives.shape[0])
         prev_sample = jax.lax.fori_loop(
             0,
-            order,
+            loop_order,
             lambda i, val: jnp.where(
                 i < effective_order,
                 val + self.get_lms_coefficient(state, effective_order, step_index, i) * state.derivatives[-(i + 1)],

_{You can send follow-ups to the cloud agent here.}

^{Reviewed by Cursor Bugbot for commit 4eeb570. Configure here.}

cursor · 2026-06-21T11:51:06Z


-        sigma = state.sigmas[timestep]
+        step_index = jnp.where(state.timesteps == timestep, jnp.arange(state.timesteps.shape[0]), 0).sum()
+        sigma = state.sigmas[step_index] + 1e-5


Step sigma epsilon mismatch

Medium Severity

step uses state.sigmas[step_index] + 1e-5 for denoising and the ODE derivative, while scale_model_input uses the same index without the offset. Pipelines scale inputs then call step with the same timestep, so the two paths disagree on noise level versus the PyTorch LMS reference.

Additional Locations (1)

src/diffusers/schedulers/scheduling_lms_discrete_flax.py#L153-L155

^{Reviewed by Cursor Bugbot for commit 4eeb570. Configure here.}

cursor · 2026-06-21T11:51:06Z

+            order,
+            lambda i, val: jnp.where(
+                i < effective_order,
+                val + self.get_lms_coefficient(state, effective_order, step_index, i) * state.derivatives[-(i + 1)],


Order exceeds buffer size

Medium Severity

step loops up to its order argument and indexes state.derivatives[-(i + 1)], but set_timesteps only allocates max_order rows (default 4). Calling step with order greater than max_order without matching set_timesteps causes out-of-bounds indexing; previously the derivatives list could grow with order.

Additional Locations (1)

src/diffusers/schedulers/scheduling_lms_discrete_flax.py#L231-L232

^{Reviewed by Cursor Bugbot for commit 4eeb570. Configure here.}

srlynch1 and others added 2 commits June 21, 2026 21:43

Fix huggingface#2180: make FlaxLMSDiscreteScheduler jittable

5eb4950

Add jit parity tests for FlaxLMSDiscreteScheduler (huggingface#2180)

4eeb570

Three require_flax tests verify step, full-loop, and coefficient parity between eager and jax.jit execution after the jittable scheduler refactor. Co-authored-by: Cursor <cursoragent@cursor.com>

cursor Bot reviewed Jun 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make FlaxLMSDiscreteScheduler jittable (#2180)#8

Make FlaxLMSDiscreteScheduler jittable (#2180)#8
srlynch1 wants to merge 2 commits into
mainfrom
e2e/diffusers-2180

srlynch1 commented Jun 21, 2026

Uh oh!

cursor Bot left a comment •

edited

Loading

Uh oh!

cursor Bot Jun 21, 2026

Uh oh!

cursor Bot Jun 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

srlynch1 commented Jun 21, 2026

Summary

Test plan

Uh oh!

cursor Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor Bot Jun 21, 2026

Choose a reason for hiding this comment

Step sigma epsilon mismatch

Uh oh!

cursor Bot Jun 21, 2026

Choose a reason for hiding this comment

Order exceeds buffer size

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cursor Bot left a comment •

edited

Loading