Add FLUX-only TeaCache inference cache hook by srlynch1 · Pull Request #12 · agentdevsl/diffusers

srlynch1 · 2026-06-24T02:32:34Z

Summary

Add FLUX-only TeaCache inference cache via block hooks (closes #12589)
TeaCacheConfig + apply_teacache wired through CacheMixin.enable_cache()
Polynomial-rescaled modulated-input L1 skip metric from the TeaCache paper
FLUX_TEACACHE_COEFFICIENTS vendored from TeaCache4FLUX

Test plan

8 fast hook unit tests (tests/hooks/test_teacache.py)
TestFluxTransformerTeaCache mixin tests (CI)
check_copies.py
check_dummies.py

Note

Medium Risk
Hooks alter the denoising forward path for FLUX inference; wrong skips could affect image quality, though boundary steps always compute and unsupported models raise explicitly.

Overview
Adds TeaCache as a new inference cache for FLUX (FluxTransformer2DModel only in v1), wired like other caches via TeaCacheConfig, apply_teacache, and transformer.enable_cache().

At the first transformer block, the hook compares consecutive steps using a polynomial-rescaled relative L1 distance on the modulated input (FLUX coeffs in FLUX_TEACACHE_COEFFICIENTS). When the accumulated score stays below rel_l1_thresh, middle blocks are bypassed and the last step’s full-stack residual is replayed; first and last denoising steps always run.

Docs cover usage and a comparison with FirstBlockCache / MagCache. Tests include hook unit tests and TeaCacheTesterMixin on the Flux transformer.

^{Reviewed by Cursor Bugbot for commit 111bcf4. Bugbot is set up for automated code reviews on this repo. Configure here.}

Co-authored-by: Cursor <cursoragent@cursor.com>

Completes public registration for TeaCache constants and keeps dummy_pt_objects in sync. Co-authored-by: Cursor <cursoragent@cursor.com>

cursor

Cursor Bugbot has reviewed your changes using default effort and found 2 potential issues.

Bugbot Autofix prepared fixes for both issues found in the latest run.

✅ Fixed: Mixin threshold blocks inference skip
- Changed TeaCacheConfigMixin rel_l1_thresh to float('inf') so polynomial-rescaled L1 from randn perturbations cannot block the required second-pass skip in _test_cache_inference.
✅ Fixed: Single block skip stalls step
- TeaCacheHeadHook now advances step_index on skip when advance_step_on_skip=True for the single-block apply_teacache path where the tail block hook is bypassed.

Or push these changes by commenting:

@cursor push 84b4512cc4

Preview (84b4512cc4)

diff --git a/src/diffusers/hooks/teacache.py b/src/diffusers/hooks/teacache.py
--- a/src/diffusers/hooks/teacache.py
+++ b/src/diffusers/hooks/teacache.py
@@ -132,11 +132,13 @@
         config: TeaCacheConfig,
         extract_modulated_input: Callable,
         coefficients: List[float],
+        advance_step_on_skip: bool = False,
     ):
         self.state_manager = state_manager
         self.config = config
         self.extract_modulated_input = extract_modulated_input
         self.coefficients = coefficients
+        self.advance_step_on_skip = advance_step_on_skip
         self._metadata = None
 
     def initialize_hook(self, module):
@@ -180,6 +182,9 @@
         if not should_compute:
             logger.debug(f"TeaCache: Skipping step {state.step_index}")
 
+            if self.advance_step_on_skip:
+                self._advance_step(state)
+
             output = hidden_states
             res = state.previous_residual
 
@@ -230,7 +235,15 @@
         self.state_manager.reset()
         return module
 
+    def _advance_step(self, state: TeaCacheState):
+        state.step_index += 1
+        if state.step_index >= self.config.num_inference_steps:
+            state.step_index = 0
+            state.accumulated_distance = 0.0
+            state.previous_residual = None
+            state.previous_modulated_input = None
 
+
 class TeaCacheBlockHook(ModelHook):
     def __init__(self, state_manager: StateManager, is_tail: bool = False, config: TeaCacheConfig = None):
         super().__init__()
@@ -350,7 +363,9 @@
         name, block = remaining_blocks[0]
         logger.info(f"TeaCache: Applying Head+Tail Hooks to single block '{name}'")
         _apply_teacache_block_hook(block, state_manager, config, is_tail=True)
-        _apply_teacache_head_hook(block, state_manager, config, extract_modulated_input, coefficients)
+        _apply_teacache_head_hook(
+            block, state_manager, config, extract_modulated_input, coefficients, advance_step_on_skip=True
+        )
         return
 
     head_block_name, head_block = remaining_blocks.pop(0)
@@ -372,13 +387,16 @@
     config: TeaCacheConfig,
     extract_modulated_input: Callable,
     coefficients: List[float],
+    advance_step_on_skip: bool = False,
 ) -> None:
     registry = HookRegistry.check_if_exists_or_initialize(block)
 
     if registry.get_hook(_TEACACHE_LEADER_BLOCK_HOOK) is not None:
         registry.remove_hook(_TEACACHE_LEADER_BLOCK_HOOK)
 
-    hook = TeaCacheHeadHook(state_manager, config, extract_modulated_input, coefficients)
+    hook = TeaCacheHeadHook(
+        state_manager, config, extract_modulated_input, coefficients, advance_step_on_skip=advance_step_on_skip
+    )
     registry.register_hook(hook, _TEACACHE_LEADER_BLOCK_HOOK)
 
 

diff --git a/tests/models/testing_utils/cache.py b/tests/models/testing_utils/cache.py
--- a/tests/models/testing_utils/cache.py
+++ b/tests/models/testing_utils/cache.py
@@ -643,10 +643,11 @@
     """
 
     # Default TeaCache config - can be overridden by subclasses.
-    # Uses num_inference_steps=4 so interior steps can be skipped during _test_cache_inference.
+    # Uses num_inference_steps=4 and an infinite rel_l1_thresh so the second
+    # inference step is always skipped, which is required by _test_cache_inference.
     TEA_CACHE_CONFIG = {
         "num_inference_steps": 4,
-        "rel_l1_thresh": 100.0,
+        "rel_l1_thresh": float("inf"),
     }
 
     def _get_cache_config(self):

_{You can send follow-ups to the cloud agent here.}

^{Reviewed by Cursor Bugbot for commit 111bcf4. Configure here.}

cursor · 2026-06-24T02:34:53Z

+    TEA_CACHE_CONFIG = {
+        "num_inference_steps": 4,
+        "rel_l1_thresh": 100.0,
+    }


Mixin threshold blocks inference skip

Medium Severity

TeaCacheConfigMixin uses rel_l1_thresh=100.0 assuming the second _test_cache_inference pass will skip, but _test_cache_inference perturbs hidden_states with randn_like, which often pushes the polynomial-rescaled L1 above 100 so step 1 fully recomputes. Cached and uncached outputs then match and the test fails.

^{Reviewed by Cursor Bugbot for commit 111bcf4. Configure here.}

cursor · 2026-06-24T02:34:53Z

+        logger.info(f"TeaCache: Applying Head+Tail Hooks to single block '{name}'")
+        _apply_teacache_block_hook(block, state_manager, config, is_tail=True)
+        _apply_teacache_head_hook(block, state_manager, config, extract_modulated_input, coefficients)
+        return


Single block skip stalls step

Low Severity

When apply_teacache finds only one transformer block, the head hook is outermost and returns early on skip without invoking the co-located tail block hook, so _advance_step never runs and step_index stays stuck across forwards.

Additional Locations (1)

src/diffusers/hooks/teacache.py#L179-L224

^{Reviewed by Cursor Bugbot for commit 111bcf4. Configure here.}

srlynch1 and others added 2 commits June 24, 2026 07:39

Add FLUX-only TeaCache inference cache hook

0800602

Co-authored-by: Cursor <cursoragent@cursor.com>

Export FLUX_TEACACHE_COEFFICIENTS from top-level diffusers API.

111bcf4

Completes public registration for TeaCache constants and keeps dummy_pt_objects in sync. Co-authored-by: Cursor <cursoragent@cursor.com>

cursor Bot reviewed Jun 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add FLUX-only TeaCache inference cache hook#12

Add FLUX-only TeaCache inference cache hook#12
srlynch1 wants to merge 2 commits into
mainfrom
feat/teacache

srlynch1 commented Jun 24, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment •

edited

Loading

Uh oh!

cursor Bot Jun 24, 2026

Uh oh!

cursor Bot Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

srlynch1 commented Jun 24, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

cursor Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor Bot Jun 24, 2026

Choose a reason for hiding this comment

Mixin threshold blocks inference skip

Uh oh!

cursor Bot Jun 24, 2026

Choose a reason for hiding this comment

Single block skip stalls step

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

srlynch1 commented Jun 24, 2026 •

edited by cursor Bot

Loading

cursor Bot left a comment •

edited

Loading