refactor unet_1d tests by akshan-main · Pull Request #13898 · huggingface/diffusers

akshan-main · 2026-06-09T16:28:53Z

What does this PR do?

Part of the ongoing modeling-test migration (following #13369 and #13153). Migrates the UNet1DModel test suites (the standard UNet and the RL value-function variant) to the mixin-based structure (Config + ModelTesterMixin).

UNet1DModel has no attention processors and doesn't support gradient checkpointing, so only ModelTesterMixin applies. The hub/pretrained integration tests are kept.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? (Discussed on Slack with @sayakpaul @DN6)
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@sayakpaul @DN6

sayakpaul · 2026-06-10T03:03:16Z

@askserge could you please review the PR?

(testing)

sergereview

🤗 Serge says:

The migration to the mixin-based test structure is mostly correct, but there are issues with the output_shape property values.

Correctness

UNetRLModelTesterConfig.output_shape is set to (4, 14, 1) but the RL value-function model actually outputs (batch_size, 1) — i.e. (4, 1) for batch_size=4. The old test explicitly checked torch.Size((inputs_dict["sample"].shape[0], 1)). This is wrong regardless of whether the convention is per-sample or full-batch shape. It doesn't cause failures today because the base test_output has an operator-precedence bug (== … or self.output_shape is always truthy), and the custom test_output override doesn't use self.output_shape. But it will break when the base class is fixed, and it's misleading to readers.
UNet1DModelTesterConfig.output_shape is (4, 14, 16) (includes batch dim), while the repo convention in other migrated tests (e.g. AutoencoderKLTesterConfig) is to use per-sample shape without the batch dimension — (3, 32, 32) not (4, 3, 32, 32). Should be (14, 16) to match the convention.

Minor

In TestUNetRLModel.test_output (line 173), self.get_dummy_inputs() is called a second time just to read shape[0] (which is the hardcoded batch_size = 4). This allocates a full set of tensors unnecessarily.

model: claude-opus-4-6 · 28 LLM turns · 30 tool calls · 143.1s · 627103 in / 6681 out tokens

sergereview · 2026-06-10T03:06:00Z

    @property
-    def output_shape(self):
+    def output_shape(self) -> tuple:
        return (4, 14, 16)


output_shape includes the batch dimension (4, 14, 16), but the repo convention (see e.g. AutoencoderKLTesterConfig) is to use the per-sample shape without the batch dim. Should be (14, 16) to stay consistent.

Suggested change

return (4, 14, 16)

return (14, 16)

sergereview · 2026-06-10T03:06:00Z

    @property
-    def output_shape(self):
+    def output_shape(self) -> tuple:
        return (4, 14, 1)


output_shape is (4, 14, 1) but the RL value-function model outputs shape (batch_size, 1). The old test explicitly asserted torch.Size((batch_size, 1)). This value is wrong regardless of convention (per-sample it should be (1,), full-batch it should be (4, 1)).

The base test_output currently has an operator-precedence bug that makes the assertion vacuously true, and the custom test_output override doesn't reference self.output_shape, so this doesn't fail today — but it will break when the base class is fixed.

Suggested change

return (4, 14, 1)

return (1,)

sergereview · 2026-06-10T03:06:00Z

+        with torch.no_grad():
+            output = model(**self.get_dummy_inputs()).sample
+
+        assert output.shape == (self.get_dummy_inputs()["sample"].shape[0], 1), "Input and output shapes do not match"


Nit: self.get_dummy_inputs() is called a second time just to read shape[0], which is always the hardcoded batch_size = 4. This unnecessarily allocates a full set of tensors. Consider storing the inputs in a local variable:

Suggested change

assert output.shape == (self.get_dummy_inputs()["sample"].shape[0], 1), "Input and output shapes do not match"

inputs = self.get_dummy_inputs()

assert output.shape == (inputs["sample"].shape[0], 1), "Input and output shapes do not match"

(and pass inputs to the model call above as well)

* refactor unet_1d tests * use per-sample output_shape for unet_1d tests --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update * update * update * update * [CI] Refactor SD3 Transformer Test (#13340) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * refactor unet tests (3d_condition, motion, controlnetxs) (#13897) * refactor unet_3d_condition tests * refactor unet_motion tests * refactor unet_controlnetxs tests * refactor unet_1d tests (#13898) * refactor unet_1d tests * use per-sample output_shape for unet_1d tests --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * refactor unet_2d tests (#13901) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * [chore] log quant config to the user_agent (#13850) log quant config to the user_agent * Integrate AutoRound into Diffusers (#13552) * support auto_round Signed-off-by: Xin He <xin3.he@intel.com> * add document and unit tests Signed-off-by: Xin He <xin3.he@intel.com> * fix CI Signed-off-by: Xin He <xin3.he@intel.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update document and overwrite the default quantization_config with specified backend. Signed-off-by: Xin He <xin3.he@intel.com> * add UT and fix bug Signed-off-by: Xin He <xin3.he@intel.com> * update per comments Signed-off-by: Xin He <xin3.he@intel.com> * update per comments Signed-off-by: Xin He <xin3.he@intel.com> * fix compile error in doc Signed-off-by: Xin He <xin3.he@intel.com> * Apply style fixes * small nits * Add auto_round dependency to the versions table Signed-off-by: Xin He <xin3.he@intel.com> * fix make deps_table_check_updated Signed-off-by: Xin He <xin3.he@intel.com> * fix CI Signed-off-by: Xin He <xin3.he@intel.com> --------- Signed-off-by: Xin He <xin3.he@intel.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * [tests] refactor UNet model tests to align with the new pattern (#13153) * refactor unet2d condition model tests. * fix tests * up * fix * Revert "fix" This reverts commit 46d44b7. * up * recompile limit * [tests] refactor test_models_unet_1d.py to use modular testing mixins Refactor UNet1D model tests to follow the modern testing pattern using BaseModelTesterConfig and focused mixin classes (ModelTesterMixin, MemoryTesterMixin, TrainingTesterMixin, LoraTesterMixin). Both UNet1D standard and RL variants now have separate config classes and dedicated test classes organized by concern (core, memory, training, LoRA, hub loading). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * [tests] refactor test_models_unet_2d.py to use modular testing mixins Refactor UNet2D model tests (standard, LDM, NCSN++) to follow the modern testing pattern. Each variant gets its own config class and dedicated test classes organized by concern (core, memory, training, LoRA, hub loading). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * [tests] refactor test_models_unet_3d_condition.py to use modular testing mixins Refactor UNet3DConditionModel tests to follow the modern testing pattern with separate classes for core, attention, memory, training, and LoRA. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * [tests] refactor test_models_unet_controlnetxs.py to use modular testing mixins Refactor UNetControlNetXSModel tests to follow the modern testing pattern with separate classes for core, memory, training, and LoRA. Specialized tests (from_unet, freeze_unet, forward_no_control, time_embedding_mixing) remain in the core test class. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * [tests] refactor test_models_unet_spatiotemporal.py to use modular testing mixins Refactored the spatiotemporal UNet test file to follow the modern modular testing pattern with BaseModelTesterConfig and focused test classes: - UNetSpatioTemporalTesterConfig: Base configuration with model setup - TestUNetSpatioTemporal: Core model tests (ModelTesterMixin, UNetTesterMixin) - TestUNetSpatioTemporalAttention: Attention-related tests (AttentionTesterMixin) - TestUNetSpatioTemporalMemory: Memory/offloading tests (MemoryTesterMixin) - TestUNetSpatioTemporalTraining: Training tests (TrainingTesterMixin) - TestUNetSpatioTemporalLoRA: LoRA adapter tests (LoraTesterMixin) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * remove test suites that are passed. * fix consistencydecodervae tests * Revert "fix consistencydecodervae tests" This reverts commit 41b036b. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * [tests] fix vidtok tests (#13894) * fix vidtok tests * style * Update tests/models/autoencoders/test_models_autoencoder_vidtok.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Apply style fixes --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * clean up --------- Signed-off-by: Xin He <xin3.he@intel.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Akshan Krithick <97239696+akshan-main@users.noreply.github.com> Co-authored-by: Xin He <xin3.he@intel.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

refactor unet_1d tests

6aff79a

github-actions Bot added tests size/L PR with diff > 200 LOC labels Jun 9, 2026

sayakpaul approved these changes Jun 10, 2026

View reviewed changes

sergereview Bot requested changes Jun 10, 2026

View reviewed changes

akshan-main and others added 2 commits June 9, 2026 20:16

use per-sample output_shape for unet_1d tests

39f8abb

Merge branch 'main' into tests-refactor-unet-1d

77ef64a

sayakpaul merged commit 0d56193 into huggingface:main Jun 10, 2026
12 of 13 checks passed

DN6 pushed a commit that referenced this pull request Jun 10, 2026

refactor unet_1d tests (#13898)

1ead334

* refactor unet_1d tests * use per-sample output_shape for unet_1d tests --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor unet_1d tests#13898

refactor unet_1d tests#13898
sayakpaul merged 3 commits into
huggingface:mainfrom
akshan-main:tests-refactor-unet-1d

akshan-main commented Jun 9, 2026

Uh oh!

sayakpaul commented Jun 10, 2026

Uh oh!

sergereview Bot left a comment

Uh oh!

sergereview Bot Jun 10, 2026

Uh oh!

sergereview Bot Jun 10, 2026

Uh oh!

sergereview Bot Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	assert output.shape == (self.get_dummy_inputs()["sample"].shape[0], 1), "Input and output shapes do not match"
	inputs = self.get_dummy_inputs()
	assert output.shape == (inputs["sample"].shape[0], 1), "Input and output shapes do not match"

Uh oh!

Conversation

akshan-main commented Jun 9, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul commented Jun 10, 2026

Uh oh!

sergereview Bot left a comment

Choose a reason for hiding this comment

Correctness

Minor

Uh oh!

sergereview Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

sergereview Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

sergereview Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants