[Controller] POC: overall speedup #573

fredroy · 2026-02-03T03:51:35Z

I did detect this issue some time ago, about having a controller in a scene was slowing down the simulation so much, especially on macOS. Even if the controller do nothing. And it gets slower and slower more there are Controllers.

DISCLAIMER: this was mostly the work of Claude, which detected/suggested the issues (lookups were slowing down the simulation) and generated the solution.
I just did the benches/tests to make sure it works, but as for everything with sofapython3 and/or pybind11, I cannot prove everything is okay/well-done. So deep review from experts would be appreciated 🫠

In any case, the modifications lead to a dramatic speed up:
(refer to the scene with this PR, which creates an empy scene with a certain number of Controller doing nothing)

Ubuntu 22.04 (gcc12, i7 13700k)

before:
Scene with 1 controllers and 10000 steps took 2.464146852493286 seconds.
Scene with 5 controllers and 10000 steps took 12.076464414596558 seconds.
Scene with 10 controllers and 10000 steps took 24.062500715255737 seconds.

after:
Scene with 1 controllers and 10000 steps took 0.04976940155029297 seconds.
Scene with 5 controllers and 10000 steps took 0.09446001052856445 seconds.
Scene with 10 controllers and 10000 steps took 0.1459205150604248 seconds.

--> with 10controllers, 150x faster... 😮

Windows (MSVC2026, i7 11800h)

before:
Scene with 1 controllers and 10000 steps took 6.102800607681274 seconds.
Scene with 5 controllers and 10000 steps took 27.300215482711792 seconds.
Scene with 10 controllers and 10000 steps took 54.59787082672119 seconds.
after:
Scene with 1 controllers and 10000 steps took 0.12163424491882324 seconds.
Scene with 5 controllers and 10000 steps took 0.18189406394958496 seconds.
Scene with 10 controllers and 10000 steps took 0.27340126037597656 seconds.

--> with 10controllers, 200x faster... 😲

macOS (xcode26, M3 max)

before:
Scene with 1 controllers and 10000 steps took 8.079632759094238 seconds.
Scene with 5 controllers and 10000 steps took 40.43093395233154 seconds.
Scene with 10 controllers and 10000 steps took 79.13048505783081 seconds.

after:
Scene with 1 controllers and 10000 steps took 0.03541707992553711 seconds.
Scene with 5 controllers and 10000 steps took 0.06284904479980469 seconds.
Scene with 10 controllers and 10000 steps took 0.09451079368591309 seconds.

--> with 10controllers, 837x faster... 🤪

alxbilger

Is the cache system also relevant for other trampoline classes?

fredroy · 2026-02-03T22:11:56Z

Is the cache system also relevant for other trampoline classes?

To my meager knowledge, I would say yes. But in this case it is especially useful because there were(are) many lookups to call implicitly all the *event() every step.

Summary of Modifications The changes in this branch (speedup_controller) optimize the Controller_Trampoline class in the Python bindings by adding a caching mechanism for Python method lookups: Key Changes: 1. New caching infrastructure (in Binding_Controller.h): - Added member variables to cache: - m_pySelf - cached Python self reference (avoids repeated py::cast(this)) - m_methodCache - unordered_map storing Python method objects by name - m_onEventMethod - cached fallback "onEvent" method - m_hasOnEvent / m_cacheInitialized - state flags 2. New methods (in Binding_Controller.cpp): - initializePythonCache() - initializes the cache on first use - getCachedMethod() - retrieves methods from cache (or looks them up once and caches) - callCachedMethod() - calls a cached Python method with an event - Constructor and destructor to properly manage the cached Python objects with GIL 3. Optimized handleEvent(): - Previously: every event caused py::cast(this), py::hasattr(), and attr() lookups - Now: uses cached method references, avoiding repeated Python attribute lookups 4. Optimized getClassName(): - Uses the cached m_pySelf when available instead of casting each time Purpose: This is a performance optimization that reduces overhead when handling frequent events (like AnimateBeginEvent, AnimateEndEvent), which can be called many times per simulation step. The caching eliminates repeated Python/C++ boundary crossings for method lookups.

bakpaul · 2026-02-04T13:58:12Z

Nice cache mechanism. Now I wonder what would happend if someone changed dynamically the method after it has already been cached :

MyController.onAnimateBeginEvent() # Cache is now active for this method
MyController.onAnimateBeginEvent = randomMethod # Cache is now invalid
MyController.onAnimateBeginEvent() # now what's called ?

Maybe we could add a setattr overloead invalidating the cache when it is called on a method already cached ?

fredroy added enhancement New feature or request pr: status to review pr: clean-fix pr: highlighted in next release Highlight this contribution in the notes of the upcoming release topic for next dev-meeting Worth discussion at dev meeting labels Feb 3, 2026

alxbilger reviewed Feb 3, 2026

View reviewed changes

fredroy added 2 commits February 4, 2026 11:31

add test

040d3e6

fredroy force-pushed the speedup_controller branch from 57debb4 to 040d3e6 Compare February 4, 2026 02:31

bakpaul approved these changes Feb 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Controller] POC: overall speedup #573

[Controller] POC: overall speedup #573

fredroy commented Feb 3, 2026

Uh oh!

alxbilger left a comment

Uh oh!

fredroy commented Feb 3, 2026

Uh oh!

bakpaul commented Feb 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Controller] POC: overall speedup #573

Are you sure you want to change the base?

[Controller] POC: overall speedup #573

Conversation

fredroy commented Feb 3, 2026

Ubuntu 22.04 (gcc12, i7 13700k)

Windows (MSVC2026, i7 11800h)

macOS (xcode26, M3 max)

Uh oh!

alxbilger left a comment

Choose a reason for hiding this comment

Uh oh!

fredroy commented Feb 3, 2026

Uh oh!

bakpaul commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bakpaul commented Feb 4, 2026 •

edited

Loading