Async Control Flow Refactor by MichealReed · Pull Request #1 · MichealReed/gpu.cpp

MichealReed · 2025-02-19T22:38:21Z

• Refactored key GPU operations (createContext, toCPU, dispatchKernel, and createKernel) to return std::future instead of blocking with busy-wait loops.
• Replaced inline lambda callbacks with free‑standing functions that use promise/future for asynchronous completion.
• Updated createKernel (and its templated overload) to report shader module compilation asynchronously (instead of synchronously busy‑waiting), and dispatchKernel now returns a future that is fulfilled when the kernel dispatch is complete.
• Replaced wait with WaitForFuture(ctx.instance, future) as a template to return future objects after webgpu processing is complete.

Example usage:

  // Create kernel asynchronously
  std::future<Kernel> kernelFuture = createKernel(ctx, code, bindings, totalWorkgroups);
  Kernel kernel = waitForFuture(ctx.instance, kernelFuture);

  // Dispatch the kernel and wait for completion asynchronously
  std::future<void> dispatchFuture = dispatchKernel(ctx, kernel);
  waitForFuture(ctx.instance, dispatchFuture);

Why This Is Better:
• No blocking loops or synchronous waits – all async work is chained via futures, letting the event loop run naturally.
• Reduced risk of reentrancy issues and resource errors due to improper lifetime management.
• Cleaner separation between asynchronous callbacks and higher‐level logic, making the code easier to maintain and integrate (especially in WebAssembly/Emscripten contexts).

MichealReed · 2025-02-20T20:18:04Z

Only breaking change with latest commit is dispatchKernel and toCPU no longer take a promise, toCPUAsync and dispatchKernelAsync should be used instead to get futures. Manually waiting should no longer be necessary with the sync call though, even in a emscripten context.

…f on native

… test/test_gpu.cpp

MichealReed · 2025-02-22T04:19:36Z

Added an optional param with this refactor to set a offset for the toCPU buffer read. This is tested in the new test/test_gpu.cpp. I think the toCPU with copydata is now redundant. Will leave it alone if you prefer to keep it.

refactors async

9ac780b

MichealReed changed the title ~~refactors async~~ Async Control Flow Refactor Feb 19, 2025

MichealReed added 2 commits February 19, 2025 18:06

use async context waitForContext()

14e7ab5

adds sync wrappers

9a08f8a

MichealReed added 4 commits February 20, 2025 16:34

refactors the byIdx context function and sets USE_DAWN_API compile de…

95e587d

…f on native

tests toCPU, adds offset, adds gpuflow doc, default cmakelists builds…

70d9802

… test/test_gpu.cpp

remove path

16feb9e

format

e61e809

MichealReed added 10 commits February 21, 2025 22:22

doc formatting

ad8698d

doc nits

025af2a

set project root on root cmakelists

3776dcd

fix linux issue with callback info

d58e191

should not release readback buffer

498ba74

clean up callback syntax

2db9be1

add stress test

752a53a

linux has a segfault if wait for events after.

5f82ff4

EOF newline

28dabf2

added sleeptime optional arg

39c816c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async Control Flow Refactor#1

Async Control Flow Refactor#1
MichealReed wants to merge 17 commits intoorigin/devfrom
improve-async

MichealReed commented Feb 19, 2025

Uh oh!

MichealReed commented Feb 20, 2025 •

edited

Loading

Uh oh!

MichealReed commented Feb 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

MichealReed commented Feb 19, 2025

Uh oh!

MichealReed commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MichealReed commented Feb 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

MichealReed commented Feb 20, 2025 •

edited

Loading