GGML direct conv2d support

Since GGML now has direct conv2d support for [CPU](https://github.com/ggml-org/llama.cpp/pull/14388) and [Vulkan](https://github.com/ggml-org/llama.cpp/pull/14316) we might want to try it out here and see if it helps. Compared to im2col this uses less memory and *should* run faster on GPUs that don't have matrix cores.

As a quick test I naively switched all instances of `ggml_conv_2d` in the code with `ggml_conv_2d_direct` and replaced the ggml directory with the one from llama.cpp. Right now it generates images fine on CPU (it's a bit slower than im2col) but it fails with a segfault on Vulkan.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GGML direct conv2d support #739

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

GGML direct conv2d support #739

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions