[Metaschedule] Add test case for multi-anchor subgraph by masahi · Pull Request #10856 · apache/tvm

masahi · 2022-04-01T03:16:51Z

This adds a demonstration of extracting, scheduling, and e2e-compiling relay subgraphs with multiple anchor ops. Since task extraction is not associated with TE scheduling anymore, extracting a subgraph with multiple anchor TE compute just works.

The test case manually creates a simple fused mod with two relay.dense. But in the future, an effort like #9628 should make it easier to construct multi-anchor subgraphs.

The extracted TensorIR block corresponding to two TE dense compute looks like this:

@tvm.script.ir_module
class Module:
    @T.prim_func
    def main(placeholder: T.Buffer[(128, 128), "float32"], placeholder_1: T.Buffer[(128, 128), "float32"], placeholder_2: T.Buffer[(128, 128), "float32"], T_matmul_NT: T.Buffer[(128, 128), "float32"]) -> None:
        # function attr dict
        T.func_attr({"global_symbol": "main", "tir.noalias": True})
        # body
        # with T.block("root")
        T_matmul_NT_1 = T.alloc_buffer([128, 128], dtype="float32")
        for i0, i1, i2 in T.grid(128, 128, 128):
            with T.block("T_matmul_NT"):
                i, j, k = T.axis.remap("SSR", [i0, i1, i2])
                T.reads(placeholder[i, k], placeholder_1[j, k])
                T.writes(T_matmul_NT_1[i, j])
                T.block_attr({"layout_free_placeholders":[placeholder_1]})
                with T.init():
                    T_matmul_NT_1[i, j] = T.float32(0)
                T_matmul_NT_1[i, j] = T_matmul_NT_1[i, j] + placeholder[i, k] * placeholder_1[j, k]
        for i0, i1, i2 in T.grid(128, 128, 128):
            with T.block("T_matmul_NT_1"):
                i, j, k = T.axis.remap("SSR", [i0, i1, i2])
                T.reads(T_matmul_NT_1[i, k], placeholder_2[j, k])
                T.writes(T_matmul_NT[i, j])
                T.block_attr({"layout_free_placeholders":[placeholder_2]})
                with T.init():
                    T_matmul_NT[i, j] = T.float32(0)
                T_matmul_NT[i, j] = T_matmul_NT[i, j] + T_matmul_NT_1[i, k] * placeholder_2[j, k]

@junrushao1994 @csullivan @comaniac @mbs-octoml @mikepapadim

comaniac

LGTM

Co-authored-by: Junru Shao <junrushao1994@gmail.com>

masahi · 2022-04-01T07:48:04Z

+
+        tune_rec = TuningRecord(sch.trace, [0.0], workload, tvm.target.Target(target), [])
+
+        database.commit_tuning_record(tune_rec)


@junrushao1994 @zxybazh

I keep writing this database boilerplate for manual scheduling. I'm thinking about a clean API so that users don't have to go through the explicit task extraction -> database creation steps. Right now it looks like

relay_mod = tvm.IRModule.from_expr(...) target = "llvm" params = {"weight1": weight1_np, "weight2": weight2_np} def schedule_fn(task, sch): if "nn_dense_nn_dense" in task.task_name: schedule_dense_dense(sch) return True return False database = apply_manual_schedules(relay_mod, target, params, schedule_fn) with ApplyHistoryBest(database): ...

If this looks ok, I can PR it after this one.

There is a DummyDatabase class in python/tvm/meta_schedule/testing/utils.py where we don't need to create json files for intermediate results, and I was wondering if we could further reduce boilerplate by enhancing that class. What do you think?

Yeah using the Dummy classes can be very helpful in tuning and it's acutally a good idea to provide new use interface as you mentioned, a schedule function to work on on the relay level. Let me know when you got the PR ready : )

tmoreau89

LGTM!

junrushao

Niiiiiice work! Thanks @masahi!

@junrushao1994

As discussed in #10856 (comment), add a utility under `meta_schedule/testing/utils.py` to clean up the database boilerplate. Also using `DummyDatabase` instead of `JsonDatabase` for further clean up, as suggested by @junrushao1994 .

This adds a demonstration of extracting, scheduling, and e2e-compiling relay subgraphs with multiple anchor ops. Since task extraction is not associated with TE scheduling anymore, extracting a subgraph with multiple anchor TE compute just works. The test case manually creates a simple fused mod with two `relay.dense`. But in the future, an effort like apache#9628 should make it easier to construct multi-anchor subgraphs. The extracted TensorIR block corresponding to two TE `dense` compute looks like this: ``` @tvm.script.ir_module class Module: @T.prim_func def main(placeholder: T.Buffer[(128, 128), "float32"], placeholder_1: T.Buffer[(128, 128), "float32"], placeholder_2: T.Buffer[(128, 128), "float32"], T_matmul_NT: T.Buffer[(128, 128), "float32"]) -> None: # function attr dict T.func_attr({"global_symbol": "main", "tir.noalias": True}) # body # with T.block("root") T_matmul_NT_1 = T.alloc_buffer([128, 128], dtype="float32") for i0, i1, i2 in T.grid(128, 128, 128): with T.block("T_matmul_NT"): i, j, k = T.axis.remap("SSR", [i0, i1, i2]) T.reads(placeholder[i, k], placeholder_1[j, k]) T.writes(T_matmul_NT_1[i, j]) T.block_attr({"layout_free_placeholders":[placeholder_1]}) with T.init(): T_matmul_NT_1[i, j] = T.float32(0) T_matmul_NT_1[i, j] = T_matmul_NT_1[i, j] + placeholder[i, k] * placeholder_1[j, k] for i0, i1, i2 in T.grid(128, 128, 128): with T.block("T_matmul_NT_1"): i, j, k = T.axis.remap("SSR", [i0, i1, i2]) T.reads(T_matmul_NT_1[i, k], placeholder_2[j, k]) T.writes(T_matmul_NT[i, j]) T.block_attr({"layout_free_placeholders":[placeholder_2]}) with T.init(): T_matmul_NT[i, j] = T.float32(0) T_matmul_NT[i, j] = T_matmul_NT[i, j] + T_matmul_NT_1[i, k] * placeholder_2[j, k] ```

@junrushao1994

…#10876) As discussed in apache#10856 (comment), add a utility under `meta_schedule/testing/utils.py` to clean up the database boilerplate. Also using `DummyDatabase` instead of `JsonDatabase` for further clean up, as suggested by @junrushao1994 .

This adds a demonstration of extracting, scheduling, and e2e-compiling relay subgraphs with multiple anchor ops. Since task extraction is not associated with TE scheduling anymore, extracting a subgraph with multiple anchor TE compute just works. The test case manually creates a simple fused mod with two `relay.dense`. But in the future, an effort like apache#9628 should make it easier to construct multi-anchor subgraphs. The extracted TensorIR block corresponding to two TE `dense` compute looks like this: ``` @tvm.script.ir_module class Module: @T.prim_func def main(placeholder: T.Buffer[(128, 128), "float32"], placeholder_1: T.Buffer[(128, 128), "float32"], placeholder_2: T.Buffer[(128, 128), "float32"], T_matmul_NT: T.Buffer[(128, 128), "float32"]) -> None: # function attr dict T.func_attr({"global_symbol": "main", "tir.noalias": True}) # body # with T.block("root") T_matmul_NT_1 = T.alloc_buffer([128, 128], dtype="float32") for i0, i1, i2 in T.grid(128, 128, 128): with T.block("T_matmul_NT"): i, j, k = T.axis.remap("SSR", [i0, i1, i2]) T.reads(placeholder[i, k], placeholder_1[j, k]) T.writes(T_matmul_NT_1[i, j]) T.block_attr({"layout_free_placeholders":[placeholder_1]}) with T.init(): T_matmul_NT_1[i, j] = T.float32(0) T_matmul_NT_1[i, j] = T_matmul_NT_1[i, j] + placeholder[i, k] * placeholder_1[j, k] for i0, i1, i2 in T.grid(128, 128, 128): with T.block("T_matmul_NT_1"): i, j, k = T.axis.remap("SSR", [i0, i1, i2]) T.reads(T_matmul_NT_1[i, k], placeholder_2[j, k]) T.writes(T_matmul_NT[i, j]) T.block_attr({"layout_free_placeholders":[placeholder_2]}) with T.init(): T_matmul_NT[i, j] = T.float32(0) T_matmul_NT[i, j] = T_matmul_NT[i, j] + T_matmul_NT_1[i, k] * placeholder_2[j, k] ```

@junrushao1994

…#10876) As discussed in apache#10856 (comment), add a utility under `meta_schedule/testing/utils.py` to clean up the database boilerplate. Also using `DummyDatabase` instead of `JsonDatabase` for further clean up, as suggested by @junrushao1994 .

masahi added 6 commits April 1, 2022 10:04

add test mod

dd3b3de

task extraction works

cec5da1

trying relay.build

9b4ea12

test runs but result not correct

5f8dffd

test worked

c27a481

update te_compiler_cache

b3a3a7c

masahi force-pushed the e2e-multi-anchor branch from d895854 to b3a3a7c Compare April 1, 2022 03:31

use temp dir for database json

cbea61e

comaniac approved these changes Apr 1, 2022

View reviewed changes

Comment thread tests/python/unittest/test_meta_schedule_multi_anchor.py Outdated

comment out schedule dump

a8ac4b0

junrushao reviewed Apr 1, 2022

View reviewed changes

Comment thread src/relay/backend/te_compiler_cache.cc Outdated

Update src/relay/backend/te_compiler_cache.cc

b9b5e7c

Co-authored-by: Junru Shao <junrushao1994@gmail.com>

masahi commented Apr 1, 2022

View reviewed changes

tmoreau89 approved these changes Apr 1, 2022

View reviewed changes

junrushao approved these changes Apr 1, 2022

View reviewed changes

junrushao merged commit 93b255c into apache:main Apr 1, 2022

masahi mentioned this pull request Apr 1, 2022

[Metaschedule] Add utility API to ease using manual schedules #10876

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Metaschedule] Add test case for multi-anchor subgraph#10856

[Metaschedule] Add test case for multi-anchor subgraph#10856
junrushao merged 9 commits into
apache:mainfrom
masahi:e2e-multi-anchor

masahi commented Apr 1, 2022 •

edited

Loading

Uh oh!

comaniac left a comment

Uh oh!

Uh oh!

Uh oh!

masahi Apr 1, 2022

Uh oh!

junrushao Apr 1, 2022

Uh oh!

zxybazh Apr 1, 2022

Uh oh!

tmoreau89 left a comment

Uh oh!

junrushao left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		tune_rec = TuningRecord(sch.trace, [0.0], workload, tvm.target.Target(target), [])

		database.commit_tuning_record(tune_rec)

Uh oh!

Conversation

masahi commented Apr 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

comaniac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

masahi Apr 1, 2022

Choose a reason for hiding this comment

Uh oh!

junrushao Apr 1, 2022

Choose a reason for hiding this comment

Uh oh!

zxybazh Apr 1, 2022

Choose a reason for hiding this comment

Uh oh!

tmoreau89 left a comment

Choose a reason for hiding this comment

Uh oh!

junrushao left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

masahi commented Apr 1, 2022 •

edited

Loading