[ADRENO] Revive and consolicate Adreno features#18867
Conversation
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request significantly enhances Adreno GPU support by updating the CLML SDK version, introducing a dedicated Docker-based CI environment, and extending the CLML backend with new capabilities for Large Language Model (LLM) inference. The changes streamline Adreno development and testing, improve CLML runtime efficiency, and expand the range of supported operations, particularly for quantized matrix multiplications. Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Changelog
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Code Review
This pull request includes changes to CLML SDK version in CLML.cmake, adds a new Dockerfile.ci_adreno for Adreno GPU CI, and introduces ubuntu_install_androidsdk.sh to install the Android SDK. The code also adds a compiler pass OpenCLMLOffLoadForLLM that partitions the graph with dequant Matmul to CLML backend offload. The code changes in clml_runtime.cc and clml_utils.cc involve setting dynamic tensor dimensions and updating tensor memory descriptors. Review comments suggest addressing command injection vulnerabilities in ubuntu_install_androidsdk.sh by quoting the http_proxy variable, using mkdir -p for directory creation, and sanitizing command line arguments appended to /etc/profile. Additionally, the reviewer recommends consolidating apt-get update calls in Dockerfile.ci_adreno, removing a redundant COPY command, correcting a duplicate package listing, and updating the attributes format in expected codegen for test_dequant_matmul and test_dequant_vec_matmul.
|
the consolidation looks good, maybe we can land those, then ci separately? |
Enable opencl target for gpu tests. Consolidates all Adreno tests under tests/python/relax/backend/adreno Changes to CLML corresponding to recent changes on json codegen/runtime. Docker specification for Adreno (ci_gpu + Android SDK).
0e3704a to
4a3d027
Compare
Enable opencl target for gpu tests.
Consolidates all Adreno tests under tests/python/relax/backend/adreno
Changes to CLML corresponding to recent changes on json codegen/runtime.
Docker specification for Adreno (ci_gpu + Android SDK, Gradle).