Fix OOMs happening in case of accelerate >= 0.16.0 by borzunov · Pull Request #310 · bigscience-workshop/petals

borzunov · 2023-04-25T11:31:21Z

After Speed up loading blocks using init with meta weights #285, load_pretrained_block() uses accelerate.utils.set_module_tensor_to_device()
In accelerate>=0.16.0, it saves the tensor in the dtype previously used by the model instead of dtype of the weights (Honor model dtype in load_checkpoint huggingface/accelerate#920)
Because of that, blocks and attention caches used float32, which caused OOMs
This PR makes load_pretrained_block() respect torch_dtype (default: "auto", which means reading torch_dtype from config.json)

mryab

Thanks for a quick investigation!

borzunov · 2023-04-25T11:32:54Z

    torch>=1.12
    bitsandbytes==0.38.0.post2
-    accelerate>=0.15.0,<1.0.0
+    accelerate>=0.16.0,<1.0.0


set_module_tensor_to_device's dtype arg didn't exist before 0.16.0

…op#310) - After bigscience-workshop#285, `load_pretrained_block()` uses `accelerate.utils.set_module_tensor_to_device()` - In accelerate>=0.16.0, it saves the tensor in the dtype previously used by the model instead of dtype of the weights (huggingface/accelerate#920) - Because of that, blocks and attention caches used float32, which caused OOMs - This PR makes `load_pretrained_block()` respect `torch_dtype` (default: `"auto"`, which means reading `torch_dtype` from `config.json`)

Fix dtype in load_pretrained_block()

3c913a6

borzunov requested a review from mryab April 25, 2023 11:31

mryab approved these changes Apr 25, 2023

View reviewed changes

borzunov commented Apr 25, 2023

View reviewed changes

borzunov changed the title ~~Fix OOMs caused by dtype in load_pretrained_block()~~ Fix OOMs happened with accelerate >= 0.16.0 Apr 25, 2023

borzunov changed the title ~~Fix OOMs happened with accelerate >= 0.16.0~~ Fix OOMs happened in case of accelerate >= 0.16.0 Apr 25, 2023

borzunov changed the title ~~Fix OOMs happened in case of accelerate >= 0.16.0~~ Fix OOMs happening in case of accelerate >= 0.16.0 Apr 25, 2023

borzunov merged commit 454c193 into main Apr 25, 2023

borzunov deleted the from-pretrained-dtype branch April 25, 2023 13:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix OOMs happening in case of accelerate >= 0.16.0#310

Fix OOMs happening in case of accelerate >= 0.16.0#310
borzunov merged 1 commit into
mainfrom
from-pretrained-dtype

borzunov commented Apr 25, 2023 •

edited

Loading

Uh oh!

mryab left a comment

Uh oh!

borzunov Apr 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

borzunov commented Apr 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mryab left a comment

Choose a reason for hiding this comment

Uh oh!

borzunov Apr 25, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

borzunov commented Apr 25, 2023 •

edited

Loading