Commit 114647f
Handle single-device models without
* When `hf_device_map` does not exist, infer the `device_map`
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* cleanup
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* cleanup
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* Update optimum/gptq/quantizer.py
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>
* cleanup
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* cleanup
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
* Fix device_map value to use param.device
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
---------
Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>hf_device_map after Transformers optimization (#2401)1 parent 6cfe1ee commit 114647f
1 file changed
Lines changed: 7 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
691 | 691 | | |
692 | 692 | | |
693 | 693 | | |
694 | | - | |
| 694 | + | |
| 695 | + | |
| 696 | + | |
| 697 | + | |
| 698 | + | |
| 699 | + | |
| 700 | + | |
695 | 701 | | |
696 | 702 | | |
697 | 703 | | |
| |||
0 commit comments