Skip to content

Commit 6668c8b

Browse files
Revert exllamav1 (#692)
* Revert "remove exllama in gptqmodel_ext folder (#521)" This reverts commit 7faaa9f. * Revert "remove deprecated exllama v1 code (#473)" This reverts commit eecdeb3 * Revert "remove exllama v1 (#453)" This reverts commit 5adae36 * fix quant linear * revert default value * revert adding parameter * rearrange import * update model path * fix import * remove use fp16 * remove qbits * import Enum * remove optimum * rename to exllamav1 * rename backend to exllamav1 * rename test name * update model id * replace test with lm-eval * use a correct model
1 parent d55769a commit 6668c8b

33 files changed

+3135
-2551
lines changed

gptqmodel/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
11
from .models import GPTQModel, get_best_device
22
from .quantization import BaseQuantizeConfig, QuantizeConfig
33
from .utils import BACKEND, get_backend
4+
from .utils.exllama import exllama_set_max_input_length
45
from .version import __version__

gptqmodel/integration/__init__.py

Lines changed: 0 additions & 1 deletion
This file was deleted.

gptqmodel/integration/optimum/__init__.py

Lines changed: 0 additions & 2 deletions
This file was deleted.

gptqmodel/integration/optimum/constants.py

Lines changed: 0 additions & 23 deletions
This file was deleted.

gptqmodel/integration/optimum/data.py

Lines changed: 0 additions & 257 deletions
This file was deleted.

gptqmodel/integration/optimum/hf_quantizer_gptq.py

Lines changed: 0 additions & 99 deletions
This file was deleted.

0 commit comments

Comments
 (0)