-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
class BitPrecision(Enum):
none = 'none'
int8 = 'int8'
int4 = 'int4'
float16 = 'float16'
float32 = 'float32'
class Method(Enum):
None_ = 'None'
dynamic = 'dynamic'
static = 'static'
class Quantization(BaseModel):
bit_precision: BitPrecision = Field(..., description='Quantization bit precision')
method: Method = Field(..., description='Quantization method')
We should add type field to Quantization class because the model results depends on the quant type (for example qptq, awq and so on).
Metadata
Metadata
Assignees
Labels
No labels