4️⃣Quantization
Quantization 양자화 기본 이론





Pytorch: Data Types and Sizes 실습
Integers
Floating Points
Downcasting
ML Models with Different Data Types
Model Casting: float16
float16Model Casting: bfloat16
bfloat16LLM Models in Different Data Types
Model Performance: float32 vs bfloat16
float32 vs bfloat16Default Data Type
LLM 양자화 주요 방법론:
Fine-tune Quantization 종류
QLoRA 방법론

Quantization Theory 실습
Without Quantization
Quantize the model (8-bit precision)
Freeze the model
Try running inference on the quantized model
Last updated