Quantization Fundamentals (Coursera)
Categories
Effort
Languages
Generative AI models, like large language models, often exceed the capabilities of consumer-grade hardware and are expensive to run. Compressing models through methods such as quantization makes them more efficient, faster, and accessible. This allows them to run on a wide variety of devices, including smartphones, personal computers, and [...]
Jul 21st 2025