Writing
How to run large language models on consumer hardware using quantization, GGUF, and the right tooling choices.