Accessible large language models via k-bit quantization for PyTorch.
Did you build this?
Claim your listing to see exactly how many AI agents recommend this tool, your success rate, and more. Free, no commission, no fees.
Claim This ListingLLM quantisation and optimisation library — 8-bit and 4-bit inference, LoRA training, memory-efficient CUDA kernels.
Save tools & get AI recommendations
Free forever. No credit card required.
Listed for free · No commission · Claim this listing