Right-sizes LLM models to your system's RAM, CPU, and GPU github.com 19 points by bilsbie 4 hours ago