Rotary GPU: Exploring Local Execution for Large MoE Models Under Limited VRAM arxiv.org 9 points by dryarzeg 3 hours ago