5.4K
Stars
508
Forks
11
Issues
39
Watchers
1 week
Last Commit
About shimmy
⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.