Home
Softono
NanoLLM

NanoLLM

Open source MIT Python
373
Stars
65
Forks
64
Issues
9
Watchers
1 year
Last Commit

About NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

Platforms

Web Self-hosted

Languages

Python