Home
Softono
s

speaches-ai

Professional software vendor delivering innovative solutions on the Softono platform. Specialized in both open-source and proprietary software development.

Total Products
1

Software by speaches-ai

speaches
Open Source

speaches

Speaches is an OpenAI API-compatible server for streaming transcription, translation, and speech generation. Speech-to-Text is powered by faster-whisper, while Text-to-Speech uses piper and Kokoro models. The project aims to be the Ollama equivalent for TTS and STT models. Key features include full compatibility with the OpenAI API, allowing existing tools and SDKs to work seamlessly. It supports audio generation through a chat completions endpoint for tasks like spoken audio summaries and sentiment analysis. Streaming transcription delivers results via server-sent events as audio is processed, eliminating the need to wait for completion. Models are dynamically loaded on demand and unloaded after periods of inactivity. Speaches provides Text-to-Speech via Kokoro, ranked first in the TTS Arena, and piper. It supports both GPU and CPU hardware acceleration and can be deployed using Docker Compose or standard Docker images. A Realtime API is available for interactive use cases. The server is highly configurable

AI & Machine Learning LLM Tools & Chat UIs
3.4K Github Stars