A fast, local neural text to speech system
A deep learning toolkit for Text-to-Speech, battle-tested in research
State-of-the-art TTS model under 25MB
Speech to Text to Speech, sends text as OSC messages
Multilingual Text-to-Speech (TTS)
MARS5 is a fully open-source, hyper-realistic text-to-speech (TTS).
Comprehensive Gradio WebUI for audio processing
LLM Frontend for Power Users
A gradio web UI for running Large Language Models like LLaMA
1 min voice data can also be used to train a good TTS model
SoTA open-source TTS
MyTTS is a free text-to-speech (TTS) software, 100+ languages
Browser extension and cross-platform desktop app based on ChatGPT API
A Wordpress plugin for read articles with text-to-speech (tts)
Toolkit for conversational AI
Examples and guides for using the Gemini API
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
The official .NET library for the OpenAI API
Easy-to-use Speech Toolkit including Self-Supervised Learning model
High-quality multi-lingual text-to-speech library by MyShell.ai
Conversational voice AI agents
Implementation of AudioLM audio generation model in Pytorch
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)