Home
Softono
ai-audio-tools

ai-audio-tools

Open source
13
Stars
2
Forks
0
Issues
2
Watchers
3 months
Last Commit

About ai-audio-tools

Community list of AI tools for audio and music

Platforms

Web Self-hosted

Links

AI Audio Tools

Awesome Stars Last Commit PRs Welcome Tools

Typing SVG

I want to?

I want to... Go here
Make a full song from scratch Creation & Production
Clone or transform a voice AI Voice & Cover Generation
Remove vocals from a track Source Separation
Make a podcast Radio & Podcast
Use audio for health/medicine Health & Wellbeing
Build an audio AI app Development

Quick Navigation

Creation & Production Lyric Writing Voice Covers Separation
Mastering Plugins Analysis Health
Podcast Hearing Detection Speech
Transcription TTS Enhancement Development

Badge Key

free freemium paid api open-source enterprise vst hardware acquired


Creation & Production

  • VRS/A freemium - AI-powered lyric writing and music production workstation with multi-model Ghostwriter, Suno integration via browser extension, audio analysis, album art generation, and VRSA Studio (studio.vrsa.app) for a dedicated production environment.
  • Suno freemium - Generative AI music creation platform that allows users to create full songs, including vocals and instrumentation, from text prompts.
  • Soundry AI freemium - AI for Musicians, by Musicians.
  • Sonauto freemium - Create hit songs with AI.
  • Microphone Studio freemium - Multi-track recording without expensive studio equipment.
  • TuneFlow free open-source - Generate lyrics, melody, drum beats and more, while editing and mixing like any professional DAW.
  • CassetteAI freemium - AI powered music production platform: make lyrics, beats & vocals with AI then mix & publish straight from Cassette.
  • AIVA freemium - The Artificial Intelligence composing emotional soundtrack music.
  • beatoven.ai freemium - A simplified music creation tool that helps you create music for your videos and podcasts.
  • Infinite Album freemium - Adaptive AI music for gamers who livestream.
  • Epidemic Sound paid - High quality music and sound effects for all your content, all rights included.
  • Wonder paid - Dynascore: The world's first Dynamic Music Engine.
  • Amper acquired (Acquired by Shutterstock) - AI Music Composition Tools for Content Creators.
  • AudioStack paid api - AI-first platform for producing audio at scale.
  • mayk.it freemium - Your virtual music studio.
  • boomy freemium - Make instant music, share it with the world.
  • enote paid - Intelligent Sheet Music.
  • Qosmo - Qosmo is a group of artists, researchers, designers, and programmers.
  • AI Music acquired (Acquired by Apple) - Our music helps brands enable deeper connections with their audiences.
  • Splash HQ - The next generation of music producers.
  • musico - AI-driven software engine that generates music. It can react to gesture, movement, code or other sound.
  • Yousician freemium - The largest music educator on the planet.
  • Tape It free - App for songwriting & audio recording.
  • sessionwire paid - All-in-one online collaboration platform that delivers a seamless studio experience.
  • Aflorithmic paid api - Professional audio, voice, sound and music to scale.
  • Audio Design Desk paid - The Audio Solution for Video Editors.
  • Never Before Heard Sounds freemium - A music studio powered by AI.
  • NeuralDSP paid vst - Empowers music players by democratizing the access to world-class sound, through an intuitive software/hardware ecosystem.
  • Neutone free vst - AI audio plugin & community bridging the gap between AI research and creativity.
  • Udio freemium - AI music generator with full song creation from text prompts, integrated lyric editor, and granular line-by-line vocal control.
  • Mureka freemium - AI music generation with style-reference input, vocal timbre selection, and voice cloning for demos and song prototyping.
  • Soundverse freemium - Full-suite AI music studio with text-to-song, beat generation, stem separation, and SAAR — a voice-controlled music production assistant.
  • ACE Studio freemium - All-in-one AI music studio with expressive AI vocals, natural-sounding AI instruments, and a DAW bridge for Logic, Ableton, and FL Studio.
  • Stable Audio freemium - Text-to-audio and audio-to-audio generation for music and sound effects from Stability AI, trained on licensed datasets.
  • Riffusion free open-source - Diffusion model-based real-time music generation from text prompts, operating directly on audio spectrograms.
  • LoudMe freemium - Text-to-music generator for royalty-free songs and instrumentals with style and mood controls.
  • Ecrett Music freemium - Scene and mood-based AI background music generator aimed at video and content creators requiring instant scoring.
  • Soundful freemium - AI platform for generating royalty-free, high-quality soundtracks customizable by mood, tempo, and brand identity for commercial use.
  • SongGPT freemium - AI song generator for producing full tracks from short text prompts with genre selection.
  • Tunee freemium - AI music and lyric generation platform with access to multiple underlying generative models for varied output styles.
  • LOVO freemium api - Advanced text-to-speech and voice cloning platform for content creators, supporting emotional range control and voice actor-style production.

↑ Back to top


Lyric Writing & Songwriting

  • VRS/A freemium - AI-powered lyric writing and music production workstation with multi-model Ghostwriter, Suno integration via browser extension, audio analysis, album art generation, and VRSA Studio.
  • Lyric Studio freemium - Mobile-first AI songwriting ecosystem with a lyric editor, AI-generated verse/chorus drafts, rhyme suggestions, and song organization tools.

↑ Back to top

AI Voice & Cover Generation

  • Jammable freemium - (formerly Voicify AI) AI song cover generator with 22,000+ community-uploaded voice models and custom voice cloning from 10 minutes of audio.
  • Musicfy freemium - AI voice covers and voice cloning platform with text-to-music generation, voice-to-instrument conversion, and a large copyright-free vocal library.
  • Lalals freemium - AI voice swapping tool suite with 1,000+ voice options, stem splitting, and real-time conversion for remixes and vocal experimentation.

↑ Back to top

Source Separation

  • Music AI paid api - Professional AI stem separation and audio analysis platform for broadcasters and remixers, partnered with SourceAudio's 140+ broadcaster network.
  • TuneFlow free - A free DAW offering high quality vocal, drums, melody, bass stem separation, all-in-one audio separation, editing and vocal/instrument to MIDI transcription.
  • Spliter.ai freemium - AI Audio Processing.
  • Gaudio enterprise api - Redefine your audio experience in music/video streaming and virtual/augmented reality.
  • AudioShake paid api - An On-Demand Stem Creation Platform for the Music Industry.
  • Audionamix enterprise - Audio separation solutions for the entertainment industry.
  • vocali.se freemium - Separate vocals and music from any song, in seconds.
  • lalal.ai freemium - High-quality stem splitting based on the world's #1 AI-powered technology.
  • VocalRemover free - Separate voice from music out of a song free with powerful AI algorithms.
  • PhonicMind freemium - Separate vocals, drums, bass and other instruments out of your songs with HiFi AI.
  • EasySplitter freemium - AI-Based Vocal Remover Online for DJ Singers.
  • Remover.studio free - Vocal Remover & Online Karaoke.
  • MVSep free - Free separation of songs with many different algorithms (Demucs, MDX, UVR etc).
  • MuzLab freemium - Remove vocals from songs and split drums, bass and other instruments out of music.
  • Fadr freemium - Remove stems, convert to MIDI, and create high-quality remixes and mashups using AI tools.

↑ Back to top

Mastering, Mixing & Production Analysis

  • SoundBoost AI freemium - AI music mastering platform with goal-based controls — specify targets like loudness, warmth, or punch and the engine applies processing automatically.
  • VerifAI Audio freemium - Instant AI-driven feedback on track quality covering mixdown balance, loudness levels, bitrate, and other release-readiness metrics.

↑ Back to top

Plugins & Sample Tools

  • Samplab paid vst - AI VST plugin for granular audio sample editing, enabling note-level pitch manipulation of polyphonic audio with automatic chord progression detection.
  • Slooply freemium - AI-powered sample discovery platform with similarity search, mood/key/BPM filtering, MIDI export, and direct drag-and-drop DAW integration.
  • Atlas paid - AI sample library organizer with auto-tagging, similar-sound search, and a smart drum map interface for large sample collections.
  • Playbeat paid vst - AI generative groove sequencer for instant beat creation with MIDI export and real-time DAW sync.

↑ Back to top

Analysis & Recommendation

  • SONOTELLER freemium - AI music analysis tool for song lyric summarization, theme extraction, and musical feature identification.
  • Musicful freemium - AI-powered music recommendation and discovery engine focused on contextual and emotional matching.
  • Harmix api - AI music search with natural language, videos, similar audio and lyrics. Auto-tagging for audio and video.
  • AIMS paid api - AI-powered music similarity search & auto-tagging for anyone who makes music discovery their business.
  • FeedForward enterprise api - The intuitive audio search engine for audio & sound catalogues.
  • Aimi free - Discover the artists who freed their music from the shackles of songs and playlists.
  • Utopia Music enterprise - Fair Pay for Every Play.
  • Musiio acquired (Acquired by SoundCloud) - Use Artificial Intelligence to help automate your workflows.
  • niland acquired (Acquired by Spotify) - Build AI Powered Music Apps.
  • cyanite freemium api - AI for Music tagging and similarity search.
  • musicube acquired (Acquired by SongTradr) - B2B AI music metadata services like auto-tagging, metadata enrichment and semantic search.
  • Musixmatch freemium api - Algorithms and tools for music discovery, recommendation, and search based on lyrics.
  • hoopr paid - Find the best music, tell better stories, grow your audience.
  • Pex enterprise api - Music identification and copyright compliance. Audio fingerprinting, cover song identification in large scale.

↑ Back to top

Health & Wellbeing

  • Endel freemium - Personalized soundscapes to help you focus, relax, and sleep.
  • Lucid - Transforming music into medicine, using AI to compose and curate a personalized therapeutic music experience.
  • Wavepaths paid - Music for Psychedelic Therapy.
  • Suki enterprise - AI-powered voice solutions for healthcare.
  • audEERING enterprise api - Technology that can detect emotions and health information from the voice.
  • brain.fm freemium - Music to Focus Better.
  • SPOKE freemium - Lo-fi & Lyricism-led Mindfulness music episodes.
  • sona - Music as medicine. Research-based music for anxiety made by Grammy-winning producers.
  • Novoic enterprise - Using speech to detect neurological diseases.
  • Ubenwa enterprise - Infant health analysis based on cry signals.

↑ Back to top

Radio / Podcast

  • faidr free - Your favorite radio, interruption free.
  • fathom - The search engine for podcasts.
  • Nomono paid hardware - A self-contained recording kit for capturing interviews in the field.
  • Descript freemium - All-in-one audio & video editing, as easy as a doc.
  • auphonic freemium - Automatic audio post production web service for podcasts, broadcasters, radio shows, movies, screencasts and more.
  • SimonSays paid - Edit Video 5x Faster, Built For Teams.
  • Podcastle freemium - Studio-quality recording, AI-powered editing, and seamless exporting.
  • cleanvoice freemium - Removes filler sounds, stuttering and mouth sounds from your podcast or audio recording.
  • Super Hi-Fi enterprise - Artificial Intelligence Powered Music Experiences.

↑ Back to top

Hearing

  • Whisper.ai paid hardware - Smarter than your average hearing aid.
  • Eargo paid hardware - A Revolutionary New Hearing Aid.
  • Concha Labs hardware - Helping you hear more clearly.

↑ Back to top

Sound detection

  • Audio Analytic enterprise api - Creating exceptional human experiences through a greater sense of hearing.
  • SoundEye enterprise - Advanced sound recognition solutions capable of classifying sounds such as screaming, gunshot, coughing, and crying.
  • cochl api enterprise - A next-generation sound AI platform that understands any sounds like a human.
  • Josh.ai paid - A voice-controlled home automation system.
  • SEE SOUND paid - The world's first smart home hearing system.
  • Epigos.ai api - AI models that can be used to extract hidden data from audio sources.
  • HyperSurfaces enterprise - Seamlessly merging the physical and data worlds without the need for keyboards, buttons or touch screens.
  • HyperSentience enterprise - Delivers context awareness to phones, VR/AR headsets, smart watches, speakers and laptops.
  • Circulr Sound hardware - Smart audio wearables.
  • Securaxis enterprise - We turn sounds into information.
  • Deeply enterprise api - We add meaning to every sound in the world using advanced deep learning technology for sound event detection and context recognition.
  • Reef Pulse - Coral reef monitoring using bioacoustics and AI: sound event detection (boats, divers, waves, marine mammals, fishes, invertebrates) for impactful management of marine ecosystems.

↑ Back to top

Speech

Transcription

  • Ava freemium - Professional and AI-Based Captions for Deaf and HoH (Transcription & Diarization).
  • verbit enterprise - Professional AI-Based Transcription & Captioning.
  • otter freemium - Everything hybrid teams need for productive, collaborative meetings.
  • Trint paid - Audio Transcription Software — Speech to Text to Magic.
  • Rev paid - 99% accurate captions, transcripts, and subtitles.
  • voiceitt - An app for people with non-standard speech.
  • deepgram.com freemium api - Better voice applications with faster, more accurate transcription through AI Speech Recognition.
  • fireflies.ai freemium - AI assistant for your meetings.
  • SoapBox api enterprise - Speech technology that makes kids heard.
  • Amberscript freemium - SaaS solutions that automatically transform audio and video into text and subtitles using speech recognition.
  • Speaksee - Live captions what's being said during in-person group meetings.
  • Speechmatics api enterprise - Autonomous Speech Recognition technology that understands every voice.
  • sonix freemium - Automated transcription in 35+ languages.
  • Picovoice freemium api open-source - End-to-end Edge Voice AI, on-device voice recognition.
  • BoldVoice paid - Speak English clearly and confidently.
  • Gladia freemium api - Power your product with cutting-edge AI transcription, translation and audio intelligence using a single API.
  • Podsqueeze freemium - Re-purpose your audio or video podcast into transcript, show notes, blog post, video clips and other assets to publish and promote your show.

↑ Back to top

Synthesis (TTS)

  • adauris.ai freemium - Transforming written content into engaging audio with seamless distribution.
  • Aflorithmic paid api - Professional audio, voice, sound and music to scale.
  • Sonantic acquired (Acquired by Spotify) - Deliver compelling, lifelike performances with fully expressive AI-generated voices.
  • kroop AI - Harness synthetic media generation and detection with endless possibilities.
  • dubverse freemium - Make your content multilingual at a click of a button and reach more people.
  • Resemble.ai freemium api - Generate AI Voices that sound real.
  • Replica freemium - AI voice actors for games, film & the metaverse.
  • Respeecher paid - Voice Cloning for Content Creators.
  • amai - Ultra realistic text to speech voice engines.
  • AssemblyAI freemium api - Transcribe and understand audio with a single AI-powered API.
  • DAISYS - New voices that sound like real people.
  • WellSaid paid - Text-to-speech technology that creates life-like synthetic voices, from the voices of real people.
  • Deepsync - Generate audio content that exactly sounds like you.
  • coqui.ai open-source - Providing open speech tech for everyone.
  • Voiseed - AI-based Voice Engine able to mimic the emotions and prosody of human speech.
  • Speechki freemium - NLP-based text and audio editing platform with hundreds of AI voices inside.
  • Jellypod freemium - The AI podcast studio. Create customizable AI podcasts in minutes.
  • MiSynth - A brain-controlled instrument that uses synaptic technology and BCIs to turn imagined sounds into a synthesized MIDI instrument.
  • ElevenLabs freemium api - Developing the most compelling AI speech software for publishers and creators.
  • Wondercraft freemium - Wondercraft enables users to generate podcasts using Text-to-Speech technology.
  • play.ht freemium api - Building the future of content creation based on generative machine learning models.
  • Revocalize.ai freemium - Generate studio-quality AI Voices and train AI voice models from the web dashboard or the VST plugin.
  • morpheme.ai - Actor-First, Digital-Double Voices powered by the latest AI technology, ensuring they are efficient, authentic, and ethical.

↑ Back to top

Enhancement & Manipulation

  • Meaning - Streaming real-time voice and accent conversion.
  • VideoDubber freemium - Translating video/audio through voice cloning and accent conversion in 150+ languages.
  • krisp freemium - An AI-powered software solution for effective online meetings.
  • voicemod freemium - Free real-time voice changer.
  • audo freemium api - Noise cancellation products for creators, developers, and virtual meetings.
  • AudioTelligence enterprise api - Software that transforms the clarity and intelligibility of speech in challenging acoustic environments.
  • immersitech.io enterprise - We don't make audio. We make audio better.
  • utterly freemium - Noise removal for meetings and audio.
  • claerity.ai freemium - Cutting-edge AI to eliminate all background noise on video conference calls.
  • Neural Love freemium - Set of AI-powered tools to enhance audio quality.
  • HeardThat freemium - A smartphone app that turns your smartphone into a sophisticated speech-enhancement device.
  • Chatable freemium - A smartphone app that removes disruptive background noise.
  • BdSound enterprise - Intelligent Audio Solution for audio and voice-enabled products.
  • echosonic - Revolutionizing microphone by bringing Machine Learning capabilities into it.
  • Insoundz freemium - Generative AI Audio Enhancement.
  • Xound freemium - AI-powered audio enhancements in just one click. Grammarly for audio.

Development

Tools & SDKs

  • Quilio api - We maintain tools to help developers build real-time audio AI applications with ease.

Contributing

Fork the repo, edit the README, and open a PR.

Contributors

↑ Back to top