Voice & Audio AI Tools
Speech recognition, text-to-speech, and audio processing AI tools
Showing 51 tools
nugget
Achieve 85% support automation with AI agents: chatbot, voice and email bots, social media replies, agent co-pilot, and AI ticketing automation.
producer.ai
Create the music you imagine. Producer.ai is a generative AI instrument for creating, remixing, and sharing studio-quality songs from simple prompts. Swap stems, extend tracks, and personalize your sound effortlessly.
Assemblyai
With AssemblyAI's industry-leading Speech AI models, transcribe speech to text and extract insights from your voice data.
ElevenLabs
ElevenLabs offers an AI voice generator and voice agents platform, featuring over 5,000 voices in 70+ languages. It enables users to create lifelike speech and high-quality visuals within a complete creative workflow.
Krisp
Krisp is an AI Meeting Assistant that offers noise cancellation, transcription, meeting notes, summaries, and accent conversion, all designed to enhance focus and communication during meetings.
Suno AI
Create stunning original music for free in seconds using AI. Make your own masterpieces, share with friends, and discover music from artists worldwide.
Udio
Udio is an AI music generator that allows users to create personalized music by simply describing it in text. It serves both professional musicians and amateurs, offering state-of-the-art AI-editing tools for music production.
Fathom
Free AI meeting assistant for video calls.
Descript Overdub
Easily create a realistic voice clone or choose from stock AI voices with Descript Overdub, enabling seamless audio editing and enhancing your creative projects with advanced voice technology.
Krisp
Krisp is an AI Meeting Assistant that offers noise cancellation, transcription, meeting notes, summaries, and accent conversion, all designed to enhance focus and communication during meetings.
Adobe Podcast
Adobe Podcast offers next-generation audio solutions for recording, transcribing, editing, and sharing. It features tools like StudioBETA for browser-based enhancements and options to improve voice recordings by removing noise and echo.
Speechify
Speechify is a text-to-speech and voice typing AI assistant that reads aloud books, PDFs, and web pages using natural voices. It helps users listen and comprehend content faster, integrating seamlessly with various platforms.
Fireflies.ai
Fireflies.ai is the industry leader in transcription accuracy, offering automatic note-taking, transcription, summarization, and analysis of team conversations across various video conferencing platforms in over 100 languages.
Resemble AI
Resemble AI provides enterprises with ultra-realistic voice creation and deepfake detection solutions. Trusted by Fortune 500s, it offers scalable, secure platforms for generative AI, including the Chatterbox voice model and DETECT-3B deepfake detection.
Dolby.io
Dolby OptiView enhances live streaming with real-time engagement, seamless ad integration, and cross-platform consistency. It delivers immersive, interactive experiences that boost viewer engagement and drive revenue, optimizing video playback for all devices.
Auphonic
Auphonic is an automatic audio post production web service that balances levels between speakers, music, and speech, while also offering noise and reverb reduction, filtering, and auto-equalization to enhance sound quality.
Papercup
Papercup offers natural-sounding AI dubbing and professional voice-over services, combining advanced synthetic voices with human expertise to deliver emotionally compelling content across various formats. Their technology ensures cultural sensitivity and nuanced expression for global audiences.
Otter.ai
Otter.ai is an AI Notetaker that enhances productivity through real-time transcription, automated summaries, and insights. It supports various integrations and features like collaborative note editing and unlimited transcription minutes.
Avoma
AI meeting assistant for revenue teams.
Murf AI
Murf AI offers an AI Voice Generator and Text to Speech APIs & SDKs, allowing users to create ultra-realistic voiceovers in over 20 languages using 200+ AI voices in seconds.
Supernormal
Supernormal is an AI-powered meeting platform that automates meeting notes, agendas, and insights, allowing users to focus on building connections and enhancing productivity. It integrates with Google Meet, Zoom, and Microsoft Teams.
Altered Studio
Professional AI voice editor and voice changer.
WellSaid Labs
WellSaid Labs offers professional-quality text-to-speech voiceovers using AI voices that sound natural and expressive. With over 120 licensed-actor voices, it simplifies production workflows and enhances communication for modern teams.
Grain
Grain is an AI-powered notetaker that provides accurate meeting summaries, account insights, and coaching suggestions. It is designed for growing teams, offering features like automatic recordings, tailored notes, and effortless highlight sharing.
Deepdub
Deepdub is an end-to-end localization platform that offers scalable dubbing and voice-over solutions, combining proprietary technology with expert production to deliver premium, AI-driven voice experiences for global storytelling.
Cleanvoice AI
Cleanvoice AI is an AI-powered podcast editing tool that automatically removes background noise, filler words, and mouth sounds, allowing users to edit their podcasts in just 10 minutes instead of hours. It also offers features like transcription, summarization, and multitrack editing.
Play.ht
Play.ht is the best AI voice generator featuring over 200 realistic voices and multi-speaker capabilities in 40+ languages, designed for creators and enterprises to produce indistinguishable AI voiceovers and text-to-speech content.
Airgram
Airgram, in collaboration with Notta, offers an integrated solution for AI meeting notes and audio transcription. It records, transcribes, and summarizes voice conversations into actionable text, enhancing productivity during meetings.
MeetGeek
AI meeting automation for recording and insights.
Magenta Studio
Magenta Studio is an open source research project that explores the role of machine learning as a tool in the creative process of making art and music.
LOVO AI
AI voice generator and text-to-speech platform.
Loopin
AI meeting workspace connecting calendar and notes.
Sembly AI
Sembly AI automatically generates accurate meeting notes and transcripts, capturing meetings from platforms like Google Meet and Zoom. It provides summaries, identifies speakers, and supports multilingual chats to enhance team collaboration.
Veritone Voice
Veritone Voice is a leading AI voice solution that enables the creation of lifelike text-to-speech and speech-to-speech synthetic voices at unmatched speed and scale. It allows for content creation on demand, localization in over 150 languages, and cloning of voices with consent.
Descript
Video editor with AI voice + overdub.
Curious Thing
Voice AI for recruitment and screening calls.
Listnr
Listnr is a professional AI voice generator that offers over 1000 realistic voices in 142+ languages. Trusted by over 3 million users, it enables the creation of multilingual content, voice cloning, and engaging voiceovers for various applications.
Unveil
ZYLIA Beamformer is a spatial audio processing plugin designed for Ambisonics recordings, supporting up to 7th order microphones. It enhances audio workflows with precise spatial filtering, virtual microphone setups, and sound source separation in your DAW.
Mubert
Mubert is an AI music generator that creates royalty-free music from text prompts, utilizing millions of samples from various artists. It combines human creativity with AI technology to deliver customized audio for videos and projects.
Audo Studio
Audo Studio offers one-click audio cleaning that automatically removes background noise, reduces echoes, and adjusts volume levels, enhancing speech quality for YouTubers and podcasters in seconds.
Ecrett Music
Ecrett Music offers an easy way to create royalty-free music using an intuitive interface, allowing users to customize music for games, videos, podcasts, and ads without needing prior music knowledge.
Voicemod
Voicemod is a free real-time voice changer app that transforms your voice with over 200 effects, allowing you to sound like a girl, a robot, or even AI anime waifus. It enhances gaming, streaming, and group chats with hilarious sound effects.
Utterly
Utterly is a noise cancellation app designed to enhance audio quality during meetings and recordings by removing background noise. It processes audio locally on your device, ensuring privacy and improved sound clarity.
MuseNet
MuseNet is a deep neural network that generates 4-minute musical compositions using 10 different instruments, blending styles from various genres, including country and classical. It learns patterns of harmony and rhythm from extensive MIDI files without explicit programming.
Soundful
Soundful is an AI Music Studio that allows creators to effortlessly generate unique, royalty-free background music for videos, streams, and podcasts at the click of a button, ensuring copyright compliance and affordability.
Voicegain
Voicegain provides developers with a highly accurate and affordable Speech-to-Text platform, enabling the creation of voice-enabled applications and AI voice agents. Their deep-learning-based models can be deployed on-premise or in the cloud, offering flexibility and integration with existing systems.
Uberduck
Uberduck offers realistic, expressive synthetic vocals for music, voiceovers, and videos, enabling users to generate speech, singing, and rapping from text. It supports voice cloning and provides tools for creating custom voices across various languages and musical styles.
Splash Pro
Splash Pro is an interactive online music creation platform that allows users to collaborate and create music with other artists in seconds. It transforms music into a social and immersive experience, enabling unique tracks that can be shared globally.
Resound
Resound is an AI Podcast Editor designed to automate podcast editing, allowing creators to edit in minutes rather than hours. It identifies unwanted mistakes and long silences, enabling efficient content refinement.
Brusfri
null
Noise Eraser
Noise Eraser uses AI to identify and extract background noise, enhancing human voice clarity in audio files. It allows users to isolate specific sounds, creating high-quality, noise-free audio easily on any device without professional editing skills.
About Voice & Audio AI tools
Speech recognition, text-to-speech, and audio processing AI tools Compare features, pricing, and reviews to find the best voice & audio ai tools. Explore the wider Smart Assistants category for related solutions.
Popular Voice & Audio AI tools
- nugget
- producer.ai
- Assemblyai
- ElevenLabs
- Krisp
- Suno AI