Voice & Audio AI Tools

Speech recognition, text-to-speech, and audio processing AI tools

In Smart Assistants

Showing 51 tools

nugget

nugget

Achieve 85% support automation with AI agents: chatbot, voice and email bots, social media replies, agent co-pilot, and AI ticketing automation.

producer.ai

producer.ai

Create the music you imagine. Producer.ai is a generative AI instrument for creating, remixing, and sharing studio-quality songs from simple prompts. Swap stems, extend tracks, and personalize your sound effortlessly.

Assemblyai

Assemblyai

With AssemblyAI's industry-leading Speech AI models, transcribe speech to text and extract insights from your voice data.

Free + Paid
ElevenLabs

ElevenLabs

ElevenLabs offers an AI voice generator and voice agents platform, featuring over 5,000 voices in 70+ languages. It enables users to create lifelike speech and high-quality visuals within a complete creative workflow.

Free + Paid
Krisp

Krisp

Krisp is an AI Meeting Assistant that offers noise cancellation, transcription, meeting notes, summaries, and accent conversion, all designed to enhance focus and communication during meetings.

Free + Paid
Suno AI

Suno AI

Create stunning original music for free in seconds using AI. Make your own masterpieces, share with friends, and discover music from artists worldwide.

Free + Paid
Udio

Udio

Udio is an AI music generator that allows users to create personalized music by simply describing it in text. It serves both professional musicians and amateurs, offering state-of-the-art AI-editing tools for music production.

Free + Paid
Fathom

Fathom

Free AI meeting assistant for video calls.

Free
Descript Overdub

Descript Overdub

Easily create a realistic voice clone or choose from stock AI voices with Descript Overdub, enabling seamless audio editing and enhancing your creative projects with advanced voice technology.

Free + Paid
Krisp

Krisp

Krisp is an AI Meeting Assistant that offers noise cancellation, transcription, meeting notes, summaries, and accent conversion, all designed to enhance focus and communication during meetings.

Free + Paid
Adobe Podcast

Adobe Podcast

Adobe Podcast offers next-generation audio solutions for recording, transcribing, editing, and sharing. It features tools like StudioBETA for browser-based enhancements and options to improve voice recordings by removing noise and echo.

Free + Paid
Speechify

Speechify

Speechify is a text-to-speech and voice typing AI assistant that reads aloud books, PDFs, and web pages using natural voices. It helps users listen and comprehend content faster, integrating seamlessly with various platforms.

Free + Paid
Fireflies.ai

Fireflies.ai

Fireflies.ai is the industry leader in transcription accuracy, offering automatic note-taking, transcription, summarization, and analysis of team conversations across various video conferencing platforms in over 100 languages.

Free + Paid
Resemble AI

Resemble AI

Resemble AI provides enterprises with ultra-realistic voice creation and deepfake detection solutions. Trusted by Fortune 500s, it offers scalable, secure platforms for generative AI, including the Chatterbox voice model and DETECT-3B deepfake detection.

Subscription - $0.006 per second
Dolby.io

Dolby.io

Dolby OptiView enhances live streaming with real-time engagement, seamless ad integration, and cross-platform consistency. It delivers immersive, interactive experiences that boost viewer engagement and drive revenue, optimizing video playback for all devices.

Free + Paid
Auphonic

Auphonic

Auphonic is an automatic audio post production web service that balances levels between speakers, music, and speech, while also offering noise and reverb reduction, filtering, and auto-equalization to enhance sound quality.

Free + Paid
Papercup

Papercup

Papercup offers natural-sounding AI dubbing and professional voice-over services, combining advanced synthetic voices with human expertise to deliver emotionally compelling content across various formats. Their technology ensures cultural sensitivity and nuanced expression for global audiences.

Subscription - custom pricing
Otter.ai

Otter.ai

Otter.ai is an AI Notetaker that enhances productivity through real-time transcription, automated summaries, and insights. It supports various integrations and features like collaborative note editing and unlimited transcription minutes.

Free + Paid
Avoma

Avoma

AI meeting assistant for revenue teams.

Free + Paid
Murf AI

Murf AI

Murf AI offers an AI Voice Generator and Text to Speech APIs & SDKs, allowing users to create ultra-realistic voiceovers in over 20 languages using 200+ AI voices in seconds.

Free + Paid
Supernormal

Supernormal

Supernormal is an AI-powered meeting platform that automates meeting notes, agendas, and insights, allowing users to focus on building connections and enhancing productivity. It integrates with Google Meet, Zoom, and Microsoft Teams.

Free + Paid
Altered Studio

Altered Studio

Professional AI voice editor and voice changer.

Free + Paid
WellSaid Labs

WellSaid Labs

WellSaid Labs offers professional-quality text-to-speech voiceovers using AI voices that sound natural and expressive. With over 120 licensed-actor voices, it simplifies production workflows and enhances communication for modern teams.

Subscription - $49/month (creator)
Grain

Grain

Grain is an AI-powered notetaker that provides accurate meeting summaries, account insights, and coaching suggestions. It is designed for growing teams, offering features like automatic recordings, tailored notes, and effortless highlight sharing.

Free + Paid
Deepdub

Deepdub

Deepdub is an end-to-end localization platform that offers scalable dubbing and voice-over solutions, combining proprietary technology with expert production to deliver premium, AI-driven voice experiences for global storytelling.

Subscription - custom pricing
Cleanvoice AI

Cleanvoice AI

Cleanvoice AI is an AI-powered podcast editing tool that automatically removes background noise, filler words, and mouth sounds, allowing users to edit their podcasts in just 10 minutes instead of hours. It also offers features like transcription, summarization, and multitrack editing.

Subscription - $10/month (starter)
Play.ht

Play.ht

Play.ht is the best AI voice generator featuring over 200 realistic voices and multi-speaker capabilities in 40+ languages, designed for creators and enterprises to produce indistinguishable AI voiceovers and text-to-speech content.

Free + Paid
Airgram

Airgram

Airgram, in collaboration with Notta, offers an integrated solution for AI meeting notes and audio transcription. It records, transcribes, and summarizes voice conversations into actionable text, enhancing productivity during meetings.

Free + Paid
MeetGeek

MeetGeek

AI meeting automation for recording and insights.

Free + Paid
Magenta Studio

Magenta Studio

Magenta Studio is an open source research project that explores the role of machine learning as a tool in the creative process of making art and music.

Free + Paid
LOVO AI

LOVO AI

AI voice generator and text-to-speech platform.

Free + Paid
Loopin

Loopin

AI meeting workspace connecting calendar and notes.

Free + Paid
Sembly AI

Sembly AI

Sembly AI automatically generates accurate meeting notes and transcripts, capturing meetings from platforms like Google Meet and Zoom. It provides summaries, identifies speakers, and supports multilingual chats to enhance team collaboration.

Free + Paid
Veritone Voice

Veritone Voice

Veritone Voice is a leading AI voice solution that enables the creation of lifelike text-to-speech and speech-to-speech synthetic voices at unmatched speed and scale. It allows for content creation on demand, localization in over 150 languages, and cloning of voices with consent.

Subscription - custom pricing
Descript

Descript

Video editor with AI voice + overdub.

Subscription-$12-$24/month
Curious Thing

Curious Thing

Voice AI for recruitment and screening calls.

Subscription - custom pricing
Listnr

Listnr

Listnr is a professional AI voice generator that offers over 1000 realistic voices in 142+ languages. Trusted by over 3 million users, it enables the creation of multilingual content, voice cloning, and engaging voiceovers for various applications.

Free + Paid
Unveil

Unveil

ZYLIA Beamformer is a spatial audio processing plugin designed for Ambisonics recordings, supporting up to 7th order microphones. It enhances audio workflows with precise spatial filtering, virtual microphone setups, and sound source separation in your DAW.

One-time purchase - $399
Mubert

Mubert

Mubert is an AI music generator that creates royalty-free music from text prompts, utilizing millions of samples from various artists. It combines human creativity with AI technology to deliver customized audio for videos and projects.

Free + Paid
Audo Studio

Audo Studio

Audo Studio offers one-click audio cleaning that automatically removes background noise, reduces echoes, and adjusts volume levels, enhancing speech quality for YouTubers and podcasters in seconds.

Free + Paid
Ecrett Music

Ecrett Music

Ecrett Music offers an easy way to create royalty-free music using an intuitive interface, allowing users to customize music for games, videos, podcasts, and ads without needing prior music knowledge.

Freemium
Voicemod

Voicemod

Voicemod is a free real-time voice changer app that transforms your voice with over 200 effects, allowing you to sound like a girl, a robot, or even AI anime waifus. It enhances gaming, streaming, and group chats with hilarious sound effects.

Free + Paid
Utterly

Utterly

Utterly is a noise cancellation app designed to enhance audio quality during meetings and recordings by removing background noise. It processes audio locally on your device, ensuring privacy and improved sound clarity.

Subscription - $8/month (starter)
MuseNet

MuseNet

MuseNet is a deep neural network that generates 4-minute musical compositions using 10 different instruments, blending styles from various genres, including country and classical. It learns patterns of harmony and rhythm from extensive MIDI files without explicit programming.

Free
Soundful

Soundful

Soundful is an AI Music Studio that allows creators to effortlessly generate unique, royalty-free background music for videos, streams, and podcasts at the click of a button, ensuring copyright compliance and affordability.

Free + Paid
Voicegain

Voicegain

Voicegain provides developers with a highly accurate and affordable Speech-to-Text platform, enabling the creation of voice-enabled applications and AI voice agents. Their deep-learning-based models can be deployed on-premise or in the cloud, offering flexibility and integration with existing systems.

Free + Paid
Uberduck

Uberduck

Uberduck offers realistic, expressive synthetic vocals for music, voiceovers, and videos, enabling users to generate speech, singing, and rapping from text. It supports voice cloning and provides tools for creating custom voices across various languages and musical styles.

Free + Paid
Splash Pro

Splash Pro

Splash Pro is an interactive online music creation platform that allows users to collaborate and create music with other artists in seconds. It transforms music into a social and immersive experience, enabling unique tracks that can be shared globally.

Free + Paid
Resound

Resound

Resound is an AI Podcast Editor designed to automate podcast editing, allowing creators to edit in minutes rather than hours. It identifies unwanted mistakes and long silences, enabling efficient content refinement.

Subscription - $10/month (pro)
Brusfri

Brusfri

null

Free + Paid
Noise Eraser

Noise Eraser

Noise Eraser uses AI to identify and extract background noise, enhancing human voice clarity in audio files. It allows users to isolate specific sounds, creating high-quality, noise-free audio easily on any device without professional editing skills.

Free + Paid

About Voice & Audio AI tools

Speech recognition, text-to-speech, and audio processing AI tools Compare features, pricing, and reviews to find the best voice & audio ai tools. Explore the wider Smart Assistants category for related solutions.

Popular Voice & Audio AI tools

  • nugget
  • producer.ai
  • Assemblyai
  • ElevenLabs
  • Krisp
  • Suno AI