Microsoft MAI Models is a suite of three in-house AI models for speech transcription, voice generation, and image generation, available via Microsoft Foundry.
Google Cloud Speech to Text is a freemium AI transcription API supporting 125+ languages and real-time streaming recognition, built on the Chirp foundation model for enterprise accuracy.
Gladia is an AI-powered speech recognition API that provides real-time and async audio transcription with speaker diarization and multilingual support.