Overview
What is Google Cloud Speech-to-Text?
Speech-to-Text on Google Cloud is a tool used to convert speech into text using an API powered by Google’s AI technologies. The vendor states users can transcribe content in real time or from stored files; deliver a better user experience…
Awards
Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards
Reviewer Pros & Cons
Pricing
Speech-to-Text V2 API
$0.016
Speech-to-Text V1 API
$0.024
Entry-level set up fee?
- No setup fee
Offerings
- Free Trial
- Free/Freemium Version
- Premium Consulting/Integration Services
Product Details
- About
- Competitors
- Tech Details
- FAQs
What is Google Cloud Speech-to-Text?
The service includes up to 60 minutes for transcribing and analyzing audio free per month. (Applies to processing audio with the Speech-to-Text V1 API only.)
Advanced speech AI
Speech-to-Text can utilize Chirp, Google Cloud’s foundation model for speech trained on millions of hours of audio data and billions of text sentences. This contrasts with traditional speech recognition techniques that focus on large amounts of language-specific supervised data. These techniques give users improved recognition and transcription for more spoken languages and accents.
Support for 125 languages and variants
Build for a global user base with extensive language support. The service transcribes short, long, and even streaming audio data. Speech-to-Text also offers users more accurate and globe-spanning translation and recognition with Chirp, the next generation of universal speech models. Chirp was built using self-supervised training on millions of hours of audio and 28 billion sentences of text spanning 100+ languages.
Pretrained or customizable models for transcription
Offers a selection of trained models for voice control, phone call, and video transcription optimized for domain-specific quality requirements. Users can customize, experiment with, create, and manage custom resources with the Speech-to-Text UI.
Out-of-the-box regulatory and security compliance
Speech-to-Text API v2 gives enterprise and business customers added security and regulatory requirements out of the box. Data residency enables the invocation of transcription models through a fully regionalized service that taps into Google Cloud regions like Singapore and Belgium. Recognizer resourcefulness eliminates the need for dedicated service accounts for authentication and authorization. Logs for resource generation and transcription are made easily available in the Google Cloud console. And Speech-to-Text API v2 offers enterprise-grade encryption with customer-managed encryption keys for all resources as well as batch transcription.
AI-powered speech recognition and transcription
Speech-to-Text uses model adaptation to improve the accuracy of frequently used words, expand the vocabulary available for transcription, and improve transcription from noisy audio. Model adaptation lets users customize Speech-to-Text to recognize specific words or phrases more frequently than other options that might otherwise be suggested. For example, you could bias Speech-to-Text towards transcribing "weather" over "whether."
Streaming speech recognition
Sends real-time speech recognition results as the API processes the audio input streamed from connected application’s microphone or sent from a prerecorded audio file (inline or through Cloud Storage).
Google Cloud Speech-to-Text Features
- Supported: Global vocabulary
- Supported: Streaming speech recognition
- Supported: Speech adaptation
- Supported: Speech-to-Text On-Prem
- Supported: Multichannel recognition
- Supported: Noise robustness
- Supported: Domain-specific models
- Supported: Content filtering
- Supported: Transcription evaluation
Google Cloud Speech-to-Text Screenshots
Google Cloud Speech-to-Text Video
Google Cloud Speech-to-Text Competitors
Google Cloud Speech-to-Text Technical Details
Deployment Types | On-premise, Software as a Service (SaaS), Cloud, or Web-Based |
---|---|
Operating Systems | Windows, Mac |
Mobile Application | No |