AssemblyAI
Open siteMultilingual Speech-to-Text API with near-human accuracy
AssemblyAI Information
What is AssemblyAI?
AssemblyAI is a Speech AI platform that offers a suite of models for speech-to-text, streaming speech-to-text, and speech understanding. It enables users to build powerful products by leveraging voice data. AssemblyAI provides accurate and reliable speech recognition and audio intelligence, allowing users to extract insights, generate summaries, and understand spoken content, offering features like summarization, sentiment analysis, and topic detection. The platform is designed for various use cases including conversational intelligence, voice agents, contact centers, medical applications, and captioning.
AssemblyAI Core Features
- Speech-to-Text
- Streaming Speech-to-Text
- Audio Intelligence
- Speaker Diarization
- Sentiment Analysis
- Topic Detection
- PII Redaction
AssemblyAI Pricing
free
$0/month
- Access to industry-leading Speech-to-Text and Audio Intelligence models
- Speech recognition
- Speaker diarization
- Custom spelling and vocabulary
- Profanity filtering, auto punctuation and casing
- Transcribe up to 416 hours of prerecorded audio for free
- Get tips and support as you build from developer docs and community resources
pay as you go
Start as low as $0.12/hr for Speech-to-Text/month
- Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
- Streaming Speech-to-Text
- Concurrency starting at 200 files and 100 streams
- Technical support via live chat and email
custom
Custom/month
- Flexible, zero-obligation pricing that scales to millions of hours
- Dedicated technical support with response time under one hour
- Customize rate limits - scale to any workload
- Customized SLAs and SLOs
- BAA for HIPAA Compliance
- Compliance with EU Data Residency standards
- Self-hosted deployments (On-prem, VPC)
- Early access to new models and model improvements
- Available through AWS Marketplace