Submit an audio file for transcription by providing an HTTPS URL. The API pre-charges synapses based on audio duration, submits the job for processing, and returns an initial response with a transcription ID for polling.
Creates a new transcription job. The audio file must be accessible via a public HTTPS URL. The service validates the URL, calculates the synapse cost based on audio duration and any applicable multipliers, pre-charges the organization's balance, and submits the job for processing. The transcription starts in queued status and transitions through processing to completed or failed.
Billing uses consumption-based pricing with the FVT-1 model. Priority levels (normal, high, urgent) apply multipliers to the base cost.
organization_id string required
The organization ID (org_ prefixed string) to bill for this transcription.
transcription_options object required
Core transcription configuration.
Show properties
audio_urlstring (required) -- HTTPS URL of the audio file to transcribe.languagestring (required) -- Language code (e.g.,en,es,fr,de). If the language is unknown, useenand auto-detection will be applied.
speaker_analysis object optional
Speaker diarization configuration.
Show properties
diarization_enabledboolean -- Enable speaker identification. Defaults tofalse.speakers_expected_countinteger -- Expected number of speakers (1-10).
intelligence_features object optional
Advanced AI analysis features.
Show properties
sentiment_analysis_enabledboolean -- Enable sentiment analysis per sentence. Defaults tofalse.entity_detection_enabledboolean -- Enable named entity detection. Defaults tofalse.auto_highlights_enabledboolean -- Enable automatic key phrase highlights. Defaults tofalse.summarization_enabledboolean -- Enable automatic summarization. Defaults tofalse.
privacy_settings object optional
Privacy and PII redaction settings.
Show properties
pii_redaction_enabledboolean -- Enable PII redaction in transcripts. Defaults tofalse.pii_policiesstring[] -- List of PII policies to apply (e.g.,["email", "phone_number", "ssn"]).
webhook object optional
Webhook notification configuration.
Show properties
urlstring -- HTTPS URL to receive webhook notifications.eventsstring[] -- List of events to trigger the webhook (e.g.,["transcription.completed", "transcription.failed"]).
Returns a success envelope with the following top-level fields:
success--trueon successful creation.data-- Transcription data includingtranscription_id(withtrans_prefix), initialstatusofqueued,model_key,audio_metadata, andtimestamps.billing-- Billing details includingsynapses_consumed,synapses_refunded,transaction_id,currency, and appliedmultipliers(base, organization, priority).metadata-- Request metadata includingrequest_id,timestamp, andprocessing_time_ms.actions-- HATEOAS-style links to related endpoints such ascheck_status.
- Bash
- Python
- Python
- JavaScript
curl -X POST https://api.aitronos.com/v1/audio/transcribe \
-H "Authorization: Bearer $ACCESS_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"organization_id": "org_abc123def456",
"transcription_options": {
"language": "en",
"audio_url": "https://storage.example.com/meeting-recording.mp3"
},
"speaker_analysis": {
"diarization_enabled": true,
"speakers_expected_count": 2
},
"intelligence_features": {
"sentiment_analysis_enabled": true,
"entity_detection_enabled": true,
"summarization_enabled": true
},
"privacy_settings": {
"pii_redaction_enabled": true,
"pii_policies": ["email", "phone_number", "ssn"]
}
}'