Skill Detail

AssemblyAI Audio Intelligence Agent

Extract structured intelligence from audio using the AssemblyAI API with sentiment analysis, entity detection, topic modeling, and auto-chapter generation. Uses the assemblyai Python SDK for transcript processing pipelines.

Media & TranscriptionMCP

Media & Transcription MCP Security Reviewed

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill assemblyai-audio-intelligence-agent Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Author

AssemblyAI

Last updated

Mar 24, 2026

Quick brief

Analyze audio content using AssemblyAI’s Audio Intelligence models to extract structured data beyond basic transcription. This skill leverages the AssemblyAI API v2 for advanced audio analysis workflows.

How it works

What this skill actually does

The transcription pipeline uploads audio via POST /v2/upload, creates transcript jobs with audio_intelligence features enabled, and polls for completion. Supported intelligence models include sentiment analysis per utterance, entity detection (PII, names, locations, monetary amounts), topic detection using the IAB taxonomy, and auto-chapter generation with summary headlines.

Speaker diarization identifies individual speakers and labels utterances for meeting transcript formatting. Content safety detection flags sensitive topics across configurable severity thresholds. PII redaction can be applied to both text output and audio files with configurable entity types.

The assemblyai Python SDK (v0.20+) provides high-level wrappers for the REST API with automatic polling and webhook support. Real-time transcription uses WebSocket streaming for live audio feeds with interim result callbacks.

Batch processing handles multiple files with parallel job submission and result aggregation into structured JSON or SRT output formats. Rate limit management includes automatic throttling and retry logic for the 200 concurrent transcript limit.