Skill Detail

ElevenLabs Voice Cloning Agent

Clone and synthesize custom voices using the ElevenLabs API v2 with instant voice cloning. Manages voice library operations, text-to-speech generation with SSML markup, and audio stream output via the elevenlabs-python SDK.

Media & TranscriptionOpenClaw

Media & Transcription OpenClaw Security Reviewed

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill elevenlabs-voice-cloning-agent Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Last updated

Mar 24, 2026

Quick brief

Build custom voice profiles and generate natural speech using the ElevenLabs API v2 endpoint. This skill handles the full voice cloning workflow from sample upload through synthesis output.

How it works

What this skill actually does

The agent manages voice library operations including creating instant voice clones from audio samples via POST /v1/voices/add, listing available voices, and editing voice settings (stability, similarity_boost, style, use_speaker_boost). Text-to-speech requests are sent to /v1/text-to-speech/{voice_id} with configurable model selection between eleven_monolingual_v1, eleven_multilingual_v2, and eleven_turbo_v2.

Advanced features include SSML markup support for controlling prosody, emphasis, and breaks in generated speech. The skill streams audio output in real-time using chunked transfer encoding for low-latency playback.

Integrates with the elevenlabs-python SDK for programmatic control, and supports output in mp3, pcm, ulaw, and opus formats. Rate limiting and quota tracking are built in to manage API usage across generation requests.

Best fit

When to reach for it

Best when the job fits Media & Transcription.
Works naturally with OpenClaw setups.

Trust & provenance

Why this listing is credible

Trust status: Security Reviewed.
Last updated Mar 24, 2026.

View source ↗