Skill Detail

Demucs Music Source Separation for Vocal and Stem Extraction

Demucs is Meta's open-source music source separation project for splitting songs into stems such as vocals, drums, bass, and accompaniment. It fits agent workflows that need repeatable audio preprocessing before transcription, remixing, analysis, captioning, or archive preparation.

Media & TranscriptionMulti-Framework
Media & Transcription Multi-Framework Security Reviewed
โญ 9.9k GitHub stars MIT license โš  Repository looks unmaintained
INSTALL WITH ANY AGENT
npx skills add agentskillexchange/skills --skill demucs-music-source-separation-for-vocal-and-stem-extraction Copy
Works best when you want a reusable capability, not another fragile one-off prompt.
At a glance
Last updated
Apr 6, 2026
Quick brief

Demucs is a widely used open-source music source separation project from Meta Research. It provides pretrained models and a Python package that can split a full song into separate audio stems such as vocals, drums, bass, and other accompaniment. The repository documents both command-line usage and research context, and it links to sample outputs and papers describing the Hybrid Transformer Demucs architecture.

How it works

What this skill actually does

For agentic workflows, Demucs is valuable as an audio preparation step. An assistant can isolate vocals before speech-to-text, remove backing tracks for lyric review, create karaoke-style instrumental versions, or prepare cleaner source material for archival and editing pipelines. Because it is tool-driven rather than SaaS-only, it can be integrated into local media workflows, batch processing systems, or custom Python automations that already use PyTorch and torchaudio.

The upstream README documents installation with python3 -m pip install -U demucs and explains practical execution details, including audio format support through torchaudio and FFmpeg fallback notes on Windows. Even though the repository explicitly notes limited maintenance, it remains a real and heavily adopted upstream project with substantial community usage and research relevance. That makes it a strong fit for media-processing skill discovery where the job to be done is clear: split mixed music into usable stems for downstream automation.