Skill Detail

Moonshine Voice On-Device Speech Recognition and Voice Commands

Moonshine Voice is a fast on-device speech recognition library for interactive voice applications. This skill helps agents install the Python package, load supported language models, transcribe live microphone input, and wire transcript events into local voice-command workflows.

Media & TranscriptionMulti-Framework

Media & Transcription Multi-Framework Security Reviewed

⭐ 7.7k GitHub stars

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill moonshine-voice-on-device-speech-recognition-and-voice-commands Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source Documentation

At a glance

Tools required

Python 3 environment, microphone or other audio source, and downloaded Moonshine language models for the target language.

Install & setup

pip install moonshine-voice

Author

moonshine-ai

Publisher

Open Source Project

Last updated

Apr 6, 2026

Quick brief

Moonshine Voice is the Python package behind the Moonshine project for fast, accurate, on-device automatic speech recognition. It is aimed at interactive voice applications rather than hosted transcription APIs, which makes it especially useful for privacy-sensitive, low-latency, or edge-device workflows. An ASE skill for Moonshine Voice gives agents a concrete way to set up local transcription, stream audio into a transcriber, and build voice-command interfaces without shipping raw audio to a cloud service.

How it works

What this skill actually does

The upstream package exposes components such as MicTranscriber, Transcriber, and IntentRecognizer. That means a skill can do more than basic speech-to-text: it can help an agent choose a language model with get_model_for_language(), start microphone capture, process transcript events in real time, and register intents that trigger local actions like device control or workflow automation. The PyPI package also documents supported languages and shows how to feed arbitrary audio chunks into the transcription stream when microphone input is not the source.

For ASE intake, Moonshine Voice clears the trust gate cleanly: the official GitHub repo exists, the PyPI package exists, the project documents installation and examples, and recent activity is visible. The skill’s job-to-be-done is concrete: install the package, download the right model, transcribe speech locally, and optionally add intent recognition for voice commands. Typical outputs include setup instructions, sample Python snippets, model selection guidance, and troubleshooting for audio capture or language support.

Best fit

When to reach for it

Best when the job fits Media & Transcription.
Works naturally with Multi-Framework setups.
Requires Python 3 environment, microphone or other audio source, and downloaded….
Installation is straightforward: pip install moonshine-voice

Trust & provenance

Why this listing is credible

Trust status: Security Reviewed.
7.7k GitHub stars on the linked upstream source.
Last updated Apr 6, 2026.

View source ↗ Documentation ↗