
audio-extract
Extracts audio track from a video file. Use when you need to get audio from video, prepare audio for transcription, or separate audio from video content. Runs locally with no API key required.
Extracts audio track from a video file. Use when you need to get audio from video, prepare audio for transcription, or separate audio from video content. Runs locally with no API key required.
Audio Extract
Extracts the audio track from a video file. This is a local operation using the bundled ffmpeg binary - no API keys or external services required.
Command
agent-media audio extract --in <path> [options]
Inputs
| Option | Required | Description |
|---|---|---|
--in |
Yes | Input video file path or URL (supports mp4, webm, mkv, avi, mov) |
--format |
No | Output audio format: mp3 (default) or wav |
--out |
No | Output path, filename or directory (default: ./) |
Output
Returns a JSON object with the extracted audio file:
{
"ok": true,
"media_type": "audio",
"action": "extract",
"provider": "local",
"output_path": "extracted_123_abc.mp3",
"mime": "audio/mpeg",
"bytes": 24779
}
Examples
Extract audio as MP3 (default):
agent-media audio extract --in video.mp4
Extract audio as WAV:
agent-media audio extract --in video.mp4 --format wav
Custom output directory:
agent-media audio extract --in video.mp4 --out ./audio-files
Use Case: Video Transcription Workflow
Since transcription services work best with audio files (smaller uploads, faster processing), use this workflow:
# Step 1: Extract audio from video (local, instant)
agent-media audio extract --in interview.mp4 --format mp3
# Output: extracted_xxx.mp3
# Step 2: Transcribe the audio (cloud API)
agent-media audio transcribe --in extracted_xxx.mp3 --provider fal
Provider
This action uses the local provider with bundled ffmpeg (via ffmpeg-static). No API keys required.
You Might Also Like
Related Skills

songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
moltbot
slack-gif-creator
Knowledge and utilities for creating animated GIFs optimized for Slack. Provides constraints, validation tools, and animation concepts. Use when users request animated GIFs for Slack like "make me a GIF of X doing Y for Slack."
anthropics
algorithmic-art
Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.
anthropics
brand-guidelines
Applies Anthropic's official brand colors and typography to any sort of artifact that may benefit from having Anthropic's look-and-feel. Use it when brand colors or style guidelines, visual formatting, or company design standards apply.
anthropics
theme-factory
Toolkit for styling artifacts with a theme. These artifacts can be slides, docs, reportings, HTML landing pages, etc. There are 10 pre-set themes with colors/fonts that you can apply to any artifact that has been creating, or can generate a new theme on-the-fly.
anthropics
canvas-design
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.
anthropics