Skip to main content

Get Started in 5 Minutes

Quick Start →

Create account, get API key, make your first call.

Install SDK

pip install audiopod

Hello World

from audiopod import AudioPod

client = AudioPod()  # Uses AUDIOPOD_API_KEY env var

# Separate a song into vocals, drums, bass, guitar, piano, other
result = client.stems.separate(
    url="https://youtube.com/watch?v=VIDEO_ID",
    mode="six"
)

print(result["download_urls"])

What Can You Build?

🎵 Stem Separation

Isolate vocals, drums, bass, guitar from any song. 2-16 stem modes.

🎤 Voice Cloning

Clone any voice from 10 seconds of audio. Generate speech in that voice.

📝 Transcription

Convert audio/video to text. Speaker diarization, 50+ languages.

🎹 Music Generation

Generate music from text prompts. Any genre, any duration.

🔇 Noise Reduction

Remove background noise from audio. Multiple quality modes.

👥 Speaker Separation

Identify and separate speakers from recordings.

🗣️ Text-to-Speech

150+ voices, 64+ languages. SSML support.

🎙️ Voice Changer

Transform voices with AI-powered style transfer.

Pricing

Pay-as-you-go. No subscriptions required. Add funds to your wallet and use.
ServiceRate
Stem Separation$0.10/min
Transcription$0.01/min
Voice Cloning/TTS$0.04/min
Voice Conversion$0.13/min
Speech Translation$0.40/min
Music Generation$0.02/min
Noise Reduction0.020.02-0.08/min
Speaker Separation$0.20/min
Karaoke Generation$0.25/min

View Full Pricing →

Complete pricing for all 14+ services

Resources

Quick Start

Zero to API call in 5 minutes

Python SDK

Full Python SDK reference

Node.js SDK

Full Node.js SDK reference

API Reference

Complete endpoint documentation

API Status

Check service health

Support

Get help from our team