Overview
AudioPod AI’s Voice Changer API uses OpenVoice v2 technology to convert source audio to match target voice characteristics. Transform any speech recording to sound like a different voice while preserving the original speech content, timing, and emotional expression.Key Features
- Voice Conversion: Transform source audio to match target voice characteristics
- Content Preservation: Maintains original speech content, timing, and emotional expression
- Multiple Voice Sources: Use any completed voice profile as a target voice
- High Quality Processing: Advanced OpenVoice v2 technology for natural-sounding results
- Public Voice Support: Access to both user-owned and public voice profiles
- Flexible Input: Support for various audio formats (WAV, MP3, M4A, etc.)
- Real-time Processing: Fast conversion for production workflows
Authentication
All endpoints require authentication:- API Key:
Authorization: Bearer your_api_key - JWT Token:
Authorization: Bearer your_jwt_token
Voice Conversion
Convert Audio to Target Voice
Transform the voice characteristics in a source audio file to match a target voice profile.- POST
- Python
- cURL
file(required): Source audio file containing speech to convertvoice_uuid(required): UUID of the target voice profile to match
- User’s own custom voice profiles (from voice cloning)
- Public voice profiles available in the voice library
- Voice must have status “COMPLETED” and available audio file
Job Management
Get Conversion Status
Monitor the progress of voice conversion jobs.- GET
- Python
List Conversion Jobs
Get all voice conversion jobs for the authenticated user.- GET
- Python
Download Converted Audio
- Python
Voice Profile Management
Find Available Voices
Before converting, you can browse available voice profiles to use as targets.Use Cases & Examples
Podcast Voice Consistency
Content Localization
Voice Anonymization
Error Handling
404 Not Found - Voice Not Available
404 Not Found - Voice Not Available
Causes: - Target voice UUID doesn’t exist - Voice is not accessible to user - Voice is not completed or ready
Solutions: - Verify voice UUID is correct - Ensure voice is completed and has audio file available - Use public voices or your own custom voices
400 Bad Request - Invalid Audio
400 Bad Request - Invalid Audio
Causes: - Source audio file has no speech content - Unsupported audio format - Audio file corrupted
Solutions: - Ensure audio contains clear speech - Use supported formats (WAV, MP3, M4A) - Verify file integrity
422 Processing Error - Conversion Failed
422 Processing Error - Conversion Failed
Causes: - Source audio quality too poor - Incompatible voice characteristics - Processing timeout
Solutions: - Improve source audio quality - Try different target voice - Use shorter audio segments
402 Payment Required - Insufficient Credits
402 Payment Required - Insufficient Credits
Causes: - Not enough credits for conversion duration
Solutions: - Purchase additional credits - Check credit requirements for audio duration
Best Practices
Audio Quality Guidelines
For optimal voice conversion results:Target Voice Selection
Pricing
Voice conversion pricing is based on audio duration:| Service | Cost | Description |
|---|---|---|
| Voice Conversion | 990 credits/minute | Transform voice characteristics using OpenVoice v2 |
Cost Examples
| Duration | Credits | USD Cost |
|---|---|---|
| 30 seconds | 495 | $0.0659 |
| 2 minutes | 1980 | $0.2634 |
| 5 minutes | 4950 | $0.6584 |
| 10 minutes | 9900 | $1.3167 |
Cost Optimization Tips
- Pre-process audio to remove silence and optimize duration
- Batch similar conversions using the same target voice
- Test with shorter clips before converting long content
- Use high-quality source audio to avoid re-processing
