Process an audio file to extract specified stems (vocals, instrumental, or other components) with optional preprocessing. Supports file upload or URL with webhook callback.
Parameter | Type | Required | Description |
---|---|---|---|
audio_url | string | Optional | URL of the audio file to extract from. Either audio_url or audio_file must be provided. |
audio_file | UploadFile | Optional | Audio file to upload and process. Either audio_url or audio_file must be provided. |
stems | string | Optional | JSON string list of required stems. e.g., ["vocals", "drums"] . See all options below. |
preprocessing_options | string | Optional | JSON string list of preprocessing options. e.g., ["Denoise"] . See all options below. |
webhook_url | string | Optional | Callback URL to receive async processing results. |
vocals
— All vocal content (combined)male_vocal
— Male vocals onlyfemale_vocal
— Female vocals onlylead_vocal
— Lead singer’s voiceback_vocal
— Backing vocalsguitar
— All guitar soundsacoustic_guitar
— Acoustic guitarelectric_guitar
— Electric guitarrhythm_guitar
— Rhythm guitar partssolo_guitar
— Solo guitar partsdrums
— All drum elementskick_drum
— Kick or bass drumsnare_drum
— Snare drumtoms
— Tom drumshi_hat
— Hi-hat cymbalsride
— Ride cymbalcrash
— Crash cymbalpiano
— Piano onlykeys
— Keyboard instruments (includes synths, organs, etc.)strings
— String instruments (e.g., violin, cello)winds
— Wind instruments (e.g., saxophone, flute)bass
— Bassline instrumentsinstrumental
— All non-vocal content (used to get instrumental version)drums
, bass
, piano
, guitar
— Common full-band instrumentationNote: If thestems
parameter is omitted or an empty list is supplied, this full band composite (["drums", "bass", "piano", "guitar"]
) is used as the default.
Denoise
— Reduces ambient or background noiseDeecho
— Removes echo and early reflectionsDereverb
— Reduces late reverberation from recordings🔐 Replacepath_to_audio.mp3
,api_key
, andwebhook_url
with actual values.
audio_url
nor audio_file
is provided.Successfully initiated audio extraction
The response is of type object
.