Generate a song from an image by analyzing it and creating music based on visual content. The process can optionally include custom lyrics, voice conversion, and various musical parameters.
Image file to upload and analyze. Supported formats: JPEG, PNG, GIF, BMP, WEBP.
URL of the image to analyze. Either this or image_file must be provided.
"https://mybucket.s3.amazonaws.com/image.png"
Additional prompt to guide the song generation from the image.
300"Generate a relaxing acoustic track inspired by this scene."
Custom lyrics to include in the generated audio.
3000"Let the colors of the sunset fill your heart."
Tags or themes to avoid in the song.
"no heavy metal, avoid loud drums"
Generate instrumental output only. Lyrics will be ignored.
Generate vocal-only output.
Musical key for the song.
"C major"
Beats per minute for the song tempo. Defaults to 0 (auto-selected).
Optional callback URL for async processing results.
"https://example.com/webhook"
Voice ID for converting the generated audio. Cannot be used with vocal_only mode.