Skip to main content
Loading models…

Model Categories

Song Generation: Create full songs with optional lyrics and vocal support
  • ACE-Step 1.5, ElevenLabs Music, MiniMax Music 2.0
Music & Sound Effects: Generate instrumental music or sound effects from text prompts
  • Stable Audio 2.5
Sound Effects: Synthesize audio effects and ambient sounds from text prompts
  • ElevenLabs Sound Effects, MMAudio V2
ElevenLabs Music is the only model that supports force_instrumental to generate music without vocals.
Audio generation uses an async queue system. See the Audio Queue API to start generation and Audio Retrieve API to fetch results.

Pricing

Pricing varies by model:
  • Per-generation: Fixed price per audio clip (MiniMax Music 2.0, Stable Audio 2.5)
  • Duration-tiered: Price scales with duration tier (ElevenLabs Music, ACE-Step 1.5)
  • Per-second: Price based on output duration (ElevenLabs Sound Effects, MMAudio V2)
For exact quotes before generation, use the Audio Quote API.

Duration-Tiered Pricing

Models with duration-tiered pricing accept any duration_seconds within the model’s min_durationmax_duration range. The price is determined by which tier the requested duration falls into. Tier ranges are returned in the /models response under pricing.durations, with min_seconds and max_seconds for each tier. For example, ElevenLabs Music accepts 3–600 seconds (up to 10 minutes) at $0.75 per minute, rounded up to the nearest minute:
Duration RangeTier KeyBase Price
3–60s60$0.75
61–120s120$1.50
121–180s180$2.25
181–240s240$3.00
241–300s300$3.75
301–360s360$4.50
361–420s420$5.25
421–480s480$6.00
481–540s540$6.75
541–600s600$7.50
These are base prices before markup. Use the Audio Quote API to get the exact price you will be charged.

Key Parameters

ParameterDescription
promptText description of the audio to generate
lyrics_promptSong lyrics for vocal models (required when model has lyrics_required=true)
duration_secondsOutput length in seconds
force_instrumentalGenerate without vocals (where supported)