Image to Video
Motion Control

Kling 2.6 Motion Control

Kuaishou's motion control model that transfers motion from reference videos to character images while maintaining identity and adapting environments.

Kling 2.6 Motion Control
Text to Video
Image to Video
Video to Video

Wan 2.6

Alibaba's Wan 2.6 video generation family for text-to-video, image-to-video, and video-to-video with multi-shot 1080p output.

Wan 2.6
Text to Video
Image to Video

Hailuo 02

MiniMax's #2 globally-ranked video model with NCR architecture, ultra-realistic physics, and 1080p cinematic output.

Hailuo 02
Text to Image
Image to Image

Seedream 4.5

ByteDance's unified 4K image generation and editing model with professional-grade text rendering and commercial photography quality.

Seedream 4.5
Text to Image
Image to Image

Nano Banana Pro

Google's advanced Gemini 3 Pro image model featuring 4K resolution, enhanced reasoning, real-time data integration, and multi-image composition capabilities.

Nano Banana Pro
Text to Video
Image to Video

Sora 2

OpenAI's advanced video model with realistic physics simulation, synchronized audio generation, and innovative Cameo feature for personalized content.

Sora 2
Text to Video
Image to Video

Sora 2 Pro

Premium Sora 2 variant delivering professional-grade 1024p video with enhanced fidelity, extended duration, and sophisticated audio-visual coherence.

Sora 2 Pro
Music

Suno Music

AI music generator with customizable styles, vocals, and full creative control over musical characteristics and quality.

Suno Music

Search models...

0All Models
Character Animation
Image to Image
Image to Video
Motion Control
Music
Text to Image
Text to Video
Video to Video
Kling 2.6 Motion Control

Kling 2.6 Motion Control

Per Request:$0.04
Save 30%

Kuaishou's motion control model that transfers motion from reference videos to character images while maintaining identity and adapting environments.

Seedance 1.5 Pro

Seedance 1.5 Pro

Per Request:$0.04
Save 24%

ByteDance's latest video model with synchronized audio generation, flexible aspect ratios, and enhanced motion control.

Wan 2.6

Wan 2.6

Per Request:$0.40
Save 20%

Alibaba's Wan 2.6 video generation family for text-to-video, image-to-video, and video-to-video with multi-shot 1080p output.

GPT Image 1.5

GPT Image 1.5

Per Request:$0.01
Save 23%

OpenAI's latest image model with 4x speed, precision editing, and superior text rendering.

Grok Imagine

Grok Imagine

Per Request:$0.03
Save 57%

xAI's Aurora-powered visual AI for image generation and video creation with Fun, Normal, and Spicy creative modes.

Hailuo 02

Hailuo 02

Per Request:$0.04
Save 22%

MiniMax's #2 globally-ranked video model with NCR architecture, ultra-realistic physics, and 1080p cinematic output.

Seedance 1.0 Pro

Seedance 1.0 Pro

Per Request:$0.10
Save 83%

ByteDance's #1 ranked video model with multi-shot storytelling, cinema-grade motion, and bilingual text-to-video generation.

Z-Image

Z-Image

Per Request:$0.01

Alibaba's efficient 6B-parameter image model with sub-second generation and exceptional Chinese-English bilingual text rendering capabilities.

Kling 2.6

Kling 2.6

Per Request:$0.33

Kuaishou's revolutionary video model that simultaneously generates visuals with synchronized dialogue, sound effects, and ambient audio in one pass.

Seedream 4.5

Seedream 4.5

Per Request:$0.03
Save 37.5%

ByteDance's unified 4K image generation and editing model with professional-grade text rendering and commercial photography quality.

FLUX.2

FLUX.2

Per Request:$0.03

Black Forest Labs' production-grade model combining 4MP image generation and editing with multi-reference support, precise typography, and hex color control.

Wan Animate

Wan Animate

Per Request:$0.04

Alibaba's 14B-parameter character animation model that transfers motion from reference videos to static characters with exceptional identity preservation.

Nano Banana Pro

Nano Banana Pro

Per Request:$0.03
Save 90%

Google's advanced Gemini 3 Pro image model featuring 4K resolution, enhanced reasoning, real-time data integration, and multi-image composition capabilities.

GPT-4o Image

GPT-4o Image

Per Request:$0.02
Save 80%

OpenAI's native multimodal image generator with exceptional text rendering, precise prompt following, and conversational editing capabilities.

Nano Banana

Nano Banana

Per Request:$0.03
Save 36%

Google's leaderboard-topping image model (Gemini 2.5 Flash) excelling in natural language editing, character consistency, and multi-image blending.

Sora 2

Sora 2

Per Request:$0.05
Save 95%

OpenAI's advanced video model with realistic physics simulation, synchronized audio generation, and innovative Cameo feature for personalized content.

Sora 2 Pro

Sora 2 Pro

Per Request:$0.50
Save 95%

Premium Sora 2 variant delivering professional-grade 1024p video with enhanced fidelity, extended duration, and sophisticated audio-visual coherence.

Veo 3.1

Veo 3.1

Per Request:$0.25

Google DeepMind's 1080p video model with native audio generation, scene extension to 60+ seconds, and advanced creative controls for cinematic storytelling.

Suno Music

Suno Music

Per Request:$0.10
Save 90%

AI music generator with customizable styles, vocals, and full creative control over musical characteristics and quality.

Extend Music

Extend Music

Per Request:$0.10

Extend or modify existing music tracks by creating sequels based on source audio. Supports custom mode with full parameter control or simple mode inheriting original parameters. Specify continuation points and maintain style consistency across extensions up to 8 minutes.

Upload and Cover Audio

Upload and Cover Audio

Per Request:$0.10

Transform audio tracks into new styles while preserving original melodies. Upload your audio files (up to 2 minutes) and convert them with AI-powered style transfer. Supports custom and simplified modes with vocal/instrumental options and audio weight controls.

Upload and Extend Audio

Upload and Extend Audio

Per Request:$0.10

Upload audio files and extend them while maintaining the original style and characteristics. AI generates seamless continuations from specified time points. Supports multiple model versions with style weight and creative controls for natural extensions.

Add Instrumental

Add Instrumental

Per Request:$0.10

Generate musical accompaniment for uploaded audio files containing vocals or melodies. AI creates matching instrumental backing tracks with customizable style tags, genre preferences, and quality controls. Perfect for adding professional-quality backing to vocal recordings.

Add Vocals

Add Vocals

Per Request:$0.10

Layer AI-generated vocals onto existing instrumental tracks. Provide lyrics or descriptions and the API generates matching vocal performances with customizable gender, style, and expression. Transform instrumental music into complete songs with professional AI singing.

Get Timestamped Lyrics

Get Timestamped Lyrics

Per Request:$0.01

Retrieve lyrics synchronized with precise timestamps from generated music. Returns word-by-word timing data, waveform visualization, and alignment accuracy scores. Essential for karaoke applications, lyric videos, and music synchronization projects.

Boost Music Style

Boost Music Style

Per Request:$0.01

AI-enhanced music style description generator. Transform simple style inputs like 'pop, mysterious' into detailed, comprehensive musical descriptions. Optimize your prompts for better music generation results with enriched genre, mood, and instrumentation details.

Generate Music Cover

Generate Music Cover

Per Request:$0.01

Generate alternative cover versions of existing music tracks. Create variations with automatic style changes while maintaining the essence of the original composition. Perfect for producing multiple versions or exploring different interpretations of your generated music.

Replace Section

Replace Section

Per Request:$0.05

Replace specific sections of generated music tracks with precision timing control. Modify choruses, verses, or any segment by specifying start and end times. Maintains overall coherence while allowing targeted changes to lyrics, style, or musical elements.

Generate Persona

Generate Persona

Per Request:$0.01

Create reusable music personas from existing audio tracks. Save distinctive vocal characteristics, musical styles, and personality traits for consistent use across multiple generations. Build your own AI artist profiles for brand consistency and style continuity.

Generate Lyrics

Generate Lyrics

Per Request:$0.01

AI-powered lyrics generation based on themes, moods, and descriptions. Create original song lyrics from simple prompts up to 200 characters. Generate creative, coherent lyrics for any genre or emotional tone with professional songwriting quality.

Convert to WAV

Convert to WAV

Per Request:$0.01

Obtain high-quality WAV format files from your generated music. Convert any PoYo-generated audio to lossless WAV format for professional use, further editing, or high-fidelity playback. Essential for production workflows requiring uncompressed audio.

Vocal Remover

Vocal Remover

Per Request:$0.07

Separate vocals from instrumentals or split audio into multiple stem tracks. Two modes available: vocal separation for isolating vocals and backing tracks, or stem splitting for extracting drums, bass, vocals, and other instruments individually. Professional-grade audio source separation.

AI Music Video

AI Music Video

Per Request:$0.02

Generate visualized music videos from audio tracks. Create engaging visual content automatically synchronized to your music with optional author attribution and brand watermarks. Perfect for social media content, promotional materials, and music distribution.