Complete catalog of all supported models on the Skytells platform with pricing, capabilities, and input schemas. Last updated March 17, 2026.
Every model on Skytells has its own namespace , pricing structure , capabilities , and input/output schema . This page is a complete reference for all currently supported models, grouped by type.
Use the namespace value when making API calls. Text models use the Inference API endpoints (/v1/chat/completions, /v1/responses). All other models use the Predictions API endpoint (/v1/predictions).
# Text models — Inference API
curl https://api.skytells.ai/v1/chat/completions \
-H "x-api-key: YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model": "deepbrain-router", "messages": [{"role": "user", "content": "Hello!"}]}'
# Image / Video / Audio models — Predictions API
curl -X POST https://api.skytells.ai/v1/predictions \
-H "x-api-key: YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model": "truefusion-pro", "input": {"prompt": "A sunset over mountains"}}'
Text models are accessed via the Inference API , which is fully OpenAI-compatible . You can use the OpenAI SDK by pointing base_url to https://api.skytells.ai/v1.
Property Value Namespace gpt-5Vendor OpenAI Pricing $0.50 / 1M input tokens · $1.25 / 1M output tokens Capabilities text-to-text, coding, writing, reasoning, chat, analysis, summarization, instruction-following, problem-solvingStatus Operational OpenAI Compatible ✅
OpenAI's flagship fifth-generation model, offering state-of-the-art reasoning, coding, and instruction-following capabilities.
Input Schema Output Schema Pricing Details
Parameter Type Required Default Description modelstring ✅ — Must be gpt-5 messagesarray ✅ — Array of {role, content} objects. Roles: system, user, assistant streamboolean — falseEnable server-sent events streaming max_tokensinteger — 8192Maximum tokens to generate temperaturenumber — 0.7Sampling temperature (0–2) top_pnumber — 0.95Nucleus sampling probability (0–1) frequency_penaltynumber — 0.0Penalise token frequency (-2.0–2.0) presence_penaltynumber — 0.0Penalise new topics (-2.0–2.0) stopstring | array — — Stop sequences (up to 4)
{
"id" : "chatcmpl-..." ,
"object" : "chat.completion" ,
"model" : "gpt-5" ,
"choices" : [
{
"index" : 0 ,
"message" : { "role" : "assistant" , "content" : "..." },
"finish_reason" : "stop"
}
],
"usage" : {
"prompt_tokens" : 10 ,
"completion_tokens" : 25 ,
"total_tokens" : 35
}
} Token Type Price Input tokens $0.50 / 1M tokens Output tokens $1.25 / 1M tokens
Billed separately for input and output. See Pricing for details.
Property Value Namespace gpt-5.4Vendor OpenAI Pricing $0.50 / 1M input tokens · $1.25 / 1M output tokens Capabilities text-to-text, coding, writing, reasoning, chat, analysis, summarization, instruction-following, problem-solvingStatus Operational OpenAI Compatible ✅
An incremental update to GPT-5 with improved accuracy, stronger instruction adherence, and refined reasoning in complex multi-turn conversations.
Input Schema Output Schema Pricing Details
Parameter Type Required Default Description modelstring ✅ — Must be gpt-5.4 messagesarray ✅ — Array of {role, content} objects. Roles: system, user, assistant streamboolean — falseEnable server-sent events streaming max_tokensinteger — 8192Maximum tokens to generate temperaturenumber — 0.7Sampling temperature (0–2) top_pnumber — 0.95Nucleus sampling probability (0–1) frequency_penaltynumber — 0.0Penalise token frequency (-2.0–2.0) presence_penaltynumber — 0.0Penalise new topics (-2.0–2.0) stopstring | array — — Stop sequences (up to 4)
{
"id" : "chatcmpl-..." ,
"object" : "chat.completion" ,
"model" : "gpt-5.4" ,
"choices" : [
{
"index" : 0 ,
"message" : { "role" : "assistant" , "content" : "..." },
"finish_reason" : "stop"
}
],
"usage" : {
"prompt_tokens" : 10 ,
"completion_tokens" : 25 ,
"total_tokens" : 35
}
} Token Type Price Input tokens $0.50 / 1M tokens Output tokens $1.25 / 1M tokens
Billed separately for input and output. See Pricing for details.
Property Value Namespace gpt-5.3-codexVendor OpenAI Pricing $1.75 / 1M input tokens · $0.175 / 1M cached input tokens · $14 / 1M output tokens Capabilities text-to-text, coding, writing, reasoning, chat, analysis, problem-solving, qualityStatus Operational OpenAI Compatible ✅ Edge Compatible ✅
Use this model with the Responses API — POST /v1/responses . Set "model": "gpt-5.3-codex" in the request body along with input (and optional instructions, previous_response_id, stream, and other Responses request fields ).
GPT-5.3 Codex achieves state-of-the-art performance on SWE-Bench Pro, a rigorous evaluation of real-world software engineering. SWE-Bench Pro spans four languages and is more contamination-resistant, challenging, diverse, and industry-relevant than SWE-bench Verified (which only tests Python). It also far exceeds the previous state-of-the-art on Terminal-Bench 2.0, which measures terminal skills for coding agents. GPT-5.3 Codex does so with fewer tokens than prior models, so you can build more per dollar.
Responses request Example response Pricing Details
Parameter Type Required Default Description modelstring ✅ — Must be gpt-5.3-codex inputstring | array ✅ — User prompt as a string, or an array of chat-style message objects (same idea as /v1/chat/completions messages) instructionsstring — — System-level instructions (equivalent to a system message) previous_response_idstring — — Continue a prior turn without resending full history streamboolean — falseStream the response as SSE max_output_tokensinteger — 8192Maximum tokens to generate temperaturenumber — 0.7Sampling temperature (0–2) top_pnumber — 0.95Nucleus sampling (0–1)
See the Responses API reference for the full schema.
{
"id" : "resp_..." ,
"object" : "response" ,
"model" : "gpt-5.3-codex" ,
"status" : "completed" ,
"output_text" : "..."
} Shape follows the OpenAI Responses object; see Responses — Response object for all fields.
Token Type Price Input tokens $1.75 / 1M tokens Cached input tokens $0.175 / 1M tokens Output tokens $14 / 1M tokens
Billed separately for input, cached input, and output. See Pricing for details.
Property Value Namespace deepbrain-routerVendor Skytells ✓ Pricing $0.50 / 1M input tokens · $1.25 / 1M output tokens Capabilities text-to-text, coding, writing, reasoning, chat, analysis, summarization, instruction-following, problem-solving, quality, fast, routerStatus Operational OpenAI Compatible ✅ Edge Compatible ✅ Cold Boot No
DeepBrain Router is Skytells' advanced model orchestration layer, built to intelligently choose the right model for the right task. Optimized for coding, writing, reasoning, and complex multi-domain workloads, it dynamically routes requests across a curated set of flagship models from leading providers. The result is stronger output quality, improved cost-performance balance, and a more reliable AI experience at scale.
DeepBrain Router is the recommended default for most use cases. It automatically selects the best underlying model for each request, so you get consistently high quality without managing model selection yourself.
Input Schema Output Schema Orchestration Pricing Details
Parameter Type Required Default Description modelstring ✅ — Must be deepbrain-router messagesarray ✅ — Array of {role, content} objects. Roles: system, user, assistant streamboolean — falseEnable server-sent events streaming max_tokensinteger — 8192Maximum tokens to generate temperaturenumber — 0.7Sampling temperature (0–2). Lower = more deterministic top_pnumber — 0.95Nucleus sampling probability (0–1) frequency_penaltynumber — 0.0Penalise token frequency (-2.0–2.0) presence_penaltynumber — 0.0Penalise new topics (-2.0–2.0) stopstring | array — — Stop sequences (up to 4)
Unlike traditional LLMs, Reasoning models, and other flagship models, DeepBrain Router model works differently. It dynamically selects the best underlying model for each request, so you get consistently high quality without managing model selection yourself, which advances AI experiance and Agentic workflows with less overhead and more flexibility.
{
"id" : "chatcmpl-..." ,
"object" : "chat.completion" ,
"model" : "deepbrain-router" ,
"choices" : [
{
"index" : 0 ,
"message" : { "role" : "assistant" , "content" : "..." },
"finish_reason" : "stop"
}
],
"usage" : {
"prompt_tokens" : 10 ,
"completion_tokens" : 25 ,
"total_tokens" : 35
}
} DeepBrain Router dynamically selects from the following models based on task complexity, domain, and quality requirements:
Model Version Provider DeepBrain Mini — Skytells DeepBrain 2.0 — Skytells GPT-4o 2024-11-20 OpenAI GPT-4o Mini 2024-07-18 OpenAI GPT-4.1 Nano 2025-04-14 OpenAI GPT-4.1 Mini 2025-04-14 OpenAI GPT-4.1 2025-04-14 OpenAI o4-mini 2025-04-16 OpenAI GPT-5 Nano 2025-08-07 OpenAI GPT-5 Mini 2025-08-07 OpenAI GPT-5 Chat 2025-08-07 OpenAI GPT-5 2025-08-07 OpenAI GPT-5.2 Chat 2025-12-11 OpenAI GPT-5.2 2025-12-11 OpenAI GPT-OSS 120B — OpenAI Llama-4 Maverick 17B-128E (FP8) — Meta DeepSeek-V3.1 — DeepSeek DeepSeek-V3.2 — DeepSeek Grok-4 — xAI Grok-4 Fast Reasoning — xAI Claude Haiku 4.5 20251001 Anthropic Claude Sonnet 4.5 20250929 Anthropic Claude Opus 4.1 20250805 Anthropic Claude Opus 4.6 — Anthropic
You cannot control which model is selected. The router optimises for quality and task fit. All routed requests are billed at the DeepBrain Router rate regardless of the underlying model chosen.
Token Type Price Input tokens $0.50 / 1M tokens Output tokens $1.25 / 1M tokens
Billed separately for input and output. See Pricing for details.
Property Value Namespace truefusionVendor Skytells Pricing $0.03 / image Capabilities text-to-imageStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt for generation aspect_ratioenum — 1:11:1, 16:9, 4:3, 3:2, 2:3, 3:4, 9:16, 21:9number_of_imagesinteger — 1Number of images (1–9) prompt_optimizerboolean — trueUse prompt optimizer
{
"type" : "array" ,
"items" : { "type" : "string" , "format" : "uri" }
} Returns an array of image URLs.
Property Value Namespace truefusion-proVendor Skytells Pricing $0.05 / image Capabilities text-to-image, image-to-imageStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Prompt for generated image aspect_ratioenum — 1:11:1, 16:9, 21:9, 3:2, 2:3, 4:5, 5:4, 3:4, 4:3, 9:16, 9:21imagestring (URI) — — Input image for img2img mode prompt_strengthnumber — 0.8Prompt strength for img2img (0–1) num_outputsinteger — 1Number of outputs (1–4) num_inference_stepsinteger — 28Denoising steps (1–50). Recommended: 28–50 guidancenumber — 3Guidance scale (0–10) seedinteger — — Random seed for reproducibility output_formatenum — webpwebp, jpg, pngoutput_qualityinteger — 80Output quality (0–100) go_fastboolean — trueUse fp8 quantized model for speed megapixelsenum — 11, 0.25disable_safety_checkerboolean — falseDisable safety checker
{
"type" : "array" ,
"items" : { "type" : "string" , "format" : "uri" }
} Returns an array of image URLs.
Property Value Namespace truefusion-maxVendor Skytells Pricing $0.12 / image Capabilities text-to-image, image-to-image, qualityStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt for image generation aspect_ratioenum — 1:111 ratios including 21:9, 9:21 image_promptstring (URI) — — Image reference to guide composition image_prompt_strengthnumber — 0.1Blend between prompt and image (0–1) safety_toleranceinteger — 2Safety level (1=strict, 6=permissive) seedinteger — — Random seed rawboolean — falseGenerate less processed, more natural images output_formatenum — jpgjpg, png
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Property Value Namespace truefusion-ultraVendor Skytells Pricing $0.15 / image Capabilities text-to-image, image-to-image, qualityStatus Operational
Flagship model with stunning photorealism, artistic creativity, and unmatched consistency across styles. Supports inpainting, style references, and magic prompt optimization.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt for image generation aspect_ratioenum — 1:115 ratios including 1:3, 3:1 resolutionenum — NoneSpecific resolution override (e.g., 1024x1024, 1536x640) magic_prompt_optionenum — AutoAuto, On, Off — optimizes prompt for qualityimagestring (URI) — — Image for inpainting (requires mask) maskstring (URI) — — Black/white mask for inpainting style_typeenum — NoneAuto, General, Realistic, Designstyle_reference_imagesarray (URI) — — Style reference images seedinteger — — Random seed (max 2,147,483,647)
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Property Value Namespace truefusion-2Vendor Skytells Pricing $0.07–$0.10 / image (resolution-based) Capabilities text-to-image, image-to-image, reference, qualityStatus Operational
Attach up to 3 reference images as ground truth and tag them in your prompt using @tag_name. Preserves identity, style, and materials while giving control over angle, composition, and lighting.
Input Schema Output Schema Pricing Details
Parameter Type Required Default Description promptstring ✅ — Text prompt for image generation aspect_ratioenum — 16:916:9, 9:16, 4:3, 3:4, 1:1, 21:9resolutionenum — 1080p720p, 1080preference_imagesarray (URI) — []Up to 3 reference images (0.5–2 aspect ratio) reference_tagsarray (string) — []Tags for references (use @tag_name in prompt) seedinteger — — Random seed remix_idstring — — Remix from another Skytells prediction
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Resolution Price 720p $0.07 / image 1080p $0.10 / image
Property Value Namespace truefusion-2-optimaVendor Skytells Pricing $0.008 / computing second Capabilities text-to-image, image-to-image, qualityStatus Operational Cold Boot Yes (CPU-based deployment)
Next-generation MoE architecture delivering unmatched realism, lifelike lighting, and film-grade image precision. Billed by compute time rather than per-image.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt for image generation aspect_ratioenum — 1:11:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 1:3, 3:1
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Property Value Namespace truefusion-xVendor Skytells Pricing $0.10 / image Capabilities text-to-image, image-to-image, qualityStatus Operational
Ultra-fast, ultra-high-resolution with inpainting support and quality control.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt nnumber — 1Number of images imagearray (URI) — — Reference images maskstring (URI) — — Inpainting mask qualityenum — mediumlow, medium, highsizeenum — 1024x1024768x1152, 1152x768, 1792x1024, 1920x822, etc.output_formatenum — jpgjpg, pngoutput_compressionnumber — 100Compression ratio (1–100)
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Property Value Namespace truefusion-edgeVendor Skytells Pricing $0.01 / image Capabilities text-to-image, image-to-image, fastStatus Operational
Ultra-fast, lightweight model optimized for speed. Only 4 denoising steps needed.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Prompt for generated image aspect_ratioenum — 1:111 ratios num_outputsinteger — 1Number of outputs (1–4) num_inference_stepsinteger — 4Denoising steps (1–4) seedinteger — — Random seed output_formatenum — webpwebp, jpg, pngoutput_qualityinteger — 80Quality (0–100) go_fastboolean — trueUse fp8 quantized model megapixelsenum — 11, 0.25disable_safety_checkerboolean — falseDisable safety checker
{
"type" : "array" ,
"items" : { "type" : "string" , "format" : "uri" }
} Returns an array of image URLs.
Property Value Namespace truefusion-panoVendor Skytells Pricing $0.02 / GPU second Capabilities text-to-image, image-to-imageStatus Operational
Multiple model variants available including multilingual (English + Chinese).
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Input prompt negative_promptstring — ""Things to exclude model_variantenum — 1600M-1024px1600M-1024px, 1600M-1024px-multilang, 1600M-512px, 600M-1024px-multilang, 600M-512px-multilangwidthinteger — 1024Output width heightinteger — 1024Output height num_inference_stepsinteger — 18Denoising steps guidance_scalenumber — 5CFG scale (1–20) pag_guidance_scalenumber — 2PAG guidance (1–20) seedinteger — — Random seed
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Property Value Namespace truefusion-standardVendor Skytells Pricing $0.05 / image Capabilities text-to-imageStatus Operational
Full-featured model with LoRA support. Load custom LoRA weights from Skytells, HuggingFace, or CivitAI.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Prompt for generated image aspect_ratioenum — 1:111 ratios imagestring (URI) — — Input image for img2img prompt_strengthnumber — 0.8Prompt strength for img2img (0–1) num_outputsinteger — 1Number of outputs (1–4) num_inference_stepsinteger — 28Denoising steps (1–50) guidancenumber — 3Guidance (0–10) lora_weightsstring — — LoRA weights URL or identifier (e.g., skytells/truefusion-base) lora_scalenumber — 1LoRA strength (-1 to 3) seedinteger — — Random seed output_formatenum — webpwebp, jpg, pngoutput_qualityinteger — 80Quality (0–100) go_fastboolean — trueUse fp8 quantized model megapixelsenum — 11, 0.25disable_safety_checkerboolean — falseDisable safety checker
{
"type" : "array" ,
"items" : { "type" : "string" , "format" : "uri" }
} Returns an array of image URLs.
Property Value Namespace truefusion-variantVendor Skytells Pricing $0.05 / image Capabilities text-to-image, image-to-imageStatus Operational
Schema not yet published — contact support for input details.
Property Value Namespace flux-fastVendor Skytells Pricing $0.01 / image Capabilities text-to-image, image-to-image, qualityStatus Operational
Super-fast Flux model optimized by Skytells for instant generation, with configurable speed modes.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt speed_modeenum — Extra Juiced 🔥Speed optimization level guidancenumber — 3.5Guidance scale image_sizeinteger — 1024Base image size (longest side) aspect_ratioenum — 1:111 ratios output_formatenum — jpgpng, jpg, webpoutput_qualityinteger — 80Quality (1–100) num_inference_stepsinteger — 28Inference steps seedinteger — -1Random seed
{
"type" : "string" ,
"format" : "uri"
} Returns a single image URL.
Property Value Namespace google-imagen-3Vendor Google Pricing $0.08 / image Capabilities text-to-image, image-to-imageStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt negative_promptstring — — What to discourage aspect_ratioenum — 1:11:1, 9:16, 16:9, 3:4, 4:3safety_filter_levelenum — block_medium_and_aboveblock_low_and_above, block_medium_and_above, block_only_high
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace google-imagen-4Vendor Google Pricing $0.08 / image Capabilities text-to-image, image-to-image, qualityStatus Operational
Google's flagship text-to-image model.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt aspect_ratioenum — 1:11:1, 9:16, 16:9, 3:4, 4:3safety_filter_levelenum — block_medium_and_aboveblock_low_and_above, block_medium_and_above, block_only_high
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace nano-bananaVendor Google Pricing $0.06 / image Capabilities text-to-image, image-to-image, fastStatus Operational
Google's Gemini 2.5-based image editing model with multi-image input support.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt image_inputarray (URI) — []Input images for transformation (multiple supported) aspect_ratioenum — match_input_imagematch_input_image, 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9output_formatenum — jpgjpg, png
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace nvidia-sanaVendor Nvidia Pricing $0.05 / image Capabilities text-to-imageStatus Operational
Fast image model with wide artistic range and resolutions up to 4096×4096. Multiple model variants including multilingual.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Input prompt negative_promptstring — ""Things to exclude model_variantenum — 1600M-1024px1600M-1024px, 1600M-1024px-multilang, 1600M-512px, 600M-1024px-multilang, 600M-512px-multilangwidthinteger — 1024Output width heightinteger — 1024Output height num_inference_stepsinteger — 18Denoising steps guidance_scalenumber — 5CFG scale (1–20) pag_guidance_scalenumber — 2PAG guidance (1–20) seedinteger — — Random seed
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace gpt-image-1Vendor OpenAI Pricing $0.002 / image Capabilities text-to-image, image-to-image, qualityStatus Operational Inference Partner-served
Requires your own OpenAI API key.
Input Schema Output Schema
Parameter Type Required Default Description openai_api_keystring (password) ✅ — Your OpenAI API key promptstring ✅ — Text description aspect_ratioenum — 1:11:1, 3:2, 2:3input_imagesarray (URI) — — Input images for editing number_of_imagesinteger — 1Number of images (1–10) qualityenum — autolow, medium, high, autobackgroundenum — autoauto, transparent, opaqueoutput_formatenum — webppng, jpeg, webpoutput_compressioninteger — 90Compression (0–100%) moderationenum — autoauto, lowuser_idstring — — End-user identifier for abuse monitoring
{
"type" : "array" ,
"items" : { "type" : "string" , "format" : "uri" }
} Returns an array of image URLs.
Property Value Namespace FLUX-1.1-proVendor Black Forest Labs Pricing $0.04 / image Capabilities text-to-image, image-to-image, qualityStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt nnumber ✅ 1Number of images sizeenum ✅ 1024x10241024x1024, 768x1152, 1152x768, 1792x1024, etc.output_formatenum ✅ jpgjpg, png
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace FLUX.2-proVendor Black Forest Labs Pricing $0.02 / image megapixel Capabilities text-to-image, image-to-image, qualityStatus Operational
Megapixel-based pricing. Supports up to 8 input reference images.
Input Schema Output Schema Pricing Details
Parameter Type Required Default Description promptstring ✅ — Text prompt input_imagesarray (URI) — []Up to 8 reference images aspect_ratioenum — 1:1match_input_image, custom, plus standard ratiosresolutionenum — 1 MPmatch_input_image, 0.5 MP, 1 MP, 2 MP, 4 MPwidthinteger — — Custom width (256–2048, multiples of 32) heightinteger — — Custom height (256–2048, multiples of 32) safety_toleranceinteger — 2Safety level (1–5) seedinteger — — Random seed output_formatenum — pngwebp, jpg, pngoutput_qualityinteger — 100Quality (0–100)
{
"type" : "string" ,
"format" : "uri"
} Resolution Price per image 0.5 MP $0.01 1 MP $0.02 2 MP $0.04 4 MP $0.08
Property Value Namespace flux-2-pro-legacyVendor Black Forest Labs Pricing $0.02 / image megapixel Capabilities text-to-image, image-to-image, editing, qualityStatus Operational
Supports up to 8 reference images. Same schema as FLUX.2 Pro with additional safety_tolerance control.
Property Value Namespace flux-2-flexVendor Black Forest Labs Pricing $0.08 / image megapixel Capabilities text-to-image, image-to-image, editing, qualityStatus Operational
Max-quality generation with up to 10 reference images , configurable inference steps, guidance, and prompt upsampling.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt input_imagesarray (URI) — []Up to 10 reference images aspect_ratioenum — 1:1match_input_image, custom, plus standard ratiosresolutionenum — 1 MP0.5 MP, 1 MP, 2 MP, 4 MPwidth / heightinteger — — Custom dimensions (256–2048) stepsinteger — 30Inference steps (1–50) guidancenumber — 4.5Guidance scale (1.5–10) prompt_upsamplingboolean — trueAuto-modify prompt for creativity safety_toleranceinteger — 2Safety (1–5) seedinteger — — Random seed output_formatenum — pngwebp, jpg, pngoutput_qualityinteger — 100Quality (0–100)
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace truefusion-video-proVendor Skytells Pricing $0.196 / second Capabilities text-to-video, image-to-videoStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt for video generation negative_promptstring — ""Things to exclude aspect_ratioenum — 16:916:9, 9:16, 1:1start_imagestring (URI) — — First frame of the video end_imagestring (URI) — — Last frame of the video cfg_scalenumber — 0.5Flexibility scale (0–1). Higher = more constrained durationenum — 55, 10 seconds
{
"type" : "string" ,
"format" : "uri"
} Returns a video URL.
Property Value Namespace truefusion-videoVendor Skytells Pricing $0.112 / second Capabilities text-to-video, image-to-videoStatus Operational
Same schema as TrueFusion Video Pro but without end_image support. Lower cost option.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt negative_promptstring — ""Things to exclude aspect_ratioenum — 16:916:9, 9:16, 1:1start_imagestring (URI) — — First frame cfg_scalenumber — 0.5Flexibility scale (0–1) durationenum — 55, 10 seconds
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace meraVendor Skytells Pricing $3.42 / prediction Capabilities text-to-video, image-to-video, audio, qualityStatus Operational
Skytells's latest video generation model — physically accurate, super realistic, and controllable. Supports reference images for subject consistency.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt secondsenum ✅ 84, 6, 8, 12 secondssizeenum ✅ 720x1280720x1280, 1280x720input_referencearray (URI) — []1–3 reference images for R2V
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace lumoVendor Skytells Pricing $1.12 / prediction Capabilities image-to-videoStatus Operational
Specialized for motion, animations, and general use cases. Schema not yet published.
Property Value Namespace lipfusionVendor Skytells Pricing $0.04 / second Capabilities video-to-video, audio-to-videoStatus Operational
Ultra-realistic lip-syncing for videos, animations, avatars, and live streams. Supports audio file input or text-to-speech with 45+ voice presets.
Input Schema Output Schema
Parameter Type Required Default Description video_urlstring (URI) ✅ — Target video (.mp4/.mov, under 100MB, 2–10s, 720p–1080p) audio_filestring (URI) ✅ — Audio for lip sync (.mp3/.wav/.m4a/.aac, under 5MB) textstring — — Text for TTS lip sync (Enterprise only) voice_idenum — en_AOT45+ voice presets (English + Chinese) motion_awareboolean — trueAdjust sync based on subject movements voice_speednumber — 1Speech rate for TTS (0.8–2)
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace veo-3.1Vendor Google Pricing $0.43 / second Capabilities text-to-video, image-to-video, qualityStatus Operational
Google's next-gen video model with context-aware audio, reference images, and last-frame interpolation.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Text prompt aspect_ratioenum — 16:916:9, 9:16durationenum — 84, 6, 8 secondsimagestring (URI) — — Start image last_framestring (URI) — — End image for interpolation reference_imagesarray (URI) — []1–3 reference images for R2V (16:9, 8s only) negative_promptstring — — What to exclude resolutionenum — 1080p720p, 1080pgenerate_audioboolean — trueGenerate audio with video seedinteger — — Random seed
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace veo-3.1-fastVendor Google Pricing $0.13–$0.17 / second (audio-dependent) Capabilities text-to-video, image-to-video, fastStatus Operational
Faster variant of Veo 3.1 with dynamic pricing based on audio generation.
Input Schema Output Schema Pricing Details
Parameter Type Required Default Description promptstring ✅ — Text prompt aspect_ratioenum — 16:916:9, 9:16durationenum — 84, 6, 8 secondsimagestring (URI) — — Start image last_framestring (URI) — — End image for interpolation negative_promptstring — — What to exclude resolutionenum — 1080p720p, 1080pgenerate_audioboolean — trueGenerate audio with video seedinteger — — Random seed
{
"type" : "string" ,
"format" : "uri"
} Audio Generation Price per second Enabled $0.17 Disabled $0.13
Property Value Namespace veo-3.1-previewVendor Google Pricing $0.43 / second Capabilities text-to-video, image-to-video, quality, soundStatus Operational
Preview version with person_generation safety control. Same schema as Veo 3.1 with additional person generation options (allow_adult, dont_allow).
Property Value Namespace sora-2Vendor OpenAI Pricing $0.002 / video Capabilities text-to-video, image-to-videoStatus Operational Inference Partner-served
Requires your own OpenAI API key.
Input Schema Output Schema
Parameter Type Required Default Description openai_api_keystring (password) ✅ — Your OpenAI API key promptstring ✅ — Text description of the video secondsenum — 44, 8, 12aspect_ratioenum — portraitportrait (720×1280), landscape (1280×720)input_referencestring (URI) — — Reference image or video
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace sora-2-proVendor OpenAI Pricing $0.002 / video Capabilities text-to-video, image-to-videoStatus Operational Inference Partner-served
Same as Sora 2 with additional resolution control (standard = 720p, high = 1024p). Requires your own OpenAI API key.
Property Value Namespace wan-2.5-i2vVendor Alibaba Pricing $0.06–$0.16 / second (resolution-based) Capabilities image-to-video, reference, quality, audio, videoStatus Operational
Alibaba's image-to-video model with background audio, prompt expansion, and multi-resolution support.
Input Schema Output Schema Pricing Details
Parameter Type Required Default Description imagestring (URI) ✅ — Input image promptstring ✅ — Text prompt negative_promptstring — ""What to exclude resolutionenum — 720p480p, 720p, 1080pdurationenum — 55, 10 secondsaudiostring (URI) — — Audio file for sync (wav/mp3, 3–30s, ≤15MB) enable_prompt_expansionboolean — trueEnable prompt optimizer seedinteger — — Random seed
{
"type" : "string" ,
"format" : "uri"
} Resolution Price per second 480p $0.06 720p $0.11 1080p $0.16
Property Value Namespace video-upscaleVendor Topaz Labs Pricing $0.10 / 5 seconds Capabilities video-to-videoStatus Operational
Input Schema Output Schema
Parameter Type Required Default Description videostring (URI) ✅ — Video file to upscale target_resolutionenum — 1080p720p, 1080p, 4ktarget_fpsinteger — 30Target FPS (15–60)
{
"type" : "string" ,
"format" : "uri"
}
Property Value Namespace beatfusion-2.0Vendor Skytells Pricing $0.75 / prediction Capabilities text-to-audio, music, quality, audioStatus Operational
Skytells's flagship music generation model. Generate full-length songs with vocals, lyrics, and rich instrumentation. Supports structure tags: [Intro], [Verse], [Pre Chorus], [Chorus], [Bridge], [Outro], [Hook], [Solo], and more.
Input Schema Output Schema
Parameter Type Required Default Description lyricsstring ✅ — Lyrics with structure tags (1–3500 chars). Use \n for line breaks promptstring — ""Music style description (0–2000 chars) sample_rateenum — 4410016000, 24000, 32000, 44100bitrateenum — 25600032000, 64000, 128000, 256000audio_formatenum — mp3mp3, wav, pcm
{
"type" : "string" ,
"format" : "uri"
} Returns an audio file URL.
Property Value Namespace beatfusion-1.0Vendor Skytells Pricing $0.45 / prediction Capabilities text-to-audio, music, quality, audioStatus Operational
First-generation music model. Supports [intro], [verse], [chorus], [bridge], [outro] tags.
Input Schema Output Schema
Parameter Type Required Default Description promptstring ✅ — Music style description (10–300 chars) lyricsstring ✅ — Lyrics with structure tags (10–600 chars) sample_rateenum — 4410016000, 24000, 32000, 44100bitrateenum — 25600032000, 64000, 128000, 256000audio_formatenum — mp3mp3, wav, pcm
{
"type" : "string" ,
"format" : "uri"
}
Every model on Skytells defines its own input_schema and output_schema. These schemas follow JSON Schema conventions and describe exactly what parameters a model accepts and what it returns.
Different models are built for different tasks, so their inputs vary significantly:
Simple models like TrueFusion only need a prompt and optional aspect_ratio
Advanced models like TrueFusion Pro add controls for guidance, num_inference_steps, seed, and output format
Flagship models like TrueFusion Ultra support inpainting (image + mask), style references, and resolution presets
Reference models like TrueFusion 2.0 accept tagged reference images you can invoke by name in your prompt
Video models add duration, start_image/end_image, and cfg_scale for temporal control
Audio models use lyrics and prompt for compositional control with structure tags
Partner models (GPT-Image-1, Sora 2) require your own API key via openai_api_key
Most image models share these parameters:
Parameter Description promptAlways required. The text description of what to generate aspect_ratioControls image dimensions. Available ratios vary by model seedFor reproducible generations. Set the same seed to get identical outputs output_formatUsually webp, jpg, or png
Models return one of two patterns:
Single output — returns one URL:
{ "type" : "string" , "format" : "uri" }
Multiple outputs — returns an array of URLs:
{ "type" : "array" , "items" : { "type" : "string" , "format" : "uri" } }
Models that support num_outputs or number_of_images typically use the array pattern.
Unit Description imageFlat rate per generated image secondBilled per second of generated video/audio predictionFlat rate per API call gpuBilled per GPU second used computing_secondBilled per compute second image_megapixelBilled by output resolution in megapixels 5 secondsBilled per 5-second chunk
How is this guide?
Good Bad