Models & Billing

Navigate Skytells' full model catalog — image, video, audio, and text/LLM models — and accurately estimate costs before you build.

What you'll be able to do after this module

Pick the right model for any job — image, video, audio, or LLM text. Estimate what your product will cost before writing a line of code. Avoid over-spending on quality you don't need.

How pricing works

Skytells uses flexible, unit-based pricing — each model is billed in the unit that best reflects how it consumes resources. There are no flat subscriptions or hidden compute surcharges; you pay only for what you use.

Billing units

Unit	Description	Typical models
`image`	Per image generated	Image generation models
`video`	Per video clip generated	Video models
`second`	Per second of output media	Variable-length audio/video
`audio_second`	Per second of audio output	Speech and music models
`video_second`	Per second of video output	Long-form video models
`5 seconds`	Per 5-second block of output	Short video segment models
`minute`	Per minute of output	Long-form audio/video
`prediction`	Per prediction call (flat rate)	Fixed-output models
`gpu`	Per GPU-second of compute	Fine-tuning and training jobs
`computing_second`	Per second of compute time	Specialized compute models
`image_megapixel`	Per megapixel generated	Upscaling and high-res models
`token`	Per token (input + output)	LLM text models
`million_token`	Per million tokens	High-volume LLM usage

Always fetch live pricing from the API instead of hardcoding values. Use GET /v1/models/:namespace and inspect the pricing field — it includes both the unit type and the per-unit rate so your cost estimates stay accurate as prices change.

Cost estimation examples

# Image model (unit: image)
500 images/day × $0.04/image × 30 days = $600/month

# Video model (unit: video_second)
10 clips/day × 8 seconds/clip × $0.12/second × 30 days = $288/month

# LLM text model (unit: million_token)
1,000,000 tokens/day × $2.00/million × 30 days = $60/month

Image models

Skytells' flagship image models are the TrueFusion family — purpose-built for quality, speed, and flexibility.

TrueFusion family

Model	Cost/prediction	Speed	Best for
`truefusion`	$0.02	Fast	Quick drafts, previews
`truefusion-pro`	$0.04	Medium	Default choice. High quality
`truefusion-max`	$0.06	Slow	Maximum detail
`truefusion-ultra`	$0.08	Slow	Photorealism at its best
`truefusion-2.0`	$0.06	Medium	Next-gen quality, improved coherence
`truefusion-edge`	$0.01	Very fast	Real-time previews (Edge-only)
`truefusion-pano`	$0.05	Medium	Wide panoramic outputs

Third-party image models

Available on the same API with the same x-api-key header — no extra accounts needed:

Model	Provider	Best for
`google-imagen-3`	Google	Photorealistic renders
`google-imagen-4`	Google	Latest Google quality
`flux-1.1-pro`	Black Forest Labs	Artistic / creative styles
`flux-2-pro`	Black Forest Labs	Highest BFL quality
`gpt-image-1`	OpenAI	Instruction-following
`nvidia-sana`	NVIDIA	Fast diffusion

Video models

Video predictions always take time (30s–5min). Use webhooks rather than polling. Output is always an MP4 URL.

Model	Cost/prediction	Description
`truefusion-video-pro`	$1.50	Skytells flagship video, best quality
`truefusion-video`	$0.80	Standard video generation
`mera`	$2.00	Cinematic quality
`lumo`	$1.20	Stylized/artistic video
`lipfusion`	$0.60	Lip-sync — animate a face with audio
`veo-3.1`	$3.00	Google Veo 3.1
`veo-3.1-fast`	$1.50	Faster Google Veo variant
`sora-2`	$2.50	OpenAI Sora 2
`sora-2-pro`	$5.00	OpenAI Sora 2 Pro

Video generation takes significantly longer than images. Always use webhooks for video — don't hold HTTP connections open for minutes.

Audio models

Model	Cost/prediction	Quality	Best for
`beatfusion-2.0`	$0.75	High	Production music, commercial use
`beatfusion-1.0`	$0.45	Standard	Prototyping, ambient tracks

Text & LLM models

Text models power the Inference API (/v1/chat/completions, /v1/responses, /v1/embeddings). They are billed per token, not per prediction.

Model	Billing unit	Description
`deepbrain-router`	`million_token`	Recommended default. Skytells' smart router — automatically selects the best available model for your prompt and budget. Fastest path to production.
`gpt-5`	`million_token`	OpenAI GPT-5, accessed through Skytells. Frontier-level reasoning and instruction-following.
`gpt-5.4`	`million_token`	Latest GPT-5 variant with extended context and improved factuality.

The model catalog changes frequently. Skytells continuously adds new models — from partner LLMs to new TrueFusion variants — sometimes daily. The table above reflects models available at the time this module was written, but always treat GET /v1/models as the source of truth. Hardcoding model IDs in production code can cause MODEL_NOT_FOUND errors when a model is renamed, deprecated, or replaced by a newer version. Fetch the catalog at startup or cache it with a short TTL.

Choosing a text model

Situation	Recommendation
Building something new, any use case	`deepbrain-router` — let Skytells pick the best model
Need the most capable reasoning available	`gpt-5` or `gpt-5.4`
Want cost predictability	`deepbrain-router` (optimises for quality-per-token automatically)
Semantic search / RAG embeddings	`deepbrain-router` with `/v1/embeddings`

Fetching live model data

Always fetch pricing from the API rather than hardcoding it:

# List all models with pricing
curl https://api.skytells.ai/v1/models \
  -H "x-api-key: $SKYTELLS_API_KEY"

# Get a specific model
curl https://api.skytells.ai/v1/models/truefusion-pro \
  -H "x-api-key: $SKYTELLS_API_KEY"

Choosing the right model

Use this decision tree before picking a model:

Quick decision guide

Situation	Model recommendation
First prototype / learning	`truefusion` (cheapest)
Real product, default	`truefusion-pro`
Real-time user-facing preview	`truefusion-edge`
Highest quality image	`truefusion-ultra` or `truefusion-2.0`
Artistic / creative style	`flux-1.1-pro`
Cinematic video	`mera` or `veo-3.1`
Animate a face	`lipfusion`
Background music	`beatfusion-2.0`

Cost estimation worksheet

Fill this in before you build. Use the pricing object from GET /v1/models/:namespace to get unit and rate values programmatically:

def estimate_monthly_cost(
    units_per_day: float,
    cost_per_unit: float,
    days: int = 30,
) -> float:
    """Works for any billing unit — image, token, second, etc."""
    return units_per_day * cost_per_unit * days

# Image model (unit: image) — truefusion-pro at $0.04/image
print(estimate_monthly_cost(100,  0.04))      # $120/mo — 100 images/day
print(estimate_monthly_cost(500,  0.04))      # $600/mo — 500 images/day
print(estimate_monthly_cost(1000, 0.01))      # $300/mo — 1000 fast previews/day

# Video model (unit: video_second, 8 sec clips)
print(estimate_monthly_cost(10 * 8, 0.12))   # $288/mo — 10 clips/day × 8s × $0.12/s

# LLM text model (unit: million_token)
print(estimate_monthly_cost(1.0, 2.00))       # $60/mo — 1M tokens/day at $2/M

Cost optimization tip: Use truefusion-edge at $0.01 for real-time user previews, then truefusion-pro at $0.04 for the final high-quality render. This dual approach gives great UX without paying premium prices for every interaction.

Rate limits by plan

Plan	Standard API	Edge API
Free	60 req/min	Not available
Pro	600 req/min	Not available
Business	Custom	✓ Edge-supported models only
Enterprise	Custom	✓ Edge-supported models only

When you hit a rate limit, the API returns 429 Too Many Requests with a Retry-After header. Implement exponential backoff in your retry logic.

Summary

You've completed the AI Foundations path. You can navigate the platform, make predictions, and estimate costs for any product you want to build.

You now know:

The TrueFusion family and when to use each tier
How to fetch live model data so your app stays accurate
How to estimate monthly costs before you write a line of code
truefusion-edge for real-time previews, truefusion-pro for final renders

Next steps:

Image Generation path → — master prompts and integrate images in apps
SDK Mastery → — use the Python or TypeScript SDK for cleaner code
API Reference → — complete technical reference

PreviousPlatform Tour NextThe Two APIs — Prediction & Inference Schemas

On this page