Beginner20 minModule 2 of 6

Models & Billing

Navigate Skytells' full model catalog — image, video, audio, and text/LLM models — and accurately estimate costs before you build.

What you'll be able to do after this module

Pick the right model for any job — image, video, audio, or LLM text. Estimate what your product will cost before writing a line of code. Avoid over-spending on quality you don't need.


How pricing works

Skytells uses flexible, unit-based pricing — each model is billed in the unit that best reflects how it consumes resources. There are no flat subscriptions or hidden compute surcharges; you pay only for what you use.

Billing units

UnitDescriptionTypical models
imagePer image generatedImage generation models
videoPer video clip generatedVideo models
secondPer second of output mediaVariable-length audio/video
audio_secondPer second of audio outputSpeech and music models
video_secondPer second of video outputLong-form video models
5 secondsPer 5-second block of outputShort video segment models
minutePer minute of outputLong-form audio/video
predictionPer prediction call (flat rate)Fixed-output models
gpuPer GPU-second of computeFine-tuning and training jobs
computing_secondPer second of compute timeSpecialized compute models
image_megapixelPer megapixel generatedUpscaling and high-res models
tokenPer token (input + output)LLM text models
million_tokenPer million tokensHigh-volume LLM usage

Cost estimation examples

# Image model (unit: image)
500 images/day × $0.04/image × 30 days = $600/month

# Video model (unit: video_second)
10 clips/day × 8 seconds/clip × $0.12/second × 30 days = $288/month

# LLM text model (unit: million_token)
1,000,000 tokens/day × $2.00/million × 30 days = $60/month

Image models

Skytells' flagship image models are the TrueFusion family — purpose-built for quality, speed, and flexibility.

TrueFusion family

ModelCost/predictionSpeedBest for
truefusion$0.02FastQuick drafts, previews
truefusion-pro$0.04MediumDefault choice. High quality
truefusion-max$0.06SlowMaximum detail
truefusion-ultra$0.08SlowPhotorealism at its best
truefusion-2.0$0.06MediumNext-gen quality, improved coherence
truefusion-edge$0.01Very fastReal-time previews (Edge-only)
truefusion-pano$0.05MediumWide panoramic outputs

Third-party image models

Available on the same API with the same x-api-key header — no extra accounts needed:

ModelProviderBest for
google-imagen-3GooglePhotorealistic renders
google-imagen-4GoogleLatest Google quality
flux-1.1-proBlack Forest LabsArtistic / creative styles
flux-2-proBlack Forest LabsHighest BFL quality
gpt-image-1OpenAIInstruction-following
nvidia-sanaNVIDIAFast diffusion

Video models

Video predictions always take time (30s–5min). Use webhooks rather than polling. Output is always an MP4 URL.

ModelCost/predictionDescription
truefusion-video-pro$1.50Skytells flagship video, best quality
truefusion-video$0.80Standard video generation
mera$2.00Cinematic quality
lumo$1.20Stylized/artistic video
lipfusion$0.60Lip-sync — animate a face with audio
veo-3.1$3.00Google Veo 3.1
veo-3.1-fast$1.50Faster Google Veo variant
sora-2$2.50OpenAI Sora 2
sora-2-pro$5.00OpenAI Sora 2 Pro

Audio models

ModelCost/predictionQualityBest for
beatfusion-2.0$0.75HighProduction music, commercial use
beatfusion-1.0$0.45StandardPrototyping, ambient tracks

Text & LLM models

Text models power the Inference API (/v1/chat/completions, /v1/responses, /v1/embeddings). They are billed per token, not per prediction.

ModelBilling unitDescription
deepbrain-routermillion_tokenRecommended default. Skytells' smart router — automatically selects the best available model for your prompt and budget. Fastest path to production.
gpt-5million_tokenOpenAI GPT-5, accessed through Skytells. Frontier-level reasoning and instruction-following.
gpt-5.4million_tokenLatest GPT-5 variant with extended context and improved factuality.

Choosing a text model

SituationRecommendation
Building something new, any use casedeepbrain-router — let Skytells pick the best model
Need the most capable reasoning availablegpt-5 or gpt-5.4
Want cost predictabilitydeepbrain-router (optimises for quality-per-token automatically)
Semantic search / RAG embeddingsdeepbrain-router with /v1/embeddings

Fetching live model data

Always fetch pricing from the API rather than hardcoding it:

# List all models with pricing
curl https://api.skytells.ai/v1/models \
  -H "x-api-key: $SKYTELLS_API_KEY"

# Get a specific model
curl https://api.skytells.ai/v1/models/truefusion-pro \
  -H "x-api-key: $SKYTELLS_API_KEY"

Choosing the right model

Use this decision tree before picking a model:

Image Video Audio Speed] fastest preview Balanced Max quality Skytells native Google quality OpenAI style High quality Lower cost What do you need? Media type Priority G H truefusion-edge truefusion-pro truefusion-ultra or truefusion-2.0 truefusion-video-pro veo-3.1 sora-2 beatfusion-2.0 beatfusion-1.0

Quick decision guide

SituationModel recommendation
First prototype / learningtruefusion (cheapest)
Real product, defaulttruefusion-pro
Real-time user-facing previewtruefusion-edge
Highest quality imagetruefusion-ultra or truefusion-2.0
Artistic / creative styleflux-1.1-pro
Cinematic videomera or veo-3.1
Animate a facelipfusion
Background musicbeatfusion-2.0

Cost estimation worksheet

Fill this in before you build. Use the pricing object from GET /v1/models/:namespace to get unit and rate values programmatically:

def estimate_monthly_cost(
    units_per_day: float,
    cost_per_unit: float,
    days: int = 30,
) -> float:
    """Works for any billing unit — image, token, second, etc."""
    return units_per_day * cost_per_unit * days

# Image model (unit: image) — truefusion-pro at $0.04/image
print(estimate_monthly_cost(100,  0.04))      # $120/mo — 100 images/day
print(estimate_monthly_cost(500,  0.04))      # $600/mo — 500 images/day
print(estimate_monthly_cost(1000, 0.01))      # $300/mo — 1000 fast previews/day

# Video model (unit: video_second, 8 sec clips)
print(estimate_monthly_cost(10 * 8, 0.12))   # $288/mo — 10 clips/day × 8s × $0.12/s

# LLM text model (unit: million_token)
print(estimate_monthly_cost(1.0, 2.00))       # $60/mo — 1M tokens/day at $2/M

Rate limits by plan

PlanStandard APIEdge API
Free60 req/minNot available
Pro600 req/minNot available
BusinessCustom✓ Edge-supported models only
EnterpriseCustom✓ Edge-supported models only

When you hit a rate limit, the API returns 429 Too Many Requests with a Retry-After header. Implement exponential backoff in your retry logic.


Summary

You now know:

  • The TrueFusion family and when to use each tier
  • How to fetch live model data so your app stays accurate
  • How to estimate monthly costs before you write a line of code
  • truefusion-edge for real-time previews, truefusion-pro for final renders

Next steps:

On this page