Supported Models

25 models across 10 providers. Last updated March 26, 2026.

Video Generation

9 models

Veo 3.1

Google AI Studio

State-of-the-art video generation with audio. Text-to-video and image-to-video.

1080pPer-second pricing

Veo 3.1 Fast

Google AI Studio

Faster inference variant of Veo 3.1 for rapid iteration.

1080pPer-second pricing

Sora 2 Standard

OpenAI

High-quality video generation from text and image inputs.

1080pPer-second pricing

Sora 2 Pro

OpenAI

Premium video generation with higher resolution and longer durations.

1080pPer-second pricing

Kling 3.0

fal.ai

High-quality video generation with strong motion consistency.

1080pPer-second pricing

Seedance 2.0

fal.ai

Dance and motion-focused video generation with natural movement.

1080pPer-second pricing

Wan 2.2 (A14B)

fal.ai

Text-to-video with strong character consistency. 14B parameter model.

1080pPer-second pricing

Wan 2.6

fal.ai

Latest Wan model with improved quality and longer durations.

1080pPer-second pricing

HunyuanVideo (13B)

fal.ai

Tencent's 13B parameter video generation model via fal.ai.

1080pPer-second pricing

Image Generation

4 models

Nano Banana Pro

Google AI Studio

High-quality image generation with excellent prompt adherence.

1024x1024Per-image pricing

Nano Banana 2

Google AI Studio

Next-gen image generation with improved detail and consistency.

1024x1024Per-image pricing

FLUX.2 Pro

Black Forest Labs

High-quality image generation with excellent prompt adherence and detail.

2048x2048Per-image pricing

FLUX.2 klein 4B

Local (Sidecar)

Lightweight FLUX model for local image generation on Mac (Metal/MPS).

1024x1024Free (runs locally)

LLMs

5 models

Gemini Pro (latest)

Google AI Studio

Advanced reasoning and multimodal understanding with 1M token context.

1000K contextPer-token pricing

Gemini Flash (latest)

Google AI Studio

Fast, cost-efficient model for everyday tasks. 1M context.

1000K contextPer-token pricing

GPT-5

OpenAI

Advanced LLM with strong reasoning and 128K context window.

128K contextPer-token pricing

Claude Sonnet 4.6

Anthropic

High-capability model with excellent instruction following and 200K context.

200K contextPer-token pricing

Qwen3 4B

Local (Ollama)

Compact LLM for script writing and AI assistant. Runs fully on-device.

32K contextFree (runs locally)

Audio / TTS

2 models

Multilingual v2

ElevenLabs

High-quality multilingual voice cloning and text-to-speech.

Per-character pricing

Turbo v2.5

ElevenLabs

Low-latency voice synthesis for fast iteration and previews.

Per-character pricing

Transcription

5 models

Gemini Transcription

Google AI Studio

Cloud-based transcription via Gemini with high accuracy.

Per-minute pricing

GPT-5 Transcription

OpenAI

Cloud-based transcription via OpenAI with word-level timestamps.

Per-minute pricing

Scribe v1

ElevenLabs

ElevenLabs transcription with speaker detection.

Per-minute pricing

Whisper Tiny

Local (bundled)

Lightweight on-device transcription. Bundled with Skia, no download required.

Free (runs locally)

Whisper Small

Local (download)

Higher-accuracy on-device transcription. One-time download (~500 MB).

Free (runs locally)

Start creating with these models.

Join the waitlist for early access and founding member pricing.

We'll email you when Skia is ready. No spam. Unsubscribe anytime.