Back to Getting Started

Model Selection Guide

Choosing the Right AI Models for Your Assistant

Not all AI models are created equal. Learn how to balance cost, speed, and quality to get the best results for your needs.

Why Model Selection Matters

Your AI assistant's "brain" is the language model it uses. Different models have different strengths:

πŸ’° Cost

Prices range from $0.0001 to $0.06 per 1,000 tokens. Using an expensive model for simple tasks wastes money.

⚑ Speed

Faster models respond in seconds. Slower (but smarter) models can take 10-30 seconds for complex tasks.

🧠 Quality

Premium models reason better, understand nuance, and make fewer mistakes on complex tasks.

πŸ“š Context Window

How much text the model can "remember" at once. Larger windows handle long documents or conversations.

Major Provider Comparison

ProviderPopular ModelsCost RangeBest For
OpenAIGPT 5.2, GPT 5.3 Codex$0.00175 - $0.014 / 1k tokensGeneral purpose, coding, complex reasoning, multimodal
AnthropicOpus 4.6, Sonnet 4.6$0.003 - $0.025 / 1k tokensLong documents, careful analysis, creative writing, extended context
GoogleGemini 3.1 Pro$0.002 - $0.012 / 1k tokensMultimodal (text + images + video), speed, cost-effectiveness
Qwen (Alibaba)Qwen 2.5, Qwen Max$0.00001 - $0.0016 / 1k tokensMultilingual (especially Chinese), cost-effective, coding
Kimi (Moonshot AI)Kimi K1$0.0006 - $0.0025 / 1k tokensUltra-long context (200k+ tokens), document analysis
OpenRouterAll of the above + 300+ more$0.00001 - $0.025 / 1k tokensFlexibility, cost optimization, model experimentation (no markup)

πŸ’‘ Pro Tip: OpenRouter gives you access to 300+ models from all major providers through a single API key with no markup on inference pricing (pass-through rates).

Pricing updated March 2026. Token costs shown are input rates; output tokens typically cost 4-10x more. Visit provider pricing pages for current rates and detailed breakdowns.

Direct Provider vs OpenRouter: Which to Choose?

Go Direct When:

  • βœ“You know exactly which model you want (e.g., only GPT-4)
  • βœ“You need provider-specific features (like OpenAI's DALL-E integration)
  • βœ“You qualify for enterprise pricing or volume discounts
  • βœ“You want the absolute latest model releases (direct gets them first)
  • βœ“You're building on a single provider's ecosystem long-term

Best for: Committed users who know their needs and value direct support.

Use OpenRouter When:

  • βœ“You want flexibility to switch models without changing code
  • βœ“You want to use cheap models for simple tasks, premium for complex ones
  • βœ“You're experimenting and don't know which model fits best
  • βœ“You want automatic failover if one provider has an outage
  • βœ“You want unified billing across all providers (one bill, one API key)

Best for: Most beginners and anyone optimizing for cost and flexibility. ⭐ Recommended

Model Recommendations by Use Case

πŸ“§ Simple Tasks (Email summaries, weather, reminders)

Recommended: Qwen 2.5, Gemini 3.1 Pro, Kimi K1

Cost: $0.00023-0.002/1k input β€’ Speed: 1-3 seconds β€’ Monthly: $5-20

🧠 Complex Reasoning (Strategy, analysis, decisions)

Recommended: GPT 5.2, Opus 4.6, Sonnet 4.6

Cost: $0.00175-0.005/1k input, $0.014-0.025/1k output β€’ Speed: 3-10 seconds β€’ Monthly: $50-250

πŸ“„ Long Documents (Research papers, contracts, books)

Recommended: Kimi K1 (200k+ context), Opus 4.6, Gemini 3.1 Pro

Cost: $0.0006-0.005/1k input β€’ Context: 200k-1M+ tokens β€’ Monthly: $30-120

πŸ’» Coding & Technical Tasks

Recommended: GPT 5.3 Codex, Sonnet 4.6, Qwen 2.5

Cost: $0.00023-0.003/1k input, $0.01-0.015/1k output β€’ Speed: 2-8 seconds β€’ Monthly: $40-180

πŸ’° Budget-Conscious (High volume, low complexity)

Recommended: Qwen 2.5 (~$0.23/M), Kimi K1, Gemini 3.1 via OpenRouter

Cost: $0.00023-0.002/1k input β€’ Speed: 1-5 seconds β€’ Monthly: $2-15

How to Get Started

1

Choose Your Provider

Start with OpenRouter if you're unsure. You can always switch to direct providers later.

2

Create an Account & Add Credits

Sign up and add $5-20 to start. Most providers accept credit cards and have pay-as-you-go pricing.

3

Generate an API Key

Look for "API Keys" in your provider dashboard. Generate a new key and copy it immediatelyβ€”you won't see it again.

4

Store It Securely

Never share your API key or commit it to code. Store it in an encrypted vault (1Password, Bitwarden) or environment variable.

5

Configure Your Assistant

Add the API key to your assistant's configuration and choose which model to use as the default.

Common Questions

Can I switch providers or models later?

Yes! If you use OpenRouter, switching models is as simple as changing a config setting. With direct providers, you'll need to swap API keys, but it's still straightforward.

What if I run out of credits?

Your assistant will stop working until you add more. Most providers send email warnings when your balance is low. Set up billing alerts to avoid surprises.

How do I track my usage?

All providers have usage dashboards showing tokens consumed and costs. Check it weekly for the first month to understand your spending patterns.

Is my data safe with these providers?

Major providers (OpenAI, Anthropic, Google) don't train on API data by default. Always check the provider's data policy. For maximum privacy, avoid sending sensitive information or use local models.

Can I use multiple models for different tasks?

Yes! Advanced setups can route tasks to different models based on complexity. Use cheap models for simple tasks, premium for hard ones. OpenRouter makes this especially easy.

Still Not Sure Which to Choose?

We can help you select the right models for your use case and budget, set up your API keys securely, and optimize costs over time.