Token Pricing
Transparent pricing for all AI models and services across input and output tokens.
Transparent pricing for all AI models and services across input and output tokens. Use the tabs below to switch between input-token and output-token pricing.
Language models
Chat completion and text generation models.
| Model | Description | Price / 1M tokens |
|---|---|---|
| GreenL | Lightweight, efficient language model. | €0.25 |
| GreenR | Reasoning-optimized language model. | €0.35 |
| gemma4 | Long-context multimodal model (256k). Public Preview. | €0.50 |
Chat models
Direct provider chat models available through the GreenPT API proxy (per 1M input tokens).
| Model | Price / 1M |
|---|---|
deepseek-r1-distill-llama-70b | €0.90 |
llama-3.3-70b-instruct | €0.90 |
mistral-nemo-instruct-2407 | €0.20 |
llama-3.1-8b-instruct | €0.20 |
gemma-3-27b-it | €0.25 |
devstral-2-123b-instruct-2512 | €0.60 |
qwen3-235b-a22b-instruct-2507 | €0.75 |
mistral-small-3.2-24b-instruct-2506 | €0.15 |
qwen3-coder-30b-a3b-instruct | €0.20 |
qwen3.5-397b-a17b | €0.66 |
gpt-oss-120b | €0.15 |
voxtral-small-24b-2507 | €0.15 |
Vector & search models
Embedding and reranking services.
Note: embedding and reranking models only charge for input tokens. There are no output token costs for these services.
| Model | Description | Price / 1M tokens |
|---|---|---|
| Green Embedding | Text vectorization and semantic search. | €0.20 |
| Green Rerank | Document reranking and relevance scoring. | €0.12 |
Speech-to-text models
Audio transcription services (priced per second).
| Model | Mode | Rate |
|---|---|---|
| GreenS | Pre-recorded audio (batch) | €0.52 / hour |
| GreenS | Live audio (real-time) | €0.65 / hour |
| GreenS Pro | Monolingual: pre-recorded | €0.52 / hour |
| GreenS Pro | Monolingual: live audio | €0.78 / hour |
| GreenS Pro | Multilingual: pre-recorded | €0.60 / hour |
| GreenS Pro | Multilingual: live audio | €1.04 / hour |
Language model completions
Generated text and chat responses.
| Model | Description | Price / 1M tokens |
|---|---|---|
| GreenL | Fast, efficient completions. | €0.80 |
| GreenR | Advanced reasoning completions. | €0.95 |
| gemma4 | Long-context multimodal completions. Public Preview. | €1.50 |
Chat models
Direct provider chat models available through the GreenPT API proxy (per 1M output tokens).
| Model | Price / 1M |
|---|---|
deepseek-r1-distill-llama-70b | €0.90 |
llama-3.3-70b-instruct | €0.90 |
mistral-nemo-instruct-2407 | €0.20 |
llama-3.1-8b-instruct | €0.20 |
gemma-3-27b-it | €0.50 |
devstral-2-123b-instruct-2512 | €2.35 |
qwen3-235b-a22b-instruct-2507 | €2.25 |
mistral-small-3.2-24b-instruct-2506 | €0.35 |
qwen3-coder-30b-a3b-instruct | €0.80 |
qwen3.5-397b-a17b | €3.96 |
gpt-oss-120b | €0.60 |
voxtral-small-24b-2507 | €0.35 |
Note: embedding, reranking, and speech-to-text services don't generate output tokens. They're billed only on input. See the Input tokens tab.
Token usage examples
Understanding token consumption for different use cases.
GreenL usage examples
| Use case | Tokens | Cost |
|---|---|---|
| Short answer | 50 | €0.00004 |
| Paragraph | 200 | €0.00016 |
| Article | 1,000 | €0.0008 |
- Short answer: brief responses, simple Q&A.
- Paragraph: detailed explanations, summaries.
- Article: long-form content, reports.
GreenR usage examples
| Use case | Tokens | Cost |
|---|---|---|
| Analysis | 50 | €0.0000475 |
| Detailed report | 200 | €0.00019 |
| Research paper | 1,000 | €0.00095 |
- Analysis: complex reasoning, analysis.
- Detailed report: in-depth analysis, research.
- Research paper: academic content, detailed analysis.
Cost comparison
Input vs output token pricing.
| Model | Input / 1M | Output / 1M | Output multiplier |
|---|---|---|---|
| GreenL | €0.25 | €0.80 | 3.2× |
| GreenR | €0.35 | €0.95 | 2.7× |
Pricing notes
- All prices are in EUR and exclude applicable taxes.
- Token counting follows OpenAI-compatible standards.
- Speech-to-text pricing is based on audio duration, not tokens.
- Output tokens are typically priced higher than input tokens.
- Embedding and reranking models only charge for input tokens (no output costs).
- Volume discounts available for enterprise customers.
- Prices subject to change with 30 days notice.