Model garden

gemma 2 2b jpn it

Direct via de EU-router of als dedicated GPU-deployment. Data blijft in Europa.

gemma 2 2b jpn it is een open-source taalmodel van Google met 2.6B parameters, gehost op Europese GPU's via een OpenAI-compatibele API.

google/gemma-2-2b-jpn-it
text->text · google · EU-hosted
2.6B
Parameters
Contextvenster
8GB
Minimale VRAM
POST /api/v1/chat/completions200 OK

Specificaties

Parameters 2.6B
Minimale VRAM 8 GB
Architectuur Gemma2ForCausalLM (vLLM)
Licentie gemma
Modaliteit text->text
Uitgebracht September 2024
Uitgever google ↗

Prijzen

€0.03
Input (per 1M tokens)
€0.06
Output (per 1M tokens)

Gedeelde EU-router, pay-per-token, scale-to-zero. Dedicated GPU-deployments worden per uur afgerekend — zie prijzen.

Direct aanroepen

Drop-in vervanger voor OpenAI: wijzig alleen de base-URL en de API-key. Ook het Anthropic-formaat (/v1/messages) wordt ondersteund.

curl https://hostyourai.com/api/v1/chat/completions \
  -H "Authorization: Bearer hyai-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/gemma-2-2b-jpn-it",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Veelgestelde vragen

Kan ik gemma 2 2b jpn it in de EU draaien?

Ja. HostYourAI draait gemma 2 2b jpn it op GPU's in Europese datacenters via vLLM. Prompts en outputs verlaten de EU niet en er is geen Amerikaanse cloudprovider in de keten.

Is gemma 2 2b jpn it hosten AVG/GDPR-compliant?

Ja. Alle verwerking vindt plaats binnen de EU, er is een verwerkersovereenkomst (DPA) beschikbaar en de subprocessor-lijst is openbaar. Open-source gewichten betekenen ook: geen training op jouw data.

Wat kost gemma 2 2b jpn it?

Via de gedeelde EU-router betaal je €0.03 per miljoen input-tokens en €0.06 per miljoen output-tokens, zonder vaste kosten. Voor hoge volumes of isolatie kun je gemma 2 2b jpn it ook als dedicated GPU-instance per uur draaien.

Is de API compatibel met OpenAI?

Ja. Je gebruikt de standaard OpenAI-SDK's met een aangepaste base-URL (https://hostyourai.com/api/v1). Ook de Anthropic Messages API wordt ondersteund als drop-in.

Andere modellen van Google

gemma 4 31B it qat w4a16 ct

[!Note] This model card is for the new versions of the Gemma 4 family optimized with Quantization-Aware Training (QAT), which allows preserving similar quality to bfloat16 while dramatically reducing the memory requirements to load the model. Four versions of the QAT checkpoints are available: Unquantized QAT checkpoints (Q40): Half-precision weights extracted from the QAT pipeline, ideal for custom downstream compilation and research. Available for Gemma 4 E2B, E4B, 12B, 26B A4B, and 31B, and their drafter models. GGUF (Q40): Ready-to-deploy formats for broad ecosystem compatibility. Availabl

34B Bekijk model →
gemma 4 26B A4B it qat q4 0 unquantized

[!Note] This model card is for the new versions of the Gemma 4 family optimized with Quantization-Aware Training (QAT), which allows preserving similar quality to bfloat16 while dramatically reducing the memory requirements to load the model. Four versions of the QAT checkpoints are available: Unquantized QAT checkpoints (Q40): Half-precision weights extracted from the QAT pipeline, ideal for custom downstream compilation and research. Available for Gemma 4 E2B, E4B, 12B, 26B A4B, and 31B, and their drafter models. GGUF (Q40): Ready-to-deploy formats for broad ecosystem compatibility. Availabl

27B Bekijk model →
gemma 4 31B it qat q4 0 unquantized

[!Note] This model card is for the new versions of the Gemma 4 family optimized with Quantization-Aware Training (QAT), which allows preserving similar quality to bfloat16 while dramatically reducing the memory requirements to load the model. Four versions of the QAT checkpoints are available: Unquantized QAT checkpoints (Q40): Half-precision weights extracted from the QAT pipeline, ideal for custom downstream compilation and research. Available for Gemma 4 E2B, E4B, 12B, 26B A4B, and 31B, and their drafter models. GGUF (Q40): Ready-to-deploy formats for broad ecosystem compatibility. Availabl

33B Bekijk model →
gemma 4 31B

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on small models) and generating text output. This release includes open-weights models in both pre-trained and instruction-tuned variants. Gemma 4 features a context window of up to 256K tokens and maintains multilingual support in over 140 languages.

33B Bekijk model →
gemma 4 26B A4B

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on small models) and generating text output. This release includes open-weights models in both pre-trained and instruction-tuned variants. Gemma 4 features a context window of up to 256K tokens and maintains multilingual support in over 140 languages.

27B Bekijk model →
gemma 4 26B A4B it

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on small models) and generating text output. This release includes open-weights models in both pre-trained and instruction-tuned variants. Gemma 4 features a context window of up to 256K tokens and maintains multilingual support in over 140 languages.

27B Bekijk model →

Probeer gemma 2 2b jpn it gratis

Account aanmaken duurt een minuut. Test gemma 2 2b jpn it direct in de playground.

Start gratis