Instantly via the EU router or as a dedicated GPU deployment. Data stays in Europe.
Meta Llama Guard 2 8B is an open-source language model from Meta with 8B parameters, hosted on EU GPUs via an OpenAI-compatible API.
Shared EU router, pay-per-token, scale-to-zero. Dedicated GPU deployments are billed hourly — see pricing.
✓ Verified working on 22-06-2026 — responded in 1579 ms on our EU infrastructure.
Drop-in replacement for OpenAI: change only the base URL and API key. The Anthropic format (/v1/messages) is supported too.
curl https://hostyourai.com/api/v1/chat/completions \
-H "Authorization: Bearer hyai-..." \
-H "Content-Type: application/json" \
-d '{
"model": "meta-llama/Meta-Llama-Guard-2-8B",
"messages": [{"role": "user", "content": "Hello!"}]
}'
Yes. HostYourAI runs Meta Llama Guard 2 8B on GPUs in European datacenters via vLLM. Prompts and outputs never leave the EU and there is no US cloud provider in the chain.
Yes. All processing happens inside the EU, a Data Processing Agreement (DPA) is available and the subprocessor list is public. Open-source weights also mean: no training on your data.
Via the shared EU router you pay €0.10 per million input tokens and €0.18 per million output tokens, with no fixed costs. For high volume or isolation you can also run Meta Llama Guard 2 8B as a dedicated hourly GPU instance.
Yes. You use the standard OpenAI SDKs with a custom base URL (https://hostyourai.com/api/v1). The Anthropic Messages API is supported as a drop-in as well.
Creating an account takes a minute. Test Meta Llama Guard 2 8B straight away in the playground.
Start for free