AMD EPYC 9354 Servers —from €299/month or €0.42/hour ⭐ 32 cores 3.25GHz / 768GB RAM / 2x3.84TB NVMe / 10Gbps 100TB
Currency:
EUR – €
Choose a currency
  • Euro EUR – €
  • United States dollar USD – $
VAT:
OT 0%
Choose your country (VAT)
  • OT All others 0%
22.08.2025

New LLM Models Available on HOSTKEY GPU Servers

server one

When ordering a GPU server, you can now select pre-installed LLM models or deploy them later through your client portal.

Qwen3:32B

The flagship model of Alibaba’s third-generation Qwen family.

  • Size: 32 billion parameters, ensuring high accuracy and strong performance on complex tasks.

  • Capabilities: natural language generation, document analysis, advanced reasoning and context-rich responses.

  • Use cases: enterprise-grade chatbots, multilingual support and business process automation.

  • Key advantage: superior reasoning accuracy and extended context handling.

Order

Qwen3-Coder

A specialized Qwen model optimized for software development tasks.

  • Purpose: code generation, explanation, auto-completion and error correction.

  • Supported languages: Python, C++, Java, Go, JavaScript and more.

  • Use cases: accelerating software development, integrating into IDEs, and supporting DevOps workflows.

  • Key advantage: enhanced syntax handling and analysis of large code bases.

Order

GPT-OSS-20B

An open-source LLM with 20 billion parameters designed for versatile applications.

  • Balance: strong performance with moderate compute requirements.

  • Capabilities: text generation, question answering and conversational AI.

  • Use cases: AI assistants, chatbots, information retrieval and content automation.

  • Key advantage: fully open-source, highly customizable and easy to integrate.

Order

In addition to the new models, HOSTKEY also offers GPU servers with pre-installed DeepSeek-r1-14b, Gemma-3-27b-it, Llama-3.3-70B, Phi-4-14b

Advantages of Running LLMs on Your Own GPU Servers

  1. Full control – manage your infrastructure, execution environment and security policies.

  2. Data privacy – your data stays on your servers, without being transferred to third-party providers.

  3. Flexibility – choose the right model for your workload: general-purpose, code-focused or domain-specific.

  4. Scalability – run multiple models in parallel or distribute workloads across GPUs.

  5. No external limits – no request caps or vendor-imposed restrictions.

You can order an LLM model today either when configuring your GPU server or directly through your client portal.

Other news

06.05.2026

HOSTKEY Expands Era-IX and Cogent Connectivity in Amsterdam and Frankfurt

HOSTKEY expands Era-IX and Cogent network capacity in Amsterdam and Frankfurt to deliver higher bandwidth, lower latency and improved network stability.

28.04.2026

Cost-Effective Server Configurations in Europe, the US, and Turkey

High-performance servers and GPU configurations at reduced prices. Suitable for high-load projects, databases, and AI workloads.

25.03.2026

Introducing HOSTKEY's Partnership with Chainstack

HOSTKEY partners with Chainstack to deliver one-click self-hosted blockchain nodes with full control, optimized hardware and global reach for Web3 teams.

21.01.2026

Get up to 30% off Ryzen servers

Ryzen power, NVMe speed, up to 30% off. Deploy in the EU or the US from €129/month.

23.12.2025

Thank You for Being with HOSTKEY in 2025. Wishing You a Merry Christmas and a Happy New Year

As 2025 comes to an end, we want to thank you for your trust and for choosing HOSTKEY. It has been a pleasure supporting your projects this year, and we look forward to your continued success in the year ahead.

Upload