22.08.2025

New LLM Models Available on HOSTKEY GPU Servers

When ordering a GPU server, you can now select pre-installed LLM models or deploy them later through your client portal.

Qwen3:32B

The flagship model of Alibaba’s third-generation Qwen family.

Size: 32 billion parameters, ensuring high accuracy and strong performance on complex tasks.
Capabilities: natural language generation, document analysis, advanced reasoning and context-rich responses.
Use cases: enterprise-grade chatbots, multilingual support and business process automation.
Key advantage: superior reasoning accuracy and extended context handling.

Order

Qwen3-Coder

A specialized Qwen model optimized for software development tasks.

Purpose: code generation, explanation, auto-completion and error correction.
Supported languages: Python, C++, Java, Go, JavaScript and more.
Use cases: accelerating software development, integrating into IDEs, and supporting DevOps workflows.
Key advantage: enhanced syntax handling and analysis of large code bases.

Order

GPT-OSS-20B

An open-source LLM with 20 billion parameters designed for versatile applications.

Balance: strong performance with moderate compute requirements.
Capabilities: text generation, question answering and conversational AI.
Use cases: AI assistants, chatbots, information retrieval and content automation.
Key advantage: fully open-source, highly customizable and easy to integrate.

Order

In addition to the new models, HOSTKEY also offers GPU servers with pre-installed DeepSeek-r1-14b, Gemma-3-27b-it, Llama-3.3-70B, Phi-4-14b

Advantages of Running LLMs on Your Own GPU Servers

Full control – manage your infrastructure, execution environment and security policies.
Data privacy – your data stays on your servers, without being transferred to third-party providers.
Flexibility – choose the right model for your workload: general-purpose, code-focused or domain-specific.
Scalability – run multiple models in parallel or distribute workloads across GPUs.
No external limits – no request caps or vendor-imposed restrictions.

You can order an LLM model today either when configuring your GPU server or directly through your client portal.

Other news

06.05.2026

HOSTKEY Expands Era-IX and Cogent Connectivity in Amsterdam and Frankfurt

HOSTKEY expands Era-IX and Cogent network capacity in Amsterdam and Frankfurt to deliver higher bandwidth, lower latency and improved network stability.

28.04.2026

Cost-Effective Server Configurations in Europe, the US, and Turkey

High-performance servers and GPU configurations at reduced prices. Suitable for high-load projects, databases, and AI workloads.

25.03.2026

Introducing HOSTKEY's Partnership with Chainstack

HOSTKEY partners with Chainstack to deliver one-click self-hosted blockchain nodes with full control, optimized hardware and global reach for Web3 teams.

21.01.2026

Get up to 30% off Ryzen servers

Ryzen power, NVMe speed, up to 30% off. Deploy in the EU or the US from €129/month.

23.12.2025

Thank You for Being with HOSTKEY in 2025. Wishing You a Merry Christmas and a Happy New Year

As 2025 comes to an end, we want to thank you for your trust and for choosing HOSTKEY. It has been a pleasure supporting your projects this year, and we look forward to your continued success in the year ahead.

Show all news

1 /

New LLM Models Available on HOSTKEY GPU Servers

Qwen3:32B

Qwen3-Coder

GPT-OSS-20B

Other news

HOSTKEY Expands Era-IX and Cogent Connectivity in Amsterdam and Frankfurt

Cost-Effective Server Configurations in Europe, the US, and Turkey

Introducing HOSTKEY's Partnership with Chainstack

Get up to 30% off Ryzen servers

Thank You for Being with HOSTKEY in 2025. Wishing You a Merry Christmas and a Happy New Year

Show all news

Show all blogs