GPU Servers with AMD Radeon RX 7900 XTX and AMD R9700

Servers powered by AMD Radeon RX 7900 XTX based on the RDNA 3 architecture and AMD R9700 based on RDNA 4 deliver high performance for artificial intelligence, 3D rendering and big data processing. This powerful and cost-effective solution is ideal for business workloads of any complexity.

⚡

NEW! GPU Servers Equipped with AMD Radeon AI PRO R9700 (32 GB) — €0.471/hour

⏰

GPU servers are available on both hourly and monthly payment plans. Read about how the hourly server rental works.

Apps for AI, ML and Data Science

Order a GPU server with pre-installed software and get a ready-to-use environment in minutes.

AI Platform All apps

PyTorch Fully featured framework for building deep learning models.

Self-hosted AI Chatbot Free and self-hosted AI Chatbot built on Ollama, Lllama3 LLM model and OpenWebUI interface.

TensorFlow Free and open-source software library for machine learning and artificial intelligence.

Apache Spark Multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

JupyterLab Web-based interactive development environment for notebooks, code, and data.

Anaconda Open ecosystem for Data science and AI development.

Apache Airflow Open-source workflow management platform for data engineering pipelines.

Advantages of AMD Radeon RX 7900 XTX

The AMD Radeon RX 7900 combines cutting-edge technology with high performance. It offers excellent power efficiency, ample video memory, and support for advanced features like second-generation Ray Tracing and Infinity Cache. This makes it an ideal choice for professional workloads, including 3D rendering, artificial intelligence (AI), and scientific computing.
Powered by RDNA 3 architecture and equipped with 24 GB GDDR6 memory, the card effortlessly handles demanding tasks such as 4K gaming and complex 3D graphics.

RDNA 3 Architecture

The graphics card features 6,144 stream processors, delivering exceptional performance for both gaming and professional applications.

Up to 24GB of GDDR6 Memory

Ample capacity for handling complex calculations.

Infinity Cache Support

Enhanced memory bandwidth for improved performance.

Second-Generation Ray Tracing

Delivers realistic graphics and enhanced visualization.

Wide Compatibility

Optimized for various professional applications.

Energy Efficiency

Lower power consumption reduces operating costs.

Scalability

Perfect for multi-GPU server configurations.

Affordable Pricing

High performance at a more competitive price than similar NVIDIA solutions. Using multiple AMD GPUs in a single server offers a cost-effective alternative to high-end NVIDIA cards.

Advantages of the AMD Radeon AI PRO R9700

The AMD Radeon AI PRO R9700 is designed for artificial intelligence workloads, local inference, 3D rendering, and large-scale machine learning models. Built on the RDNA 4 architecture, the GPU features 32 GB of GDDR6 memory and supports modern AI frameworks through the AMD ROCm platform. With high compute performance, multi-GPU scalability, and PCIe 5.0 support, it is well suited for professional workstations and AI servers with demanding computational workloads.

RDNA 4 Architecture

Modern AMD architecture with improved performance for AI workloads and professional computing.

32 GB GDDR6 Memory

Large VRAM capacity suitable for local LLM deployment, generative AI, and large dataset processing.

128 AI Accelerators

Hardware AI accelerators deliver high performance for inference and machine learning workloads.

AMD ROCm Support

Compatibility with popular AI frameworks including PyTorch, TensorFlow, and ONNX Runtime.

PCIe 5.0 Support

High-speed data transfer for modern server and workstation platforms.

64 MB Infinity Cache

Reduced memory latency and improved overall GPU performance.

Multi-GPU Scalability

Suitable for servers and workstations with multiple GPUs for AI and HPC workloads.

High AI Performance

Up to 1531 TOPS INT4 and up to 95.7 TFLOPS FP16 for AI workloads and accelerated computing.

Optimized Price-to-Performance Ratio

The AMD Radeon AI PRO R9700 offers large VRAM capacity and strong AI performance at a more affordable price compared to several professional NVIDIA solutions.

Performance: AMD Radeon RX 7900 XTX vs Nvidia RTX 4090 (Ubuntu 22.04, kernel 6.8.0, ROCm 6.3.1, CUDA 12.6)

	AMD Radeon RX 7900 XTX	Nvidia RTX 4090
Llama 3.3 70B (2K context, 54 Gb VRAM). Q4 in Ollama	Response: 12 token/s	Response: 17 token/s
Gemma 2 27B (2K context - 28 Gb VRAM). Q4 in Ollama	Response: 32 token/s	Response: 40 token/s
Gemma 2 27B (8K context — 41 Gb VRAM). Q4 in Ollama	Response: 33 token/s	Response: 42 token/s
Phi4 14B (12 Gb VRAM) 2K context. Q4 in Ollama	Response: 48 token/s	Response: 76 token/s
Qwen25-32b-Instruct. Fp16 in vLLM	End-to-End Request Latency (30 workers): 10 s	End-to-End Request Latency (30 workers): 10 s
Qwen25-32b-Instruct. Fp16 in vLLM	Combined Token Throughput (30 workers): 710 token/s	Combined Token Throughput (30 workers): 750 token/s
Qwen25-32b-Instruct. Fp16 in vLLM	Time to First Token (30 workers): 1.5 s	Time to First Token (30 workers): 2.3 s
Qwen25-32b-Instruct. Fp16 in vLLM	Inter-Token Latency (30 workers): 0.037s	Inter-Token Latency (30 workers): 0.037s
Qwen25-32b-Instruct. Fp16 in vLLM	Request per Second (30 workers): 2.1 request/s	Request per Second (30 workers): 2.3 request/s
Qwen25-32b-Instruct. Fp16 in vLLM	Tokens per Second (30 workers): 27 tokens/s	Tokens per Second (30 workers): 27.5 tokens/s

Comparison of AMD Radeon AI PRO R9700 and NVIDIA RTX 4090 AI Benchmarks Based on Public Tests

The results below are based on publicly available benchmarks and reviews using different hardware configurations, drivers, and software environments.

	AMD Radeon AI PRO R9700	NVIDIA RTX 4090
DeepSeek-R1 14B	53.5 token/s	~60-75 token/s*
DeepSeek-R1 32B	26.3 token/s	~40-65 token/s*
GPT-OSS 20B	102.4 token/s	~110-140 token/s*
VRAM Capacity	32 GB	24 GB
Power Consumption (TDP)	300W	450W
Software Platform	ROCm	CUDA

* Results may vary depending on CPU, RAM capacity, drivers, batch size, quantization, and inference framework.

AMD Radeon AI PRO R9700 Specifications

Main characteristics:

AMD RDNA 3 architecture
5 nm GPU Compute Die + 6 nm Memory Cache Die
Stream Processors: 6144
Base frequency: 1855 MHz
Boost frequency: 2500 MHz

Memory:

Memory type: GDDR6
Max memory size: 24 Gb
Memory bus width: 384 bits
Memory speed: 20 Gbps
Memory bandwidth : 960 GB/s

Specifications:

Ray Tracing:
Supports
AMD FidelityFX Super Resolution (FSR)
Infinity Cache Technology: 96 MB
AMD Smart Access Memory
AMD SmartShift Eco

AMD Radeon AI PRO R9700 Specifications

Key Specifications:

Graphics Architecture: AMD RDNA 4
AI Accelerators: 128
Stream Processors: 4096
PCI Express Interface: PCIe 5.0 x16
Typical Board Power (TBP): 300W

Memory:

Memory Type: GDDR6
Memory Capacity: 32 GB
Memory Bus Width: 256-bit
Memory Bandwidth: Up to 640 GB/s
Infinity Cache Capacity: 64 MB

Features:

Support for the AMD ROCm software platform
Ray Tracing support
AI Performance: Up to 1531 TOPS INT4
Support for multi-GPU AI systems
Optimized for AI inference and LLM workloads

Our Advantages

Reliable Data Centers
Top reliability and security provide stable operation of your servers and 99.982% uptime per year.
DDoS protection
The service is organized using software and hardware solutions to protect against TCP-SYN Flood attacks (SYN, ACK, RST, FIN, PUSH).
High-bandwidth Internet connectivity
We provide a 1-10 Gbps unmetered port. You can transfer huge datasets in minutes.
Full control
Remote server management via IPMI, iDRAC, KVM API, web-panel and etc.
Eco-friendly
Hosting in the most environmentally friendly data center in Europe.
A replacement server is always available
A fleet of substitution servers will reduce downtime when migrating and upgrading.
Quick replacement of components
In the case of component failure, we will promptly replace them.
Round-the-clock technical support
The application form allows you to get technical support at any time of the day or night. First response within 15 minutes.

What included

Traffic
The amount of traffic depends on location. All servers are deployed with 1-10 Gbps port, incoming traffic is free (fair usage). Outgoing traffic limit and rates are subject to a selected traffic plan.
Free DDoS protection
We offer basic DDoS protection free in Europe.
Customer support 24/7
Our customer technical support guarantees that our customers will receive technical assistance whenever necessary.

Our Services

Network

Security

Technical support

Other

1 /

What customers say

After launching another successful IP — HUNT: Showdown, a competitive first-person PvP bounty hunting game with heavy PvE elements, Crytek aimed to bring this amazing game for its end-users. We needed a hosting provider that can offer us high-performance servers with great network speed, latency, and 24/7 support.

Stefan Neykov Crytek

doXray has been using HOSTKEY for the development and the operation of our software solutions. Our applications require the use of GPU processing power. We have been using HOSTKEY for several years and we are very satisfied with the way they operate. New requirements are setup fast and support follows up after the installation process to check if everything is as requested. Support during operations is reliable and fast.

Wimdo Blaauboer doXray

We would like to thank HOSTKEY for providing us with high-quality hosting services for over 4 years. Ip-label has been able to conduct many of its more than 100 million daily measurements through HOSTKEY’s servers, making our meteorological coverage even more complete.

D. Jayes IP-Label

1 /

Our Ratings

4.3 out of 5

4.8 out of 5

4.0 out of 5

Configure your server

Hot deals

HOT AMD Ryzen Server — €129/month or €0.179/hour

AMD Ryzen 7950X, 16 cores, 4.5 GHz / 128 GB RAM / 2×1.92 TB U.2 NVMe SSD / 1 Gbps, 50 TB traffic

Order a server

From €259 Sale on 4th Gen AMD EPYC™ Servers!

3.25 GHz EPYC 9354 — 32 cores / 2× EPYC 9354 — 64 cores servers. Up to 1 TB RAM, and 2× 3.84 TB NVMe SSDs. 10 Gbps bandwidth and 100 TB traffic included with all servers!

Explore

High-RAM High-RAM Dedicated Servers with up to 4.6TB RAM

Choose high-RAM dedicated servers with up to 4.6 TB of RAM and 12 NVMe drives, powered by AMD EPYC 4th Gen CPUs.

Order

Hot deals Sale on pre-configured dedicated servers

Ready-to-use servers with a discount. We will deliver the server within a day of the receipt of the payment.

Order now

50% OFF Dedicated Servers for hosting providers - 7 days trial and 50% OFF

Discover affordable dedicated servers for hosting providers, situated in a top-tier Amsterdam data center in the Netherlands. 7 days trial, 50% OFF on the first 3 months, 50% OFF for a backup server.

Order a server

Web3 Web3 Dedicated Servers Infrastructure

Built for Blockchain: CPUs with16-64 cores, 1-10 Gbps, Up to 768 GB DDR5 RAM, 3.48 TB Enterprise NBMe, Global Locations

Order a server

1 /4

Solutions

GPU servers for data science

e-Commerce hosting

Finance and FinTech

Private cloud

Rendering, 3D Design and visualization

Managed colocation

GPU servers for Deep Learning

Data Centers

Get acquainted with our state-of-art and reliable data centers

Speed test

Determine how your network perform

Try before you buy

Be the judge. Take our servers for a test drive

Wide range of pre-configured servers with instant delivery and sale

Resources

Knowledge base

You can always find answers and useful tips in our Knowledge Base

FAQ

Find answers and solutions to common issues.

Technical support

Our 24/7 Support Team is always ready to help.

1 /

FAQ

How many CUDA cores are in the AMD Radeon RX 7900 XTX?

The card does not use CUDA cores, as CUDA is NVIDIA's proprietary technology. Instead, it features 6,144 Stream Processors, which serve a similar function in AMD's GPU architecture.

Is AMD Radeon RX 7900 XTX suitable for Neural Network Training?

Yes, the card supports OpenCL, ROCm and vLLM, which can be used for training, inference, chatbots and video recognition. The card is also compatible with popular machine learning frameworks like PyTorch and TensorFlow. Its performance with FP16 models is comparable to the NVIDIA RTX 4090, though it does not yet support FP8 models. For certain workloads, especially in multi-GPU configurations, the RX 7900 XTX is an excellent option, offering strong performance at a lower cost than NVIDIA alternatives.

What are the limitations of using the Radeon RX 7900 XTX in AI?

The primary limitation is the lack of full CUDA support, which can make some AI frameworks and software less compatible out of the box. This may require software adaptation or the use of an emulator like ZLUDA to run CUDA-based applications on AMD hardware.

News

26.05.2026

Blog

How Our Documentation Team Built an LLM Agent for Automated Translation from English to Other Languages

This article details how we built a custom LLM agent for translating technical documentation, featuring validation, Markdown and code preservation, Git integration, and multi-step quality checks.

19.05.2026

Blog

How to Connect to S3 Storage: A Step-by-Step Guide with Examples

A complete practical guide to connecting and working with S3-compatible object storage. Learn how to configure AWS CLI, Rclone, boto3, Cyberduck, S3 Browser, s3cmd ands3fs for backups, file management, synchronization and application integration.

15.05.2026

Blog

India Wanted to Buy a Supercomputer. They Were Denied. So They Built Their Own

In the late 1980s, India attempted to purchase a Cray Y-MP supercomputer, but the US refused to issue an export license. Instead, the country established C-DAC and built its own PARAM 8000 supercomputer within three years. We analyze how this was achieved and why the rejection by Cray ultimately worked in India's favor.

Show all News

Show all News / Blogs

1 /

Need more information or have a question?

Related GPU Services

GPU Infrastructure

Rendering and Creative Workloads

Gaming and Real-Time Graphics

High-Performance and Data Center GPUs

Cloud and Virtual GPU Solutions

AI and Machine Learning