EN
Currency:
EUR – €
Choose a currency
  • Euro EUR – €
  • United States dollar USD – $
VAT:
OT 0%
Choose your country (VAT)
  • OT All others 0%
Choose a language
  • Choose a currency
    Choose you country (VAT)
    Dedicated Servers
  • Instant
  • Custom
  • Single CPU servers
  • Dual CPU servers
  • Servers with 4th Gen EPYC
  • Servers with AMD Ryzen and Intel Core i9
  • Storage Servers
  • Servers with 10Gbps ports
  • Premium Servers
  • High-RAM Dedicated Servers
  • Servers for Solana Nodes
  • Web3 Server Infrastructure
  • Hosting virtualization nodes
  • GPU
  • Sale
  • Virtual Servers
  • Instant VPS & VDS
  • Hosting with ispmanager
  • Hosting with cPanel
  • GPU
  • Dedicated GPU server
  • VM with GPU
  • Tesla A100 80GB & H100 Servers
  • Nvidia RTX 5090
  • Nvidia RTX PRO 6000
  • GPU servers equipped with AMD Radeon
  • Sale
    Apps
    Colocation
  • Colocation in the Netherlands
  • Remote smart hands
  • Services
  • L3-L4 DDoS Protection
  • Network equipment
  • IPv4 and IPv6 address
  • Managed servers
  • SLA packages for technical support
  • Monitoring
  • Software
  • VLAN
  • Announcing your IP or AS (BYOIP)
  • USB flash/key/flash drive
  • Traffic
  • Hardware delivery for EU data centers
  • AI Chatbot Lite
  • AI Platform
  • About
  • Hostkey for Business
  • Careers at HOSTKEY
  • Server Control Panel & API
  • Data Centers
  • Network
  • Speed test
  • Hot deals
  • Contact
  • Reseller program
  • Affiliate Program
  • Grants for winners
  • Grants for scientific projects and startups
  • News
  • Our blog
  • Payment terms and methods
  • Legal
  • Abuse
  • Looking Glass
  • The KYC Verification
  • Hot Deals
    Ollama

    Free and self-hosted AI Chatbot built on Ollama, Lllama3 LLM model and OpenWebUI interface.

    Self-hosted AI Chatbot

    Personal chatbot powered by Ollama, an open source large language model Lllama3 and OpenWebUI interface running on your own server.

    Rent a virtual (VPS) or a dedicated server from HOSTKEY with a pre-installed and ready-to-use Self-hosted AI Chatbot, which can process your data and documents using various LLM models.

    You can upload the most recent versions of Phi3, Mistral, Gemma, and Code Llama models.

    Servers available in the Netherlands, Finland, Germany and Iceland.

    • Security and data privacy - all your data is stored and processed on your server, ensuring it never leaves your environment;
    • Cost efficiency - you only pay for the server rental; the operation and load of the neural network are not charged and are completely free;
    • Scalability - you can easily transfer the chatbot from one server to another, managing costs and knowing the exact expenses in advance;
    • Flexible Customization - you can tailor the model to your needs: load multiple ready-made models from leading providers or train your chatbot to generate responses based on your information and documents. You can even use multiple models simultaneously.
    4.3/5
    4.8/5
    SERVERS In action right now 5 000+

    Explore AI Chatbot plans

    AI Chatbot Lite

    € 27
    Per Month
    Payment for use of the HOSTKEY-shared GPU server

    AI Chatbot running on a HOSTKEY-shared GPU server. You get the admin rights to use AI Chatbot privately and manage your team.

    • Security and data privacy
    • Unlimited users
    • Chatbot admin rights
    • File Analysis
    • Multi-model mode
    • Server admin rights
    • Server scalability
    • Load and custom models
    • RAG

    Self-hosted AI Chatbot

    Server-based price
    Per Month
    Payment for server rental

    AI Chatbot pre-installed on your own VPS or Dedicated GPU server. You get the full admin rights over your personal server.

    • Security and data privacy
    • Unlimited users
    • Chatbot admin rights
    • File Analysis
    • Multi-model mode
    • Server admin rights
    • Server scalability
    • Load and custom models
    • RAG

    With every AI Chatbot plan, you will get pre-installed and ready-to-use models:

    • gemma2:latest 9.2B
    • llama3:latest 8.0B

    “Self-hosted AI Chatbot” plan lets you manage and load more models. This feature is not available for the “AI Chatbot Lite” plan.

    AI Chatbot Lite - a paid trial tier of the main product
    AI Chatbot Lite runs on a HOSTKEY-shared GPU server. This is a paid trial tier of the main product (Self-hosted AI Chatbot). The payment is taken for using a shared HOSTKEY-managed GPU server. AI Chatbot Lite offers basic chatbot functionality with no limits on the number of users or prompt requests. It makes AI Chatbot Lite a perfect trial plan before purchasing the Self-hosted AI Chatbot (main product).

    Corporate AI Chatbot

    An AI chatbot hosted on your own server ensures security, with all data stored and processed within your environment.

    • Data security - all data is stored and processed on your server, ensuring full control over sensitive information.
    • No external SaaS solutions - no relying on ChatGPT, Google Gemini, Microsoft Copilot or other SaaS vendors for in-house operations
    • Scalability - increase server capacity as needed

    Multi-model mode

    OpenWebUI allows you to use multiple models in a single workspace, speeding up workflow and enabling you to process a single prompt across several models simultaneously without switching windows. Just make one request and visually compare the results to select the best option.

    • Faster content generation - a single prompt is processed across several models simultaneously
    • Easy comparison - visually compare results from multiple models in one workspace
    • Less text editing - Choose the best text parts from several options without a need for self-editing

    Internal Code Review

    Hosting an AI Chatbot on your own server allows you to conduct code reviews preventing the risk of code leakage and protecting corporate confidentiality. The scalability of a self-hosted solution is very useful for handling increased workloads – you can increase a server capacity as needed.

    • Code security - conduct code reviews without the risk of code leakage
    • Scalability - forget about tokens or user limits, you pay only for server rental
    • Transparent pricing - increase server capacity as needed during the working process

    Support for Customers and Employees

    Connect an AI chatbot to your knowledge base to generate answers to frequently asked questions from your users or employees.

    • Internal assistant - use a self-hosted AI Chatbot for customer or employee support
    • Response control - maintain full control over the data, models, and request processing algorithms used to generate responses
    • Data security - ensure that your customers’ response statistics are not analyzed by third parties

    Corporate File Analysis

    Analyze corporate documents in various formats, such as pdf, csv, rst, xml, md, epub, doc, docx, xls, xlsx, ppt, and txt, while keeping all analytics on your server to ensure confidentiality.

    • Document security - corporate documents are processed internally, with no data shared externally
    • Format versatility - supports a wide range of formats like PDF, CSV, RST, XML, MD, EPUB, DOC, DOCX, XLS, XLSX, PPT, and TXT.
    • Internal data processing - documents are processed on your server with no risk of data leakage

    How to order Self-hosted AI Chatbot (main product)

    1. Choose server and license

      Choose the desired server. During the ordering process, select the AI Chatbot, network settings and other parameters.
    2. Place an order

      After placing and paying for the order, we will contact you and let you know the exact time the server is ready. The delivery time of the server depends on its type and a pre-installed software.
    3. Start working

      When the server is ready, we will send all the access details to your email. AI Chatbot will already be installed and ready to go.

    How to order AI Chatbot Lite (paid trial tier)

    1. Place an order

      Fill the AI Chatbot Lite order form on a HOSTKEY website.
    2. Make a payment

      Create a HOSTKEY account via the provided link and pass a KYC Identity verification. Then make a payment as per the invoice, which will be emailed to you.
    3. Start working

      You could start working with no additional delays with a ready-to-use AI Chatbot Lite. We have already taken care of its deployment.

    Self-hosted AI Сhatbot — officially free software

    Personal chatbot powered by Ollama, an open source large language model Lllama3 and OpenWebUI interface - a HOSTKEY solution built on officially free and open-source software.

    Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine. Ollama is licensed under the MIT License.

    Lllama3 is Meta's latest open-source large language model that has been scaled up to 70 billion parameters, making it one of the largest and most powerful language models in the world.

    Open WebUI is an extensible, feature-rich, and user-friendly self-hosted WebUI designed to operate entirely offline. OpenWebUI is licensed under the MIT License.

    We guarantee that our servers are running safe and original software.

    • Ordering the Self-hosted AI Chatbot you pay only for the server rental. There are no additional fees for using the software and/or its functions.
    • AI Chatbot Lite is a paid trial tier of the main product (Self-hosted AI Chatbot). The payment is taken for using a shared HOSTKEY-managed GPU server.

    FAQ

    How to install AI Сhatbot on a virtual or dedicated server?

    To install AI Сhatbot, you need to select it in the "App Marketplace" tab (AI and Machine Learning group) while ordering a server on the HOSTKEY website. Our auto-deployment system will install the software on your server.

    What is the difference between Self-hosted AI Chatbot and AI Chatbot Lite?

    “Self-hosted AI Chatbot” runs on your personal server rented at HOSTKEY. It provides full chatbot functionality with no limits on the number of users or prompt requests.

    “AI Chatbot Lite” is a paid trial tier of the main product, which runs on a shared HOSTKEY-managed GPU server. AI Chatbot Lite offers basic chatbot functionality with no limits on the number of users or prompt requests, making it a perfect trial plan before purchasing the Self-hosted AI Chatbot (main product).

    Why does AI Chatbot Lite cost money while being a trial tier?

    The payment for AI Chatbot Lite covers the use of a shared HOSTKEY-managed GPU server, meaning you don’t need to rent a server to use the software, which is free by itself.

    How do I change the AI Chatbot plan?

    You just need to order a new one and cancel the previous one. Also you can use both of them simultaneously as they operate independently from each other.

    Which are the key advantages of Self-hosted AI Сhatbot?

    • Security and data privacy - all data is stored and processed on your server, ensuring it never leaves your environment;
    • Cost efficiency - you only pay for the server rental; neural network operations are not charged and are completely free;
    • Scalability - you can easily transfer the chatbot from one server to another, managing costs and knowing the exact expenses in advance;
    • Flexible Customization - tailor the chatbot to your needs by loading multiple ready-made models from leading providers or set it up to generate responses based on your information and documents. You can even use multiple models simultaneously.

    Can I use API requests with the “AI-chatbot Lite” plan?

    Yes, with the “AI-chatbot Lite” plan, you can interact with the chatbot using API requests. To set it up, follow these instructions:
    https://docs.openwebui.com/getting-started/advanced-topics/api-endpoints
    https://github.com/ollama/ollama/blob/main/docs/api.md

    Can I use the RAG (Retrieval Augmented Generation) feature with the 'AI Chatbot Lite' plan?

    The RAG (Retrieval Augmented Generation) feature allows the chatbot to generate responses based on specific data, such as your documentation. Unfortunately, this feature is not available on the “AI Chatbot Lite” trial plan.

    To access this feature, we recommend upgrading to the “AI Chatbot on your own server” plan.

    Learn more about adding documents to the knowledge base (RAG):
    https://hostkey.com/documentation/marketplace/machine_learning/ai_chatbot/#adding-documents-to-the-knowledge-base-rag

    Which AI models can I use?

    The "AI Chatbot Lite" plan offers 2 models pre-installed by default: gemma2:latest 9.2B and llama3:latest 8.0B. You can use them either separately or simultaneously. The "AI Chatbot Lite" plan does not allow you to install or delete models.

    The "AI Chatbot on Your Own Server" plan also includes 2 pre-installed models by default: gemma2:latest 9.2B and llama3:latest 8.0B. However, with this plan, you can install and delete any models available in the Ollama library

    Self-hosted AI Сhatbot key features

    A self-hosted AI Chatbot has a number of advantages compared to popular paid services:

    Versatility of use
    Whether as a content generator, coder or a technical support specialist, you decide how to use your chatbot.
    Security and data privacy
    The LLM is deployed on our own server infrastructure, your data is completely protected and under your control. It is not shared or processed in the external environment.
    Cost efficiency
    You only pay for server rental, regardless of the number of people using the chatbot. You have no restrictions on the number of tokens, the number of requests per unit of time, etc. - the price solely depends on the leased server capacity.
    High performance
    The full power of neural networks is dedicated only to you and your employees. Even on medium-power servers, you get performance comparable to the expensive subscriptions plans of popular neural networks.
    Independence from IT service providers
    You can choose the most suitable neural network option from hundreds of open source LLMs. You can always install alternative models tailored to your needs. The version of the model used is completely controlled by you.
    Personal solutions
    With the OpenWebUI interface, you can adjust AI models parameters to fit your requirements. You can also upload new models for Ollama as needed.
    User-friendly interface
    OpenWebUI allows you to configure various parameters such as temperature, top-k and top-p to fine-tune the generated results according to your preferences.
    Retrieval Augmented Generation (RAG) support
    OpenWebUI supports RAG, allowing users to easily integrate local and web content into their chats. Add hybrid search to your LLM or search for information on websites.
    Flexible API access
    You can use Ollama and OpenWebUI to create your own applications, such as Telegram chatbots or AI software, using the API.
    Get pre-installed AI Chatbot
    on servers located in data centers across Europe, the USA, and Turkey.

    Why choose a Self-hosted AI Сhatbot server at HOSTKEY?

    • TIER III Data Centers

      Top reliability and security provide stable operation of your servers and 99.982% annual uptime.
    • DDoS protection

      The service is organized using software and hardware solutions to protect against TCP-SYN Flood attacks (SYN, ACK, RST, FIN, PUSH).
    • Round-the-clock technical support

      The application form allows you to get technical support at any time of the day or night. First response within 15 minutes.

    What customers say

    Crytek
    After launching another successful IP — HUNT: Showdown, a competitive first-person PvP bounty hunting game with heavy PvE elements, Crytek aimed to bring this amazing game for its end-users. We needed a hosting provider that can offer us high-performance servers with great network speed, latency, and 24/7 support.
    Stefan Neykov Crytek
    doXray
    doXray has been using HOSTKEY for the development and the operation of our software solutions. Our applications require the use of GPU processing power. We have been using HOSTKEY for several years and we are very satisfied with the way they operate. New requirements are setup fast and support follows up after the installation process to check if everything is as requested. Support during operations is reliable and fast.
    Wimdo Blaauboer doXray
    IP-Label
    We would like to thank HOSTKEY for providing us with high-quality hosting services for over 4 years. Ip-label has been able to conduct many of its more than 100 million daily measurements through HOSTKEY’s servers, making our meteorological coverage even more complete.
    D. Jayes IP-Label
    1 /

    Our Ratings

    4.3 out of 5
    4.8 out of 5

    Self-Hosted Ollama AI Chatbot vs SaaS Solutions: Why Go Independent with your own ollama server ?

    Picking between an AI chatbot that you host and a SaaS platform like ChatGPT or Gemini is very important when selecting an AI assistant. By using the Ollama chatbot, you’ll avoid being tied to any vendor, handle your infrastructure yourself and enjoy top-level privacy.

    Ollama Chatbot is compared to ChatGPT.

    Although ChatGPT is a powerful tool, your data is stored and processed outside your reach. In contrast, running the Ollama chatbot on your own hardware or using an AI chatbot VPS allows you to keep all data local. It is especially important in industries that are regulated such as healthcare and finance.

    A comparison between Ollama Chatbot and Gemini

    Google Gemini AI depends a lot on its ecosystem. You may not always be able to see or change how things are done. Using Ollama hosting, you can modify or swap models and set up your stack as you wish, without being limited by the ecosystem.

    Ollama Chatbot is compared to Copilot.

    GitHub Copilot is focused on code generation but lacks flexibility for broader AI interactions.щ With the Ollama chatbot, you can have more conversations and control how it reasons, remembers things and works with other systems.

    Advantages of Self-Hosted Ollama AI Chatbot on Your Server

    When you run your Ollama chatbot on your own VPS or server, you gain benefits right away and in the future.

    Data Privacy

    • Only you can see what you search for.
    • Guarantees that the system follows GDPR and HIPAA rules
    • You are responsible for all logs in your system.

    Customization

    • Adjust the chatbot’s responses and actions.
    • Use your company’s own data for training.
    • Set up the processes for deploying models.

    Pricing Model

    • There are no charges for each token or for each user.
    • Costs that are easy to predict with ollama vps or ollama server
    • Only pay for the time you spend using the service: hourly or monthly

    More Important Advantages

    • There is no throttling: You get full performance, unlike what you get with shared SaaS models.
    • Your AI chatbot VPS answers right away because there is no cloud latency.
    • You can use your own LLMs, including those that are open-source.

    For Developers and AI Enthusiasts: Build and Experiment Freely

    If you’re working on AI apps, the Ollama chatbot is the perfect partner for you.

    • Programmatic control of Ollama is possible with API access.
    • Complete flexibility in the open source model
    • Try using custom LLMs and playing with prompt engineering.
    • Use GitHub, Docker and CI/CD processes in your project.

    If you’re working on agents, integrating vector databases or making changes to the UI, a self-hosted AI chatbot allows you to do things that cloud platforms cannot.

    Ollama Chatbot Performance: Fast, Private, Reliable

    The system is designed to provide high performance for ollama hosting.

    • On 4090 GPUs, typical response time is less than 1 second.
    • It takes between 5 and 15 seconds to load a model on pre-configured servers
    • Latency benchmarks:

      • The lite model takes around 20 milliseconds.
      • Full model takes about 150ms to run.
    • The difference between VPS and dedicated server.

      • VPS is ideal for users with light needs.
      • Dedicated is known for its dependable performance on demanding jobs.

    How Businesses Use Self-Hosted AI Chatbots

    Ollama chatbots are being used by many industries to make things more efficient and improve how users interact.

    • Healthcare Providers: For patient questions and access to information within the organization
    • Legal and Compliance Departments: The Legal and Compliance Departments look over documents, draft summaries and ensure that all conversations are secure.
    • Education and Training: Intelligent tutors, FAQ answers and customized learning paths are all part of education and training.
    • Financial Institutions: keep an eye on market trends, help with customer support and make sure data is secure.
    • Software Development Companies: they review code, produce documentation and help with DevOps.
    • eCommerce and Retail: ensure customers can receive help all the time, get product recommendations and track their orders

    High-level Security with Local Control

    It is very important to have good security. When you use a self-hosted AI chatbot, you don’t have to give up anything.

    • All your data remains within your infrastructure.
    • A GDPR- or HIPAA-compliant setup is available (sample templates are available)
    • Storing logs, tracking actions and keeping model results

    Best Server Configurations with GPU Cards for a Self-Hosted AI Chatbot

    It takes only a few minutes to deploy any server. You can select from free or discounted AI software that is already installed on our marketplace. Customers can choose to be billed hourly or monthly. Save up to 40% when you choose a long-term plan.

    Dedicated Servers

    1. AI-Dedicated-1

      • CPU: AMD Ryzen 7950X
      • GPU: RTX 4090
      • RAM: 128GB DDR5
      • Storage: 2TB NVMe SSD
      • Port: 1Gbps unlimited
      • Price: €720/month | €1.20/hour
    2. AI-Dedicated-2

      • CPU: Intel Xeon Gold 6414U
      • GPU: RTX 4090
      • RAM: 256GB DDR4 ECC
      • Storage: 4TB NVMe SSD
      • Port: 1Gbps unlimited
      • Price: €980/month | €1.90/hour
    3. AI-Dedicated-Pro

      • CPU: AMD EPYC 9654P
      • GPU: RTX 5090
      • RAM: 512GB DDR5 ECC
      • Storage: 8TB NVMe SSD
      • Port: 1Gbps unlimited
      • Price: €1590/month | €3.10/hour

    VPS Plans

    1. AI-VPS-Lite

      • vCPU: 8 vCores
      • GPU: Shared RTX 4090
      • RAM: 32GB
      • Storage: 200GB NVMe
      • Port: 1Gbps
      • Price: €110/month | €0.25/hour
    2. AI-VPS-Pro

      • vCPU: 16 vCores
      • GPU: Dedicated RTX 4090
      • RAM: 64GB
      • Storage: 500GB NVMe
      • Port: 1Gbps
      • Price: €210/month | €0.40/hour
    3. AI-VPS-Max

      • vCPU: 24 vCores
      • GPU: Dedicated RTX 5090
      • RAM: 96GB
      • Storage: 1TB NVMe
      • Port: 1Gbps
      • Price: €370/month | €0.72/hour

    All of our servers are set up and ready for you to use right away. You can get the Ollama chatbot, Docker stacks and other AI tools directly from our marketplace.

    Manage your AI infrastructure entirely on your own. Sign up for Ollama hosting, run your self-hosted AI chatbot on a strong Ollama VPS or Ollama server and start in just a few minutes.

    Upload