🗃️ Manage Managed GPU Cluster
Browse Manage Managed GPU Cluster guides — covering Create a new Managed GPU Cluster, Get cluster access information, Manage GPU Cluster, and View Managed GPU Cluster list, and more.
🗃️ Modify K8S Cluster Configuration
Browse Modify K8S Cluster Configuration guides — covering Add a worker group, Edit labels and taints for a worker group, Change K8s configuration, and Change the base worker group, and more.
📄️ Load balancer service for Managed GPU Cluster
Managed GPU Cluster is built on native Kubernetes and integrates additional cloud provider components, including the FPT Cloud Controller Manager.
📄️ Deploy an application on Managed GPU Cluster
This guide walks through deploying the DeepSeek-R1 model on FPT Managed GPU Cluster using Ollama and Open-WebUI.
📄️ File Storage High Performance integration
Prerequisites for using High Performance Storage with Managed GPU Cluster.
📄️ SLURM on FPT Cloud
Introduction to SLURM and running SLURM on FPT Cloud Managed GPU Cluster.
📄️ vGPU feature in FPT Kubernetes Engine
Introduction to the vGPU feature in FPT Kubernetes Engine.
📄️ GPU time sharing/time slicing in FPT Kubernetes Engine
Introduction to the GPU time sharing/time slicing feature in FPT Kubernetes Engine.
📄️ MPS GPU sharing
MPS is an NVIDIA GPU feature that allows multiple containers to share the same physical GPU.
🗃️ Deploy GPU Workload to Managed GPU Cluster
Browse Deploy GPU Workload to Managed GPU Cluster guides — covering Fine-tuning an LLM model with Unsloth on Kubernetes, Multi-GPU example: serving an LLM with vLLM, Multi-node example: vLLM and multi-host serving, and Single GPU example: serving an LLM with vLLM, and more.
📄️ FPT Managed GPU Cluster
FPT Managed GPU Cluster