OpenShift

The Kubernetes-Native Engine for Hybrid Cloud and Enterprise AI

If you are an IT Admin looking for the «Gold Standard» of enterprise Kubernetes, Red Hat OpenShift is the conversation. Built on CRI-O and Red Hat Enterprise Linux (RHEL), OpenShift isn’t just a container orchestrator—it is a complete, hardened operational ecosystem.

1. The Core

Shifting the Paradigm with OpenShift Virtualization (Kuinverted)

OpenShift has flipped the script on traditional infrastructure. Instead of running Kubernetes inside VMs, you run VMs inside Kubernetes using OpenShift Virtualization (based on KubeVirt).

  • Unified Operations: Manage your legacy Windows or Linux VMs right alongside your modern containers in the same console.
  • Bare Metal Performance: By running OpenShift on bare metal, you eliminate the «Hypervisor Tax» ($$ and latency), giving your VMs direct access to hardware while using Kubernetes constructs (like YAML and Operators) to manage them.

2. Hybrid Multicloud

The «Open Hybrid Cloud» Blueprint

Red Hat’s strategy is built on the Advanced Cluster Management (ACM).

  • Cluster Lifecycle: Whether your OpenShift cluster is on-prem, in AWS (ROSA), or Azure (ARO), the management experience is identical.
  • Global Hub: ACM allows you to push policies, security configurations, and application updates across a global fleet of clusters from a single «Hub» cluster.
  • Submariner: Need pod-to-pod networking across different clouds? OpenShift uses Submariner to provide direct L3 connectivity between clusters in a hybrid mesh.

3. Cloud Native

Beyond «Vanilla» Kubernetes

OpenShift takes the complexity of Kubernetes and wraps it in Enterprise Operators.

  • Over-the-Air Updates: The entire stack—from the OS (RHCOS) to the monitoring (Prometheus/Grafana)—is updated via a single «Update» button, ensuring the cluster stays secure.
  • Integrated CI/CD: With OpenShift Pipelines (Tekton) and OpenShift GitOps (Argo CD) built-in, admins provide developers with a «Push-to-Production» workflow out of the box.
  • Advanced Networking: OVN-Kubernetes provides a robust, high-performance SDN that handles massive scale and complex multi-tenancy.

4. AI & RHOAI

The AI Lifecycle on OpenShift

For AI, Red Hat offers OpenShift AI (RHOAI). It is arguably the most mature platform for «AI-DevOps» (MLOps).

  • GPU as a First-Class Citizen: Using the NVIDIA GPU Operator, OpenShift automatically detects, labels, and configures drivers for your GPUs across the cluster.
  • Data Science Pipelines: RHOAI provides a collaborative environment with Jupyter Notebooks, PyTorch, and TensorFlow, all running as containers.
  • Model Serving: Once your AI model is ready, OpenShift uses KServe to deploy it as a scalable microservice, handling auto-scaling and load balancing natively.

The Admin’s Verdict

OpenShift is for the organization that wants to stop managing «Servers» and start managing «Services.» It requires a steeper learning curve (YAML, OC commands, and K8s concepts), but it offers the highest level of automation and security (SELinux, FIPS compliance) in the industry. It is the definitive choice for a Cloud-Native First strategy where AI and Hybrid Cloud are the default, not an afterthought.

Posted by oliver@olivergraf.tech, 0 comments

Proxmox VE

The Open-Source Powerhouse for Hybrid Cloud and Edge AI

If Nutanix is the «iPhone of Data Centers,» Proxmox VE is the high-performance Linux workstation you built yourself. For IT Admins who value transparency, CLI control, and avoiding the «Broadcom/VMware tax,» Proxmox has evolved into a formidable enterprise contender.

Here’s how Proxmox is addressing the current trends of hybrid cloud, cloud-native, and AI.

1. The Core

KVM, LXC, and ZFS Mastery

Proxmox VE isn’t just a hypervisor; it’s a tightly integrated stack built on Debian.

  • Dual-Virtualization: Unlike Nutanix, Proxmox allows you to run both full KVM VMs and lightweight LXC Containers side-by-side on the same node. This is a game-changer for resource density.
  • ZFS & Ceph: Proxmox offers native support for ZFS (for local resilient storage) and Ceph (for hyperconverged, distributed storage). Managing a Ceph cluster directly from the Proxmox GUI provides Nutanix-like HCI features without the proprietary licensing.

2. Hybrid Cloud

The API-First Approach

Proxmox doesn’t have a proprietary «Cloud Cluster» like NC2, but it wins through Open Standards and the Proxmox Backup Server (PBS).

  • Proxmox Backup Server (PBS): This is the secret sauce. With deduplication and incremental backups, moving workloads between an on-prem Proxmox node and a colocation or cloud-hosted Proxmox instance is seamless.
  • Terraform & Ansible: Because Proxmox is built on standard Linux bridge networking and an open API, automating your «Hybrid» strategy via Infrastructure as Code (IaC) is straightforward and well-documented.

3. Cloud Native

Kubernetes on Proxmox

While Proxmox doesn’t have a branded «NKP» equivalent, it is arguably the best platform for running custom Kubernetes stacks:

  • Cloud-Init Support: Proxmox has deep integration with Cloud-Init, allowing you to automate the deployment of Ubuntu/Talos/Fedora CoreOS nodes for your K8s clusters.
  • CSI Drivers: Using the Proxmox CSI driver, Kubernetes clusters can dynamically provision PVs (Persistent Volumes) directly from your Proxmox storage backend (Ceph or ZFS), providing enterprise-grade persistence for containerized apps.

4. AI & GPU Passthrough

The «Bare Metal» Performance

For AI workloads, Proxmox is often preferred by technical teams because it stays out of the way of the hardware.

  • PCIe Passthrough: Proxmox makes it incredibly easy to pass through NVIDIA GPUs to VMs or even LXC containers with minimal overhead. This is critical for training LLMs or running inference engines.
  • vGPU Support: With the right drivers (like mdev), you can partition a single physical GPU across multiple AI development environments.
  • Self-Hosted LLMs: Using Proxmox to run Ollama or LocalAI in an LXC container gives you near-native performance for private, on-prem AI without the complexity of a massive enterprise suite.

The Admin’s Verdict

Proxmox VE is the ultimate «Admin’s Hypervisor.» It rewards Linux expertise with unparalleled flexibility. While Nutanix focuses on «making infrastructure invisible,» Proxmox focuses on «making infrastructure accessible.» If you want to build a sovereign, high-performance stack for K8s and AI without the six-figure licensing bill, Proxmox is the architecture of choice.

Posted by oliver@olivergraf.tech, 0 comments

Beyond the Hypervisor

Architecting a Modern Stack with Nutanix NCP, NKP, and GPT-in-a-Box

For years, Nutanix was synonymous with «HCI.» But if you’re still looking at it as just a storage-plus-compute box, you’re missing the shift. Today’s Nutanix Cloud Platform (NCP) is a full-stack operating environment designed to solve the «Day 2» operations nightmare of hybrid clouds, Kubernetes clusters, and now, LLM deployments.

1. Unified Infrastructure

The AHV & AOS Advantage

At the core, the Acropolis Operating System (AOS) and the AHV Hypervisor remain the gold standard for reducing complexity.

  • Storage Fabric: Nutanix distributes metadata across the cluster, ensuring no single point of failure and linear scalability. For admins, this means no more LUNs or volumes to manage—just a single, intelligent pool of storage.
  • Microsegmentation: With Nutanix Flow, network security is baked into the hypervisor. You can implement Zero Trust policies at the VM level without complex firewall hairpining.

2. Hybrid Multicloud

Stretching the Layer 2

The real challenge for admins is the «Cloud Gap.» Nutanix Cloud Clusters (NC2) solves this by extending the Nutanix software stack natively into AWS and Azure bare-metal nodes.

  • Unified Management: Use the same Prism Central console to manage on-prem clusters and cloud-based nodes.
  • No Refactoring: Since it’s the same AOS/AHV stack, you can vMotion (or rather, «Cross-Hypervisor Disaster Recovery») workloads to the public cloud without changing IP addresses or re-architecting the app.

3. Cloud Native

Enter Nutanix Kubernetes Platform (NKP)

Kubernetes is notoriously difficult to manage at scale. NKP (built on the foundations of D2iQ) provides a production-ready CNCF-compliant distribution that simplifies life for DevOps teams:

  • Fleet Management: Manage multiple clusters across different environments from a single pane of glass.
  • Persistent Storage for Containers: Through the CSI (Container Storage Interface) driver, Nutanix Unified Storage provides block, file, and object storage directly to your pods, ensuring stateful apps (like Databases) perform at scale.

4. AI Infrastructure

GPT-in-a-Box 2.0

The board wants AI, but the security team wants data privacy. GPT-in-a-Box is Nutanix’s answer to the «Shadow AI» problem.

  • GPU Passthrough & vGPU: Optimized support for NVIDIA GPUs allows for high-density training and inference workloads.
  • Full-Stack Software: It integrates NVIDIA NIM (Inference Microservices) and open-source frameworks (PyTorch, Kubeflow) so you can deploy an LLM on-premises in hours rather than weeks.
  • Data Sovereignty: By keeping the model and the training data on your own Nutanix Files/Objects storage, you maintain 100% control over your IP.

The Admin’s Verdict

Nutanix has evolved into a «Cloud OS.» Whether you are managing legacy SQL servers, scaling a microservices-based web app on Kubernetes, or spinning up an Llama 3 instance for the dev team, the underlying logic remains consistent. It’s about moving the complexity from the human to the software.

Posted by oliver@olivergraf.tech, 0 comments

Platform9

The SaaS-Managed Shortcut to Hybrid Cloud and AI

If Nutanix is the «Private Cloud in a Box» and Proxmox is the «DIY Hypervisor,» Platform9 is the «Remote Pilot.» It’s designed for teams that want to keep their data on-prem (for cost or compliance) but want someone else to handle the «Day 2» operations—patching, monitoring, and troubleshooting the underlying stack.

1. The Architecture

SaaS-Managed KVM & K8s

Platform9 uses a Hosted Control Plane model. You provide the bare metal or VMs, install a small agent, and the Platform9 SaaS portal takes over.

  • Management as a Service: The control plane (Prism/vCenter equivalent) lives in the cloud. Platform9’s SREs (Site Reliability Engineers) monitor your local clusters 24/7, often fixing issues before you even get an alert.
  • Infrastructure Agnostic: Unlike Nutanix, Platform9 doesn’t care if you use Dell, HP, or old white-box servers. It turns any x86 hardware into a managed cloud.

2. Hybrid Cloud

The «vJailbreak» Strategy

Platform9 has positioned itself as the «Express Lane» for migrating away from legacy stacks like VMware.

  • In-Place Conversion: With tools like vJailbreak, Platform9 can convert existing VMware clusters to KVM-based Private Clouds in-place, minimizing downtime.
  • Unified Management: You can manage your on-prem KVM nodes and your public cloud (AWS/Azure) Kubernetes clusters through a single, consistent SaaS interface.

3. Cloud Native

Managed Kubernetes (PMK)

Platform9 was one of the first to offer Managed Kubernetes (PMK) on any infrastructure.

  • Zero-Touch Upgrades: Kubernetes upgrades are notoriously painful. Platform9 automates the rolling upgrade of your masters and workers, ensuring your API stays up.
  • Bare Metal K8s: For maximum performance, you can run Kubernetes directly on bare metal (no hypervisor overhead), which is the gold standard for high-throughput apps.

4. AI & Edge

GPU-Ready Private AI

As of 2026, Platform9 has leaned heavily into the Private AI trend with specific features for distributed workloads:

  • AI Hardware Sharing: Their Private Cloud Director now supports dynamic GPU sharing and predictive rebalancing, allowing multiple AI models to share the same physical hardware efficiently.
  • Edge-Optimized: Because the management is in the cloud, you can deploy «Nano-Clusters» (two-node HA) at edge locations (retail, factories) for local AI inference, while managing them all from a central dashboard.
  • DevOps Integration: With native Terraform providers, spinning up a GPU-backed environment for a data scientist becomes a single script rather than a week of ticket requests.

The Admin’s Verdict

Platform9 is the «Easy Button» for IT Admins who are understaffed or don’t want to become specialists in hypervisor internals. You get the sovereignty of on-prem with the operational ease of AWS. It’s perfect for the «Lean IT» team that needs to deliver AI and Kubernetes yesterday, without hiring five new SREs.

Posted by oliver@olivergraf.tech, 0 comments