Autoscaling

This guide covers configuring node pool and cluster autoscaling for HostedClusters. Autoscaling automatically adjusts the number of worker nodes based on workload demands.

Node Pool Autoscaling

Node pool autoscaling enables individual NodePools to automatically scale between a minimum and maximum number of nodes based on pending pod resource requests.

Enable Autoscaling on a NodePool

To enable autoscaling, set spec.autoScaling and remove spec.replicas:

oc patch nodepool -n <HOSTED_CLUSTER_NAMESPACE> <NODEPOOL_NAME> --type merge -p '{
  "spec": {
    "replicas": null,
    "autoScaling": {
      "min": 1,
      "max": 5
    }
  }
}'

Field	Description
`min`	Minimum number of nodes to maintain. Must be >= 0 and <= `max`. See note below about platform restrictions.
`max`	Maximum number of nodes the autoscaler can scale to. Must be >= 1 and >= `min`.

Note

autoScaling and replicas are mutually exclusive. When enabling autoscaling, replicas must be set to null.

Note

Scale-from-zero (min: 0) is only supported on the AWS platform. All other platforms require min >= 1.

Verify Autoscaling is Enabled

Check the AutoscalingEnabled condition on the NodePool:

oc get nodepool -n <HOSTED_CLUSTER_NAMESPACE> <NODEPOOL_NAME> -o jsonpath='{.status.conditions[?(@.type=="AutoscalingEnabled")].status}'

The output should be True.

Disable Autoscaling

To disable autoscaling and return to a fixed replica count:

oc patch nodepool -n <HOSTED_CLUSTER_NAMESPACE> <NODEPOOL_NAME> --type merge -p '{
  "spec": {
    "autoScaling": null,
    "replicas": 2
  }
}'

Cluster Autoscaling

Cluster autoscaling configures global autoscaling behavior that applies to all NodePools in a HostedCluster. This includes scale-down policies, node group balancing, and expander strategies.

Configure Cluster Autoscaling

Set spec.autoscaling on the HostedCluster:

oc patch hostedcluster -n <HOSTED_CLUSTER_NAMESPACE> <HOSTED_CLUSTER_NAME> --type merge -p '{
  "spec": {
    "autoscaling": {
      "scaling": "ScaleUpAndScaleDown",
      "maxNodesTotal": 10,
      "expanders": ["LeastWaste"],
      "scaleDown": {
        "delayAfterAddSeconds": 300,
        "unneededDurationSeconds": 600,
        "utilizationThresholdPercent": 50
      }
    }
  }
}'

Configuration Reference

Scaling Behavior

Field	Type	Default	Description
`scaling`	string	`ScaleUpAndScaleDown`	`ScaleUpOnly` or `ScaleUpAndScaleDown`. Controls whether the autoscaler can scale down nodes.
`maxNodesTotal`	int	unlimited	Maximum total nodes across all NodePools. The autoscaler will not scale beyond this limit.
`maxPodGracePeriod`	int	600	Maximum seconds to wait for graceful pod termination before scaling down.
`maxNodeProvisionTime`	string	15m	Maximum time to wait for a node to provision, in Go duration format (e.g., `15m`, `20m`).
`podPriorityThreshold`	int	-10	Pods with priority below this threshold won't trigger scale-up.

Note

Defaults listed in the configuration reference tables represent the cluster autoscaler's effective behavior when the field is omitted. The only API-enforced default is scaling, which defaults to ScaleUpAndScaleDown.

Expanders

Expanders control how the autoscaler selects which NodePool to scale when multiple NodePools can satisfy pending pods. Set via spec.autoscaling.expanders:

Expander	Description
`LeastWaste`	Selects the NodePool with the least idle CPU and memory after scaling.
`Priority`	Selects the NodePool with the highest user-defined priority.
`Random`	Selects a NodePool randomly.

Default: [Priority, LeastWaste]. Up to 3 expanders can be specified in priority order.

Note

The Priority expander uses a ConfigMap named cluster-autoscaler-priority-expander in the kube-system namespace of the guest cluster to determine NodePool priorities. The ConfigMap maps integer priorities to node group name patterns (regex). Higher values mean higher priority. See the upstream documentation for details on the ConfigMap format.

Scale Down Configuration

The scaleDown field is only valid when scaling is set to ScaleUpAndScaleDown:

Field	Type	Default	Description
`delayAfterAddSeconds`	int	600	Seconds to wait after scale-up before evaluating scale-down.
`delayAfterDeleteSeconds`	int	0	Seconds to wait after node deletion before evaluating scale-down.
`delayAfterFailureSeconds`	int	180	Seconds to wait after a failed scale-down before retrying.
`unneededDurationSeconds`	int	600	How long a node must be unneeded before it is eligible for removal.
`utilizationThresholdPercent`	int	50	Nodes with utilization below this percentage are candidates for removal. 0 means only completely idle nodes; 100 means any node can be removed.

Node Group Balancing

Field	Type	Default	Description
`balancingIgnoredLabels`	[]string	[]	Labels the autoscaler should ignore when comparing node groups for balancing. Platform-specific labels are added automatically.
`maxFreeDifferenceRatioPercent`	int	10	Maximum allowed difference in free resources between node groups to be considered similar for balancing. 0 = exact match required; 100 = any difference allowed.

How It Works

When a pod cannot be scheduled due to insufficient resources, the cluster autoscaler identifies NodePools with autoscaling enabled that can satisfy the pod's requirements.
The autoscaler selects a NodePool based on the configured expander strategy and triggers a scale-up by increasing the NodePool's replica count.
HyperShift provisions new platform-specific machine instances for the NodePool and the new nodes join the guest cluster.
When scaling is set to ScaleUpAndScaleDown, the autoscaler monitors node utilization. Nodes that remain underutilized (below utilizationThresholdPercent) for longer than unneededDurationSeconds are removed.

Monitoring

Check the current state of autoscaling:

# View NodePool autoscaling status
oc get nodepools -n <HOSTED_CLUSTER_NAMESPACE> -o wide

# Check autoscaler deployment in the control plane namespace
oc get deployment cluster-autoscaler -n <HOSTED_CLUSTER_NAMESPACE>-<HOSTED_CLUSTER_NAME>

# View autoscaler logs
oc logs deployment/cluster-autoscaler -n <HOSTED_CLUSTER_NAMESPACE>-<HOSTED_CLUSTER_NAME>