Autoscaling

Autoscaling Quiz

Quiz

Question 1 of 20 (0 answered)

Question 1

What is the default interval at which the HPA controller queries the Metrics API?

5 seconds 15 seconds 30 seconds 60 seconds

✓

Correct!

The HPA controller queries the Metrics API every 15 seconds by default to calculate desired replica counts.

✗

Incorrect

The HPA controller queries the Metrics API every 15 seconds by default to calculate desired replica counts.

This is different from the Metrics Server scrape interval.

Question 2

Which of the following are required for HPA to function properly?

Metrics Server installed Resource requests defined on pods VPA configured At least one replica running

✓

Correct!

HPA requires Metrics Server for metrics, resource requests to calculate utilization percentages, and at least one replica to scale. VPA is a separate autoscaler and not required for HPA.

✗

Incorrect

HPA requires Metrics Server for metrics, resource requests to calculate utilization percentages, and at least one replica to scale. VPA is a separate autoscaler and not required for HPA.

Think about what HPA needs to calculate utilization and make scaling decisions.

Question 3

Karpenter provisions nodes faster than Cluster Autoscaler because it provisions directly through cloud provider APIs rather than scaling node groups.

True False

✓

Correct!

Karpenter typically provisions nodes in 30-90 seconds by calling cloud APIs directly, while Cluster Autoscaler takes 2-5 minutes because it works through ASG/node group scaling.

✗

Incorrect

Karpenter typically provisions nodes in 30-90 seconds by calling cloud APIs directly, while Cluster Autoscaler takes 2-5 minutes because it works through ASG/node group scaling.

Consider the architectural difference between direct API calls and group-based scaling.

Question 4

Using the HPA formula, calculate the desired replicas:

currentReplicas = 3
currentCPU = 90%
targetCPU = 50%

desiredReplicas = ceil[currentReplicas * (currentCPU / targetCPU)]

What will this code output?

5 6 4 3

✓

Correct!

desiredReplicas = ceil[3 * (90/50)] = ceil[3 * 1.8] = ceil[5.4] = 6 pods

✗

Incorrect

desiredReplicas = ceil[3 * (90/50)] = ceil[3 * 1.8] = ceil[5.4] = 6 pods

Remember to use ceiling function on the final result.

Question 5

Which VPA update mode should be used for databases where automatic restarts are risky?

Auto Recreate Initial Off

✓

Correct!

The ‘Off’ mode only provides recommendations without automatic updates, making it safe for stateful workloads like databases where unexpected restarts could cause issues.

✗

Incorrect

The ‘Off’ mode only provides recommendations without automatic updates, making it safe for stateful workloads like databases where unexpected restarts could cause issues.

Databases need careful handling when changing resources.

Question 6

What component embedded in Kubelet collects container-level metrics like CPU, memory, and network I/O?

✓

Correct!

cAdvisor (Container Advisor) is embedded in Kubelet and collects container-level metrics from the container runtime.

✗

Incorrect

cAdvisor (Container Advisor) is embedded in Kubelet and collects container-level metrics from the container runtime.

Its full name is Container Advisor.

Question 7

Arrange the metrics collection flow from container to HPA:

Drag to arrange in the correct order

⋮⋮ Container Runtime

⋮⋮ cAdvisor

⋮⋮ Kubelet

⋮⋮ Metrics Server

⋮⋮ HPA Controller

✓

Correct!

Container Runtime runs containers → cAdvisor collects metrics → Kubelet aggregates → Metrics Server queries and aggregates cluster-wide → HPA Controller consumes metrics for scaling decisions.

✗

Incorrect

Question 8

Which are valid KEDA trigger sources for event-driven autoscaling?

AWS SQS queue length Kafka consumer lag Node CPU utilization Custom HTTP webhooks

✓

Correct!

KEDA scales based on external event sources like SQS queues, Kafka topics, and custom webhooks. Node CPU is handled by standard HPA, not KEDA’s event-driven model.

✗

Incorrect

KEDA scales based on external event sources like SQS queues, Kafka topics, and custom webhooks. Node CPU is handled by standard HPA, not KEDA’s event-driven model.

KEDA focuses on external events, not resource metrics.

Question 9

VPA and HPA can safely be used together on the same deployment scaling on the same metric (CPU).

True False

✓

Correct!

Using VPA and HPA together on the same metric can cause conflicts. VPA adjusts resource requests while HPA scales based on utilization of those requests, potentially causing unpredictable behavior.

✗

Incorrect

Using VPA and HPA together on the same metric can cause conflicts. VPA adjusts resource requests while HPA scales based on utilization of those requests, potentially causing unpredictable behavior.

Consider what happens when both try to optimize CPU at the same time.

Question 10

In Karpenter, what is the purpose of the NodeClass (EC2NodeClass)?

Defines what type of nodes to create (requirements, limits) Defines how to create nodes (AMI, networking, IAM) Monitors node utilization for consolidation Handles spot instance interruptions

✓

Correct!

NodeClass defines HOW to create nodes - cloud-specific configuration like AMI selection, subnets, security groups, and IAM roles. NodePool defines WHAT nodes to create.

✗

Incorrect

NodeClass defines HOW to create nodes - cloud-specific configuration like AMI selection, subnets, security groups, and IAM roles. NodePool defines WHAT nodes to create.

NodePool and NodeClass have distinct responsibilities.

Question 11

Complete the HPA behavior policy to prevent aggressive scale-down:

Fill in the parameter that limits scale-down to 50% of pods

behavior:
  scaleDown:
    stabilizationWindowSeconds: 300
    policies:
    - type: _____
      value: 50
      periodSeconds: 60

Your answer:

✓

Correct!

The ‘Percent’ type allows you to specify scale-down as a percentage of current replicas, preventing too many pods from being removed at once.

✗

Incorrect

The ‘Percent’ type allows you to specify scale-down as a percentage of current replicas, preventing too many pods from being removed at once.

Question 12

What is the key difference between Cluster Autoscaler and Karpenter in terms of instance selection?

Cluster Autoscaler is limited to pre-defined instance types configured in ASGs/node groups.

Karpenter dynamically selects the optimal instance type from the entire cloud provider catalog based on actual pod requirements, enabling better bin-packing and cost optimization.

Did you get it right?

✓

Correct!

✗

Incorrect

Question 13

What does Karpenter’s consolidation policy ‘WhenUnderutilized’ do?

Removes empty nodes only Combines pods from underutilized nodes and replaces with smaller instances Scales pods down when CPU is low Prevents any node removal

✓

Correct!

WhenUnderutilized actively consolidates by combining pods from multiple underutilized nodes, deleting unnecessary nodes, and potentially replacing nodes with cheaper/smaller instances.

✗

Incorrect

WhenUnderutilized actively consolidates by combining pods from multiple underutilized nodes, deleting unnecessary nodes, and potentially replacing nodes with cheaper/smaller instances.

Think about active optimization, not just cleanup.

Question 14

In the HPA formula desiredReplicas = ceil[currentReplicas * (currentMetric / targetMetric)], what mathematical function is applied to the result?

✓

Correct!

The ceiling function (ceil) is used to round up, ensuring there are always enough replicas to handle the load.

✗

Incorrect

The ceiling function (ceil) is used to round up, ensuring there are always enough replicas to handle the load.

It rounds in a specific direction.

Question 15

The Metrics Server stores historical metrics data for long-term analysis.

True False

✓

Correct!

Metrics Server stores only short-term, in-memory data with no historical retention. For historical metrics, you need a dedicated monitoring solution like Prometheus.

✗

Incorrect

Metrics Server stores only short-term, in-memory data with no historical retention. For historical metrics, you need a dedicated monitoring solution like Prometheus.

Consider its purpose as a real-time metrics aggregator.

Question 16

Which actions does Karpenter perform for cost optimization?

Delete empty nodes immediately Consolidate underutilized nodes Replace with cheaper instance types Handle spot instance interruptions

✓

Correct!

Karpenter performs all these optimizations: removes empty nodes, consolidates underutilized ones, replaces with cheaper instances when possible, and gracefully handles spot interruptions with replacement provisioning.

✗

Incorrect

Karpenter is designed for comprehensive cost optimization.

Question 17

Which VPA component is responsible for evicting pods that need resource updates?

Recommender Updater Admission Controller Metrics Server

✓

Correct!

The Updater component compares current vs recommended requests and evicts pods when updates are needed. The Admission Controller then sets new requests on recreated pods.

✗

Incorrect

The Updater component compares current vs recommended requests and evicts pods when updates are needed. The Admission Controller then sets new requests on recreated pods.

Each VPA component has a specific role in the update workflow.

Question 18

Arrange the VPA workflow steps in correct order:

Drag to arrange in the correct order

⋮⋮ Recommender analyzes metrics

⋮⋮ Updater evicts pod

⋮⋮ Admission Controller mutates new pod

⋮⋮ New pod starts with optimal requests

✓

Correct!

Recommender analyzes usage and updates recommendations → Updater sees difference and evicts pod → Admission Controller intercepts pod creation and applies recommendations → New pod runs with optimized resources.

✗

Incorrect

Question 19

Why is stabilizationWindowSeconds important for HPA scale-down behavior?

stabilizationWindowSeconds prevents “flapping” - rapid scale up/down cycles caused by temporary metric fluctuations.

By requiring metrics to stay below the threshold for the window duration (e.g., 300 seconds), HPA avoids premature scale-down that could cause capacity issues when load returns.

Did you get it right?

✓

Correct!

✗

Incorrect

Question 20

A deployment has these resource specifications. What percentage CPU utilization triggers HPA scaling?

resources:
  requests:
    cpu: 200m
  limits:
    cpu: 500m

HPA target: averageUtilization: 70

What will this code output?

When pods use > 70% of 500m (350m) When pods use > 70% of 200m (140m) When pods use > 70% of total node CPU When average across all pods > 70%

✓

Correct!

HPA calculates utilization based on resource requests, not limits. 70% of 200m request = 140m. When average CPU usage exceeds 140m per pod, HPA scales up.

✗

Incorrect

HPA calculates utilization based on resource requests, not limits. 70% of 200m request = 140m. When average CPU usage exceeds 140m per pod, HPA scales up.

Utilization percentage is calculated against requests.

Quiz Results

Score

0/0

Accuracy

Right

Wrong

Skipped

Last updated on January 6, 2026

Security Observability