Engineering • 8min read

Introducing Kubernetes Event-driven Autoscaling (KEDA)

Explore KEDA as an alternative autoscaling solution, extending the functionality of Kubernetes’ Horizontal Pod Autoscaler (HPA), to provide event-driven scaling, scale-to-zero and more

Written by:

Kenny Vu

Published on:

May 27, 2025

Last updated on:

Oct 16, 2025

This blog is part of our KEDA series, we recommend reading the rest of the posts in the series:
Introducing Kubernetes Event-driven Autoscaling (KEDA)
Getting Started with Autoscaling in Kubernetes with KEDA
Autoscaling workloads with KEDA and the Prometheus Scaler
Autoscaling Jobs with KEDA's Redis List Scaler

Kubernetes Native Horizontal Pod Autoscaling Support

Kubernetes natively supports Horizontal Pod Autoscaling: it enables engineers to define automatic scaling conditions and behaviour for target workloads by using the HorizontalPodAutoscaler object from the Kubernetes autoscaling API Group. Kubernetes uses these user-defined configurations to dynamically adjust the number of pod replicas for specified workloads based on real-time demand.

Our Horizontal Pod Autoscaling series explains how to get the most out of Kubernetes HPA.

Despite this built-in functionality, the Cloud Native Computing Foundation (CNCF) accepted the Kubernetes Event-Driven Autoscaling (KEDA) as a CNCF project in March 2020. In August 2023 KEDA became a Graduated CNCF project, marking the tool as a production-ready solution for autoscaling Kubernetes workloads.

What is Kubernetes Event-driven Autoscaling (KEDA)?

Microsoft announced they had partnered with Red Hat to create KEDA in May 2019, as a tool built on top of the Kubernetes Horizontal Pod Autoscaler (HPA) without replacing it. KEDA is an event-driven autoscaler, focused on enabling Kubernetes applications to scale automatically in response to events that have occurred in an event source (a KEDA Scaler).

For example, you can have a MySQL database as an external event source. By granting KEDA access to the database with a valid query, KEDA can watch the current value returned by the query and automatically scale a target Kubernetes workload up if the returned value exceeds a user-defined threshold and scale then scale it back down if the value falls below the threshold.

Are your legacy applications holding you back?

Discover how containerisation and cloud-native patterns can modernise your existing applications. Our three-week assessment validates practical migration approaches that reduce operational complexity while maintaining business continuity.

See if modern infrastructure patterns can solve your deployment challenges and enable the multi-cloud flexibility your customers demand, without compromising the stability your business relies on.

HPA vs KEDA

KEDA uses the Kubernetes HPA to help deliver some of its event driven autoscaling capabilities, but also offers additional functionality to fill in the gaps of the Kubernetes HPA. In this section, I will be explaining some of differences between HPA and KEDA.

Metric APIService Restrictions

Metrics Server, the common cluster add-on installed on many Kubernetes clusters, collects resource metrics exposed by the kubelet process running in each node. It’s simple, fast, efficient and scalable. Kubernetes HPA can automatically scale workloads based on metrics collected by the Metrics Server such as CPU and memory usage metrics. However, the Metrics Server only serves the metrics-k8s.io API group.

Using only Kubernetes HPA to automatically scale workloads using custom metrics (custom.metrics.k8s.io - metrics related to objects in the Kubernetes cluster) or external metrics (external.metrics.k8s.io - metrics external to a Kubernetes cluster) coming from a single data source can also be considered viable, but you will need to install metrics adapters - components that will connect your data source (e.g. a monitoring system) to Kubernetes to provide access to the custom or external metrics from the data source.

However, each of these metric APIs (metrics-k8s.io, custom.metrics.k8s.io, external.metrics.k8s.io) registered onto a Kubernetes cluster as a APIService Kubernetes object to represent different types/sources of metric data:

Installing the Metrics Server will involve the creation of an APIService called v1beta1.metrics.k8s.io
Installing a metrics adapter will involve the creation of an APIService called v1beta1.custom.metrics.k8s.io or v1beta1.external.metrics.k8s.io.

Some adapters will create both.

$ kubectl get apiservices | grep metrics
NAME                              SERVICE
v1beta1.custom.metrics.k8s.io     default/prom-adapter-prometheus-adapter
v1beta1.external.metrics.k8s.io   default/datadog-monitoring-cluster-agent-metrics-api
v1beta1.metrics.k8s.io            default/metrics-server

⚠️ At the time of writing May 2025, there can only be one APIService (data source) tied to each metric API, but there is an issue opened in 2020 requesting for support for multiple custom/external metrics servers in a cluster.

As a result, the number of data sources you can use to fetch custom and external metrics from is limited. You can choose to funnel all your metrics into one location (like Prometheus) and install a Prometheus metrics adapter in your Kubernetes cluster.

In contrast, KEDA has a more flexible yet simpler approach: you just need to install KEDA and then configure KEDA to use metrics and/or events from any event source supported by KEDA.

CNCF has a blog post from members of the Alibaba Cloud EDAS team about why Alibaba Cloud chose to use KEDA for application autoscaling which contains diagrams illustrating this problem.

KEDA extends HPA

KEDA does not replace the Kubernetes HPA, instead it uses HPA under the hood to provide certain features and extend its functionality without overwriting or duplicating the Kubernetes native autoscaling capabilities HPA provides. If you find yourself using KEDA you may find HorizontalPodAutoscaler objects created by KEDA, but not all KEDA features require an HPA object.

KEDA supports a wide range of event sources, known as built-in KEDA scalers, including the Kubernetes Metrics Server, meaning you can also configure KEDA to automatically scale workloads using CPU and Memory provided by the Kubernetes Metrics Server. But, instead of defining a HorizontalPodAutoscaler object, you will need to deploy a KEDA custom resource.

Scaling to or from Zero

One of the features KEDA provides is its ability to scale workloads to or from zero, which is especially useful when no events are occurring that require processing and you want to optimise resource usage:

Since Kubernetes v1.16, there has been a feature gate called HPAScaleToZero to enable setting the minimum number of replicas to 0 when using custom or external metrics, but it is disabled by default at the time of writing (Kubernetes v1.32).

Scaling Custom Resources

With KEDA you can also automatically workloads defined as a Kubernetes custom resources, but the Custom Resource Definition (CRD) must define the /scale subresource.

Metric/Event Sources

At the time of writing, KEDA has built-in support for fetching metrics and events from 74 event sources (which include Azure Monitor, Kafka, RabbitMQ, AWS SQS, and more) and with KEDA there is a standardised approach for configuring autoscaling for any of the supported event sources using two Kubernetes custom resources: ScaledObject and ScaledJob.

Assuming you already have KEDA installed, the process for setting up autoscaling for a given KEDA scaler (event source) is straightforward, you only have to deploy a correctly configured ScaledObject or ScaledJob. To get the configurations correct for the given scaler you can lean on KEDA’s documentation where each built-in scaler has it’s own page outlining all the parameters you need to define in your ScaledObject or ScaledJob object.

If your desired event source is not supported, KEDA does support external scalers, but you will have to build and maintain a GRPC service that implements the same interface as a built-in scaler.

Summary: When to use KEDA

If you only need your application scaling based on basic metrics like CPU and memory or if you only need to use custom or external metrics from a single data source, then it’s OK to use Kubernetes’s native Horizontal Pod Autoscaler.

You should consider KEDA if:

You are looking to cut down on costs by making use of KEDA’s feature to scale-to-zero when dealing with idle workloads or empty queues
Significant engineering effort will need to be invested into building, installing and maintaining the multiple services needed to consolidate data into a single source that can be exposed to your Kubernetes clusters through a limited set of metrics adapters. This is especially an issue if:
- You’re working with external systems that don’t expose metrics easily
- Your preferred monitoring solution doesn’t have a maintained HPA metrics adapter (for example, the AWS CloudWatch adapter is deprecated).
You want to automatically scale a custom resource
You want more options for fine-tuning how your workloads are automatically being scaled.

If have decided to explore using KEDA, in the next blog post, we will explain how to get started with KEDA and the key concepts you need to know before you start using KEDA.

Frequently Asked Questions

What is a Graduated CNCF project?

Cloud Native Computing Foundation (CNCF) projects are given a maturity level. A graduated project within the Cloud Native Computing Foundation (CNCF) has met rigorous criteria for security, community governance and adoption. The graduated status indicates that the project is stable, can be used in production, and has demonstrated longterm sustainability. You can view the full list of graduated CNCF projects on the CNCF website.

Does KEDA have installation requirements?

Yes. KEDA can be installed on any standard Kubernetes cluster, but you must ensure your cluster runs a supported version. At the time of writing, KEDA requires Kubernetes version 1.29 or higher. Always check the official documentation for the latest version compatibility before installation.

Should I be using KEDA for autoscaling instead of the native Kubernetes Horizontal Pod Autoscaler (HPA)?

This is not a simple answer, as it depends on your use case. If your application only needs to scale based on CPU or memory usage, or relies on a single custom metric source, then the native Kubernetes HPA may be sufficient. However, KEDA would be a better fit when you need to scale from zero, work with multiple external event sources, scale based on custom business metrics, or simplify complex scaling logic without writing and maintaining custom metric adapters.

When should I use KEDA instead of native Kubernetes HPA?

You should consider using KEDA when you need to scale workloads to zero for cost optimization, work with multiple external event sources, scale custom resources, or when building multiple metrics adapters would require significant engineering effort.

Native HPA is best suited to simpler cases, such as examples where you only need CPU/memory scaling or custom/external metrics from a single data source.

How do I get started with KEDA?

A great place to start is the next blog in our series, Getting Started with Autoscaling in Kubernetes with KEDA, where we explain KEDA concepts that we believe will help you get started learning KEDA. You can also read the documentation on the official KEDA website.

Later parts of this blog series are now available and provide practical examples of autoscaling with KEDA using different built in scalers.

This blog is part of our KEDA series, we recommend reading the rest of the posts in the series:
Introducing Kubernetes Event-driven Autoscaling (KEDA)
Getting Started with Autoscaling in Kubernetes with KEDA
Autoscaling workloads with KEDA and the Prometheus Scaler
Autoscaling Jobs with KEDA's Redis List Scaler

Cloud Platform Engineering

Workload Modernisation

VMware Kubernetes Migration

Case Studies

Whitepapers

Blog

Tutorials

Cloud Native News

About

Partnerships

Careers

Contact