Version: development

Auto Scale

Auto-scaling is a crucial technique for effective load management of service traffic. It enables service operators to automatically adjust the number of instances or resources allocated to a service based on current or expected demand and resource utilization. By doing so, auto-scaling ensures that a service can handle incoming load while optimizing the cost of running the service by allocating just the right number of resources.

Auto-scale is a core integration in Aperture that works hand in hand with the flow control capabilities to provide a comprehensive load management platform. Aperture policies allow defining auto-scaler(s) that consider the flow control state for informed scaling decisions. For example, during sudden traffic spikes, if a load scheduler on a service sheds traffic, auto-scaler can automatically add more instances to the service to handle the increased load.

With the auto-scale capability in Aperture, service operators can configure auto-scaling policies based on different service overload signals, such as load shedding, in addition to resource utilization based on CPU, memory usage, network I/O, and so on. This flexibility enables service operators to fine-tune the auto-scaling behavior based on their specific service needs. Auto-scaling policies can be set up to add or remove instances or resources based on these signals, allowing for dynamic scaling in response to changing traffic patterns.

Auto-scaling is a powerful technique that enables service operators to maintain service availability and performance while optimizing costs. In Aperture, auto-scaling is an integral component of the load management platform, working seamlessly with flow control to provide a comprehensive solution. These capabilities allow services to dynamically adjust to incoming traffic patterns, ensuring optimal performance while minimizing infrastructure costs.

Insertion

Aperture Agents interface with cloud infrastructure APIs, such as Kubernetes API, to discover, monitor, and scale infrastructure resources. The Aperture Controller uses the information from the Agents to make informed auto-scaling decisions that are then acted on by the Agents.

In an agent group, the leader Agent is responsible for interfacing with the cloud infrastructure APIs. For example, by maintaining a watch on scalable Kubernetes resources, the agent group leader can monitor changes to the resource status, such as the number of replicas configured and currently deployed. The up-to-date information is then used by the Aperture Controller to make informed auto-scaling decisions.

Auto Scaler

Gradient Controller

The Gradient Controller computes a desired scale value based on a signal and setpoint. The gradient controller tries to adjust the scale value proportionally to the relative difference between setpoint and signal.

The gradient describes a corrective factor that should be applied to the scale value to get the signal closer to the setpoint. It's computed as follows:

\text{gradient} = \left(\frac{\text{signal}}{\text{setpoint}}\right)^{\text{slope}}

gradient is then clamped to [1.0, max_gradient] range for the scale-out controller and [min_gradient, 1.0] range for the scale-in controller.

The output of the gradient controller is computed as follows:

\text{desired\_scale} = \text{gradient}_{\text{clamped}} \cdot \text{actual\_scale}.

Pod Scaler

Kubernetes Object Selector

Live Preview of Kubernetes Control Points

The Kubernetes resources identified by a Kubernetes Object Selector are called Kubernetes Control Points. These are a subset of resources in a Kubernetes cluster resource that can be scaled in or out. Aperture Agents perform automated discovery of Kubernetes Control Points in a cluster.

Use the aperturectl auto-scale control-points CLI command to list active control points.

For example:

aperturectl auto-scale control-points --kube

Returns:

AGENT GROUP   NAME                                                NAMESPACE             KIND
default       coredns                                             kube-system           Deployment
default       coredns-5d78c9869d                                  kube-system           ReplicaSet
default       gateway                                             istio-system          Deployment
default       gateway-868c757988                                  istio-system          ReplicaSet
default       istiod                                              istio-system          Deployment
default       istiod-6d9df7fb7                                    istio-system          ReplicaSet
default       local-path-provisioner                              local-path-storage    Deployment
default       local-path-provisioner-6bc4bddd6b                   local-path-storage    ReplicaSet
default       service1-demo-app                                   demoapp               Deployment
default       service1-demo-app-7b4bc9bdcd                        demoapp               ReplicaSet
default       service2-demo-app                                   demoapp               Deployment
default       service2-demo-app-677bb57574                        demoapp               ReplicaSet
default       service3-demo-app                                   demoapp               Deployment
default       service3-demo-app-58656dcf95                        demoapp               ReplicaSet
default       wavepool-generator                                  demoapp               Deployment
default       wavepool-generator-5b4578bdd9                       demoapp               ReplicaSet

Thanks for signing up!

Sign up for updates!

Insertion​

Auto Scaler​

Gradient Controller​

Pod Scaler​

Kubernetes Object Selector​

Live Preview of Kubernetes Control Points​