Skip to main content

Auto Scaling

Auto Scaling is an automatic and dynamic mechanism for adjusting your resources with flexibility. This service will dynamically increase or decrease the number of VCS instances under the designated conditions to solve the problem of insufficient resources or service interruption caused by increasing workloads to improve the quality and efficiency of your service.

Auto Scaling is an automatic and flexible adjustment monitoring mechanism that dynamically increases or decreases the number of VCS instances under the designated conditions in response to the problem of insufficient resources or service interruptions caused by the increasing workloads to improve the the quality and efficiency of your service.

When the workload reaches the demand value, it increases the number of instances; when the workload falls below the demand value, it immediately decreases the number of instances. Also, with load balancing, traffic can be distributed evenly to effectively reduce the cost of idle VCS instances.

info
  • For the permission differences between a Tenant Admin and a Tenant User when using VCS instances, please refer to User roles and permissions.
caution

Create an Auto Scaling template

You can create an Auto Scaling template and connect it to multiple VCS instances.

  • Click Auto Scaling from the service list to go to the Auto Scaling Management page, and click +CREATE.

tip

You can click on the asterisk to the left of the service in "All Services" and pin your frequently used services to "My Favorite Services" to speed up the process.

  • Enter the following setting information, and then click Next: REVIEW & CREATE>:
    • Name: Enter the name of the Auto Scaling.
    • Description (Optional): Enter the description for the Auto Scaling.
    • Meter name: Select the Meter name, which is a resource indicator for Auto Scaling to determine whether to scale up or down the number of VCS instances. The monitor meter names include four types of indicators, which are cpuutil (CPU usage), memory.usage (Memory usage), network.incoming.bytes.rate (Network incoming rate), and network.outgoing.bytes.rate (Network outgoing rate).
    • Upper Threshold: Set the maximum threshold of the meter. For example: cpuutil for the meter and 80 for the upper threshold represents that the number of VCS instances will increase when the CPU usage exceeds 80%.
    • Lower Threshold: Set the minimum threshold of the meter. Below this threshold, the number of VCS instances will be decreased.
    • Max VCS instance no.: Set the maximum value of the number of VCS instances, the default value is 2 (Auto scaling requires at least 2 VCS instances).

  • Review the settings of Auto Scaling and project credit information, then click CREATE.


Manage Auto Scaling

View Auto Scaling information and delete the Auto Scaling.

  • In Auto Scaling Management page, the latest created Auto Scaling will be listed at the top. You can click the Auto Scaling list to check detailed information or execute DELETE or REFRESH.


Attach Auto Scaling to a VCS instance

The attached Auto Scaling should be used with load balancing to automatically balance the workloads of VCS instances.

  • Click Virtual Compute Service (VCS) from the service list to go to the VCS Instance Management page, click your instance you want to attach to and go to the VCS Instance Details page. Then, click the ATTACH button next to Auto Scaling.

info

The state of the VCS instance must be Ready before you can attach/detach the auto scaling to/from the instance.

  • Auto scaling has to be used with load balancer to evenly distribute traffic to each VCS instances in the group to achieve high service availability.
  • When Auto Scaling setting window pops up, select the name of the auto scaling, the notification link service (optional), the load balancer name, and the port information, then click OK.