Zero Config Scaling

Auto-Scaling

Q: How fast does auto-scaling respond to traffic spikes?

Pre-warmed instances activate in under 10 seconds on Pro and Scale plans. Starter plan scales in approximately 30 seconds. Zero requests are dropped during scale-up.

Q: Do I need to configure auto-scaling?

No. Auto-scaling is enabled by default on every plan. HostAgentes monitors request rate, latency, and queue depth automatically.

Q: Will I be charged for auto-scaling?

Auto-scaling is included in your plan. Starter has basic scaling, Pro has full scaling with unlimited requests, and Scale offers dedicated instances.

Q: What happens when traffic drops?

Instances automatically scale down when CPU utilization stays below 20% for 5 minutes. You never pay for idle capacity.

Q: Can I set scaling limits?

Yes, on Pro and Scale plans. Set maximum instance counts, budget-based limits, or configure custom scaling rules through the dashboard.

Your agents handle 10 requests or 10,000 — without you touching a thing.

How It Works

HostAgentes monitors every request in real-time. When traffic spikes, pre-warmed instances activate in under 10 seconds. When traffic subsides, excess instances scale down automatically.

<10s

Scale-up time

Configuration needed

Requests dropped

Scaling Triggers

Request rate

Exceeds 80% of capacity

Add instances

Response latency (p95)

Exceeds 2x baseline

Add instances

Queue depth

More than 10 waiting

Add instances

CPU utilization

Below 20% for 5 min

Remove instances

By Plan

Starter

Basic auto-scaling
Up to 50 req/min
~30s scale-up

Pro

Full auto-scaling
Unlimited requests
~10s scale-up

Scale

Priority scaling
Dedicated instances
~5s scale-up

Frequently Asked Questions

How fast does auto-scaling respond to traffic spikes?

Pre-warmed instances activate in under 10 seconds on Pro and Scale plans. Starter plan scales in approximately 30 seconds. Zero requests are dropped during scale-up.

Do I need to configure auto-scaling?

No. Auto-scaling is enabled by default on every plan. HostAgentes monitors request rate, latency, and queue depth automatically and adjusts capacity in real-time.

Will I be charged for auto-scaling?

Auto-scaling is included in your plan. Starter has basic scaling (up to 50 req/min), Pro has full scaling with unlimited requests, and Scale offers dedicated instances with priority scaling.

What happens when traffic drops?

Instances automatically scale down when CPU utilization stays below 20% for 5 minutes. You never pay for idle capacity.

Can I set scaling limits?

Yes, on Pro and Scale plans. Set maximum instance counts, budget-based limits, or configure custom scaling rules through the dashboard.

Never worry about traffic spikes again

Auto-scaling included on every plan. Start with a free trial.

See Plans

Auto-Scaling

How It Works

Scaling Triggers

By Plan

Frequently Asked Questions

Never worry about traffic spikes again

We use cookies