What is soft capping on the mainframe?

Soft capping uses the Workload Manager and a defined or group capacity limit to hold an LPAR or set of LPARs at a chosen MSU ceiling measured against the rolling four hour average. Unlike hard capping it allows short bursts above the limit while keeping the four hour average at or below the cap. Because sub-capacity software is billed on that average, holding it down with a well placed cap lowers the monthly license charge directly.

Does soft capping slow the mainframe down?

Not when it is aimed correctly. Soft capping works on the rolling four hour average, not on instantaneous capacity, so the system can still run above the cap for short periods. The art is placing the cap so it constrains deferrable work in the costly peak window while leaving the online day with the capacity it needs. A cap aimed at the wrong window hurts service, a cap aimed at the right one only cuts the bill.

Which soft capping pattern should we use?

It depends on where the peak comes from. Group capacity across LPARs suits estates where workloads can share a single pooled ceiling and balance among themselves. A defined capacity cap on a specific LPAR suits a contained workload with a clear peak. Time based capping suits a predictable batch peak that can be held only during the window that sets the bill. Most large estates end up combining all three.

Soft Capping Wins: Three Patterns That Cut Peaks

The bill is the peak. Soft capping owns the peak.

Sub-capacity software on z/OS is billed on the rolling four hour average, the R4HA, which the system recalculates continuously from MSU consumption. The monthly license charge tracks the highest R4HA each product reaches during the reporting month. Soft capping is the mechanism that lets you choose that peak rather than discover it. Using the Workload Manager with a defined or group capacity limit, the LPAR or LPAR group is held at a chosen MSU ceiling measured against the four hour average, so the number SCRT reports for billing is the one you set.

The crucial point is that soft capping works on the average, not on instantaneous capacity. The system can still burst above the cap for short periods; only the four hour average is held at or below the limit. That is what separates it from hard capping and from a crude performance throttle. Aimed at the right window it cuts the bill and the online day never notices. Aimed at the wrong one it hurts service for no saving. Read this with our sub-capacity vs full capacity explainer and our MSU optimization service.

Window	Uncapped R4HA peak	Capped R4HA peak	What changed
Online morning	620	620	Left untouched; the online day keeps its capacity
Midday batch overlap	710	640	Deferrable batch held below the cap during the costly window
Evening batch	540	540	Off peak, no cap needed
Billed monthly peak	710	640	The number SCRT reports falls by 70 MSU

Window

Uncapped R4HA peak

Capped R4HA peak

What changed

Online morning

620

Left untouched; the online day keeps its capacity

Midday batch overlap

710

640

Deferrable batch held below the cap during the costly window

Evening batch

540

Off peak, no cap needed

Billed monthly peak

710

640

The number SCRT reports falls by 70 MSU

Soft capping wins: three patterns that cut peaks.

Group capacity across LPARs

Defined capacity on a contained LPAR

Time based capping on the batch window

What is soft capping?

Will it slow the system?

Which pattern fits us?

How does this connect to the renewal?

Peak setting your bill? Cap the window, not the workload.