Licensing · Soft Capping

Soft Capping and Defined Capacity, Explained.

A defined capacity limit is the single most direct lever on a sub-capacity software bill: it holds the billable peak below the ceiling the machine could reach. Set it well and you cut cost with no business impact. Set it badly and you throttle the work that pays for the machine. The whole craft is knowing where the line sits.

A ceiling on the billable peak.

Soft capping is the use of a defined capacity limit, enforced by z/OS Workload Management together with PR/SM, to hold an LPAR's billable MSU consumption at or below a set figure. It works against the rolling four hour average, the same window that sub-capacity Monthly License Charge software is billed on, which is exactly why it is a cost lever and not merely a throttle.

Defined capacity is the MSU number you set on a single LPAR. Group capacity applies the same mechanic to a group of LPARs on one machine, summing their four hour averages against one shared limit. Both differ from hard capping, which limits actual processor access by weight or logical processor count regardless of demand. Soft capping allows short bursts above the line as long as the four hour average stays controlled, so it protects responsiveness while still holding the peak that sets the bill.

The worked saving

One limit, one month.

An uncapped LPAR lets its four hour average climb wherever the busiest window takes it. Because the sub-capacity bill tracks that peak per product, the busiest four hours of the month set the charge for the whole month. A defined capacity limit pulls the billed peak down to the line you choose. Take an LPAR whose uncapped peak reaches 480 MSU but whose work could be held at 400 with no material service impact.

StateBillable peakWhat the cap does
Uncapped480 MSUPeak set by the busiest window
Defined capacity at 400400 MSUFour hour average held at the line
Billable MSU removed80 MSUAcross every sub-capacity product on the LPAR

Illustrative MSU figures. The cap reduces the billed peak for every sub-capacity product running on that LPAR, so the effect compounds across the stack.

The 80 MSU removed is not 80 MSU of lost work. It is 80 MSU of peak that the work did not actually need to run at, recovered by holding the four hour average flat through the busy window and letting the workload smooth across it. Because every sub-capacity product on the LPAR is billed against the same controlled peak, one well placed limit moves the charge for the whole stack at once. Multiply across LPARs and products and a disciplined capping policy is frequently the largest single line of avoidable spend on an unmanaged estate.

The four capping modes

Four ways to hold the line.

Capping is not one switch. There are four modes and they trade cost control against responsiveness differently. Choosing the wrong one either leaves money on the table or starves the work.

ModeWhat it limitsBest for
Hard cap (PR/SM)Actual processor access by weightAbsolute ceilings, test partitions
Defined capacity (soft)Single LPAR 4 hour average vs MSUPer LPAR cost control with burst headroom
Group capacity (soft)Group of LPARs, summed 4 hour averagesSharing a ceiling across related LPARs
Absolute cappingCaps to the MSU limit regardless of 4HRAFirm ceilings where 4HRA enforcement is too loose

Soft modes enforce only while the long term four hour average is at or above the limit, which is what preserves short burst capacity.

The art is matching the mode to the workload. A predictable batch LPAR tolerates a tight defined capacity limit. A latency sensitive online LPAR needs burst headroom and may prefer group capacity so a quiet sibling lends it room. Absolute capping suits cases where the four hour average enforcement is too permissive and a firmer ceiling is required. The point of the matrix is that a single sitewide capping rule is almost always wrong: it overcharges the steady workloads and strangles the spiky ones. The optimization is per LPAR.

How to optimize it.

Levers on defined capacity

  • Measure the real four hour peaks per LPAR before setting any limit, so the cap lands just above genuine demand rather than on a guess
  • Cap per LPAR, not sitewide: match tight limits to predictable workloads and burst headroom to latency sensitive ones
  • Use group capacity where related LPARs can share a ceiling, letting quiet partitions lend room to busy ones
  • Pair capping with WLM policy and workload scheduling so the cap is held by moving work, not by degrading it
  • Revisit limits after any workload or hardware change: a cap set for last year's estate quietly throttles or overcharges this year's

Across 500+ engagements and $180M+ of negotiated mainframe spend, the estates that treat capping as a continuous tuning discipline pay materially less than those that set a limit once and forget it, or worse, run uncapped because nobody owns the peak. Defined capacity is the most direct lever on a sub-capacity bill. It just has to be set with data, not with caution.

Frequently asked

Q1

What is soft capping?

The use of a defined capacity limit, enforced by z/OS Workload Management with PR/SM, to hold an LPAR's billable MSU consumption at or below a set figure. It works against the rolling four hour average, so it limits the peak that sub-capacity software charges are calculated against rather than instantaneous capacity.

Q2

Soft capping versus hard capping?

Hard capping limits actual processor access by weight or logical processor count, so the LPAR can never exceed the cap. Soft capping limits the four hour average against a defined MSU figure, allowing short bursts above the cap as long as the average stays controlled. Soft capping protects responsiveness; hard capping is absolute.

Q3

What is defined capacity?

The MSU figure you set on an LPAR to soft cap it. Workload Management compares the LPAR's four hour average against that figure and limits consumption when the average reaches the limit. Group capacity applies the same idea to a group of LPARs on one machine, summing their four hour averages against a single group limit.

Q4

Does soft capping reduce cost?

It can, because the sub-capacity bill tracks the four hour peak per product and a defined capacity limit holds that peak below where an uncapped busy window would drive it. The saving is weighed against service level impact during genuine spikes, which is why the limit is a tuning decision, not a set and forget one.

Related

All licensing concepts →
01 The rolling four hour average

The window the cap holdsWhy billing tracks one four hour peak, the thing defined capacity limits. R4HA explained →

02 Group capacity limits

Share a ceilingHow a single limit across related LPARs lets quiet partitions lend room. Group capacity limits →

03 IBM MSU optimization

Put capping to workThe service that sets and tunes capping policy to cut the bill safely. IBM mainframe MSU optimization →

The cap is the lever. We set it where it cuts cost without touching service.

Get expert help

Audit notice or renewal under 18 months out? We mobilize within 48 hours.

Get expert help