IBM Z System Recovery Boost Overview
IBM Z System Recovery Boost Overview
Acrobat bookmark
Jannie Houlbjerg
Kazuhiro Nakajima
Paul Schouten
Redpaper
Introducing IBM Z System Recovery Boost
This IBM® Redpaper publication introduces System Recovery Boost (SRB), which is a
firmware feature that was introduced with IBM z15™ and enhanced on IBM z16™ and IBM
z17™. SRB delivers substantially faster system shutdown and restart, short duration
Recovery Process Boosts (RPBs) for sysplex events, and fast catch-up of an accumulated
backlog of mission-critical workloads after specific events (such as system restart).
Note: SRB is a firmware feature that is available on IBM z15, IBM z16, and IBM z17
central processor complexes (CPCs), and it requires operating system support for use. It is
not available on older systems. Always check the latest firmware and software for available
SRB functions.
IBM Z® naming: The IBM Z server generations that are described in this paper are
available as the following machine types and models (MTMs):
Machine Type 3931 (M/T 3931) Model A01, which is identified as IBM z16 Model A01
Machine Type 3932 (MT 3932) Models A02/AGZ which are identified as IBM z16
Models A02, and IBM z16 Model AGZ.
Machine Type 8561 (M/T 8561) Model T01, which is identified as IBM z15 Model T01.
Machine Type 8562 (M/T 8562) Model T02, which is identified as IBM z15 model T02.
Machine Type 9175 (M/T 9175) Model ME1, which is identified as IBM z17 model ME1.
Starting with IBM z15, System Recovery Boost can deliver higher processor capacity for a
limited time after an IPL, while shutting down a system, or during specific, short-term sysplex
recovery events. The increased capacity can be provided in one or more of the following
ways:
In an LPAR on a sub-capacity machine, by utilizing the full speed for the general purpose
processors of the boosted partition (speed boost)
Dispatching some work on z Integrated Information Processors (zIIP), even if the work is
not eligible for zIIP (zIIP boost)
Configure online zIIPs that are in the standby state for the duration of the boost period
(zIIP boost)
Note: These three basic functions of System Recovery Boost continue to be fully
supported on the IBM z15 and above. System Recovery Boost Upgrade is not offered on
IBM z17. Customers who have already purchased System Recovery Boost Upgrade on
z15 or z16 can continue to use it.
3
SRB provides boosted processor capacity to mitigate the impact on workload processing
after short-term recovery events in a sysplex, and it restores normal steady-state sysplex
operation as quickly as possible.
1 Supported operating system images that are enabled for boost and running in an LPAR.
Speed Boost
When the CPC is configured as a subcapacity model, LPARs that are running in a boost
period can access the Speed Boost.
This feature requires operating system opt-in and support. At the time of writing, IBM z/OS,
IBM z/VM, z/VSE, 21st Century VSEn , and z/TPF can use the subcapacity boost. For more
information, see “Software support” on page 17.
5
Note: Consider the following points:
Speed Boost applies to general-purpose processors (CPs) only. All other engines run at
full capacity (IFLs, zIIPs, and Internal Coupling Facilities (ICFs)).
The following subcapacity models are available:
– IBM z15 T01 subcapacity CP is available for up to 34 CPs with three subcapacity
levels, 4xx, 5xx, and 6xx, where xx is 1 - 34. Model 7xx is a full capacity CP model.
– IBM z15 T02 subcapacity CP is available for up to six CPs with 26 subcapacity
levels, A0x - Y0x, where x is 1 - 6. Model Z0x is a full capacity CP model.
– IBM z16 A01 subcapacity CP is available for up to 39 CPs with three subcapacity
levels, 4xx, 5xx, and 6xx, where xx is 1 - 39. Model 7xx is a full capacity CP model.
– IBM z17 ME1 subcapacity CP is available for up to 43 CPs with three subcapacity
levels, 4xx, 5xx and 6xx, where xx is 1 - 43. Model 7xx is a full capacity CP model.
In this example, three LPARs are defined in IBM z16 A01 model 403. In normal operation, all
work is dispatched on subcapacity CPs.
When LPAR c enters a boost period, work that is dispatched from LPAR z runs at CP7 (full
capacity). Other LPARs continue to be dispatched at CP4 (subcapacity). One boost period is
started at LPAR c shutdown, and a new boost period is started at IPL (of LPAR c). At the end
of the IPL boost period, LPAR c returns to normal operation at CP4 (subcapacity).
In this period, the system can use following processors to run a CP workload:
Entitled purchased CPs.
Entitled purchased zIIPs.
If more logical zIIPs are available and configured in the LPAR profile while in the boost period,
the images bring more logical zIIP processors online to use the extra physical zIIP capacity.
After the boost period ends, the z/OS dispatching of work on CPs versus zIIPs returns to
normal.
Important: With proper configuration, z/OS can automatically bring extra logical
processors on at the start of a boost period (based on resources availability) and
automatically take them off again at the end of the boost period.
In this example, three LPARs are defined on the IBM z16 A01 model 703 with two zIIPs. Two
zIIPs are shared between LPAR b and LPAR c.
During normal operation, only zIIP eligible work is dispatched to the zIIPs. When LPAR enters
a boost period, general-purpose work and zIIP eligible work might be dispatched to the zIIPs.
7
When the boost period ends, only zIIPs eligible work is dispatched to the zIIPs.
In a sysplex, Workload Manager (WLM) sysplex routing starts to route work away from a
system after the shutdown PROC is started to accelerate shutdown.
All z/OS and middleware processing during the shutdown boost period benefits from higher
capacity CP processors or extra parallelism that is provided by zIIPs and allows CP work to
run on zIIPs.
Figure 4 Example of a shutdown boost in a subcapacity model (IBM z16 A01 shown)
In this example, three LPARs are defined in the IBM z16 A01 model 403 with two zIIPs. Two
zIIPs are assigned to LPAR b and LPAR c. During normal operation, all CP work is dispatched
at subcapacity (CP4), but only zIIP eligible work is dispatched to zIIPs.
Firmware changes on the Hardware Management Console (HMC) and Support Elements
(SEs) support greater parallelism and performance improvements in the hardware API
services, and GDPS has changed the processing and scripting to take advantage of the
firmware improvement. GDPS itself also does more work in parallel when possible.
For z/OS and GDPS support and requirements, see “Software support” on page 17.
Notes:
All these capabilities are available for IBM z16 and IBM z17 ME1.
Some of these capabilities are available for IBM z15 with a specific (concurrently
installable) LPAR Microcode Change Level (MCL). For more information,
see IBM ResourceLink (an IBMid is required.)
At the time of writing, z/OS can exploit these features. For more information, see
“Software support” on page 17.
With enhanced SRB support, IBM is extending boost technologies to provide short-term RPB
acceleration for specific sysplex recovery events.
Currently, these process recovery events often cause short-duration workload impacts or
workload spikes while the system is busy recovering. Recovery affects the normal execution
of the client workload until recovery processing completes.
The RPB is designed to provide boosted processor capacity to mitigate short-term recovery
impacts and restore steady-state sysplex operation as quickly as possible following specific
recovery events, and to provide boosted processor capacity for a short period following
restoration of steady-state operation to help with workload catch-up from the recovery event.
When the events occur, sometimes an RPB is automatically initiated, and sometimes more
configuration or setup is required to initiate the RPB, depending on what the event is.
When an event occurs, z/OS manages the RPBs internally with the operating system initiating
the boosts as these recovery events take place, and only on the images that are affected by
these events. For certain events, additional configuration or setup is required to initiate RPB.
If RPBs happen to “overlap” (a second RPB occurs before a first one uses its entire boost
period), then the overlapping boosts are merged and the boost period may be extended to
allow the full boost period for the second recovery process. For setup considerations, see
“z/OS” on page 12.
9
z/OS supports a dynamic enable and disable capability for all the RPB candidates, which
provides more control over the applicability and usage of RPBs, perhaps at different times of
day or under different operating conditions. For more information, see “z/OS” on page 12.
Note: SVC Dump Boost is only available on IBM z16 IBM z17 ME1.
Note: Customer-identified Middleware Restart Boost is available only on IBM z16 and
IBM z17.
Note: HyperSwap Configuration Load Boost is available only on IBM z16 and IBM z17.
Operational considerations
Here are several operational considerations:
During an RPB period, WLM does not route work away from the system (as it does during
a shutdown boost) or toward the system (as it does during a startup boost). WLM ignores
short-duration recovery boosts for workload routing purposes.
When bringing reserved logical zIIP processors online and offline at the start and end of
an RPB period, z/OS limits the number of “transient” zIIPs brought online and offline
automatically to at most two (more transient zIIPs during IPL and shutdown boost periods
may be configured).
z/OS starts and ends a new IBM System Management Facility (SMF) interval during an
RPB period, but when two or more recovery processes boost “overlap”, they are merged
into a single boost period and a single SMF interval.
z/OS issues ENF signals and console messages when starting, extending, or stopping an
RPB.
For z/OS, a system command or PROC (IEABE) is provided to allow for early opt-out of an
IPL or shutdown boost if wanted.
z/OS does not permit overlap between the RPBs and the longer image-level startup and
shutdown boosts:
– RPBs are not initiated while an image-level startup boost is still in progress because
the system is already boosted.
– If an RPB is in progress when a system image-level shutdown is initiated, then z/OS
“cancels” the in-progress RPB and initiates the shutdown boost period for system
shutdown.
– If more transient zIIPs were already online during the RPB, z/OS potentially must bring
more online for the shutdown boost, up to the full quota of reserved logical zIIPs.
11
Important: The base SRB capability is built in to the firmware and does not require
ordering any other features.
z/OS
You can configure a z/OS system-level parameter (IEASYSxx) to control whether a specific
z/OS image opts in to any of the available processor boosts, as shown in the following
example:
BOOST=SYSTEM | zIIP | SPEED | NONE
You can configure the RPB - SVC Dump by using RPBMINSZ in the CHNGDUMP parameter. For
more information, see Managing Recovery Process Boost.
You can configure the RPB - Customer-identified Middleware Restart Boost through your
WLM Service Definition. WLM specification is required to designate any Middleware Restart
Boost for started tasks (STCs) only. The boost is identified in WLM Classification Rules, with
Boost=Y | N.
For all the RPB candidates, you can configure these boosts as ENABLED or DISABLED by using
the IEASRB PROC, as shown in the following example:
S IEASRB,[CLASS=RP,]REQ=DISABLE|D to dynamically disable Recovery Process boost
S IEASRB,[CLASS=RP,]REQ=ENABLE|E to dynamically enable Recovery Process boost
For more information, see z/OS 2.5: System Recovery Boost and z/OS 3.1 System Recovery
Boost.
If you want to use offline zIIPs or extra zIIPs that are provided by the SRB record, you must
define reserved zIIPs in the image profile, as shown in Figure 5.
You also should review LPAR weights and storage allocation to ensure that they meet your
system requiremenCentury s.
z/VM
SRB is enabled by default. It can be controlled by the FEATURES statement in the system
configuration file, and it can be ENABLED and DISABLED, as shown in the following example:
FEATURES ENABLE RECOVERY_BOOST
FEATURES ENABLE RECOVERY_BOOST
z/OS
The D IPLINFO,BOOST,STATE command shows an image’s current boost state, both for startup
and shutdown boosts, and the RPBs, including details such as time used, remaining time,
and other details within the last 24-hour window, as shown in Example 1.
In addition, the DISPLAY M=CPU command is enhanced with the following parameters (see
Example 2):
I indicates zIIPs.
B indicates (transient) boost zIIPs. This CPU was configured online at the start of the boost
period, and is configured offline when the boost ends.
13
Example 2 CPU information for transient zIIPs
SY1 IEE174I 09.58.10 DISPLAY M 328
PROCESSOR STATUS
ID CPU SERIAL
00 + 0449D74381
01 + 1449D74381
02 +I 2449D74381
03 +B 3449D74381
04 +I 4449D74381
DISPLAY DUMP,OPTIONS can provide information about the RPBMINSZ threshold value that is
set to control boosts.
A number of IEA-messages are introduced with SRB. Messages are issued when a boost is
started and when a boost ends. These messages include information such as which type of
boost (Shutdown, IPL, or RPB) is activated. In an RPB situation, more information is
available, such as requester name (for example, ‘Sysplex partitioning’). Messages also are
issued if SRB cannot be activated due to an error or limits having been reached, or if SRB or
RPB is DISABLED. For more information, see z/OS 2.5: System Recovery Boost and z/OS 3.1
System Recovery Boost.
A new SMF interval starts whenever a boost starts or ends. SMF 90-40 is generated when a
boost event occurs (starting or stopping a boost). SMF 90-40 contains information about the
boost. SMF 90-40 is generated every 24 hour, and it contains summarized information about
RPB, such as enablement or disablement of state and usage. SMF types 30, 70-1, 72-3, and
89-1, 89-2, and 99-1 all receive a boost information byte that informs you whether a boost is
active, and if so, which type. During the boost, SMF records that contain capacity information,
such as 30, 70-1, 72-3, and 89, reflect the unboosted capacity of the image, not the effective
boosted capacity. For more information, see z/OS 2.5: MVS System Management Facilities
(SMF) and z/OS 3.1 MVS System Management Facilities (SMF).
HMC
When an LPAR is in the boost period, you can confirm the status of the SRB in the HMC or
SE Partition Image Details window, as shown in Figure 6 on page 15. During the boost
period, Processor Boost is shown as Active.
15
Also, the processor boost status is shown in HMC Monitors Dashboard, as shown in Figure 7.
Automation
Your automation product can be used in the following ways to automate and control SRB
activities:
Dynamically modify LPAR weights, which might be needed to modify or “skew” the sharing
of physical zIIP capacity during a boost period.
Drive the invocation of IEASDBS that indicates the beginning of a shutdown process.
Use new composite hardware API reconfiguration actions.
Control the level of parallelism that is present in the workload at start (for example, starting
middleware regions) and shutdown. Automation can pace or throttle these activities to
varying degrees. With boost, less pacing or more parallelism might be wanted.
Automate the new z/OS messages that are issued at the start or end of boost periods to
take whatever actions are needed.
Drive IEASRB proc invocations to enable or disable RPBs at certain times of day or as
otherwise needed.
Pricing
In this section, the available pricing options are described.
Software pricing
A boost should not increase a customers’ IBM software costs regardless of whether the client
is using 4HRA Pricing, Solution Pricing, or Consumption-based Pricing.
Software support
At the time of writing, the following software is supported by SRB:
IBM z/OS: V2R3, V2R4, V2R5, z/OS V3R1, and z/OS V3R2 with PTFs. Some of the RPBs
that are provided in Stage 2 have support in z/OS V2R3. The new Stage 3 RPBs require
z/OS V2R4 or later.
Note: z/OS APARs are associated with the FIXCAT for SRB (the FIXCAT name is
[Link]).
Note: IBM z16 A01 is the last IBM Z family that supports IBM zVSE 6.2.
Table 2 Image modes that are supported on IBM z15 and IBM z16
Image mode Engine type Operating system Second-level guest
(configurable) (primary)
17
Image mode Engine type Operating system Second-level guest
(configurable) (primary)
z/VM IFL, CP, zIIP, z/VM (IFL or CP) Linux on IBM Z (CP or
and ICF IFL)
CFCC (CP or ICF)
z/OS (CP and zIIP)
z/VSE (CP)
21st Century VSEn (CP)
z/TPF (CP)
Jannie Houlbjerg is a Systems Programmer working at JN Data in Denmark. She has more
than 20 years of experience in the IBM® Z field. Her areas of expertise include IBM Z
hardware and infrastructure, IBM Parallel Sysplex®, connectivity, performance, IBM GDPS,
and technical project management and documentation. Jannie has co-authored several
IBM Redbooks® publications.
Kazuhiro Nakajima is a Senior IT Specialist at IBM Japan. He has more than 30 years at
IBM Japan, and has been active as an advanced subject matter expert (SME) of IBM Z
products for over 20 years. His areas of expertise include IBM Z hardware, performance,
z/OS, and connectivity. Kazuhiro has co-authored several IBM Z configuration setup
IBM Redbooks publications, from IBM zEC12 to IBM z14®.
Paul Schouten is an IBM Z Client Technical Specialist who is based in Sydney, Australia.
During his 40 years supporting mainframe systems, he has performed several roles, including
Certified IT Architect, Systems Software Developer, and Systems Programming. He has
extensive experience developing and documenting high availability (HA) solutions for
IBM Enterprise customers.
The project that created this IBM Redpaper publication was managed by Octavian Lascu,
Redbooks Project Leader, Poughkeepise Center.
Find out more about the residency program, browse the residency index, and apply online at:
[Link]/redbooks/[Link]
19
Stay connected to IBM Redbooks
Look for us on LinkedIn:
[Link]
Explore new Redbooks publications, residencies, and workshops with the IBM Redbooks
weekly newsletter:
[Link]
Stay current on recent Redbooks publications with RSS Feeds:
[Link]
This information was developed for products and services offered in the US. This material might be available
from IBM in other languages. However, you may be required to own a copy of the product or product version in
that language in order to access it.
IBM may not offer the products, services, or features discussed in this document in other countries. Consult
your local IBM representative for information on the products and services currently available in your area. Any
reference to an IBM product, program, or service is not intended to state or imply that only that IBM product,
program, or service may be used. Any functionally equivalent product, program, or service that does not
infringe any IBM intellectual property right may be used instead. However, it is the user’s responsibility to
evaluate and verify the operation of any non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter described in this document. The
furnishing of this document does not grant you any license to these patents. You can send license inquiries, in
writing, to:
IBM Director of Licensing, IBM Corporation, North Castle Drive, MD-NC119, Armonk, NY 10504-1785, US
This information could include technical inaccuracies or typographical errors. Changes are periodically made
to the information herein; these changes will be incorporated in new editions of the publication. IBM may make
improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time
without notice.
Any references in this information to non-IBM websites are provided for convenience only and do not in any
manner serve as an endorsement of those websites. The materials at those websites are not part of the
materials for this IBM product and use of those websites is at your own risk.
IBM may use or distribute any of the information you provide in any way it believes appropriate without
incurring any obligation to you.
The performance data and client examples cited are presented for illustrative purposes only. Actual
performance results may vary depending on specific configurations and operating conditions.
Information concerning non-IBM products was obtained from the suppliers of those products, their published
announcements or other publicly available sources. IBM has not tested those products and cannot confirm the
accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the
capabilities of non-IBM products should be addressed to the suppliers of those products.
Statements regarding IBM’s future direction or intent are subject to change or withdrawal without notice, and
represent goals and objectives only.
This information contains examples of data and reports used in daily business operations. To illustrate them
as completely as possible, the examples include the names of individuals, companies, brands, and products.
All of these names are fictitious and any similarity to actual people or business enterprises is entirely
coincidental.
COPYRIGHT LICENSE:
This information contains sample application programs in source language, which illustrate programming
techniques on various operating platforms. You may copy, modify, and distribute these sample programs in
any form without payment to IBM, for the purposes of developing, using, marketing or distributing application
programs conforming to the application programming interface for the operating platform for which the sample
programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore,
cannot guarantee or imply reliability, serviceability, or function of these programs. The sample programs are
provided “AS IS”, without warranty of any kind. IBM shall not be liable for any damages arising out of your use
of the sample programs.
The following terms are trademarks or registered trademarks of International Business Machines Corporation,
and might also be trademarks or registered trademarks in other countries.
Db2® IBM z16™ z/VM®
GDPS® Parallel Sysplex® z/VSE®
HyperSwap® Redbooks® z15™
IBM® Redbooks (logo) ® z16™
IBM Z® System z®
IBM z14® z/OS®
The registered trademark Linux® is used pursuant to a sublicense from the Linux Foundation, the exclusive
licensee of Linus Torvalds, owner of the mark on a worldwide basis.
Other company, product, or service names may be trademarks or service marks of others.
REDP-5563-02
ISBN 0738460796
Printed in U.S.A.
®
[Link]/redbooks