PCI Manual
PCI Manual
SA23-1323-03
PCI-X SCSI RAID Controller
SA23-1323-03
Note
Before using this information and the product it supports, be sure to read the general information
under ″Product Warranties and Notices″ included with your system unit.
iii
Chapter 5. PCI-X SCSI RAID Controller Maintenance . . . . . . . . . . . . . . . . . . 31
Do’s and Don’ts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Updating the PCI-X SCSI RAID Controller Microcode . . . . . . . . . . . . . . . . . . . 32
Changing pdisks to hdisks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Rechargeable Battery Maintenance . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Displaying Rechargeable Battery Information . . . . . . . . . . . . . . . . . . . . . 33
Forcing a Rechargeable Battery Error . . . . . . . . . . . . . . . . . . . . . . . . 34
Replacing the Rechargeable Cache Battery Pack . . . . . . . . . . . . . . . . . . . . 35
Separating a Removable Cache Card From the Base Card on Type 2780 and 571E Cards . . . . . 38
Attach the Removable Cache Card to the Base Card . . . . . . . . . . . . . . . . . . 45
Replacing the Cache Directory Card . . . . . . . . . . . . . . . . . . . . . . . . . 49
Separating the 571F/575B Card Set and Moving the Cache Directory Card . . . . . . . . . . . 52
Replacing pdisks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
Recovery Procedures for Disk Failures . . . . . . . . . . . . . . . . . . . . . . . . 59
RAID Level 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
RAID Level 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
RAID Level 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
RAID Level 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Contents v
vi PCI-X SCSI RAID Controller: Reference Guide for AIX
Safety Information
DANGER
An electrical outlet that is not correctly wired could place hazardous voltage on metal parts of
the system or the devices that attach to the system. It is the responsibility of the customer to
ensure that the outlet is correctly wired and grounded to prevent an electrical shock.
Before installing or removing signal cables, ensure that the power cables for the system unit
and all attached devices are unplugged.
When adding or removing any additional devices to or from the system, ensure that the power
cables for those devices are unplugged before the signal cables are connected. If possible,
disconnect all power cables from the existing system before you add a device.
Use one hand, when possible, to connect or disconnect signal cables to prevent a possible
shock from touching two surfaces with different electrical potentials.
During an electrical storm, do not connect cables for display stations, printers, telephones, or
station protectors for communication lines.
vii
viii PCI-X SCSI RAID Controller: Reference Guide for AIX
Handling Static Sensitive Devices
Attention: Static electricity can damage this device and your system unit. To avoid damage, keep this
device in its anti-static protective bag until you are ready to install it. To reduce the possibility of
electrostatic discharge, follow the precautions listed below:
v Limit your movement. Movement can cause static electricity to build up around you.
v Handle the device carefully, holding it by its edges or its frame.
v Do not touch solder joints, pins, or exposed printed circuitry.
v Do not leave the device where others can handle and possibly damage the device.
v While the device is still in its anti-static package, touch it to an unpainted metal part of the system unit
for at least two seconds. (This drains static electricity from the package and from your body.)
v Remove the device from its package and install it directly into your system unit without setting it down. If
it is necessary to set the device down, place it on its static-protective package. (If your device is an
adapter, place it component-side up.) Do not place the device on your system unit cover or on a metal
table.
v Take additional care when handling devices during cold weather, as heating reduces indoor humidity
and increases static electricity.
ix
x PCI-X SCSI RAID Controller: Reference Guide for AIX
About This Book
This book provides usage and maintenance information regarding the PCI-X SCSI RAID Controller.
Chapters 1 through 4 of this book contain general information that is intended for all users of this product.
Chapter 5 contains service information intended for a service representative specifically trained on the
system unit and subsystem being serviced. Use this book in conjunction with your specific system unit and
operating system documentation.
ISO 9000
ISO 9000 registered quality systems were used in the development and manufacturing of this product.
Highlighting
The following highlighting conventions are used in this book:
Bold Identifies commands, subroutines, keywords, files, structures, directories, and other items
whose names are predefined by the system. Also identifies graphical objects such as buttons,
labels, and icons that the user selects.
Italics Identifies parameters whose actual names or values are to be supplied by the user.
Monospace Identifies examples of specific data values, examples of text similar to what you might see
displayed, examples of portions of program code similar to what you might write as a
programmer, messages from the system, or information you should actually type.
This document may describe hardware features and functions. While the hardware supports them, the
realization of these features and functions depends upon support from the operating system. AIX provides
this support. If you are using another operating system, consult the appropriate documentation for that
operating system regarding support for those features and functions.
Related Publications
The following publications contain related information:
v System unit documentation for information specific to your hardware configuration
v RS/6000 Eserver pSeries Adapters, Devices, and Cable Information for Multiple Bus Systems, order
number SA38-0516, contains information about adapters, devices, and cables for your system.
v RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems, order number SA38-0509,
contains diagnostic information, service request numbers (SRNs), and failing function codes (FFCs).
Available on the internet at [Link]
v AIX System Management Guide: Operating System and Devices and AIX Installation Guide and
Reference, available on the internet at [Link]
v The RAIDbook: A Handbook of Storage Systems Technology, Edition 6, Editor: Paul Massiglia
xi
Trademarks
The following terms are trademarks of International Business Machines Corporation in the United States,
other countries, or both:
v AIX
v AIX 5L
v Eserver
v PowerPC
v pSeries
v RS/6000
Other company, product, and service names may be trademarks or service marks of others.
Note: This guide refers to various hardware and software features and functions. The realization of these
features and functions depends on the limitations of your hardware and software. AIX supports all
functions mentioned. If you are using another operating system, consult the appropriate
1
documentation for that operating system regarding support for the mentioned features and
functions. This guide also uses the terms channel and bus interchangeably.
Disk Arrays
RAID technology is used to store data across a group of disks known as a disk array. Depending on the
RAID level selected, this storage technique provides the data redundancy required to keep data secure
and the system operational. If a disk failure occurs, the disk can usually be replaced without interrupting
normal system operation. Disk arrays also have the potential to provide higher data transfer and input and
output (I/O) rates than those provided by single large disks.
Each disk array can be used by AIX in the same way as it would a single non-RAID disk. For example,
after creating a disk array, you can create a file system on the disk array or use AIX commands to make
the disk array available to the system by adding the disk array to a volume group.
The PCI-X SCSI RAID Controller is managed by the PCI-X SCSI Disk Array Manager. The PCI-X SCSI
Disk Array Manager serves as the interface to the RAID configuration. It is also responsible for the
monitoring and recovery features of the controller.
If a disk array is to be used as the boot device, it may be required to prepare the disks by booting from
the Diagnostic CD and creating the disk array before installing AIX. You might want to perform this
procedure when the original boot drive is to be used as part of a disk array.
RAID level 0 offers a high potential I/O rate, but it is a nonredundant configuration. As a result, there is no
data redundancy available for the purpose of reconstructing data in the event of a disk failure. There is no
error recovery beyond what is normally provided on a single disk. Unlike other RAID levels, the array
controller never marks a RAID level 0 array as Degraded as the result of a disk failure. If a physical disk
fails in a RAID level 0 disk array, the disk array is marked as Failed. All data in the array must be backed
up regularly to protect against data loss.
If a disk fails in a RAID level 5 array, you can continue to use the array normally. A RAID level 5 array
operating with a single Failed disk is said to be operating in Degraded mode. Whenever data is read from
a Degraded disk array, the array controller recalculates the data on the Failed disk by using data and
parity blocks on the operational disks. If a second disk fails, the array will be placed in the Failed state and
will not be accessible.
RAID Level 6
RAID level 6 stripes data across all disks in the array. RAID level 6 also writes array “P” and “Q” parity
data. The P and Q parity data, which is based on Reed Solomon algorithms, is spread across all the
disks. For a RAID level 6 array of four disks, array data and parity information are written in the following
pattern:
If one or two disks fail in a RAID level 6 array, you can continue to use the array normally. A RAID level 6
array operating with a one or two Failed disks is said to be operating in Degraded mode. Whenever data
is read from a Degraded disk array, the array controller recalculates the data on the Failed disk(s) by using
data and parity blocks on the operational disks. A RAID level 6 array with a single Failed disk has similar
protection to that of a RAID level 5 array with no disk failures. If a third disk fails, the array will be placed
in the Failed state and will not be accessible.
RAID level 10 tolerates multiple disk failures. If one disk in each mirrored pair fails, the array will still be
functional, operating in Degraded mode. You can continue to use the array normally because for each
Failed disk, the data is stored redundantly on its mirrored pair. However, if both members of a mirrored
pair fail, the array will be placed in the Failed state and will not be accessible.
When a RAID level 10 disk array is created, the controller will automatically attempt to select the disks for
each mirrored pair from a different SCSI bus. For example, if four disks selected for the disk array reside
on one of the controller’s SCSI buses and another four disks selected reside on another of the controller’s
SCSI buses, the controller will automatically attempt to create each mirrored pair from one disk on each
SCSI bus. In the event of a SCSI bus failure, each mirrored pair will continue to operate in a Degraded
mode.
Note: If disks of different capacities are used in the same array, all disks are treated as if they have the
capacity of the smallest disk.
RAID Level Data Redundancy Usable Disk Read Performance Write Performance
Capacity
RAID 0 None 100% Very Good Excellent
RAID 5 Very Good 67% to 94% Very Good Good
RAID 6 Excellent 50% to 89% Very Good Fair to Good
RAID 10 Excellent 50% Excellent Very Good
RAID Level 0
Does not support data redundancy, but provides a potentially higher I/O rate.
RAID Level 5
Creates array parity information so that the data can be reconstructed if a disk in the array fails.
Provides better capacity than RAID level 10 but possibly lower performance.
RAID Level 6
Creates array “P” and “Q” parity information, using Reed Solomon algorithms, so that the data can
be reconstructed if one or two disks in the array fail. Provides better data redundancy than RAID
level 5 but with slightly lower capacity and possibly lower performance. Provides better capacity
than RAID level 10 but possibly lower performance.
RAID Level 10
Stores data redundantly on mirrored pairs to provide maximum protection against disk failures.
Provides generally better performance than RAID level 5 or 6, but has lower capacity.
Stripe-Unit Size
With RAID technology, data is striped across an array of physical disks. This data distribution scheme
complements the way the operating system requests data. The granularity at which data is stored on one
disk of the array before subsequent data is stored on the next disk of the array is called the stripe-unit
size. The collection of stripe units, from the first disk of the array to the last disk of the array, is called a
stripe.
You can set the stripe-unit size of a PCI-X SCSI Disk Array to 16 KB, 64 KB, or 256 KB. You may be able
to maximize the performance of your PCI-X SCSI Disk Array by setting the stripe-unit size to a value that
PCI-X SCSI Disk Arrays are assigned names using the hdisk form, the same as other disk storage units in
AIX. These names are automatically assigned whenever you create a disk array. The names are deleted
when you delete the disk array. The individual physical disks that comprise disk arrays or serve as
candidates to be used in disk arrays are represented by pdisk names. A pdisk is a disk that is formatted to
522 bytes/sector. Disks that are formatted to 512 bytes/sector are assigned names using the hdisk form.
These disks must be formatted to 522 bytes/sector before they can be used in disk arrays. For additional
information about formatting 512 bytes/sector hdisks to make them available for use in PCI-X SCSI Disk
Arrays, see “Preparing Disks for Use in PCI-X SCSI Disk Arrays” on page 16.
The List PCI-X SCSI Disk Array Configuration option in the PCI-X SCSI Disk Array Manager can be
used to display these pdisk and hdisk names, along with their associated location codes. For details
regarding how to view the disk array information, see “Viewing the Disk Array Configuration” on page 17.
The following sample output is displayed when the List PCI-X SCSI Disk Array Configuration option is
invoked.
-----------------------------------------------------------------------------
Name Location State Description Size
-----------------------------------------------------------------------------
sisioa1 30-60 Available PCI-XDDR Dual Channel U320 SCSI RAID Adapter
The third column in the preceding output displays the state of the device.
pdisk Descriptions
The fourth column in the preceding output is a description of the device. For an array, the description
indicates the RAID level of the array. The description of a pdisk indicates whether the disk is configured as
an Array Member, Hot Spare, or an Array Candidate.
Array Member
A 522 bytes/sector pdisk that is configured as a member of an array.
Hot Spare
A 522 bytes/sector pdisk that can be used by the controller to automatically replace a Failed disk
in a Degraded RAID level 5, 6, or 10 disk array. A Hot Spare disk is useful only if its capacity is
greater than or equal to the capacity of the smallest disk in an array that becomes Degraded. For
more information about hot spare disks, see “Using Hot Spare Disks” on page 20.
Array Candidate
A 522 bytes/sector pdisk that is a candidate for becoming an Array Member or a Hot Spare.
Auxiliary Cache
The Auxiliary Write Cache adapter (AWC) provides a duplicate, non-volatile copy of write cache data of the
RAID controller to which it is connected. Protection of data is enhanced by having two battery backed
(non-volatile) copies of write cache each stored on separate adapters. If a failure occurs to the write cache
portion of the RAID controller, or the RAID controller itself fails in such a way that the write cache data is
not recoverable, the AWC adapter provides a backup copy of the write cache data to prevent data loss
during the recovery of the failed RAID controller. The cache data is recovered to the new replacement
Should a failure of either the RAID controller or the Auxiliary Cache occur, it is extremely important that the
Maintenance Analysis Procedures (MAPs) for the Service Request Numbers (SRNs) in the AIX error log
be followed precisely. Needed service information can be found in Chapter 6, “Problem Determination and
Recovery,” on page 61.
The RAID controller and the AWC adapter each require a PCI-X slot and are required to be in the same
partition. The two adapters are connected by an internal SCSI connection. This dedicated SCSI bus runs
in Ultra4 mode (U320).
If the AWC adapter itself fails or the link between the two adapters fails, the RAID controller will stop
caching operations, destage existing write cache data to disk, and run in a performance degraded mode.
Once the AWC is replaced or the link is reestablished, the RAID controller will automatically recognize the
AWC, synchronize the cache area, resume normal caching function, and resume writing the duplicate
cache data to the AWC.
The AWC adapter is typically used in conjunction with RAID protection. RAID functions are not affected by
the attachment of an AWC. Because the AWC does not control other devices over the bus and
communicates directly with it’s attached RAID controller directly over a dedicated SCSI bus, it has little, if
any, performance impact on the system.
The AWC adapter may be directly mated with its associated storage I/O adapter (i.e. 571F/575B adapters)
or the AWC adapter may be standalone (i.e. 574F) with a SCSI cable connecting it to its associated
storage I/O adapter (i.e. 2780 or 571E).
Installation
The following is the required Order of Installation when creating an Auxiliary Cache environment. It is
critical to pay close attention to the information contained below and follow the steps exactly as written to
avoid problems during installation.
+--------------------------------------------------------------------------------–
| COMMAND STATUS
|
|Command: OK stdout: yes stderr: no
|
|Before command completion, additional instructions may appear below.
|
|-----------------------------------------------------------------------
|Name Location State Description Size
|-----------------------------------------------------------------------
|sisioa5 0J-08 Available PCI-XDDR Auxiliary Cache Adapter
| scsi20 0J-08-01-06,0 Linked Target on remote adapter is 03-07,0
|
|
+--------------------------------------------------------------------------------–
Software for the PCI-X SCSI RAID Controller is packaged in installp format and distributed as part of the
base AIX installation media, AIX update media, and through the web-based Fix Delivery Center for AIX.
This information is an overview of the AIX software support required for the PCI-X SCSI RAID Controller.
For complete information related to the installation and maintenance of AIX, refer to the AIX Installation
Guide and Reference.
The PCI-X SCSI RAID Controller executes onboard microcode. The AIX command lsmcode can be used
to determine the level of onboard microcode being used by the controller. Although a version of controller
microcode may be distributed along with AIX, this does not necessarily represent the most recent version
of microcode available for the controller. For more information regarding obtaining and downloading new
PCI-X SCSI RAID Controller microcode, see “Updating the PCI-X SCSI RAID Controller Microcode” on
page 32.
Attention: Ensure the adapters are updated with the latest microcode from the code download website as part of
the initial installation. Go to [Link]
To verify that the device support package for the PCI-X SCSI RAID Controller is installed, type as an
example:
lslpp -l [Link].14107802*
13
Output from this command will indicate if device support software for the PCI-X SCSI RAID Controller is
installed, and if so, what the corresponding levels of each fileset are.
If the output indicates that no filesets of this name are installed, you must install the appropriate package
in order for the PCI-X SCSI RAID Controller to be made available for use. This software package is
available as part of the base AIX installation media, AIX update media, and through the web-based Fix
Delivery Center for AIX.
Over time, it may become necessary to install software updates in order to have the very latest available
level of device software support for the PCI-X SCSI RAID Controller. Updates to the device support
software are packaged, distributed, and installed through the same mechanisms used for other portions of
the AIX base operating system. The standard AIX technical support procedures can be used to determine
the latest available level of device software support for the PCI-X SCSI RAID Controller.
You can also use a SMIT fast path to start the PCI-X SCSI Disk Array Manager. On the AIX command line,
type smit pxdam, and press Enter.
If a disk array is to be used as the boot device, you might need to prepare the disks by booting from the
Diagnostic CD and creating the disk array before installing AIX. You might want to perform this procedure
when the original boot drive is to be used as part of a disk array.
To start the PCI-X SCSI Disk Array Manager from Diagnostics, do the following:
1. Start Diagnostics and on the Function Selection screen, select Task Selection.
2. Select RAID Array Manager and press Enter.
3. Select PCI-X SCSI Disk Array Manager and press Enter.
15
Preparing Disks for Use in PCI-X SCSI Disk Arrays
Before a disk can be used in a PCI-X SCSI Disk Array, it must be a 522 bytes/sector Array Candidate
pdisk. To create an Array Candidate pdisk and format it to 522 byte sectors, do the following:
1. Start the PCI-X SCSI Disk Array Manager by following the steps in “Using the PCI-X SCSI Disk Array
Manager” on page 15.
2. Select Create an Array Candidate pdisk and Format to 522 Byte Sectors.
3. Select the appropriate PCI-X SCSI RAID Controller.
4. Select the disks that you want to prepare for use in the PCI-X SCSI Disk Arrays.
Attention: Continuing with this option will format the disks. All data on the disks will be lost. Some
disks require that their microcode be updated to the latest level before being formatted to 522 byte
sectors. Also, some disks do not support being formatted to 522 byte sectors. The PCI-X SCSI Disk
Array Manager will inform the user of these situations when they exist.
5. A message will display asking if you want to continue. To proceed with the format, select OK or press
Enter to continue. To return to the previous menu without formatting the disks, select Cancel.
After the formatting is complete, the disks will be Array Candidate pdisks and will be ready for use in
PCI-X SCSI Disk Arrays. This operation will also zero all the data on the disks. The PCI-X SCSI RAID
Controller keeps track of the disks that have their data zeroed. These Zeroed Array Candidate pdisks can
be used to create a disk array that will be immediately protected against disk failures, and they are the
only disks that can be added to an existing disk array. An Array Candidate pdisk will lose its Zeroed state
after it has been used in an array or is unconfigured. It will also lose its Zeroed state after the system has
been rebooted. To return an Array Candidate pdisk to the Zeroed state, follow the steps previously
described in this section for preparing disks for use in disk arrays.
7. A SMIT Dialog Screen summarizes your selections. Press Enter to create the array.
You can now add the disk array to a volume group. Logical volumes and file systems can also be created.
Use standard AIX procedures to perform these tasks, and treat the array in the same way that you would
treat any hdisk.
------------------------------------------------------------------------
Name Location State Description Size
------------------------------------------------------------------------
sisioa1 30-60 Available PCI-XDDR Dual Channel U320 SCSI RAID Adapter
The controller’s name, location, status, and description are displayed first. Each PCI-X SCSI disk array
hdisk is displayed with its Array Member pdisks directly underneath it.
v The first column of output is the name of the disk array (hdisk) or physical disk (pdisk).
v The second column of output is the device’s location code. The format of the location code is
AB-CD-EF-G,H. The AB-CD fields identify the location of the PCI-X SCSI RAID Controller. The EF value
identifies the Controller’s SCSI bus that the device is attached to. For a disk array this value will be ff
which indicates a logical bus for RAID devices. The G,H values are the SCSI ID and Logical Unit
Number (LUN) of the device.
v The third column of the above output is the state of the disk array or pdisk. For information about the
possible disk array and pdisk states, see “Disk Array Overview” on page 9. For 512 byte/sector
standalone disks (hdisks), this column is the AIX device state (for example, Available or Defined).
v The fourth column is a description of the device. For a disk array, the description is the RAID level of
the array. For a pdisk, the description can be Array Candidate, Hot Spare, or Array Member.
Array Candidate pdisks and Hot Spare pdisks are displayed at the bottom of this screen. The pdisk names
are displayed, along with location, state, description, capacity, and long-running command status. Any 512
bytes/sector standalone disks (hdisks) are displayed, along with location, state, description, and capacity.
To preserve the data on the disk array, you must first back up all files in the logical volumes and file
systems on the disk array before removing the disk array from its volume group. To delete the array, do
the following:
1. Start the PCI-X SCSI Disk Array Manager by following the steps in “Using the PCI-X SCSI Disk Array
Manager” on page 15.
2. Select Delete a PCI-X SCSI Disk Array.
3. Select the PCI-X SCSI RAID Controller.
4. Select the disk array to delete.
When the disk array has been deleted, any Active Array Member pdisks will become Active Array
Candidate pdisks.
A screen summarizes your selections and indicates the performance penalty associated with this
option.
6. Press Enter to add the disks to the array. To enable higher level components in the system to use the
increased capacity of the disk array, additional steps might be needed.
The disk state changes to Hot Spare. On subsequent disk failures, reconstruction of Failed disks will occur
automatically for RAID level 5, 6, and 10 disk arrays.
Note: If there is a Degraded disk array at the time that a Hot Spare is configured, reconstruction of the
Failed disk begins automatically.
[Entry Fields]
SCSI Adapter scsi8
Description PCI-X U320 SCSI RAID >
Status Available
Location 30-60-00
Adapter card SCSI ID [7] +#
Wide bus enabled yes +
Maximum SCSI Bus Speed in MB/s 320 +
QAS (Quick Arbitration) Capability Enable +
Apply change to DATABASE only no +
[Entry Fields]
RAID Array hdisk15
Description SCSI RAID 5 Disk Arra>
Status Available
Location 30-60-ff-0,1
Serial Number 12D0E934
Physical volume identifier none
Queue DEPTH 16
Size in Megabytes 109200
RAID Level 5
Stripe Size in KB 64
v The Physical volume identifier field is a unique value assigned to the hdisk if the disk array is a
member of a volume group. If the disk array is not a member of a volume group, this field value is none.
v The Queue DEPTH field is the depth of the command queue used for this disk array.
v The Size in Megabytes field represents the usable capacity of the disk array. For information about
calculating capacities for each RAID level, see “Supported RAID Levels” on page 3.
v The RAID Level field is the level of protection chosen for this array.
v The Stripe Size in KB field is the number of contiguous kilobytes which will be written to a single disk
before switching to the next disk in the disk array. It provides the host with a method to tune data
striping according to the typical I/O request size.
You cannot change any of the attributes on this screen. The RAID level and stripe size must be specified
when the array is created.
[Entry Fields]
Disk pdisk3
Description Physical SCSI Disk Drive
Status Available
Location 30-60-00-10,0
Serial Number E3V1W93B
Vendor and Product ID IBM IC35L036UCDY10>
Service Level
Size in Megabytes 60400
Format Timeout in minutes [120] +#
Use of the High Availability feature requires adapter and AIX software support. The adapters which support
this High Availability feature are identified in Table 1 on page 1. The AIX software support levels required
for the High Availability feature are identified in Table 2 on page 13. These adapters support being either in
a single adapter configuration or as part of this dual adapter (High Availability) configuration. The adapter
firmware will determine the correct operating mode based upon the presence or absence of another
compatible adapter, and will automatically change the operating mode as needed. No mode jumpers or
special configuration settings are needed to configure the adapter.
This function is only supported for connections to disk arrays between two systems or partitions. It is not
supported for connections to disk arrays from two adapters in the same partition or system.
One adapter functions as the primary adapter and performs direct management of the physical devices.
The second adapter serves as a client of the primary adapter and does not perform reads or writes to
devices directly. Commands directed to disk arrays through the secondary adapter are forwarded through
the SCSI bus to the primary adapter for execution. The secondary adapter detects if the primary adapter
goes offline and transitions itself to become the primary adapter. When the original primary adapter comes
back online, it now serves as the secondary adapter, unless it was previously designated as the preferred
primary adapter.
When in a High Availability configuration, the adapter behavior is altered to enable easier transition of the
disk arrays between adapters in the event of an adapter failure (or for other causes). Write cache on the
adapters is disabled in a High Availability configuration which impacts performance. Also, in a High
Availability configuration, parity footprints are kept in reserved sectors of the device in addition to
nonvolatile adapter memory. Both of these are different than the typical adapter behavior and are
applicable only if two adapters are present. The adapter will automatically switch its behavior as needed
based on the current configuration.
25
Comparison of JBOD (512 byte/sector standalone disks (hdisks))
versus Disk Arrays (522 bytes/sector disks (pdisks)) in High
Availability Configurations
JBOD (512 byte/sector standalone disks (hdisks))
The interconnection cable, which has VHDCI connectors, is provided with the external subsystem.
C1 C0
T1 C1
S T
A A
T0 C0 T T T
S
T T
The interconnection cable, which has VHDCI connectors, is provided with the external subsystem.
C1 C0 C0 C1
C1 C1
T1 S T
A A
T0 C0 T T C0 T
Primary Secondary
S
T T
Disk Arrays
Performance Considerations
Write cache on the PCI-X SCSI Raid adapters is disabled in an High Availability configuration. The
performance impacts from the lack of write cache function will be greater when running Raid-5 or Raid-6
arrays. Performance impacts will be somewhat less if running Raid-0 or Raid-10 arrays. Disk array access
via the secondary adapter must be routed through the primary adapter which will also affect read/write
performance. Additionally, Raid parity footprints being kept/updated on disk for Raid 5,6,10 arrays, will
cause performance impacts.
Installation
The following is the required Order of Installation when creating a Multi-Initiator High-Availability
environment. It is critical to pay close attention to the information contained below and follow the steps
exactly as written to avoid problems during installation.
1. Before proceeding with physically installing the high availability configuration, you must update to the
latest adapter microcode from the code download website, and to the required levels of both the AIX
Note: Some disk enclosures may require a power cycle for the changes to take affect.
6. Power on system or partition and verify proper port cabling, and the function of adapters and disk
arrays. Refer to “Viewing the Disk Array Configuration” on page 17. The output displayed will be similar
to the following. - Verify that like bus numbered busses are connected on the primary and remote
(secondary) adapters and the SCSI IDs are set to 05 and 06 on each adapter as highlighted below.
COMMAND STATUS
-----------------------------------------------------------------------
Name Location State Description Size
-----------------------------------------------------------------------
sisioa0 1V-08 Primary PCI-X Dual Channel U320 SCSI RAID Adapter
scsi4 1V-08-00-06,0 Linked Target on remote adapter is 00-05,0
scsi5 1V-08-01-06,0 Linked Target on remote adapter is 01-05,0
[Entry Fields]
SCSI Adapter sisioa0
Description PCI-XDDR Quad Channel>
Status Available
Location 1A-08
Operating mode Primary Adapter
Preferred operating mode [No Preference]
Link Operational No
Remote Adapter Vendor and Product ID
Remote Adapter Serial Number
8. Any new disk arrays can now be created from the primary adapter. See “Creating a PCI-X SCSI Disk
Array” on page 16. If new disk arrays are created, the configuration manager must be run on the
system or partition that contains the secondary adapter in order for it to recognize the new disk arrays.
9. If any configuration changes were made during the installation, for example in the previous step, or are
made at a later time, the configuration manager must be run on the system or partition containing the
secondary adapter in order for it to recognize the new configuration.
Note: pdisks that are a Failed member of a Degraded disk array can be replaced and the disk array
reconstructed while the system continues to run.
v You can physically move pdisks from one adapter to another. However, if the pdisks are members of a
disk array, be sure to move all the disks as a group. Prior to attempting a disk movement, ensure that
the disk array is not in a Degraded state because of a disk failure.
v When physically removing pdisks that are members of a disk array and there is no need to preserve
data and no intent to use the disk array again, delete the disk array before removing the disks. This
action avoids disk-array related problems the next time that these disks are used.
v Always use the SCSI and SCSI RAID Hot Plug Manager to remove and replace a pdisk. For
instructions on how to remove and replace a disk, see “Replacing pdisks” on page 58.
v If a disk array is being used as a boot device and the system fails to boot because of a suspected disk
array problem, boot using the Standalone Diagnostic media. Error Log Analysis, AIX error logs, the
PCI-X SCSI Disk Array Manager, and other tools are available on the Standalone Diagnostics to help
determine and resolve the problem with the disk array.
v Do not attempt to correct problems by swapping adapters and disks unless you are directed to do so by
the service procedures. Use Error Log Analysis to determine what actions to perform, and when
appropriate, follow the appropriate MAPs for problem determination. If multiple errors occur at
approximately the same time, look at them as a whole to determine if there is a common cause. For
additional information regarding problem determination, see Chapter 6, “Problem Determination and
Recovery,” on page 61.
v Do not confuse the Cache Directory Card, which is a small rectangular card with a round button-shaped
battery, for a cache card. The nonvolatile write cache memory is integrated into the main adapter card
or integrated into the removable cache card on the newest adapters. The memory itself is not
removable, and is battery-backed by the large, rechargeable Cache Battery Pack. The Cache Directory
Card contains only a secondary copy of the write cache directory, no cache data, and should never be
removed except under very specific cases as described in the MAPs.
v Do not unplug or exchange a Cache Battery Pack without following the procedures as outlined in this
section or in the MAPs.
v When invoking diagnostic routines for a PCI-X SCSI RAID Controller, it is recommended that Problem
Determination (PD) mode be used instead of System Verification (SV) mode unless there is a specific
reason to use SV mode (for example, you were directed to run SV mode by a MAP).
v After diagnostic routines for a PCI-X SCSI RAID Controller are run in SV mode, it is recommended to
run diagnostics in PD mode to ensure that new errors are analyzed. These actions should be performed
especially when using Standalone Diagnostic media.
31
Updating the PCI-X SCSI RAID Controller Microcode
To determine if an update is needed for your PCI-X SCSI RAID controller, follow the directions at:
[Link] If updates are needed, download
instructions are also located at that Web address.
Note: For either an Auxiliary Cache adapter or a storage I/O adapter which attaches to an Auxiliary Cache
adapter, it is recommended that both the storage I/O adapter and Auxiliary Cache adapter be
updated.
Note: pdisks which are members of a disk array or are hot spares cannot be changed to standalone
hdisks.
Rechargeable battery maintenance tasks described in this section include displaying rechargeable battery
information, forcing a rechargeable battery error, and replacing the rechargeable Cache Battery Pack.
The following are the fields displayed on the rechargeable battery information screen:
RAID Adapter
The name of the selected controller.
Battery Type
The type of rechargeable Cache Battery Pack.
Battery State
Indicates if an error condition currently exists related to the rechargeable Cache Battery Pack. The
possible values for this field are:
No battery warning/error
No warning or error condition currently exists.
Warning condition
A warning condition currently exists and an error has been logged.
Error condition
An error condition currently exists and an error has been logged.
Note: Some rechargeable Cache Battery Packs are negatively affected by higher temperatures
and thus are prorated based on the amount of time that they spend at various ambient
temperatures.
Estimated time to warning (days)
Estimated time, in units of days, until a message is issued indicating that the replacement of the
rechargeable Cache Battery Pack should be scheduled.
Estimated time to error (days)
Estimated time, in units of days, until an error is reported indicating that the rechargeable Cache
Battery Pack must be replaced.
Concurrently maintainable battery pack
Indicates that the rechargeable Cache Battery Pack can be replaced while the adapter continues
to operate.
Battery pack can be safely replaced
Indicates that the adapter’s write cache has been disabled and the rechargeable Cache Battery
Pack can be safely replaced.
Ensure that you have the correct type and quantity of Cache Battery Packs to do the replacement. To
resume normal operations, replace the Cache Battery Pack.
The Cache Battery Pack for 571F storage I/O adapter and 575B Auxiliary Cache adapter is contained in a
single battery FRU that is physically located on the 575B Auxiliary Cache adapter. The functions of forcing
a battery pack error and starting adapter cache on either adapter in the card set will result in the same
function automatically being performed on the other adapter in the card set.
Note: Using this option places the battery into the error state, which requires it to be replaced.
6. Press Enter.
7. Replace the Cache Battery Pack as described in “Replacing the Rechargeable Cache Battery Pack.”
Note: When replacing the Cache Battery Pack, the battery must be disconnected for at least 60 seconds
before connecting the new battery. This is the minimum amount of time needed for the card to
recognize that the battery has been replaced.
CAUTION:
The battery is a nickel metal hydride or lithium ion battery. To avoid possible explosion, do not
burn the battery. Exchange it with the IBM approved part. Recycle or discard the battery as
instructed by local regulations. In the United States, IBM has a process for the collection of this
battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit
available when you call.
Attention: If the cache battery pack is not already in the error state, follow the steps described in
“Forcing a Rechargeable Battery Error” on page 34 before proceeding to prevent data loss. After the
battery is in the error state, replace it with a new battery by doing the following:
4. Remove the plug that connects the battery unit and the card.
Note: The plug fits in the board only one way so that it cannot be inserted incorrectly during the
replacement steps of this procedure.
5. Do one of the following:
v For cards that have a hard plastic casing around the battery, squeeze the casing A to remove the
battery unit from the card. Remove the battery unit from the battery casing. Save the battery casing
to reuse on the replacement battery pack.
v For batteries that are attached to the card with rivets, work from the back side of the card to remove
the two pins (1) from the inside of the rivets. Working from the front side of the card, release the
plastic rivets. A hint in releasing the plastic rivets is to use a medium retractable ball point pen, in
the retracted position, to push the rivets through the tab of the battery pack.
Note: Ensure that the Cache Battery Pack is disconnected for at least 60 seconds before
connecting the new battery. This is the minimum amount of time needed for the card to
recognize that the battery has been replaced.
RZAR6585-2
3. For all except the 571F/575B card set, remove the plug that connects the battery unit and card by
squeezing the retaining latch while gently pulling on the plug.
Note: The plug fits in the board only one way so that it cannot be inserted incorrectly during the
replacement phase.
4. Remove the battery unit by gently pulling it out.
5. Install the new Cache Battery Pack by reversing this procedure.
6. Restart the adapter’s write cache by doing the following:
v Navigate to the PCI-X SCSI Disk Array Manager by using the steps in “Using the PCI-X SCSI Disk
Array Manager” on page 15.
v Select Diagnostics and Recovery Options.
v Select Controller Rechargeable Battery Maintenance.
v Select Start Adapter Cache.
v Select your PCI-X SCSI RAID Controller whose battery you just replaced.
v Press Enter.
Attention: To avoid loss of cache data, do not disconnect the cache battery during this procedure.
All cards are sensitive to electrostatic discharge. See “Handling Static Sensitive Devices” on page ix.
Attention: This procedure should only be performed if directed from an isolation procedure or a
maintenance analysis procedure (MAP).
7. To separate the cards, turn the jack screw counterclockwise until you feel the cards separate.
Important: After separating the base card from the cache card, turn the jack screw clockwise to
reseat it. then go to “Attach the Removable Cache Card to the Base Card” on page 45.
12. Loosen screw B one full turn. Do not remove this screw.
Note: Observe the keyhole slot located below screw B. In step 15 on page 44 when asked to
separate the cache card, you will need to have the keyhole slot clear the retaining screw.
14. Turn the jack screw counterclockwise to begin separating the cards. Continue turning the jack screw
until you feel the cards separate.
16. Separate the base card from the removable cache card.
Note: After separating the base card from the cache card, turn the jack screw clockwise to reseat it.
Note: To help align the card connector, view the jack screw through the hole.
b. Press down on the stiffener plate to fully seat the connector.
c. Press down on the area of the card over the connector to fully seat the connector.
Attention: All cards are sensitive to electrostatic discharge. See “Handling Static Sensitive Devices” on
page ix.
v To replace a non-571E cache directory card, do the following:
1. Remove the card using the card remove and replace procedure for the model or expansion unit on
which you are working.
2. Locate the cache directory card B. It is a small rectangular card mounted on the I/O card (see the
following figures for your adapter).
3. Unseat the connector on the cache directory card by wiggling the two corners furthest from the
mounting peg using a rocking motion. Then, pivot the cache directory card back over the mounting
peg to disengage the cache directory card from the mounting peg.
4. Install the replacement card by seating it on the connector and mounting peg.
RZAR6586-1
IPHAU609-0
2. Loosen the retaining screw located on the cache directory card one full turn. Do not remove the
retaining screw.
3. Unseat the cache directory card from the connector. To unseat and remove the cache directory card,
pull the cache directory card in the direction of the plastic card handle. You might need to pull the
card on one side then the other to help disengage the connector. When you have disengaged the
connector, lift the cache directory card up and off the retaining screw.
4. Install the replacement cache directory card. To install the replacement cache directory card, place it
down over the retaining screw, then slide it away from the plastic card handle to fully seat it in the
connector.
Separating the 571F/575B Card Set and Moving the Cache Directory
Card
Detailed instructions for separating the card set and moving the cache directory card. To avoid loss of
cache data, you must not remove the cache battery during this procedure.
Note:
To complete this procedure, you will need a T-10 TORX bit or driver.
Attention: To avoid loss of cache data, you must not remove the cache battery during this procedure.
To separate the 571F/575B card set and move the cache directory card, do the following:
Attention: All cards are sensitive to electrostatic discharge. See “Handling Static Sensitive Devices” on
page ix before beginning this procedure.
1. Label both sides of the card before separating.
2. Go to the next step if you are not servicing a 571F/575B card set containing a light pipe assembly. If
you are servicing a 571F/575B card set that contains a light pipe assembly, you will need to remove
the light pipes. To remove the light pipes, do the following:
a. Place the 571F/575B card set adapter on an ESD protective surface.
b. Remove the light pipe retaining screw C from the 571F/575B card set.
c. Slide the light pipe assembly D from between the 571F/575B cards.
d. Put both the screw and the light pipe assembly in a safe place.
3. Place the 571F/575B card set adapter on an ESD protective surface and orient it as shown.
5. To prevent possible card damage, first loosen all 5 retaining screws C BEFORE removing them. After
all five retaining screws have been loosened, remove the screws C from the 571F storage adapter.
7. Turn the 571F storage adapter over so the components are facing up and locate the cache directory
card D on the 571F storage adapter. It is the small rectangular card mounted on the I/O card.
9. Move the cache directory card to the replacement 571F storage adapter and seat it on the connector
and mounting pegs.
10. To reassemble the cards do the above procedure in reverse order. To prevent possible card damage,
insert all 5 screws C before tightening any of them.
Note: If you are installing the 571F/575B card set adapter into a cassette, do the following:
a. Remove the adapter handle B as shown in the following illustration.
b. If applicable, remove the light-pipe assembly.
c. Remove the two plastic covers from the SCSI jumper cable A as shown in the following
figure:
Replacing pdisks
Replace Failed pdisks as soon as possible, even if a reconstruction was initiated with a hot spare by the
controller. The Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device option in
the SCSI and SCSI RAID Hot Plug Manager can be used to replace Failed pdisks. The PCI-X SCSI Disk
Array Manager provides a shortcut to the SCSI and SCSI RAID Hot Plug Manager. The SCSI and SCSI
RAID Hot Plug Manager allows you to perform the following tasks:
v List all SCSI hot swap slots and their contents
v Set the identify indication for a device attached to a SCSI hot swap enclosure device
v Add a device to a SCSI hot swap enclosure device
v Replace and remove a hot swap device
v Configure added and replaced devices
To start the SCSI and SCSI RAID Hot Plug Manager, do the following:
1. Navigate to the PCI-X SCSI Disk Array Manager by using the steps in “Using the PCI-X SCSI Disk
Array Manager” on page 15.
2. Select Diagnostics and Recovery Options.
3. Select SCSI and SCSI RAID Hot Plug Manager.
4. Select the action you want to take, and follow the instructions on the screen to complete the selected
operation.
RAID Level 0
A single disk failure will cause a RAID level 0 array to change to the Failed state. After deleting the logical
volumes and file systems on the disk array and removing the disk array from the volume group, delete the
disk array, replace the Failed disk, then re-create the disk array. You must then add the disk array back to
a volume group, re-create the disk array logical volumes and file systems, and copy the data back to the
restored disk array from your backup media.
RAID Level 5
Single-Disk Failure
Whenever a single disk in a RAID level 5 disk array transitions to the Failed state, the disk array status
changes to Degraded. The disk array remains functional because the data on the Failed disk can be
reconstructed using parity and data on the remaining disks.
Whenever a disk is marked as Failed, replace it as soon as possible. If a hot spare disk was defined, the
controller may reconstruct the data on the disk automatically. If a hot spare reconstruct was not initiated by
the controller, replace the Failed disk and then initiate a reconstruct.
Multiple-Disk Failures
If a second disk in a RAID level 5 disk array fails, the array will be marked as Failed and the data will not
be accessible. After deleting the logical volumes and file systems on the disk array and removing the disk
array from the volume group, delete the disk array, replace the Failed disks, then re-create the disk array.
You must then add the disk array back to a volume group, re-create the disk array logical volumes and file
systems, and copy the data back to the restored disk array from your backup media.
RAID Level 6
Single-Disk or Dual-Disk Failure
Whenever one or two disks in a RAID level 6 disk array transitions to the Failed state, the disk array
status changes to Degraded.. The disk array remains functional because the data on the Failed disk can
be reconstructed using “P” and “Q” parity and data on the remaining disks.
Whenever a disk is marked as Failed, replace it as soon as possible. If a hot spare disk was defined, the
controller may reconstruct the data on the disk automatically. If a hot spare reconstruct was not initiated by
the controller, replace the Failed disk and then initiate a reconstruct.
Three-Disk Failures
If a third disk in a RAID level 6 disk array fails, the array will be marked as Failed and the data will not be
accessible. After deleting the logical volumes and file systems on the disk array and removing the disk
array from the volume group, delete the disk array, replace the Failed disks, then re-create the disk array.
You must then add the disk array back to a volume group, re-create the disk array logical volumes and file
systems, and copy the data back to the restored disk array from your backup media.
RAID Level 10
Single-Disk Failure
Whenever a single disk in a RAID level 10 disk array transitions to the Failed state, the disk array status
changes to Degraded. The disk array remains functional because the data on the Failed disk is also stored
on the other member of its mirrored pair.
Whenever a disk is marked as Failed, replace it as soon as possible. If a hot spare disk was Defined, the
controller may reconstruct the data on the disk automatically. If a hot spare reconstruct was not initiated by
the controller, then initiate a reconstruct on the Failed disk following replacement.
Multiple-Disk Failures
If each Failed disk belongs to a different mirrored pair, then the array will be in the Degraded state
because the data on each Failed disk is still available from its mirrored pair. The recovery procedure for
this case is the same as the single-disk failure case. After you replace the Failed disks, initiate reconstruct
operations.
When both members of a mirrored pair fail, the disk array status changes to Failed. After deleting the
logical volumes and file systems on the disk array and removing the disk array from the volume group,
delete the disk array, replace the Failed disks, then re-create the disk array. You must then add the disk
array back to a volume group, re-create the disk array logical volumes and file systems, and copy the data
back to the restored disk array from your backup media.
If a problem arises related to disk arrays and associated pdisks, use the following to identify the problem:
v Information presented by the Error Log Analysis (ELA)
v Hardware error logs viewed using the Display Hardware Error Report diagnostic task
v Disk array hdisk and pdisk status, viewed using the PCI-X SCSI Disk Array Manager
Error Log Analysis (ELA) analyzes errors presented by the adapter and recommends actions that need to
be performed to correct the errors. It is sometimes recommended that you perform a Maintenance Analysis
Procedure (MAP) to further determine what actions should be taken to resolve the problem. This chapter
provides many of these MAPs.
The MAPs contained in this chapter are intended to address only problems directly related to disk arrays
and SCSI bus-problem isolation. MAPs related to other device or adapter problems, when applicable, are
located in other system documentation.
Considerations
Read the following before using these problem determination and recovery procedures:
v If a disk array is being used as a boot device and the system fails to boot because of a suspected
disk-array problem, boot using the Standalone Diagnostic media. Error Log Analysis, AIX error logs, the
PCI-X SCSI Disk Array Manager, and other tools are available on the Standalone Diagnostics to help
determine and resolve the problem with the disk array.
v When invoking diagnostic routines for a PCI-X SCSI RAID Controller, it is recommended that Problem
Determination (PD) mode be used instead of System Verification (SV) mode unless there is a specific
reason to use SV mode (for example, you were directed to run SV mode by a MAP).
v After diagnostic routines for a PCI-X SCSI RAID Controller are run in SV mode, it is recommended to
run diagnostics in PD mode to ensure that new errors are analyzed. These actions should be performed
especially when using Standalone Diagnostic media.
Location Codes
Much of the location information contained in the hardware error logs is in the format of AIX location
codes.
The AIX location code format for a SCSI device is: AB-CD-EF-G,H where
v AB-CD identifies the PCI-X SCSI RAID Controller.
v EF identifies the Controller’s SCSI bus that the device is attached to.
Note: For a disk array this value will be ff, which indicates a logical bus for RAID devices.
v G,H are the SCSI ID and logical unit number (LUN) of the device, respectively.
Often only a portion of the AIX location code is shown. For example:
v AB-CD identifies a PCI-X SCSI RAID Controller.
v AB-CD-EF identifies a PCI-X SCSI RAID Controller’s SCSI bus.
v EF-G,H identifies the SCSI bus, SCSI ID, and LUN of an attached disk.
61
Identifying the Disk Array Problem
A disk array problem is uniquely identified by a Service Request Number (SRN). An SRN is in the format
nnnn - rrrr, where the first four digits of the SRN preceding the dash (-) is known as the Failing Function
Code (FFC, for example 2523) and the last four digits of the SRN following the dash (-) is known as the
reason code. The reason code indicates the specific problem that has occurred and must be obtained in
order to determine which MAP to use.
An SRN is provided by Error Log Analysis, which directs you to the MAPs contained in this chapter. To
obtain the reason code (last four digits of the SRN) from an AIX error log, see “Finding an SRN Given an
AIX Error Log” on page 97.
The SRN describes the problem that has been detected and should be considered the primary means of
identifying a problem. However, the List PCI-X SCSI Disk Array Configuration option within the PCI-X
SCSI Disk Array Manager is also useful in identifying a problem or confirming a problem described by
Error Log Analysis. For additional information about the PCI-X Disk Array Manager, see “Using the PCI-X
SCSI Disk Array Manager” on page 15.
Obtain the SRN and proceed to the next section to obtain a more detailed description of the problem and
to determine which MAP to use.
Note: The following table includes only SRNs that are associated with Maintenance Analysis Procedures
contained in this document. A complete list of SRNs can be found in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Table 3.
SRN Description MAP
nnnn - 8008 Permanent Cache Battery Pack failure MAP 3000
nnnn - 8009 Impending Cache Battery Pack failure MAP 3000
nnnn - 9000 Controller detected device error during configuration MAP 3090
nnnn - 9001 discovery
nnnn - 9002
nnnn - 9008 Controller does not support function expected for one or MAP 3030
more disks
nnnn - 9010 Cache data associated with attached disks cannot be found MAP 3020
nnnn - 9011 Cache data belongs to disks other than those attached MAP 3090
nnnn - 9020 Two or more disks are missing from a RAID-5 or RAID-6 MAP 3011
nnnn - 9021 Disk Array
nnnn - 9022
nnnn - 9023 One or more Disk Array members are not at required MAP 3012
physical locations
nnnn - 9024 Physical location of Disk Array members conflict with another MAP 3090
Disk Array
nnnn - 9025 Incompatible disk installed at degraded disk location in Disk MAP 3010
Array
nnnn - 9026 Previously degraded disk in Disk Array not found at required MAP 3090
physical location
MAP 3000
Use this MAP to resolve the following problems:
v Permanent Cache Battery Pack failure (SRN nnnn - 8008)
v Impending Cache Battery Pack failure (SRN nnnn - 8009)
Step 3000-1
Prior to replacing the Cache Battery Pack, it must be forced into an error state. This will ensure that write
caching is stopped prior to replacing the battery pack thus preventing possible data loss.
1. Follow the steps described in “Forcing a Rechargeable Battery Error” on page 34.
2. Go to Step 3000-2
Step 3000-2
Follow the steps described in “Replacing the Rechargeable Cache Battery Pack” on page 35.
When the problem is resolved, then go to MAP 0410: Repair Checkout, in RS/6000 pSeries Diagnostic
Information for Multiple Bus Systems.
MAP 3010
Use this MAP to resolve the following problems:
v Incompatible disk installed at the degraded disk location in Disk Array (SRN nnnn - 9025)
v Disk Array is degraded due to a missing or failed disk (SRN nnnn - 9030)
v Automatic reconstruction initiated for a Disk Array (SRN nnnn - 9031)
v Disk Array is degraded due to a missing or failed disk (SRN nnnn - 9032)
Step 3010-1
Identify the disk array by examining the hardware error log. The hardware error log may be viewed as
follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem that sent
you here and select it.
Step 3010-2
View the current disk array configuration as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select List PCI-X SCSI Disk Array Configuration.
3. Select the PCI-X SCSI RAID Controller identified in the hardware error log.
4. Go to Step 3010-3.
Step 3010-3
Does a disk array have a state of Degraded?
NO Go to Step 3010-4.
YES Go to Step 3010-5.
Step 3010-4
The affected disk array should have a state of either Rebuilding or Optimal due to the use of a hot spare
disk.
Identify the Failed disk, which is no longer a part of the disk array, by finding the pdisk listed at the bottom
of the screen that has a state of either Failed or RWProtected. Using appropriate service procedures, such
as use of the SCSI and SCSI RAID Hot Plug Manager, remove the failed disk and replace it with a new
disk to use as a hot spare:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options .
3. Select SCSI and SCSI RAID Hot Plug Manager.
4. Select Identify a Device Attached to an SCSI Hot Swap Enclosure Device.
5. Choose the location for the device you want to remove or install.
Note: The visual indicator on the Device will blink at the Identify rate.
6. If you are removing a device:
a. Select Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device.
Note: The visual indicator on the device will blink at the Remove rate.
b. Remove the device.
else, if installing a device:
a. Select Attach a Device to an SCSI Hot Swap Enclosure Device.
If a new disk is not listed as a pdisk, it may first need to be prepared for use in a disk array. Do the
following:
1. Start the PCI-X SCSI Disk Array Manager
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Create an Array Candidate pdisk and Format to 522 Byte Sectors.
3. Select the PCI-X SCSI RAID Controller.
4. Select the disk(s) from the list that you want to prepare for use in the disk arrays.
In order to make the new disk usable as a hot spare, do the following:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen .
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Change/Show PCI-X SCSI pdisk Status.
3. Select Create a Hot Spare.
4. Select the PCI-X SCSI RAID Controller.
5. Select the pdisk that you want to designate as a hot spare.
Note: Hot spare disks are useful only if their capacity is greater than or equal to that of the smallest
capacity disk in a disk array that becomes Degraded.
When the problem is resolved, go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3010-5
Identify the failed disk by finding the pdisk listed for the degraded disk array that has a state of Failed.
Using appropriate service procedures, such as use of the SCSI and SCSI RAID Hot Plug Manager,
remove the failed disk and replace it with a new disk to use in the disk array. The SCSI and SCSI RAID
Hot Plug Manager can be invoked as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options.
3. Select SCSI and SCSI RAID Hot Plug Manager.
4. Select Identify a Device Attached to an SCSI Hot Swap Enclosure Device.
5. Choose the location for the device you wish to remove/install.
Note: The visual indicator on the device will blink at the Identify rate.
6. If removing a device:
a. Select Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device.
Note: The visual indicator on the device will blink at the Remove rate.
b. Remove the device.
else, if installing a device:
66 PCI-X SCSI RAID Controller: Reference Guide for AIX
a. Select Attach a Device to an SCSI Hot Swap Enclosure Device.
Note: The visual indicator on the device will blink at the Remove rate.
b. Insert the device.
To bring the disk array back to a state of Optimal, do the following:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Reconstruct a PCI-X SCSI Disk Array.
3. Select the failed pdisk to reconstruct.
Note: The replacement disk should have a capacity that is greater than or equal to that of the smallest
capacity disk in the degraded disk array.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3011
Use this MAP to resolve the following problems:
v Two or more disks are missing from a RAID-5 or RAID-6 Disk Array (SRN nnnn - 9020 / nnnn - 9021 /
nnnn - 9022)
v One or more disk pairs are missing from a RAID-10 Disk Array (SRN nnnn - 9060)
v One or more disks are missing from a RAID-0 Disk Array (SRN nnnn - 9061 / nnnn - 9062)
Step 3011-1
Identify the disks missing from the disk array by examining the hardware error log. The hardware error log
may be viewed as follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, the missing disks are those
listed under Array Member Information with an Actual Location of *unkwn*.
7. Go to Step 3011-2.
Step 3011-2
There are three possible ways to correct the problem. Perform only one of the following three options,
listed in the order of preference:
v Locate the identified disks and install them in the correct physical locations (that is the Expected
Locations) in the system. Perform only one of the following two options:
– IPL the system or logical partition
– Unconfigure and reconfigure the adapter by performing the following:
1. Unconfigure the adapter.
Step 3012-1
Identify the disks which are not at their required physical locations by examining the hardware error log.
The hardware error log may be viewed as follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view.
Viewing the hardware error log, the disks which are not at their required locations are those listed
under Array Member Information with an Expected Location and Actual Location which do not match.
Note: An Actual Location of *unkwn* is acceptable, and no action is needed to correct it. This *unkwn*
location should only occur for the disk array member that corresponds to the Degraded Disk
S/N.
7. Go to Step 3012-2.
Step 3012-2
There are three possible ways to correct the problem. Perform only one of the following three options,
listed in the order of preference:
v Locate the identified disks and install them in the correct physical locations (that is the Expected
Locations) in the system. Perform only one of the following two options:
– IPL the system or logical partition
– Unconfigure and reconfigure the adapter by performing the following:
1. Unconfigure the adapter.
a. Start the PCI-X SCSI Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Unconfigure an Available PCI-X SCSI RAID Controller.
2. Configure the adapter.
a. Start the PCI-X SCSI Disk Array Manager
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Configure a Defined PCI-X SCSI RAID Controller.
MAP 3013
Use this MAP to resolve the following problem:
Disk array is or would become degraded and parity data is out of synchronization (SRN nnnn - 9027)
Step 3013-1
Identify the adapter and disks by examining the hardware error log. The hardware error log may be viewed
as follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, if the disk array member which
corresponds to the Degraded Disk S/N has an Actual Location of *unkwn* and is not physically
present, it may be helpful to find this disk.
7. Go to Step 3013-2.
Step 3013-3
There are three possible ways to correct the problem. Perform only one of the following three options,
listed in the order of preference:
v Restore the adapter and disks back to their original configuration. Perform onlyone of the following two
options:
– IPL the system or logical partition
– Unconfigure and reconfigure the adapter by performing the following:
1. Unconfigure the adapter.
a. Start the PCI-X SCSI Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Unconfigure an Available PCI-X SCSI RAID Controller.
2. Configure the adapter.
a. Start the PCI-X SCSI Disk Array Manager
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Configure a Defined PCI-X SCSI RAID Controller.
When the problem is resolved then go to MAP 0410: Repair Checkout, inRS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
v Delete the disk array, as follows:
Attention: All data on the disk array will be lost.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Delete a PCI-X SCSI Disk Array.
3. Select the PCI-X SCSI RAID Controller.
4. Select the disk array to delete.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
v Format the remaining members of the disk array, as follows
Attention: All data on the disk array will be lost.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
MAP 3020
Use this MAP to resolve the following problem:
Cache data associated with attached disks cannot be found (SRN nnnn - 9010)
Step 3020-1
Has the server been powered off for several days?
NO Go to Step 3020-2.
YES Go to Step 3020-7.
Step 3020-2
Are you working with a 571F/575B card set?
NO Go to Step 3020-4.
YES Go to Step 3020-3.
Step 3020-3
Using the appropriate service procedures, remove the 571F/575B card set. Create and install new card set
with the following parts installed on it:
Note: Label all parts (original and new) before moving them from one place to another.
v The new replacement 571F storage I/O adapter.
v The cache directory card from the original 571F storage I/O adapter.
v The original 575B Auxiliary Cache adapter.
Notes:
1. Refer to “Separating the 571F/575B Card Set and Moving the Cache Directory Card” on page 52.
2. Ensure that the SCSI cable and the battery power cable on the top edge of the storage I/O side of the
card set are connected to the top edge of the Auxiliary Cache side of the card set.
Go to Step 3020-5..
Step 3020-4
Note: Label all parts (original and new) before moving them from one place to another.
Using the appropriate service procedures, remove the I/O adapter. Install the new replacement storage I/O
adapter with the following parts installed on it:
v The cache directory card from the original storage I/O adapter. (Refer to “Replacing the Cache Directory
Card” on page 49.
v The removable cache card from the original storage I/O adapter (This only applies to 571E and some
2780 I/O adapters. Refer to “Separating a Removable Cache Card From the Base Card on Type 2780
and 571E Cards” on page 38.
v If the storage I/O adapter is attached to a 574F Auxiliary Cache adapter, ensure that the SCSI cable on
the last port of the storage I/O adapter is connected to the Auxiliary Cache adapter.
NO Go to Step 3020-8.
YES Go to Step 3020-6.
Step 3020-6
Was the new SRN nnnn-9050?
Attention: Data might be lost. When an Auxiliary Cache adapter connected to the RAID Controller logs
a nnnn - 9055 SRN in the hardware error log, the Reclaim does not result in lost sectors. Otherwise, the
Reclaim does result in lost sectors.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options.
3. Select Reclaim Controller Cache Storage.
4. Select the PCI-X SCSI RAID Controller.
5. Confirm that you wish to proceed.
Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is
displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost and the
system operator may want to restore data after this procedure is completed.
6. Go to Step 3020-8.
YES Contact your Service Support organization
Step 3020-7
If the server has been powered off for several days after an abnormal power-down, the cache battery pack
may be depleted. Do not replace the adapter or the cache battery pack. Reclaim the controller cache
storage as follows:
Attention: Data might be lost. When an Auxiliary Cache adapter connected to the RAID Controller logs a
nnnn - 9055 SRN in the hardware error log, the Reclaim does not result in lost sectors. Otherwise, the
Reclaim does result in lost sectors.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options.
3. Select Reclaim Controller Cache Storage.
4. Select the PCI-X SCSI RAID Controller.
5. Confirm that you wish to proceed.
Note: On the Reclaim Controller Cache Storage results screen, the number of lost sectors is
displayed. If the number is 0, there is no data loss. If the number is not 0, data has been lost
and the system operator may want to restore data after this procedure is completed.
Step 3020-8
Are you working with a 571F/575B card set?
NO Go to Step 3020-10.
YES Go to Step 3020-9.
Step 3020-9
Using the appropriate service procedures, remove the 571F/575B card set. Create and install new card set
with the following parts installed on it:
Note: Label all parts (original and new) before moving them around.
v The new 571F storage I/O adapter.
v The cache directory card from the new 571F storage I/O adapter.
v The new 575B Auxiliary Cache adapter.
Notes:
1. Refer to “Separating the 571F/575B Card Set and Moving the Cache Directory Card” on page 52.
2. Ensure that the SCSI cable and the battery power cable on the top edge of the storage I/O side of the
card set are connected to the top edge of the Auxiliary Cache side of the card set.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems
Step 3020-10
Using the appropriate service procedures, remove the I/O adapter. Install the new replacement storage I/O
adapter with the following parts installed on it:
v The cache directory card from the new storage I/O adapter. (Refer to “Replacing the Cache Directory
Card” on page 49.
v The removable cache card from the new storage I/O adapter (This only applies to 571E and some 2780
I/O adapters. Refer to “Separating a Removable Cache Card From the Base Card on Type 2780 and
571E Cards” on page 38.
v If the storage I/O adapter is attached to a 574F Auxiliary Cache adapter, ensure that the SCSI cable on
the last port of the storage I/O adapter is connected to the Auxiliary Cache adapter.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems
MAP 3021
Use this MAP to resolve the following problem:
RAID controller resources not available due to previous problems (SRN nnnn - 9054)
Step 3021-1
Perform the following:
1. Remove any new or replacement disks which have been attached to the adapter.
2. Take action on the other errors which have occurred at the same time as this error.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3030-1
Identify the affected disks by examining the hardware error log. The hardware error log may be viewed as
follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, the Device Errors Detected field
indicates the total number of disks which are affected. The Device Errors Logged field indicates the
number of disks for which detailed information is provided. Under the Device heading, the Location,
Vendor/Product ID, and S/N are provided for up to three disks. Additionally, the Controller Type and
S/N for each of these disks indicates the adapter to which the disk was last attached when it was
operational.
7. Go to Step 3030-2.
Step 3030-2
Have the adapter card or disks been physically moved recently?
Step 3030-3
There are two possible ways to correct the problem. Perform only one of the following two options, listed
in the order of preference:
v Restore the adapter and disks back to their original configuration. Perform onlyone of the following two
options:
– IPL the system or logical partition
– Unconfigure and reconfigure the adapter by performing the following:
1. Unconfigure the adapter.
a. Start the PCI-X SCSI Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Unconfigure an Available PCI-X SCSI RAID Controller.
2. Configure the adapter.
a. Start the PCI-X SCSI Disk Array Manager
1) Start Diagnostics and select Task Selection on the Function Selection screen.
MAP 3031
Use this MAP to resolve the following problem:
Required cache data cannot be located for one or more disks (SRN nnnn - 9050)
Step 3031-1
Did you just exchange the adapter as the result of a failure?
NO Go to Step 3031-5.
YES Go to Step 3031-2.
Step 3031-2
Are you working with a 571F/575B card set?
NO Go to Step 3031-4.
YES Go to Step 3031-3.
Step 3031-3
Using the appropriate service procedures, remove the 571F/575B card set. Create and install new card set
with the following parts installed on it:
Note: Label all parts (original and new) before moving them from one place to another.
v The new replacement 571F storage I/O adapter.
v The cache directory card from the original 571F storage I/O adapter.
v The original 575B Auxiliary Cache adapter.
Notes:
1. Refer to “Separating the 571F/575B Card Set and Moving the Cache Directory Card” on page 52.
2. Ensure that the SCSI cable and the battery power cable on the top edge of the storage I/O side of the
card set are connected to the top edge of the Auxiliary Cache side of the card set.
Go to Step 3031-10.
Note: Label all parts (old and new) before moving them around.
Using the appropriate service procedures, remove the I/O adapter. Install the new replacement storage I/O
adapter with the following parts installed on it:
v The cache directory card from the original storage I/O adapter. (Refer to “Replacing the Cache Directory
Card” on page 49.
v The removable cache card from the original storage I/O adapter (This only applies to 571E and some
2780 I/O adapters. Refer to “Separating a Removable Cache Card From the Base Card on Type 2780
and 571E Cards” on page 38.
v If the storage I/O adapter is attached to a 574F Auxiliary Cache adapter, ensure that the SCSI cable on
the last port of the storage I/O adapter is connected to the Auxiliary Cache adapter.
Go to Step 3031-10.
Step 3031-5
Identify the affected disks by examining the hardware error log. The hardware error log may be viewed as
follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, the Device Errors Detected field
indicates the total number of disks which are affected. The Device Errors Logged field indicates the
number of disks for which detailed information is provided. Under the Device heading, the Location,
Vendor/Product ID, and S/N are provided for up to three disks. Additionally, the Controller Type and
S/N for each of these disks indicates the adapter to which the disk was last attached when it was
operational.
7. Go to Step 3031-6.
Step 3031-6
Have the adapter card or disks been physically moved recently?
Step 3031-7
Is the data on the disks needed for this or any other system?
NO Go to Step 3031-9.
Step 3031-8
The adapter and disks, identified above, must be reunited so that the cache data can be written to the
disks.
Once the cache data is written to the disks and the system is powered off normally, the adapter and/or
disks may be moved to another location.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3031-9
There are three possible ways to correct the problem. Perform only one of the following three options,
listed in the order of preference:
v Reclaim Controller Cache Storage by performing the following:
Attention: All data on the disk array will be lost.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options.
3. Select Reclaim Controller Cache Storage.
4. Select the PCI-X SCSI RAID Controller.
5. Confirm that you will Allow Unknown Data Loss.
6. Confirm that you wish to proceed.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
v If the disks are members of a disk array, delete the disk array.
Attention: All data on the disk array will be lost.
This may be performed as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Delete a PCI-X SCSI Disk Array.
3. Select the PCI-X SCSI RAID Controller.
4. Select the disk array to delete.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
v Format the disks, as follows:
Attention: All data on the disks will be lost.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
78 PCI-X SCSI RAID Controller: Reference Guide for AIX
2. Select Diagnostics and Recovery Options.
3. Select Format Physical Disk Media (pdisk).
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3031-10
Has a new SRN nnnn-9010 or nnnn-9050 occurred?
NO Go to Step 3031-12.
YES Go to Step 3031-11.
Step 3031-11
Was the new SRN nnnn-9050?
Attention: Data might be lost. When an Auxiliary Cache adapter connected to the RAID Controller logs
a nnnn - 9055 SRN in the hardware error log, the Reclaim does not result in lost sectors. Otherwise, the
Reclaim does result in lost sectors.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options.
3. Select Reclaim Controller Cache Storage.
4. Select the PCI-X SCSI RAID Controller.
5. Confirm that you wish to proceed.
Note: On the Reclaim controller cache storage results screen, the number of lost sectors is displayed.
If the number is 0, there is no data loss. If the number is not 0, data has been lost and the system
operator may want to restore data after this procedure is completed.
6. Go to Step 3031-12.
YES Contact your Service Support organization
Step 3031-12
Are you working with a 571F/575B card set?
NO Go to Step 3031-14.
YES Go to Step 3031-13.
Step 3031-13
Using the appropriate service procedures, remove the 571F/575B card set. Create and install new card set
with the following parts installed on it:
Note: Label all parts (original and new) before moving them from one place to another.
v The new 571F storage I/O adapter.
v The cache directory card from the new 571F storage I/O adapter.
v The new 575B Auxiliary Cache adapter.
Notes:
1. Refer to “Separating the 571F/575B Card Set and Moving the Cache Directory Card” on page 52.
Step 3031-14
Using the appropriate service procedures, remove the I/O adapter. Install the new replacement storage I/O
adapter with the following parts installed on it:
v The cache directory card from the new storage I/O adapter. (Refer to “Replacing the Cache Directory
Card” on page 49.
v The removable cache card from the NEW storage I/O adapter (This only applies to 571E and some
2780 I/O adapters. Refer to “Separating a Removable Cache Card From the Base Card on Type 2780
and 571E Cards” on page 38.
v If the storage I/O adapter is attached to a 574F Auxiliary Cache adapter, ensure that the SCSI cable on
the last port of the storage I/O adapter is connected to the Auxiliary Cache adapter.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3032
Use this MAP to resolve the following problem:
Cache data exists for one or more missing or failed disks (SRN nnnn - 9051)
Step 3032-1
Identify the affected disks by examining the hardware error log. The hardware error log may be viewed as
follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, the Device Errors Detected field
indicates the total number of disks which are affected. The Device Errors Logged field indicates the
number of disks for which detailed information is provided. Under the Device heading, the Location,
Vendor/Product ID, and S/N are provided for up to three disks. Additionally, the Controller Type and
S/N for each of these disks indicates the adapter to which the disk was last attached when it was
operational.
7. Go to Step 3030-2.
NO Go to Step 3032-3.
YES Go to Step 3032-6.
Step 3032-3
Is the data on the disks (and thus the cache data for the disks) needed for this or any other system?
NO Go to Step 3032-7.
YES Go to Step 3032-4.
Step 3032-4
Have the adapter card or disks been physically moved recently?
Step 3032-5
The adapter and disks must be reunited so that the cache data can be written to the disks.
After the cache data is written to the disks and the system is powered off normally, the adapter and/or
disks may be moved to another location.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3032-6
Take action on the other errors that have occurred at the same time as this error.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3032-7
Attention: Data will be lost.
MAP 3033
Use this MAP to resolve the following problems:
v Disk has been modified after last known status (SRN nnnn - 9090)
v Incorrect disk configuration change has been detected (SRN nnnn - 9091)
Step 3033-1
Perform onlyone of the following two options:
v IPL the system or logical partition
v Unconfigure and reconfigure the adapter by performing the following:
1. Unconfigure the adapter.
a. Start the PCI-X SCSI Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Unconfigure an Available PCI-X SCSI RAID Controller.
2. Configure the adapter.
a. Start the PCI-X SCSI Disk Array Manager
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Configure a Defined PCI-X SCSI RAID Controller.
Take action on the other errors which have occurred at the same time as this error.
When the problem is resolved then go to MAP 0410: Repair Checkout, inRS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3034
Use this MAP to resolve the following problem:
Disk requires Format before use (SRN nnnn - 9092)
Step 3034-1
Identify the affected disks by examining the hardware error log. The hardware error log may be viewed as
follows:
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, the Device Errors Detected field
indicates the total number of disks which are affected. The Device Errors Logged field indicates the
number of disks for which detailed information is provided. Under the Device heading, the Location,
Vendor/Product ID, and S/N are provided for up to three disks. Additionally, the Controller Type and
S/N for each of these disks indicates the adapter to which the disk was last attached when it was
operational.
7. Go to Step 3030-2.
Step 3034-2
Are there other disk or adapter errors which have occurred at about the same time as this error?
NO Go to Step 3034-3.
YES Go to Step 3034-5.
Step 3034-3
Have the adapter card or disks been physically moved recently?
NO Go to Step 3034-4.
YES Go to Step 3034-6.
Step 3034-4
Is the data on the disks not needed for this or any other system and you wish to continue to use them with
this adapter?
NO Go to Step 3034-6.
YES Go to Step 3034-7.
Step 3034-5
Take action on the other errors which have occurred at the same time as this error.
When the problem is resolved then go to MAP 0410: Repair Checkout, inRS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3034-6
Perform only one of the following actions that is most applicable to your situation:
v Perform only one of the following two options:
– IPL the system or logical partition
– Unconfigure and reconfigure the adapter by performing the following:
1. Unconfigure the adapter.
a. Start the PCI-X SCSI Disk Array Manager.
Step 3034-7
There are two possible ways to correct the problem. Perform only one of these options.
v Format the disks.
Attention: All data on the disks will be lost.
This may be performed as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
Note: In some rare scenarios, deleting the disk array will not have no effect on a disk and the disk
must be formatted instead.
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Delete a PCI-X SCSI Disk Array.
3. Select the PCI-X SCSI RAID Controller.
4. Select the disk array to delete.
When the problem is resolved then go to MAP 0410: Repair Checkout, inRS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3035
Use this MAP to resolve the following problem:
Disk media format bad (SRN nnnn - FFF3)
Step 3035-1
Identify the affected disk by examining the hardware error log. The hardware error log may be viewed as
follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapter resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, under the Disk Information
heading, the Location, Vendor/Product ID, and S/N are provided for the disk.
7. Go to Step 3035-2.
When the problem is resolved then go to MAP 0410: Repair Checkout, inRS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3036
Use this MAP to resolve the following problem:
Identify disk to be replaced (SRN nnnn - 9200)
You are sent here when a pdisk (that is physical disk in 522 bytes/sector format) is to be replaced,
however, the location of this disk was not provided.
Step 3036-1
Identify the failing disk by examining the hardware error log. The hardware error log may be viewed as
follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. Viewing the hardware error log, under the Disk Information
heading, the Location, Vendor/Product ID, and S/N are provided for the disk.
7. Go to Step 3036-2.
Step 3036-2
Using appropriate service procedures, such as use of the SCSI and SCSI RAID Hot Plug Manager,
remove the failed disk and replace it with a new disk. The SCSI and SCSI RAID Hot Plug Manager can be
invoked as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select Diagnostics and Recovery Options.
3. Select SCSI and SCSI RAID Hot Plug Manager.
4. Select Identify a Device Attached to an SCSI Hot Swap Enclosure Device.
Note: The visual indicator on the Device will blink at the Identify rate.
6. If removing a device:
a. Select Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device
Note: The visual indicator on the Device will blink at the Remove rate.
b. Remove the device
else if installing a device:
a. Select Attach a Device to an SCSI Hot Swap Enclosure Device
Note: The visual indicator on the Device will blink at the Remove rate.
b. Insert the device
7. Go to Step 3036-3.
Step 3036-3
1. Run diagnostics in system verification mode on the adapter
2. Take action on any other errors which may have surfaced due to removing the disk, if any, such as for
degraded disk arrays
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3040
Use this MAP to resolve the following problem:
Multiple controllers connected in an invalid configuration (SRN nnnn - 9073)
Step 3040-1
Was the error logged by a 574F Auxiliary Cache adapter, a storage I/O adapter which attaches an
Auxiliary Cache adapter (for example, 2780 or 571E), or by a 571F/575B card set?
NO Go to Step 3040-5.
YES Go to Step 3040-2.
Step 3040-2
Are you working with a 571F/575B card set?
NO Go to Step 3040-4.
YES Go to Step 3040-3.
Step 3040-3
You must configure both the 571F and the 575B in the same partition.
Are both the 571F side of the card set and the 575B side of the card set configured in the same partition?
NO Change the configuration so that the same parition controls both cards in the card set.
YES Replace the entire card set.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Note: The adapter which is logging this error will run in a performance degraded mode, without
caching, until the unsupported Auxiliary Cache adapter is removed.
v A storage I/O adapter which does not support Auxiliary Cache is attached to an Auxiliary Cache adapter.
Verify that the storage I/O adapter is listed in Table 1 with Auxiliary Cache support marked as YES. If
the storage I/O adapter does not support Auxiliary Cache then the Auxiliary Cache must be
disconnected from the storage I/O adapter.
Determine which of these is the cause of your specific error and take the appropriate actions listed. If this
does not correct the error, contact your next level of support.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
Step 3040-5
There are two primary reasons for receiving this error (SRN nnnn - 9073).
v An adapter in a multi-initiator and high-availability environment sees more than one other adapter
connected in the configuration. Only two adapters are supported connected together in the multi-initiator
and high-availability configuration. Work with the customer to identify and correct the invalid
configuration.
v Incompatible adapters are connected in a multi-initiator and high-availability environment. One or both of
the adapters do not support attachment in a multi-initiator configuration. Verify the adapters logging this
error are listed in Table 1 with multi-initiator support mark as YES.
Determine which of these is the cause of your specific error and take the appropriate actions listed. If this
does not correct the error, contact your next level of support.
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3041
Use this MAP to resolve the following problem:
Multiple controllers not capable of similar functions or controlling the same set of devices (SRN nnnn -
9074).
Step 3041-1
To obtain the reason/description for this failure, you must find the formatted error information in the AIX
error log. This should also contain information about the other connected adapter (Remote Adapter Fields).
Display the hardware error log. The hardware error log may be viewed as follows:
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view. When viewing the hardware error log, the Detail Data section
contains the “Reason for Failure” and “Remote Adapter” information.
Step 3041-2
Find the Reason for Failure and information for the other attached adapter (Remote Adapter) shown in the
error log, and perform the action listed for the Reason in the following table:
Table 4.
Adapter on which to
Reason for Failure Description Action perform the action
Secondary does not support Logged by the secondary Customer needs to upgrade Physically change the type
RAID level being used by adapter. For example, the the type of secondary of adapter which logged the
the Primary. Primary has a Raid-6 array adapter or change the RAID error.
and the Secondary does level of the array on the
not support Raid-6. Primary to a level that is Change RAID level on
supported by the Primary adapter (Remote
secondary. adapter indicated in the
error log).
Secondary does not support Secondary detected some Customer may need to Adapter which logged the
disk function being used by device function/attribute that upgrade the Adapter micro error.
the Primary. it does not support. code or upgrade the type of
secondary adapter.
Secondary is unable to find Secondary logging the Verify the connections to Adapter which logged the
devices found by the error, cannot see all the the devices from the error.
Primary devices that the Primary adapter logging the error.
sees.
View the Disk Array
Configuration screens to
determine the SCSI bus
with the problem.
Secondary found devices Secondary adapter sees Verify the connections to Remote adapter indicated in
not found by the Primary more devices than the the devices from the the error log.
Primary. After this error is Remote Adapter as
logged, automatic failover indicated in the error log.
will occur.
Secondary bus not SCSI connections from the Verify connections and Either adapter.
connected to the same adapter to the devices are re-cable SCSI connections
numbered bus on Primary incorrect. Both adapters as necessary.
must be connected to the
same devices on the same
adapter SCSI bus
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 eServer pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3050
Use the following to perform SCSI bus problem isolation.
Considerations:
v Remove power from the system before connecting and disconnecting cables or devices, as appropriate,
to prevent hardware damage or erroneous diagnostic results.
v Some systems have SCSI and PCI-X bus interface logic integrated onto the system boards and use a
pluggable RAID Enablement Card (a non-PCI form factor card) for these SCSI/PCI-X buses. An
example of such a RAID Enablement Card is FC 5709. For these configurations, replacement of the
RAID Enablement Card is unlikely to solve a SCSI bus-related problem because the SCSI bus interface
logic is on the system board.
v Some adapters provide two connectors, one internal and one external, for each SCSI bus. For this type
of adapter, it is not acceptable to use both connectors for the same SCSI bus at the same time. SCSI
bus problems are likely to occur if this is done. However, it is acceptable to use an internal connector
for one SCSI bus and an external connector for another SCSI bus. The internal and external connectors
are labeled to indicate which SCSI bus they correspond to.
v When two adapters are connected in a Multi-Initiator and High-Availability configuration, as described in
Chapter 4, “Multi-Initiator and High-Availability,” on page 25, each adapter’s SCSI ID must be set to a
different value when connected to shared disk enclosures. If the SCSI IDs are not set properly, many
SCSI bus problems can occur.
Attention: Replacing RAID adapters is not recommended without assistance from your service support
organization when SCSI bus problems exist. Because the adapter may contain non-volatile write cache
data and configuration data for the attached disk arrays, additional problems can be created by replacing a
adapter when SCSI bus problems exist.
Attention: Removing functioning disks in a disk array is not recommended without assistance from your
service support organization. A disk array may become degraded or failed if functioning disks are removed
and additional problems may be created.
If the boot device is not on this adapter and the device to be removed or installed is a hot-swap device,
follow this procedure. If these conditions do not apply to your situation, see the else section located at the
end of this procedure.
1. Remove or install the device by invoking the SCSI and SCSI RAID Hot Plug Manager.
a. Start the PCI-X Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
Note: The visual indicator on the device will blink at the Identify rate.
f. If removing a device:
1) Select Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device.
Note: The visual indicator on the device will blink at the Remove rate.
2) Remove the device.
3) Label the device with the slot it was removed from to ensure it can be reinstalled in the same
location.
else if installing a device:
1) Select Attach a Device to an SCSI Hot Swap Enclosure Device.
Note: The visual indicator on the device will blink at the Remove rate.
2) Insert the device.
2. Unconfigure the adapter
a. Start the PCI-X SCSI Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Unconfigure an Available PCI-X SCSI RAID Controller.
3. Configure the adapter.
a. Start the PCI-X SCSI Disk Array Manager.
1) Start Diagnostics and select Task Selection on the Function Selection screen.
2) Select RAID Array Manager.
3) Select PCI-X SCSI Disk Array Manager.
b. Select Diagnostics and Recovery Options.
c. Select Configure a Defined PCI-X SCSI RAID Controller.
else:
1. Power off the system/logical partition
2. Remove/install the device(s)
3. Power on the system/logical partition
Step 3050-1
Identify the SCSI bus which the problem is occurring on by examining the hardware error log. The
hardware error log may be viewed as follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapter resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Step 3050-2
Have recent changes been made to the SCSI configuration?
NO Go to Step 3050-5.
YES Go to Step 3050-3.
Step 3050-3
Check for the following problems:
v Address conflicts between devices
v Cabling problems such as; configurations that exceed the maximum cable lengths, missing termination,
or excessive termination
v Both internal and external connectors for this SCSI bus are being used at the same time (only one
should have a cable attached)
v If in a Multi-Initiator High-Availability configuration, verify correct cabling and SCSI ID settings as
described in Chapter 4, “Multi-Initiator and High-Availability,” on page 25.
For more details about supported SCSI cabling, refer to RS/6000 Eserver pSeries Adapters, Devices,
and Cable Information for Multiple Bus Systems.
NO Go to Step 3050-5.
YES Go to Step 3050-4.
Step 3050-4
1. Power off the system or logical partition.
2. Correct the problem.
3. Power on the system or logical partition, and run diagnostics in system verification mode on the
adapter.
Did a SCSI bus-related failure occur?
NO Go to Step 3050-16.
YES Go to Step 3050-5.
Step 3050-5
Is problem related to the thermal fuse (that is SRN nnnn-719)?
NO Go to Step 3050-7.
YES Go to Step 3050-6.
If the same error persists, replace the components of the failing SCSI bus in the following order. Wait 10
seconds for the thermal fuse to reset itself between steps.
1. Cable (if present)
2. DASD backplane (if present)
3. System board (if SCSI bus interface logic is on the system board)
To replace a component and verify that the problem was corrected, do the following:
1. Power off the system or logical partition.
2. Replace a component listed above.
3. Power on the system or logical partition, and run diagnostics in system verification mode on the
adapter.
Did you correct the problem?
NO Go to Step 3050-7.
YES Go to Step 3050-16.
Step 3050-7
Determine if any of the disk arrays on the adapter are in a Degraded state as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select List PCI-X SCSI Disk Array Configuration.
3. Select the PCI-X SCSI RAID Controller identified in the hardware error log.
Does any disk array have a State of Degraded?
NO Go to Step 3050-9.
YES Go to Step 3050-8.
Step 3050-8
1. Identify the failed disk(s) by first finding disk arrays with a state of Degraded and then a pdisk for that
disk array which has a state of Failed.
2. Remove the failed disk from each degraded disk array by using the “Common Device Removal and
Installation Procedure” on page 90.
3. Run diagnostics in system verification mode on the adapter.
Did a SCSI bus related failure occur?
NO Go to Step 3050-16.
YES Go to Step 3050-9.
NO Go to Step 3050-12.
YES Go to Step 3050-10.
Step 3050-10
1. Remove one of the non-essential removable media devices by using the “Common Device Removal
and Installation Procedure” on page 90.
2. Run diagnostics in system verification mode on the adapter.
Did a SCSI bus related failure occur?
NO Go to Step 3050-11.
YES Go to Step 3050-9.
Step 3050-11
The last removable media device removed from the SCSI bus may be the cause of the SCSI bus
problems. Follow the repair procedures for that device.
Go to Step 3050-16.
Step 3050-12
Are there any non-essential disks which are not disk array members (for example, 512 byte/sector
standalone disks, hot spare disks, or Array Candidates) on the SCSI bus?
NO Go to Step 3050-15.
YES Go to Step 3050-13.
Step 3050-13
1. Remove one of the non-essential disks devices by using the “Common Device Removal and
Installation Procedure” on page 90.
2. Run diagnostics in system verification mode on the adapter.
Did a SCSI bus related failure occur?
NO Go to Step 3050-14.
YES Go to Step 3050-12.
Step 3050-14
The last disk removed from the SCSI bus may be the cause of the SCSI bus problems. Follow the repair
procedures for that device.
Go to Step 3050-16.
Step 3050-15
Contact your service support organization.
MAP 3051
Use the following to determine what other FRUs, besides a disk, may need to be replaced in order to
solve a problem.
You are sent here when a pdisk (that is physical disk in 522 bytes/sector format) was identified as the
primary FRU to replace in order to solve a problem. However, if replacing the disk did not resolve the
problem then other FRUs may need to be replaced.
Considerations:
v Remove power from the system before connecting and disconnecting cables or devices, as appropriate,
to prevent hardware damage or erroneous diagnostic results.
v Keep in mind that some systems have SCSI and PCI-X bus interface logic integrated onto the system
boards and use a pluggable RAID Enablement Card (a non-PCI form factor card) for these SCSI/PCI-X
busses. An example of such a RAID Enablement Card is FC 5709. For these configurations,
replacement of the RAID Enablement Card is unlikely to solve a SCSI bus related problem since the
SCSI bus interface logic is on the system board.
v Some adapters provide two connectors, one internal and one external, for each SCSI bus. For this type
of adapter, it is not acceptable to use both connectors for the same SCSI bus at the same time. SCSI
bus problems are likely to occur if this is done. However, it is acceptable to use an internal connector
for one SCSI bus and an external connector for another SCSI bus. The internal and external connectors
are labeled to indicate which SCSI bus they correspond to.
Attention: Replacing RAID adapters is not recommended without assistance from your service support
organization when SCSI bus problems exist. Because the adapter may contain non-volatile write cache
data and configuration data for the attached disk arrays, additional problems can be created by replacing a
adapter when SCSI bus problems exist.
Attention: Removing functioning disks in a disk array is not recommended without assistance from your
service support organization. A disk array may become degraded or failed if functioning disks are removed
and additional problems may be created.
Step 3051-1
Identify the SCSI bus which the problem is occurring on by examining the hardware error log. The
hardware error log may be viewed as follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapter resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Step 3051-2
While the error persists, replace the components of the failing SCSI bus in the following order.
1. Cable (if present)
2. Adapter (if SCSI bus interface logic is on the adapter) or system board (if SCSI bus interface logic is
on the system board)
3. DASD backplane (if present)
To replace a component and see if the problem was corrected, do the following:
1. Power off the system/logical partition
2. Replace a component listed above
3. Power on the system/logical partition, and run diagnostics in system verification mode on the adapter
When the problem is resolved then go to MAP 0410: Repair Checkout, in RS/6000 Eserver pSeries
Diagnostic Information for Multiple Bus Systems.
MAP 3090
The problem that occurred is uncommon or complex to resolve. Information should be gathered and
assistance obtained from your service support organization.
Step 3090-1
Record the hardware error log. The hardware error log may be viewed as follows:
1. Start Diagnostics and select Task Selection on the Function Selection screen.
2. Select Display Hardware Error Report.
3. Select Display Hardware Errors for PCI-X SCSI RAID Adapters.
4. Select the adapter resource, or select all adapters resources if the adapter resource is not known.
5. On the Error Summary screen, look for an entry with a SRN corresponding to the problem which sent
you here and select it.
Note: If multiple entries exist for the SRN, some entries could be older versions or a problem has
occurred on multiple entities (such as adapters, disk arrays, and devices). Older entries can be
ignored, however, this MAP may need to be used multiple times if the same problem has
occurred on multiple entities.
6. Select the hardware error log to view.
7. Go to Step 3090-2.
Step 3090-2
Collect any hardware error logged about the same time for the adapter.
Go to Step 3090-3.
Step 3090-3
Collect the current disk array configuration. The disk array configuration may be viewed as follows:
1. Start the PCI-X SCSI Disk Array Manager.
a. Start Diagnostics and select Task Selection on the Function Selection screen.
96 PCI-X SCSI RAID Controller: Reference Guide for AIX
b. Select RAID Array Manager.
c. Select PCI-X SCSI Disk Array Manager.
2. Select List PCI-X SCSI Disk Array Configuration.
3. Select the PCI-X SCSI RAID Controller identified in the hardware error log.
4. Go to Step 3090-4.
Step 3090-4
Contact your service support organization.
Note: Only Error IDs of the form SISIOA_xxxx are potentially related to disk arrays.
3. Locate the SENSE DATA in the Detail Data.
4. Identify the bytes 24-27 of the SENSE DATA from the 32 bytes shown.
Note: Use the following example AIX Error Log to help you identify bytes 24-27.
5. The first four digits of the SRN, known as the Failing Function Code (FFC), can be found in the
following table:
6. The second four digits of the SRN, known as the Reason Code is equal to bytes 26-27 of the SENSE
DATA
Description
DISK ARRAY PROTECTION SUSPENDED
Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES
Detail Data
SENSE DATA
00FF FFFF 1900 00F0 066B 0200 0201 0000 0309 0025 0000 0004 5703 9030 0000 0000
^ ^
| |
| Last 4-digits of SRN (bytes 26-27)
CCIN of Controller (bytes 24-25)
ARRAY INFORMATION
Location S/N RAID Level
ff-0,0 23897FFD 5
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission
limits. Neither the provider nor the manufacturer is responsible for any radio or television interference
caused by using other than recommended cables and connectors or by unauthorized changes or
modifications to this equipment. Unauthorized changes or modifications could void the user’s authority to
operate the equipment.
This device complies with Part 15 of the FCC Rules. Operation is subject to the following two conditions:
(1) this device may not cause harmful interference, and (2) this device must accept any interference
received, including interference that may cause undesired operation.
This product has been tested and found to comply with the limits for Class A Information Technology
Equipment according to CISPR 22 / European Standard EN 55022. The limits for Class A equipment were
derived for commercial and industrial environments to provide reasonable protection against interference
with licensed communication equipment.
Attention: This is a Class A product. In a domestic environment this product may cause radio
interference in which case the user may be required to take adequate measures.
The network adapter interfaces housed within this equipment are approved separately, each one having its
own independent approval number. These interface adapters, supplied by the manufacturer, do not use or
contain excessive voltages. An excessive voltage is one which exceeds 70.7 V peak ac or 120 V dc. They
99
interface with this equipment using Safe Extra Low Voltages only. In order to maintain the separate
(independent) approval of the manufacturer’s adapters, it is essential that other optional cards, not
supplied by the manufacturer, do not use main voltages or any other excessive voltages. Seek advice from
a competent engineer before installing other adapters not supplied by the manufacturer.
VCCI Statement
The following is a summary of the VCCI Japanese statement in the box above.
This is a Class A product based on the standard of the Voluntary Control Council for Interference by
Information Technology Equipment (VCCI). If this equipment is used in a domestic environment, radio
disturbance may arise. When such trouble occurs, the user may be required to take corrective actions.
Warning: This is a Class A product. In a domestic environment this product may cause radio interference
in which case the user will be required to take adequate measures.
Dieses Gerät erfüllt die Bedingungen der EN 55022 Klasse A. Für diese von Geräten gilt folgende
Bestimmung nach dem EMVG:
Hinweis
Dieses Genehmigungsverfahren ist von der Deutschen Bundespost noch nicht veröffentlicht worden.
The manufacturer may not offer the products, services, or features discussed in this document in other
countries. Consult the manufacturer’s representative for information on the products and services currently
available in your area. Any reference to the manufacturer’s product, program, or service is not intended to
state or imply that only that product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any intellectual property right of the manufacturer may be used
instead. However, it is the user’s responsibility to evaluate and verify the operation of any product,
program, or service.
The manufacturer may have patents or pending patent applications covering subject matter described in
this document. The furnishing of this document does not give you any license to these patents. You can
send license inquiries, in writing, to the manufacturer.
The following paragraph does not apply to the United Kingdom or any country where such
provisions are inconsistent with local law: THIS MANUAL IS PROVIDED ″AS IS″ WITHOUT
WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A
PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain
transactions; therefore, this statement may not apply to you.
This information could include technical inaccuracies or typographical errors. Changes are periodically
made to the information herein; these changes will be incorporated in new editions of the publication. The
manufacturer may make improvements and/or changes in the product(s) and/or the program(s) described
in this publication at any time without notice.
Information concerning products made by other than the manufacturer was obtained from the suppliers of
those products, their published announcements, or other publicly available sources. The manufacturer has
not tested those products and cannot confirm the accuracy of performance, compatibility or any other
claims related to products made by other than the manufacturer. Questions on the capabilities of products
made by other than the manufacturer should be addressed to the suppliers of those products.
103
104 PCI-X SCSI RAID Controller: Reference Guide for AIX
Appendix C. AIX Command-Line Interface
Many tasks used to manage the PCI-X SCSI RAID Controller can be performed by using the AIX
command line instead of using the PCI-X SCSI Disk Array Manager as described throughout this guide.
The following table summarizes these commands.
105
106 PCI-X SCSI RAID Controller: Reference Guide for AIX
Index
A hot spare
adding disks 20
adding disk to existing array 19
deleting disks 21
AIX command-line interface 105
Hot Spare Disks 20
avoiding shock hazard vii
B L
location codes 61
battery
displaying information 33
error, forcing 34
bus settings
M
viewing and changing 21 Maintenance Analysis Procedures (MAPs) 64
maintenance tips 31
MAP 3000 64
C MAP 3010 64
MAP 3011 67
Cache Battery Pack
MAP 3012 69
replacing 35
MAP 3013 70
capacities, disk array 8
MAP 3020 72
checkpoints 64
MAP 3021 74
command-line interface 105
MAP 3030 75
common tasks 15
MAP 3031 76
controller microcode, updating 32
MAP 3032 80
creating disk array 16
MAP 3033 82
creating disks hot spare 20
MAP 3034 82
MAP 3035 85
MAP 3036 86
D MAP 3040 87, 88
deleting disks hot spare 21 MAP 3050 90
deleting, disk array 19 MAP 3051 95
description MAP 3090 96
RAID Level 0 4
RAID Level 10 7
RAID Level 5 5
disk array
O
overview, controller 1
adding disk to an existing 19
overview, disk arrays 9
deleting 19
viewing configuration 17
disk array problems
identifying 62
P
disk array settings PCI-X SCSI Disk Array Manager, using 15
viewing 23 pdisk descriptions 10
disk array settings, viewing 23 pdisk settings
disk array states 9 viewing 23
disk array, capacities 8 pdisks
disk arrays, definitions 2 replacing 58
disk arrays, overview 9 pdisks, changing to hdisks 32
physical disks (pdisks), states 10
preparing disks for disk array use 16
E problem determination 61
examples
AIX error log 98
R
RAID Level
H recovery procedures
RAID Level 0 4
59
hdisks, changing from pdisks 32
recovery procedures 59
highlighting xi
RAID Level 10 7
107
RAID Level 10 (continued)
recovery procedures 60
RAID Level 5 5
RAID Levels 3, 8
rechargeable battery error, forcing 34
rechargeable battery information, displaying 33
Rechargeable Battery Maintenance 33
rechargeable Cache Battery Pack, replacing 35
recovery 61
recovery procedures
RAID Level 0 59
RAID Level 10 60
RAID Level 5 59
RAID Level 6 59
related publications xi
S
Service Request Number (SRN) Table 62
shock hazard, avoiding vii
software, verifying installation 13
SRN
finding 97
table 62
states
disk arrays (hdisks) 9
physical disks (pdisks) 10
stripe-unit size 8
summary, RAID Level 8
V
verifying installation of controller software 13
viewing
disk array settings 23
viewing and changing, bus settings 21
viewing disk array configuration 17
We appreciate your comments about this publication. Please comment on specific errors or omissions, accuracy,
organization, subject matter, or completeness of this book. The comments you send should pertain to only the
information in this manual or product and the way in which the information is presented.
For technical questions and information about products and prices, please contact your IBM branch office, your IBM
business partner, or your authorized remarketer.
When you send comments to IBM, you grant IBM a nonexclusive right to use or distribute your comments in any
way it believes appropriate without incurring any obligation to you. IBM or any other organizations will only use the
personal information that you supply to contact you about the issues that you state on this form.
Comments:
If you would like a response from IBM, please fill in the following information:
Name Address
Company or Organization
_ _ _ _ _ _ _Fold
_ _ _and
_ _ _Tape
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _Please
_ _ _ _ _do
_ _not
_ _ staple
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _Fold
_ _ _and
_ _ Tape
______
NO POSTAGE
NECESSARY
IF MAILED IN THE
UNITED STATES
Information Development
Department H6DS-905-6C006
11501 Burnet Road
Austin, TX 78758-3493
_________________________________________________________________________________________
Fold and Tape Please do not staple Fold and Tape
Cut or Fo
SA23-1323-03 Along Lin
Printed in USA
SA23-1323-03