Bull Escala BL460 Troubleshooting guide

  • Hello! I am an AI chatbot trained to assist you with the Bull Escala BL460 Troubleshooting guide. I’ve already reviewed the document and can help you find the information you need or explain it in simple terms. Just ask your questions, and providing more details will help me assist you more effectively!
Escala BL460
Problem Determination and
Service Guide
ESCALA Blade
REFERENCE
86 A7 81FB 00
ESCALA Blade
Escala BL460
Problem Determination and Service
Guide
Hardware
October 2009
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A7 81FB 00
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited
to, copying, distributing, modifying, and making derivative works.
Copyright
Bull SAS 2009
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this
book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the rights of the proprietors of the trademarks mentioned in this manual.
All brand names and software and hardware product names are subject to trademark and/or patent protection.
Quoting of brand and product names is for information purposes only and does not represent trademark misuse.
The information in this document is subject to change without notice. Bull will not be liable for errors
contained herein, or for incidental or consequential damages in connection with the use of this material.
Preface
i
Table of Contents
List of Figures.......................................................................................................... iv
List of Tables ............................................................................................................ v
Safety ...................................................................................................................vii
Safety statements.......................................................................................................................... viii
Guidelines for trained service technicians........................................................................................ xiv
Inspecting for unsafe conditions ..................................................................................................... xiv
Guidelines for servicing electrical equipment .................................................................................... xv
Chapter 1. Introduction .......................................................................................1
1.1 Related documentation........................................................................................................ 1
1.2 Notices and statements in this documentation ........................................................................ 2
1.3 Features and specifications.................................................................................................. 3
1.4 Supported DIMMs .............................................................................................................. 5
1.5 Blade server control panel buttons and LEDs.......................................................................... 7
1.6 Turning on the blade server ............................................................................................... 10
1.7 Turning off the blade server............................................................................................... 11
1.8 System-board layouts........................................................................................................ 12
1.9 System-board connectors................................................................................................... 12
1.10 System-board LEDs ........................................................................................................... 14
Chapter 2. Diagnostics..................................................................................... 15
2.1 Diagnostic tools ............................................................................................................... 16
2.2 Collecting dump data ....................................................................................................... 17
2.3 Location codes................................................................................................................. 18
2.4 Reference codes............................................................................................................... 19
2.4.1 System reference codes (SRCs) ................................................................................. 20
2.4.2 POST progress codes (checkpoints) ........................................................................... 71
2.4.3 Service request numbers (SRNs).............................................................................. 109
2.5 Error logs ...................................................................................................................... 147
2.6 Checkout procedure ....................................................................................................... 148
2.6.1 About the checkout procedure ................................................................................ 148
2.6.2 Performing the checkout procedure ......................................................................... 149
ii Escala BL460 - Problem Determination and Service Guide
2.7 Verifying the partition configuration..................................................................................152
2.8 Running the diagnostics program .....................................................................................152
2.8.1 Starting AIX concurrent diagnostics .........................................................................152
2.8.2 Starting stand-alone diagnostics from a CD ..............................................................152
2.8.3 Starting stand-alone diagnostics from a NIM server...................................................153
2.8.4 Using the diagnostics program ...............................................................................155
2.9 Boot problem resolution...................................................................................................156
2.10 Troubleshooting tables ....................................................................................................158
2.10.1 General problems .................................................................................................158
2.10.2 Drive problems .....................................................................................................158
2.10.3 Intermittent problems..............................................................................................159
2.10.4 Keyboard problems...............................................................................................159
2.10.5 Management module service processor problems .....................................................160
2.10.6 Memory problems .................................................................................................161
2.10.7 Microprocessor problems .......................................................................................161
2.10.8 Monitor or video problems .....................................................................................162
2.10.9 Network connection problems ................................................................................163
2.10.10 PCI expansion card (PIOCARD) problem isolation procedure .....................................164
2.10.11 Optional device problems ......................................................................................165
2.10.12 Power problems ....................................................................................................165
2.10.13 POWER® Hypervisor (PHYP) problems ....................................................................167
2.10.14 Service processor problems ....................................................................................169
2.10.15 Software problems ................................................................................................181
2.10.16 Universal Serial Bus (USB) port problems .................................................................181
2.11 Light path diagnostics .....................................................................................................182
2.11.1 Viewing the light path diagnostic LEDs.....................................................................182
2.11.2 Light path diagnostics LEDs.....................................................................................184
2.12 Firmware problem isolation .............................................................................................185
2.13 Recovering the system firmware .......................................................................................186
2.13.1 Starting the PERM image........................................................................................186
2.13.2 Starting the TEMP image ........................................................................................186
2.13.3 Recovering the TEMP image from the PERM image....................................................187
2.13.4 Verifying the system firmware levels ........................................................................187
2.13.5 Committing the TEMP system firmware image ...........................................................188
2.14 Solving shared Bull Blade Chassis – Enterprise resource problems........................................188
2.14.1 Solving shared keyboard problems .........................................................................189
2.14.2 Solving shared media tray problems........................................................................190
2.14.3 Solving shared network connection problems ...........................................................192
2.14.4 Solving shared power problems..............................................................................193
2.14.5 Solving shared video problems ...............................................................................193
2.15 Solving undetermined problems .......................................................................................194
2.16 Calling Bull for service ....................................................................................................196
Chapter 3. Parts listing....................................................................................197
Preface
iii
Chapter 4. Removing and replacing blade server components ............................... 201
4.1 Installation guidelines ..................................................................................................... 201
4.1.2 System reliability guidelines ................................................................................... 202
4.1.3 Handling static-sensitive devices ............................................................................. 202
4.1.4 Returning a device or component............................................................................ 203
4.2 Removing the blade server from a Bull Blade Chassis - Enterprise......................................... 203
4.3 Installing the blade server in a Bull Blade Chassis - Enterprise.............................................. 204
4.4 Removing and replacing Tier 1 CRUs ............................................................................... 206
4.4.1 Removing the blade server cover ............................................................................ 206
4.4.2 Installing and closing the blade server cover ............................................................ 208
4.4.3 Removing the bezel assembly................................................................................. 209
4.4.4 Installing the bezel assembly .................................................................................. 210
4.4.5 Removing a drive.................................................................................................. 211
4.4.6 Installing a drive ................................................................................................... 212
4.4.7 Removing a memory module .................................................................................. 213
4.4.8 Installing a memory module.................................................................................... 214
4.4.9 Removing the management card............................................................................. 216
4.4.10 Installing the management card .............................................................................. 217
4.4.11 Removing and installing an I/O expansion card....................................................... 219
4.4.12 Removing the battery............................................................................................. 224
4.4.13 Installing the battery .............................................................................................. 225
4.4.14 Removing the disk drive tray .................................................................................. 227
4.4.15 Installing the hard disk drive tray ............................................................................ 228
4.5 Replacing the Tier 2 system-board and chassis assembly .................................................... 229
Chapter 5. Configuring .................................................................................. 231
5.1 Updating the firmware.................................................................................................... 231
5.2 Configuring the blade server ........................................................................................... 233
5.3 Using the SMS utility....................................................................................................... 234
5.3.1 Starting the SMS utility........................................................................................... 234
5.3.2 SMS utility menu choices ....................................................................................... 234
5.4 Creating a CE login ....................................................................................................... 235
5.5 Configuring the Gigabit Ethernet controllers ...................................................................... 235
5.6 Blade server Ethernet controller enumeration ..................................................................... 236
5.7 MAC addresses for host Ethernet adapters ........................................................................ 237
5.8 Updating IBM System Director ......................................................................................... 238
Appendix A. Getting help and technical assistance.................................................... 239
Before you call........................................................................................................................... 239
Using the documentation............................................................................................................. 239
iv Escala BL460 - Problem Determination and Service Guide
Appendix B. Notices.............................................................................................241
Important Notes .........................................................................................................................241
Product recycling and disposal.....................................................................................................242
Electronic emission notices...........................................................................................................243
Industry Canada Class A emission compliance statement ................................................................243
Australia and New Zealand Class A statement ..............................................................................243
United Kingdom telecommunications safety requirement..................................................................243
European Union EMC Directive conformance statement ..................................................................244
Taiwanese Class A warning statement ..........................................................................................244
Chinese Class A warning statement ..............................................................................................244
Japanese Voluntary Control Council for Interference (VCCI) statement...............................................244
List of Figures
Figure 1-1. DIMM connectors ..........................................................................................................6
Figure 1-2. Blade server control panel buttons and LEDs .....................................................................7
Figure 1-3. System-board connectors ..............................................................................................12
Figure 1-4. DIMM connectors ........................................................................................................13
Figure 1-5. System-board LEDs.......................................................................................................14
Figure 2-1. Light path diagnostic LEDs...........................................................................................183
Figure 3-1. Parts illustration .........................................................................................................197
Figure 4-1. Removing the blade server from the Bull Blade Chassis - Enterprise ..................................203
Figure 4-2. Installing the blade server in a Bull Blade Chassis - Enterprise .........................................204
Figure 4-3. Removing the cover....................................................................................................206
Figure 4-4. Installing the cover .....................................................................................................208
Figure 4-5. Removing the bezel assembly......................................................................................209
Figure 4-6. Installing the bezel assembly .......................................................................................210
Figure 4-7. Removing a SAS hard disk..........................................................................................211
Figure 4-8. Installing a SAS hard disk ...........................................................................................212
Figure 4-9. DIMM connectors ......................................................................................................213
Figure 4-10. DIMM connectors ......................................................................................................214
Figure 4-11. Removing the management card..................................................................................216
Figure 4-12. Installing the management card ...................................................................................217
Figure 4-13. Removing a CIOv form factor expansion card from the 1Xe connector.............................219
Figure 4-14. Installing a CIOv form-factor expansion card.................................................................220
Figure 4-15. Removing a combination-form-factor expansion card......................................................222
Figure 4-16. Installing a combination-form-factor expansion card.......................................................223
Figure 4-17. Removing the battery..................................................................................................224
Figure 4-18. Installing the battery ...................................................................................................225
Figure 4-19. Removing the hard disk drive tray................................................................................227
Figure 4-20. Installing the hard disk drive tray .................................................................................228
Preface
v
List of Tables
Table 1-1. Memory module combinations ........................................................................................ 5
Table 1-2. Connectors description................................................................................................. 12
Table 1-3. System-board LEDs locations ......................................................................................... 14
Table 2-1. Location code ............................................................................................................. 18
Table 2-2. Nine-word system reference code in the management-module event log............................. 20
Table 2-3. Management module reference code listing.................................................................... 20
Table 2-4. 1xxxyyyy SRCs ........................................................................................................... 22
Table 2-5. 6xxxyyyy SRCs ........................................................................................................... 26
Table 2-6. A1xxyyyy service processor SRCs.................................................................................. 27
Table 2-7. A200yyyy Logical partition SRCs .................................................................................. 27
Table 2-8. A700yyyy Licensed internal code SRCs.......................................................................... 28
Table 2-9. AA00E1A8 to AA260005 Partition firmware attention codes........................................... 28
Table 2-10. B181xxxx Service processor early termination SRCs........................................................ 30
Table 2-11. B200xxxx Logical partition SRCs................................................................................... 31
Table 2-12. B700xxxx Licensed internal code SRCs .......................................................................... 39
Table 2-13. BA000010 to BA400002 Partition firmware SRCs.......................................................... 45
Table 2-14. Management module reference code listing.................................................................... 71
Table 2-15. C1001F00 to C1645300 checkpoints........................................................................... 72
Table 2-16. C2001000 to C20082FF checkpoints ........................................................................... 79
Table 2-17. C700xxxx Server firmware IPL status checkpoints............................................................ 85
Table 2-18. CA000000 to CA2799FF checkpoints........................................................................... 85
Table 2-19. D1001xxx to D1xx3FFF dump codes........................................................................... 101
Table 2-20. D1xx3y01 to D1xx3yF2 checkpoints ........................................................................... 105
Table 2-21. D1xx900C to D1xxC003 checkpoints ......................................................................... 107
Table 2-22. 101-711 through FFC-725 SRNs................................................................................. 110
Table 2-23. Meaning of the last character (x) after the hyphen ......................................................... 127
Table 2-24. A00-FF0 through A24-xxx SRNs.................................................................................. 127
Table 2-25. ssss-102 through ssss-640 SRNs.................................................................................. 141
Table 2-26. Failing function codes 151 through 2D02 .................................................................... 144
Table 2-27. Nine-word system reference code in the management-module event log........................... 147
Table 2-28. Nine-word system reference code in the management-module event log........................... 164
Table 2-29. PCI expansion card problem isolation procedure........................................................... 164
Table 2-30. POWER® Hypervisor isolation procedures ................................................................... 167
Table 2-31. . Light path diagnostic LED descriptions........................................................................ 184
Table 3-1. Parts table ................................................................................................................ 198
Table 4-1. Memory module combination...................................................................................... 214
Table 5-1. MAC addressing scheme for physical and logical host Ethernet adapters......................... 237
Preface
vii
Safety
viii Escala BL460 - Problem Determination and Service Guide
Safety statements
Important:
Each caution and danger statement in this documentation begins with a number. This
number is used to cross reference an English-language caution or danger statement with
translated versions of the caution or danger statement in the Bull Safety Attention document.
For example, if a caution statement begins with a number 1, translations for that caution
statement appear in the Bull Safety Attention document under statement 1.
Be sure to read all caution and danger statements in this documentation before performing
the instructions. Read any additional Safety Attention that comes with your computer or
optional device before you install the device.
Preface
ix
x Escala BL460 - Problem Determination and Service Guide
Preface
xi
xii Escala BL460 - Problem Determination and Service Guide
Preface
xiii
xiv Escala BL460 - Problem Determination and Service Guide
Guidelines for trained service technicians
Inspecting for unsafe conditions
Preface
xv
Guidelines for servicing electrical equipment
/