Bull Escala Blade Server EL260B Troubleshooting guide

  • Hello! I am an AI chatbot trained to assist you with the Bull Escala Blade Server EL260B Troubleshooting guide. I’ve already reviewed the document and can help you find the information you need or explain it in simple terms. Just ask your questions, and providing more details will help me assist you more effectively!
Escala Blade
Server EL260B
Problem Determination and
Service Guide
ESCALA BLADE
SERVERS
REFERENCE
86 A1 36FA 00
ESCALA BLADE SERVERS
Escala Blade Server
EL260B
Problem Determination and Service
Guide
Hardware
July 2008
BULL CEDOC
357 AVENUE PATTON
B.P.20845
49008 ANGERS CEDEX 01
FRANCE
REFERENCE
86 A1 36FA 00
The following copyright notice protects this book under Copyright laws which prohibit such actions as, but not limited
to, copying, distributing, modifying, and making derivative works.
Copyright
Bull SAS 2008
Printed in France
Suggestions and criticisms concerning the form, content, and presentation of this
book are invited. A form is provided at the end of this book for this purpose.
To order additional copies of this book or other Bull Technical Publications, you
are invited to use the Ordering Form also provided at the end of this book.
Trademarks and Acknowledgements
We acknowledge the rights of the proprietors of the trademarks mentioned in this manual.
All brand names and software and hardware product names are subject to trademark and/or patent protection.
Quoting of brand and product names is for information purposes only and does not represent trademark misuse.
The information in this document is subject to change without notice. Bull will not be liable for errors
contained herein, or for incidental or consequential damages in connection with the use of this material.
Preface
i
Table of Contents
List of Figures........................................................................................................... v
List of Tables ........................................................................................................... vi
Safety ...................................................................................................................vii
Safety statements.......................................................................................................................... viii
Guidelines for trained service technicians........................................................................................ xiv
Inspecting for unsafe conditions ..................................................................................................... xiv
Guidelines for servicing electrical equipment .................................................................................... xv
Chapter 1. Introduction .......................................................................................1
1.1 Related documentation........................................................................................................ 1
1.2 Notices and statements in this documentation ........................................................................ 2
1.3 Features and specifications.................................................................................................. 3
1.4 Supported DIMMs .............................................................................................................. 5
1.5 Blade server control panel buttons and LEDs.......................................................................... 6
1.6 Turning on the blade server ................................................................................................. 8
1.7 Turning off the blade server................................................................................................. 9
1.8 System-board layouts........................................................................................................ 10
1.9 System-board connectors................................................................................................... 10
1.10 System-board LEDs ........................................................................................................... 11
Chapter 2. Diagnostics..................................................................................... 13
2.1 Diagnostic tools ............................................................................................................... 14
2.2 Collecting dump data ....................................................................................................... 15
2.3 Location codes................................................................................................................. 16
2.4 Reference codes............................................................................................................... 17
2.4.1 System reference codes (SRCs) ................................................................................. 18
2.4.2 POST progress codes (checkpoints) ........................................................................... 69
2.4.3 Service request numbers (SRNs).............................................................................. 107
2.5 Error logs ...................................................................................................................... 139
2.6 Checkout procedure ....................................................................................................... 140
2.6.1 About the checkout procedure ................................................................................ 140
2.6.2 Performing the checkout procedure ......................................................................... 141
ii Escala Blade EL260B - Problem Determination and Service Guide
2.7
Verifying the partition configuration..................................................................................144
2.8 Running the diagnostics program .....................................................................................144
2.8.1 Starting AIX concurrent diagnostics .........................................................................144
2.8.2 Starting stand-alone diagnostics from a CD ..............................................................144
2.8.3 Starting stand-alone diagnostics from a NIM server...................................................146
2.8.4 Using the diagnostics program ...............................................................................147
2.9 Boot problem resolution...................................................................................................148
2.10 Troubleshooting tables ....................................................................................................150
2.10.1 CD or DVD drive problems.....................................................................................150
2.10.2 Diskette drive problems..........................................................................................151
2.10.3 General problems .................................................................................................151
2.10.4 Hard disk drive problems .......................................................................................152
2.10.5 Intermittent problems..............................................................................................152
2.10.6 Keyboard problems...............................................................................................153
2.10.7 Management module service processor problems .....................................................153
2.10.8 Memory problems .................................................................................................154
2.10.9 Microprocessor problems .......................................................................................154
2.10.10 Monitor or video problems .....................................................................................155
2.10.11 Network connection problems ................................................................................156
2.10.12 PCI expansion card (PIOCARD) problem isolation procedure .....................................157
2.10.13 Optional device problems ......................................................................................158
2.10.14 Power problems ....................................................................................................158
2.10.15 POWER Hypervisor (PHYP) problems.......................................................................160
2.10.16 Service processor problems ....................................................................................162
2.10.17 Software problems ................................................................................................174
2.10.18 Universal Serial Bus (USB) port problems .................................................................175
2.11 Light path diagnostics .....................................................................................................175
2.11.1 Viewing the light path diagnostic LEDs.....................................................................175
2.11.2 Light path diagnostics LEDs.....................................................................................177
2.12 Isolating firmware problems.............................................................................................178
2.13 Recovering the system firmware .......................................................................................179
2.13.1 Starting the PERM image........................................................................................179
2.13.2 Starting the TEMP image ........................................................................................179
2.13.3 Recovering the TEMP image from the PERM image....................................................180
2.13.4 Verifying the system firmware levels ........................................................................180
2.13.5 Committing the TEMP system firmware image ...........................................................181
2.14 Solving shared Blade resource problems ...........................................................................181
2.14.1 Solving shared keyboard problems .........................................................................182
2.14.2 Solving shared media tray problems........................................................................183
2.14.3 Solving shared network connection problems ...........................................................185
2.14.4 Solving shared power problems..............................................................................186
2.14.5 Solving shared video problems ...............................................................................186
2.15 Solving undetermined problems .......................................................................................187
Chapter 3. Parts listing....................................................................................189
Preface
iii
Chapter 4. R
emoving and replacing blade server components ................................191
4.1 Installation guidelines ..................................................................................................... 191
4.1.1 System reliability guidelines ................................................................................... 192
4.1.2 Handling static-sensitive devices ............................................................................. 192
4.1.3 Returning a device or component............................................................................ 193
4.2 Removing the blade server from a Bull Blade Chassis ......................................................... 193
4.3 Installing the blade server in a Bull Blade Chassis .............................................................. 195
4.4 Removing and replacing Tier 1 CRUs ............................................................................... 197
4.4.1 Removing the blade server cover ............................................................................ 197
4.4.2 Installing and closing the blade server cover ............................................................ 198
4.4.3 Removing the bezel assembly................................................................................. 199
4.4.4 Installing the bezel assembly .................................................................................. 200
4.4.5 Removing a SAS hard disk drive............................................................................. 201
4.4.6 Installing a SAS hard disk drive .............................................................................. 202
4.4.7 Removing a memory module .................................................................................. 204
4.4.8 Installing a memory module.................................................................................... 205
4.4.9 Removing the management card............................................................................. 207
4.4.10 Installing the management card .............................................................................. 208
4.4.11 Entering vital product data ..................................................................................... 209
4.4.12 Removing and installing an I/O expansion card....................................................... 211
4.4.13 Removing the battery............................................................................................. 219
4.4.14 Installing the battery .............................................................................................. 220
4.4.15 Removing the hard disk drive tray........................................................................... 222
4.4.16 Installing the hard disk drive tray ............................................................................ 223
4.4.17 Removing the expansion bracket............................................................................. 224
4.4.18 Installing the expansion bracket.............................................................................. 225
4.5 Replacing the Tier 2 system-board and chassis assembly .................................................... 226
Chapter 5. C
onfiguring .................................................................................. 229
5.1 Updating the firmware.................................................................................................... 229
5.2 Configuring the blade server ........................................................................................... 230
5.3 Using the SMS utility....................................................................................................... 231
5.3.1 Starting the SMS utility........................................................................................... 231
5.3.2 SMS utility menu choices ....................................................................................... 231
5.4 Creating a CE login ....................................................................................................... 232
5.5 Configuring the Gigabit Ethernet controllers ...................................................................... 232
5.6 Blade server Ethernet controller enumeration ..................................................................... 233
5.7 MAC addresses for host Ethernet adapters ........................................................................ 234
Appendix A. Getting help and technical assistance.................................................... 237
Before you call........................................................................................................................... 237
iv Escala Blade EL260B - Problem Determination and Service Guide
Appendix B. Notices.............................................................................................239
Important Notes .........................................................................................................................239
Product recycling and disposal.....................................................................................................240
Electronic emission notices...........................................................................................................241
Industry Canada Class A emission compliance statement ................................................................241
Australia and New Zealand Class A statement ..............................................................................241
United Kingdom telecommunications safety requirement..................................................................241
European Union EMC Directive conformance statement ..................................................................242
Taiwanese Class A warning statement
...........................................................................................242
Chinese Class A warning statement ..............................................................................................242
Japanese Voluntary Control Council for Interference (VCCI) statement...............................................242
Preface
v
List of Figures
Figure 1-1.
Blade server control panel buttons and LEDs ..................................................................... 6
Figure 1-2. System-board connectors .............................................................................................. 10
Figure 1-3. System-board LEDs....................................................................................................... 11
Figure 2-1. Light path diagnostic LEDs .......................................................................................... 176
Figure 3-1. Parts illustration ......................................................................................................... 189
Figure 4-1. Removing the blade server from the Bull Blade Chassis .................................................. 193
Figure 4-2. Installing the blade server in a Bull Blade Chassis.......................................................... 195
Figure 4-3. Removing the cover ................................................................................................... 197
Figure 4-4. Installing the cover..................................................................................................... 198
Figure 4-5. Removing the bezel assembly ..................................................................................... 199
Figure 4-6. Installing the bezel assembly....................................................................................... 200
Figure 4-7. Removing a SAS hard disk ......................................................................................... 201
Figure 4-8. Installing a SAS hard disk........................................................................................... 202
Figure 4-9. Removing a memory module....................................................................................... 204
Figure 4-10. Installing a memory module ........................................................................................ 205
Figure 4-11. Removing the management card ................................................................................. 207
Figure 4-12. Installing the management card................................................................................... 208
Figure 4-13. Removing a small form factor (SFF) expansion card ....................................................... 212
Figure 4-14. Installing a small-form-factor expansion card................................................................. 213
Figure 4-15. Removing a standard-form-factor expansion card .......................................................... 214
Figure 4-16. Installing a standard-form-factor expansion card............................................................ 215
Figure 4-17. Removing a combination-form-factor expansion card ..................................................... 216
Figure 4-18. Installing a combination-form-factor expansion card....................................................... 217
Figure 4-19. Removing the battery ................................................................................................. 219
Figure 4-20. Installing the battery................................................................................................... 220
Figure 4-21. Removing the hard disk drive tray................................................................................ 222
Figure 4-22. Installing the hard disk drive tray................................................................................. 223
Figure 4-23. Removing the expansion bracket ................................................................................. 224
Figure 4-24. Installing the expansion bracket................................................................................... 225
vi Escala Blade EL260B - Problem Determination and Service Guide
List of Tables
Table 1-1.
Supported use of DIMMs ................................................................................................5
Table 2-1. Location code..............................................................................................................16
Table 2-2. Nine-word system reference code in the management-module event log .............................18
Table 2-3. Management module reference code listing ....................................................................18
Table 2-4. 1xxxyyyy SRCs............................................................................................................20
Table 2-5. 6xxxyyyy SRCs............................................................................................................24
Table 2-6. A1xxyyyy service processor SRCs..................................................................................25
Table 2-7. A200yyyy Logical partition SRCs...................................................................................25
Table 2-8. A700yyyy Licensed internal code SRCs..........................................................................26
Table 2-9. AA00E1A8 to AA260005 Partition firmware attention codes ...........................................26
Table 2-10. B181xxxx Service processor early termination SRCs ........................................................28
Table 2-11. B200xxxx Logical partition SRCs ...................................................................................29
Table 2-12. B700xxxx Licensed internal code SRCs ..........................................................................37
Table 2-13. BA000010 to BA400002 Partition firmware SRCs ..........................................................43
Table 2-14. Management module reference code listing ....................................................................69
Table 2-15. C1001F00 to C1645300 checkpoints...........................................................................70
Table 2-16. C2001000 to C20082FF checkpoints ...........................................................................77
Table 2-17. C700xxxx Server firmware IPL status checkpoints ............................................................83
Table 2-18. CA000000 to CA2799FF checkpoints........................................................................... 83
Table 2-19. D1001xxx to D1xx3FFF dump codes .............................................................................99
Table 2-20. D1xx3y01 to D1xx3yF2 checkpoints ...........................................................................104
Table 2-21. D1xx900C to D1xxC003 checkpoints..........................................................................106
Table 2-22. 101-711 through FFC-725 SRNs .................................................................................108
Table 2-23. Meaning of the last character (x) after the hyphen .........................................................120
Table 2-24. A00-FF0 through A24-xxx SRNs ..................................................................................120
Table 2-25. ssss-102 through ssss-640 SRNs ..................................................................................134
Table 2-26. Failing function codes 151 through 2D02.....................................................................137
Table 2-27. Nine-word system reference code in the management-module event log ...........................139
Table 2-28. Nine-word system reference code in the management-module event log ...........................157
Table 2-29. PCI expansion card problem isolation procedure...........................................................157
Table 2-30. POWER Hypervisor isolation procedures ......................................................................160
Table 2-31. . Light path diagnostic LED descriptions ........................................................................177
Table 3-1. Parts table.................................................................................................................190
Table 4-1. ESCALA EL260B vital product data..............................................................................210
Table 5-1. MAC addressing scheme for physical and logical host Ethernet adapters .........................234
Preface
vii
Safety
viii Escala Blade EL260B - Problem Determination and Service Guide
Safety statements
Important:
Each caution and danger statement in this documentation begins with a number. This
number is used to cross reference an English-language caution or danger statement with
translated versions of the caution or danger statement in the Bull Safety Attention document.
For example, if a caution statement begins with a number 1, translations for that caution
statement appear in the Bull Safety Attention document under statement 1.
Be sure to read all caution and danger statements in this documentation before performing
the instructions. Read any additional Safety Attention that comes with your computer or
optional device before you install the device.
Preface
ix
x Escala Blade EL260B - Problem Determination and Service Guide
Preface
xi
xii Escala Blade EL260B - Problem Determination and Service Guide
Preface
xiii
xiv Escala Blade EL260B - Problem Determination and Service Guide
Guidelines for trained service technicians
Inspecting for unsafe conditions
Preface
xv
Guidelines for servicing electrical equipment
xvi Escala Blade EL260B - Problem Determination and Service Guide
/