iv bullx cluster suite - Maintenance Guide
3.1.2
Using ipmi Tools .................................................................................................... 3-3
3.2 Managing Hardware ......................................................................3-4
3.2.1 Managing Nodes and CMC using nsctrl................................................................... 3-4
3.2.2 Managing PDUs using nsctrl or clmpdu..................................................................... 3-5
3.2.3 Using Remote Hardware Management CLI (BSM Commands)...................................... 3-7
3.2.4 Using nsfirm command ........................................................................................... 3-8
3.3 Using Argos to maintain the cluster....................................................3-9
3.4 Collecting Information for Resolving Problems ...................................3-10
Chapter 4. Managing System Logs.......................................................................... 4-1
4.1 Introduction to syslog-ng...................................................................4-1
4.2 Configuring syslog-ng......................................................................4-1
4.2.1 options Section ...................................................................................................... 4-2
4.2.2 source Section ....................................................................................................... 4-2
4.2.3 destination Section ................................................................................................. 4-3
4.2.4 filter Section .......................................................................................................... 4-4
4.2.5 log Section ............................................................................................................ 4-4
Chapter 5. Monitoring the System and Devices........................................................ 5-1
5.1 Monitoring the System .....................................................................5-1
5.1.1 Time ..................................................................................................................... 5-1
5.1.2 IOstat ................................................................................................................... 5-1
5.1.3 dstat ..................................................................................................................... 5-2
5.2 Getting Information about Storage Devices (lsiocfg) .............................5-3
5.2.1 lsiocfg Command Syntax......................................................................................... 5-3
5.2.2 HBA Inventory........................................................................................................ 5-4
5.2.3 Disks Inventory....................................................................................................... 5-4
5.2.4 Disk Usage and Partition Inventories......................................................................... 5-5
5.3 Checking Device Power State (pingcheck) ..........................................5-6
5.4 Setting Up Outlet Air Temperature .....................................................5-6
Chapter 6. Debugging Tools ................................................................................... 6-1
6.1 Modifying the Core Dump Size.........................................................6-1
6.2 Identifying InfiniBand Network Problems (ibtracert) ..............................6-1
6.3 Using dump tools with RHEL5 (crash, proc, kdump)..............................6-2