xiv HPC BAS4 - Administrator's Guide
List of Figures
Figure 1-1.
A typical HPC architecture ........................................................................................... 1-2
Figure 1-2. Management Functions................................................................................................ 1-3
Figure 3-1. ClusterDB architecture ................................................................................................. 3-1
Figure 3-2. Cluster Network – diagram 1..................................................................................... 3-34
Figure 3-3. Cluster Network – diagram 2..................................................................................... 3-35
Figure 3-4. Storage physical view ............................................................................................... 3-42
Figure 3-5. Cluster Database – Machine view 1............................................................................ 3-49
Figure 3-6. Cluster Database – Machine view 2............................................................................ 3-50
Figure 3-7. HWManager view.................................................................................................... 3-55
Figure 3-8. Cluster Database – Complementary tables................................................................... 3-57
Figure 3-9. Cluster Database – NsDoctor view.............................................................................. 3-59
Figure 3-10. ClusterDB –Nagios View ........................................................................................... 3-61
Figure 3-11. Cluster Database – Lustre view ................................................................................... 3-62
Figure 4-1. NovaScale Master Map view..................................................................................... 4-39
Figure 4-2. NovaScale Nagios file system indicator ...................................................................... 4-41
Figure 4-3. Lustre Management Node web interface...................................................................... 4-42
Figure 4-4. Detailed view of Lustre file systems.............................................................................. 4-43
Figure 4-5. Group performance global view pop up window ......................................................... 4-44
Figure 4-6. Dispatched performance view pop up window............................................................. 4-44
Figure 4-7. Global performance view pop up window................................................................... 4-45
Figure 5-1. Main steps for deployment........................................................................................... 5-4
Figure 5-2. Image modification (workon, store, deploy, detach)........................................................ 5-6
Figure 5-3. Names of derived images or patches ............................................................................ 5-7
Figure 5-4. Primary system node partitions ................................................................................... 5-12
Figure 5-5. Secondary system node partitions............................................................................... 5-12
Figure 6-1. SLURM Simplified Architecture.................................................................................... 6-18
Figure 6-2. SLURM Architecture - Subsystems ................................................................................ 6-19
Figure 7-1. TORQUE Resource Manager Architecture ...................................................................... 7-3
Figure 7-2. TORQUE resource allocation........................................................................................ 7-7
Figure 8-1. NovaScale Master - HPC Edition opening view .............................................................. 8-5
Figure 8-2. Map button all status opening view............................................................................... 8-6
Figure 8-3. Rack view with the problems window at the bottom ........................................................ 8-7
Figure 8-4. Host Service details..................................................................................................... 8-8
Figure 8-5. Status overview screen ................................................................................................ 8-9
Figure 8-6. Alert Window showing the different alert states............................................................ 8-11
Figure 8-7. Hostgroups Reporting Notifications Window showing the Notification Levels................... 8-12
Figure 8-8. Status Monitoring Control window showing the links to add and delete comments............ 8-13
Figure 8-9. Monitoring Service Status window for a host................................................................ 8-14
Figure 8-10. Storage overview window ......................................................................................... 8-17
Figure 8-11. Group Performance view ........................................................................................... 8-19
Figure 8-12. Global overview for a host (top screen) ....................................................................... 8-20
Figure 8-13. Detailed monitoring view for a host (bottom half of screen displayed in Figure 8-12) ........ 8-21
Figure 8-14. Ethernet Switch services............................................................................................. 8-31
Figure 9-1. I/O Status – initial screen ............................................................................................ 9-4
Figure 9-2. NovaScale Master HPC Edition - I/O Status Details ........................................................ 9-5
Figure 9-3. NovaScale Master –HPC Edition – I/O Resources of a node............................................ 9-7
Figure 9-4. Detailed service status for a storage host ....................................................................... 9-9