Difference between revisions of "Hardware:Status"

From CAC Wiki
Jump to: navigation, search
Line 38: Line 38:
 
| Recovery from power outage
 
| Recovery from power outage
 
| Scheduler queues for SX (SNO, Linux) compute cluster re-opened. Cluster is up and running.  
 
| Scheduler queues for SX (SNO, Linux) compute cluster re-opened. Cluster is up and running.  
 +
| Yes
 +
|-
 +
| 3/27/2017 - 2:00 PM
 +
| File system (disk arrays 1 and 2) 
 +
| Trouble shooting on disk arrays
 +
| Replacing disks, rebooting head units; intermittent login and disk access issues to be expected.
 
| Yes
 
| Yes
 
|-|}
 
|-|}

Revision as of 19:43, 27 March 2017

This page shows all system status updates for the SW cluster. This page will be updated with additional information as new events arise.

System Status Messages
Date/Time Affected Systems Issue Details Resolved ?
3/21/2017 - 1:30 PM All Compute / Login Power blip / outage Shutdown of all compute clusters and login nodes. Yes
3/22/2017 - 10:30 AM All Compute / Login Recovery from power outage Login nodes, system, and data access restored. Compute cluster still down, scheduler queues disabled. Yes
3/24/2017 - 8:00 AM All Compute Recovery from power outage Compute cluster nodes cac013-cac099 up and running. Scheduler queues restricted/disabled. Yes
3/24/2017 - 2:00 PM All Compute Recovery from power outage Scheduler queues for SW (Linux) compute cluster re-opened. Cluster is up and running. SNO (SX) cluster queues still disabled. Yes
3/24/2017 - 3:00 PM All Compute Recovery from power outage Scheduler queues for SX (SNO, Linux) compute cluster re-opened. Cluster is up and running. Yes
3/27/2017 - 2:00 PM File system (disk arrays 1 and 2) Trouble shooting on disk arrays Replacing disks, rebooting head units; intermittent login and disk access issues to be expected. Yes