Difference between revisions of "Hardware:Status"

From CAC Wiki
Jump to: navigation, search
Line 11: Line 11:
 
| Yes
 
| Yes
 
|-
 
|-
| 7/13/2017 - 10:00 PM
+
| 11/21/2017 - 11:00 PM
| swlogin1
+
| Frontenac (all nodes)
| unreachable through ssh
+
| Temporary unmount of /global file system
| resolved
+
| re-mounted, file system accessible
| Yes
+
| yes
 
|-
 
|-
| 10/02/2017 - 8:00 AM
+
| 10/30/2017 - 8:00 AM
| head-6b
+
| multiple production nodes unreachable
| disk array full
+
| scheduler lost contact to production nodes
| partly resolved (freed 4 TB)
+
| nodes will be transfered to Frontenac
| Yes
+
|-
+
| 10/03/2017 - 8:00 AM
+
| head-6b
+
| disk array at near capacity
+
| working on reducing usage
+
 
| yes
 
| yes
 
|-
 
|-
Line 35: Line 29:
 
| yes
 
| yes
 
|-
 
|-
| 10/30/2017 - 8:00 AM
+
| 10/03/2017 - 8:00 AM
| multiple production nodes unreachable
+
| head-6b
| scheduler lost contact to production nodes
+
| disk array at near capacity
| nodes will be transfered to Frontenac
+
| working on reducing usage
 
| yes
 
| yes
 
|-
 
|-
| 11/21/2017 - 11:00 PM
+
| 10/02/2017 - 8:00 AM
| Frontenac (all nodes)
+
| head-6b
| Temporary unmount of /global file system
+
| disk array full
| re-mounted, file system accessible
+
| partly resolved (freed 4 TB)
| yes
+
| Yes
 +
|-
 +
| 7/13/2017 - 10:00 PM
 +
| swlogin1
 +
| unreachable through ssh
 +
| resolved
 +
| Yes
 
|-
 
|-
 
|}
 
|}

Revision as of 13:35, 23 November 2017

This page shows information about the status of systems at the Centre for Advanced Computing. It will be updated with additional information as new events arise.

System Status Messages
7/13/2017 - 10:00 PM all systems issues with Grid Engine qmaster resolved Yes
11/21/2017 - 11:00 PM Frontenac (all nodes) Temporary unmount of /global file system re-mounted, file system accessible yes
10/30/2017 - 8:00 AM multiple production nodes unreachable scheduler lost contact to production nodes nodes will be transfered to Frontenac yes
10/30/2017 - 8:00 AM swlogin1 (login node) No login possible login restored yes
10/03/2017 - 8:00 AM head-6b disk array at near capacity working on reducing usage yes
10/02/2017 - 8:00 AM head-6b disk array full partly resolved (freed 4 TB) Yes
7/13/2017 - 10:00 PM swlogin1 unreachable through ssh resolved Yes