Difference between revisions of "Hardware:Status"

From CAC Wiki
Jump to: navigation, search
Line 7: Line 7:
 
| '''01/28/2018 - 5:00 AM'''
 
| '''01/28/2018 - 5:00 AM'''
 
| '''Frontenac login node caclogin02'''
 
| '''Frontenac login node caclogin02'''
| '''Out-of-schedule shutdown (~45min)'''
+
| '''Node went down out of schedule (~45min)'''
| '''investigating causes'''
+
| '''Temporarily switched to redundancy node, investigating causes'''
| '''no'''
+
| '''partly'''
 
|-
 
|-
| 01/18/2018 - 11:30 AM'''
+
| 01/18/2018 - 11:30 AM
 
| Frontenac login node caclogin01
 
| Frontenac login node caclogin01
 
| Out-of-schedule shutdown / reboot (~45min)
 
| Out-of-schedule shutdown / reboot (~45min)
 
| updates / maintenance
 
| updates / maintenance
| no
+
| yes
 
|-
 
|-
 
| 11/21/2017 - 11:00 PM
 
| 11/21/2017 - 11:00 PM

Revision as of 13:38, 29 January 2018

This page shows information about the status of systems at the Centre for Advanced Computing. It will be updated with additional information as new events arise.

System Status Messages
01/28/2018 - 5:00 AM Frontenac login node caclogin02 Node went down out of schedule (~45min) Temporarily switched to redundancy node, investigating causes partly
01/18/2018 - 11:30 AM Frontenac login node caclogin01 Out-of-schedule shutdown / reboot (~45min) updates / maintenance yes
11/21/2017 - 11:00 PM Frontenac (all nodes) Temporary unmount of /global file system re-mounted, file system accessible yes
10/30/2017 - 8:00 AM multiple production nodes unreachable scheduler lost contact to production nodes nodes will be transfered to Frontenac yes
10/30/2017 - 8:00 AM swlogin1 (login node) No login possible login restored yes
10/03/2017 - 8:00 AM head-6b disk array at near capacity working on reducing usage yes
10/02/2017 - 8:00 AM head-6b disk array full partly resolved (freed 4 TB) yes
7/13/2017 - 10:00 AM swlogin1 unreachable through ssh resolved yes
7/13/2017 - 8:00 AM caclogin01 temporary maintenance shutdown back up yes