Difference between revisions of "Hardware:Status"
From CAC Wiki
Line 4: | Line 4: | ||
{| class="wikitable" style="float:left; margin-right: 25px;" | {| class="wikitable" style="float:left; margin-right: 25px;" | ||
!colspan="5"| '''System Status Messages''' | !colspan="5"| '''System Status Messages''' | ||
+ | |- | ||
+ | | '''Date''' | ||
+ | | '''Affected systems''' | ||
+ | | '''Details/reason''' | ||
+ | | '''Resolution''' | ||
+ | |- | ||
+ | | 05/01/2018 - 9:00 AM | ||
+ | | Scheduler maintenance | ||
+ | | Scheduled upgrade/downtime of scheduler | ||
+ | | | ||
|- | |- | ||
| 04/23/2018 - 7:00 AM | | 04/23/2018 - 7:00 AM | ||
Line 9: | Line 19: | ||
| login issues, reboot | | login issues, reboot | ||
| functional after reboot | | functional after reboot | ||
− | |||
|- | |- | ||
| 04/19/2018 - 3:30 PM | | 04/19/2018 - 3:30 PM | ||
Line 15: | Line 24: | ||
| lost access to file system, reboot | | lost access to file system, reboot | ||
| resolved after reboot | | resolved after reboot | ||
− | |||
|- | |- | ||
| 03/16/2018 - 11:00 AM | | 03/16/2018 - 11:00 AM | ||
Line 21: | Line 29: | ||
| Scheduled upgrade/downtime of scheduler | | Scheduled upgrade/downtime of scheduler | ||
| Upgrade complete, working on x11 support | | Upgrade complete, working on x11 support | ||
− | |||
|- | |- | ||
| 01/28/2018 - 5:00 AM | | 01/28/2018 - 5:00 AM | ||
Line 27: | Line 34: | ||
| Node went down out of schedule | | Node went down out of schedule | ||
| login restored, investigating causes | | login restored, investigating causes | ||
− | |||
|- | |- | ||
| 01/18/2018 - 11:30 AM | | 01/18/2018 - 11:30 AM | ||
Line 33: | Line 39: | ||
| Out-of-schedule shutdown / reboot (~45min) | | Out-of-schedule shutdown / reboot (~45min) | ||
| updates / maintenance | | updates / maintenance | ||
− | |||
|- | |- | ||
| 11/21/2017 - 11:00 PM | | 11/21/2017 - 11:00 PM | ||
Line 39: | Line 44: | ||
| Temporary unmount of /global file system | | Temporary unmount of /global file system | ||
| re-mounted, file system accessible | | re-mounted, file system accessible | ||
− | |||
|- | |- | ||
| 10/30/2017 - 8:00 AM | | 10/30/2017 - 8:00 AM | ||
Line 45: | Line 49: | ||
| scheduler lost contact to production nodes | | scheduler lost contact to production nodes | ||
| nodes will be transfered to Frontenac | | nodes will be transfered to Frontenac | ||
− | |||
|- | |- | ||
| 10/30/2017 - 8:00 AM | | 10/30/2017 - 8:00 AM | ||
Line 51: | Line 54: | ||
| No login possible | | No login possible | ||
| login restored | | login restored | ||
− | |||
|- | |- | ||
| 10/03/2017 - 8:00 AM | | 10/03/2017 - 8:00 AM | ||
Line 57: | Line 59: | ||
| disk array at near capacity | | disk array at near capacity | ||
| working on reducing usage | | working on reducing usage | ||
− | |||
|- | |- | ||
| 10/02/2017 - 8:00 AM | | 10/02/2017 - 8:00 AM | ||
Line 63: | Line 64: | ||
| disk array full | | disk array full | ||
| partly resolved (freed 4 TB) | | partly resolved (freed 4 TB) | ||
− | |||
|- | |- | ||
| 7/13/2017 - 10:00 AM | | 7/13/2017 - 10:00 AM | ||
Line 69: | Line 69: | ||
| unreachable through ssh | | unreachable through ssh | ||
| resolved | | resolved | ||
− | |||
|- | |- | ||
| 7/13/2017 - 8:00 AM | | 7/13/2017 - 8:00 AM | ||
Line 75: | Line 74: | ||
| temporary maintenance shutdown | | temporary maintenance shutdown | ||
| back up | | back up | ||
− | |||
|- | |- | ||
|} | |} |
Revision as of 16:06, 30 April 2018
This page shows information about the status of systems at the Centre for Advanced Computing. It will be updated with additional information as new events arise.
System Status Messages | ||||
---|---|---|---|---|
Date | Affected systems | Details/reason | Resolution | |
05/01/2018 - 9:00 AM | Scheduler maintenance | Scheduled upgrade/downtime of scheduler | ||
04/23/2018 - 7:00 AM | Frontenac login node | login issues, reboot | functional after reboot | |
04/19/2018 - 3:30 PM | Frontenac login node | lost access to file system, reboot | resolved after reboot | |
03/16/2018 - 11:00 AM | Scheduler upgrade | Scheduled upgrade/downtime of scheduler | Upgrade complete, working on x11 support | |
01/28/2018 - 5:00 AM | Frontenac login node caclogin02 | Node went down out of schedule | login restored, investigating causes | |
01/18/2018 - 11:30 AM | Frontenac login node caclogin01 | Out-of-schedule shutdown / reboot (~45min) | updates / maintenance | |
11/21/2017 - 11:00 PM | Frontenac (all nodes) | Temporary unmount of /global file system | re-mounted, file system accessible | |
10/30/2017 - 8:00 AM | multiple production nodes unreachable | scheduler lost contact to production nodes | nodes will be transfered to Frontenac | |
10/30/2017 - 8:00 AM | swlogin1 (login node) | No login possible | login restored | |
10/03/2017 - 8:00 AM | head-6b | disk array at near capacity | working on reducing usage | |
10/02/2017 - 8:00 AM | head-6b | disk array full | partly resolved (freed 4 TB) | |
7/13/2017 - 10:00 AM | swlogin1 | unreachable through ssh | resolved | |
7/13/2017 - 8:00 AM | caclogin01 | temporary maintenance shutdown | back up |