System Status

Follow @HPCNotices on X for real-time updates about the status of OSC's clusters and storage.

For information about the status of licenses for the software packages installed at OSC, visit the License Server Status Updates page.

View unresolved known issues and current system-related status messages below.

If you have encountered a problem not already listed below, please contact our client support team.

Scheduled OSC System Downtimes

OSC strives to minimize the number of system downtimes and provide clients with significant advance notice but reserves the right to change the date(s) of these downtimes and/or add additional downtimes to improve the performance of our services.

Tuesday, August 11, 2026
Thursday, January 7, 2027

Active System Messages

Rolling Reboot for Security Fix

A rolling reboot is in progress to address CVE-2026-23111 for all clusters, including Ascend, Cardinal, and Pitzer. Login nodes will be rebooted first and access will be temporarily disrupted. Compute nodes will reboot in a rolling fashion; running jobs will not be impacted. See: https://www.osc.edu/resources/technical_support/known_issues/rolling_reboot_for_security_fix

Known Issues (unresolved)

A list of all known issues, including those that have been resolved, can be found here.

Title Category Description Posted Updated
Rolling Reboot for Security Fix

A rolling reboot is in progress to address CVE-2026-23111 (nf_tables logic bug) for all clusters, including Ascend, Cardinal, and Pitzer. Login nodes will be rebooted first and access... Read more

2 days 19 hours ago 1 day 21 hours ago
Temporary Login Node Instability on Ascend Ascend

We are currently experiencing temporary instability on the Ascend login nodes, which may result in slow response times or unexpected session disconnects. Our team is actively... Read more

2 weeks 1 day ago 2 weeks 1 day ago
cuMemHostRegister Fails with CUDA_ERROR_INVALID_VALUE on RHEL 9.6 Ascend, Cardinal, GPU, system software

After upgrading the operating system to RHEL 9.6 during the scheduled downtime on May 12, 2026,  applications utilizing UCX (... Read more

2 weeks 2 days ago 2 weeks 2 days ago
STAR-CCM+ OpenMPI Job Failed due to Out-of-Memory Cardinal, Software

After the scheduled downtime on May 12, 2026, STAR-CCM+ encounters out-of-memory errors when running OpenMPI jobs. A message... Read more

2 weeks 2 days ago 2 weeks 2 days ago
ptrace Disabled Across OSC Systems

ptrace has been disabled globally across all OSC systems to mitigate a newly identified Linux kernel vulnerability. If this security mitigation impacts your active research... Read more

4 weeks 20 hours ago 4 weeks 20 hours ago