Owens

Switch failure on Nov 17 2018

At about 4:05 am on November 17th, OSC experienced a major switch failure which resulted in the home directory service and GPFS file systems being disrupted. Most services were back up around 10 am, but some users may still be seeing stale file handles on GPFS. We are still working on recovering GPFS clients. For more updates, see: https://bit.ly/2DIXr1G

Reboot of NetApp as part of an upgrade on November 19

We will have a reboot of the NetApp as part of an upgrade, starting from 9:30 AM on Monday, November 19, 2018, to address a bug that causes NetApp issues caused by the network switch outage we had on Nov 14, 2018. Any cluster nodes, OnDemand service, and all filesystems won't be impacted by the reboot. We also do not expect any disruptions to users' jobs due to this reboot.

Major network switch outage on November 14, 2018

At about 1:50 AM on November 14th, OSC experienced a major switch failure which resulted in the home directory service being disrupted. As a result, the home directories were offline and all logins were failing to all clusters. All user-facing issues have been resolved and the services are back. Running jobs may recover, but please look at job output to verify correctness. Some jobs experienced failures and will need to be resubmitted. For more information, see: https://bit.ly/2FlCZFD

Quantum Espresso 6.3 available on Owens

Date: 
Thursday, October 18, 2018 - 3:15pm
System(s): 

Quantum Espresso 6.3 has been installed on Owens; usage is via the module espresso/6.3. For information on available executables and installation details see the software page for Quantum Espresso or the output of the respective module help command, e.g.: module help espresso/6.3.  Note that packages WEST were not installed because they are not available with 6.3.

DOWNTIME FOR ALL CLUSTERS ON OCTOBER 23, 2018

A downtime for all HPC systems is scheduled from 7 a.m. to 10 p.m., Tuesday, Oct. 23, 2018. The downtime will affect the Oakley, Ruby and Owens Clusters, web portals and HPC file servers. Login services, including my.osc.edu, and access to storage will not be available during this time. In preparation for the downtime, the batch scheduler will begin holding jobs that cannot be completed before 7 a.m., Oct. 23, 2018.

Pages