Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.

Known Issues

Titlesort ascending Cat. Res. Description Post Upd.
Unscheduled GPFS Outage filesystem Resolved

As of 11:30PM on June 16th, we have removed the GPFS filesystem from service due to a number of hardware failures. At this point, further hardware failures would put a large portion of the entire... (Read more)

2 months 2 weeks ago 2 months 2 weeks
System reboot due to security vulnerability Resolved

2015/02/17 UPDATE - Security Patch Succesfully Implimented

All systems have been updated with the secuirty patch.  


Starting Thursday, 4:00PM we will begin taking systems... (Read more)

7 months 1 week ago 6 months 2 weeks
System Downtime 9/29/13 Outage Resolved

OSC systems will be offline on September 29th, 2013 for maintenance. Please visit osc.edu/n for more information.

1 year 11 months ago 1 year 11 months
Statewide Intel compiler license checkout failures Licensing Resolved

This morning (9/10/14) we updated our Intel compiler licenses. We are seeing some unexpected license checkout failures in the logs (please click through to see details):

10:44:... (Read more)          
11 months 4 weeks ago 11 months 2 weeks
Scheduling suspended Batch Resolved

We have temporarily suspended scheduling due to some problems with the parallel scratch file system.

11 months 2 weeks ago 11 months 2 weeks
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... (Read more)

6 months 2 weeks ago 6 months 2 weeks
Ruby is offline Operations Resolved

The Ruby Transitional Cluster (only open to select research groups) is currently offline due to network problems. We expect it will return to service some time after the downtime.

1 year 11 months ago 1 year 6 months
Proj13 file system difficulties filesystem Resolved

We are currently experiencing difficulties with the servers for the filesystem mounted at /nfs/proj13.

1 year 10 months ago 1 year 10 months
Poor network performance on some filesystems filesystem Resolved

We are experiencing some network performance issues on a cluster of servers involved with providing GPFS and some project filesystems. GPFS appears to be functioning acceptably, but proj01, proj02... (Read more)

2 years 1 month ago 2 years 1 month
Password changes may be delayed Infrastructure Resolved

Due to an infrastructure problem, password changes via ARMSTRONG may be delayed until further notice.

1 year 8 months ago 1 year 8 months

Pages