Known issues

Unresolved known issues

Known issue with an Unresolved Resolution state is an active problem under investigation; a temporary workaround may be available.

Resolved known issues

A known issue with a Resolved (workaround) Resolution state is an ongoing problem; a permanent workaround is available which may include using different software or hardware.

A known issue with Resolved Resolution state has been corrected.

Known Issues

Title Category Resolution Description Posted Updated
Lustre bug causing Oakley login node crashes filesystem, login Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... Read more

9 years 7 months ago 9 years 5 months ago
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... Read more
9 years 6 months ago 9 years 6 months ago
Problems with Project Space (/nfs/gpfs) filesystem Resolved

(9/8/15 14:21 Eastern) Project space appears to be back to normal operation. We are running some tests to verify that the problem is fully resolved.


As of early afternoon, Sept. 8,... Read more

9 years 6 months ago 9 years 6 months ago
Unscheduled GPFS Outage filesystem Resolved

As of 11:30PM on June 16th, we have removed the GPFS filesystem from service due to a number of hardware failures. At this point, further hardware failures would put a large portion of the entire... Read more

9 years 9 months ago 9 years 9 months ago
Armstrong inaccessible Resolved

Update: 2PM March 12th: Armstrong is back up and running.  Please notify oschelp@osc.edu of any lingering issues.


As of 10AM Thursday March 12th... Read more

10 years 3 weeks ago 10 years 3 weeks ago
Intermittent DNS issues Resolved

3/9/15 Update: The DNS issues have been resolved.  In total, the following services may have been affected by the DNS issues:

10 years 4 weeks ago 10 years 3 weeks ago
System reboot due to security vulnerability Resolved

2015/02/17 UPDATE - Security Patch Succesfully Implimented

All systems have been updated with the secuirty patch.  


Starting Thursday, 4:00PM we will begin taking systems... Read more

10 years 2 months ago 10 years 1 month ago
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today

 

A rolling reboot is required on Ruby to update a critical... Read more

10 years 1 month ago 10 years 1 month ago
MVAPICH broken on Ruby Ruby Resolved

Update Monday February 16th -- Ruby MVAPICH2 build fixed.

Ruby's MVAPICH2 build has been fixed.  Please email oschelp@osc.edu with any issues.

... Read more
10 years 1 month ago 10 years 1 month ago
OnDemand, Awesim, and DB Services down morning of Feb 12 Resolved

Update: Reboot was succesful.  OnDemand, Awesim, and Database services are back online.  Report any issues to oschelp@osc.edu.


A short reboot... Read more

10 years 1 month ago 10 years 1 month ago

Pages