Known Issues

Title Cat. Res.sort descending Description Post Upd.
Intermittent DNS issues Resolved

3/9/15 Update: The DNS issues have been resolved.  In total, the following services may have been affected by the DNS issues:

7 months 4 days ago 7 months 1 day
Ruby Rolling Reboot Resolved

2015/02/16 RUBY Rolling Reboot starting Today


A rolling reboot is required on Ruby to update a critical... (Read more)

7 months 3 weeks ago 7 months 3 weeks
MVAPICH broken on Ruby Ruby Resolved

Update Monday February 16th -- Ruby MVAPICH2 build fixed.

Ruby's MVAPICH2 build has been fixed.  Please email with any issues.

... (Read more)
7 months 3 weeks ago 7 months 3 weeks
OnDemand, Awesim, and DB Services down morning of Feb 12 Resolved

Update: Reboot was succesful.  OnDemand, Awesim, and Database services are back online.  Report any issues to

A short reboot... (Read more)

7 months 3 weeks ago 7 months 3 weeks
System reboot due to security vulnerability Resolved

2015/02/17 UPDATE - Security Patch Succesfully Implimented

All systems have been updated with the secuirty patch.  

Starting Thursday, 4:00PM we will begin taking systems... (Read more)

8 months 1 week ago 7 months 3 weeks
Oakley login node down Oakley Resolved

One of the Oakley login... (Read more)

8 months 2 weeks ago 8 months 6 days
Emergency InfiniBand Shutdown (All systems) Network Resolved

We have returned to service. It appears that we have resolved the networking issues enough to allow jobs to run safely. We will continue working with our vendors to fix any remaining hardware... (Read more)

1 year 2 months ago 1 year 2 months
Certain modules not accessible Software Resolved

Certain modules are not working for all clusters since the downtime.  We have reports specifically that Amber, Gaussian, and Turbomole are not working.  We are working to resolve the issue, but... (Read more)

1 year 2 months ago 1 year 2 months
Lustre is still offline. HPC systems back up Maintenance Resolved

Day One of the scheduled downtime has been completed, and HPC operations have resumed. As planned, Lustre work will extend into Day Two. Jobs using /fs/lustre or $PFSDIR cannot run until this work... (Read more)

1 year 3 months ago 1 year 2 months
Ruby is offline Operations Resolved

The Ruby Transitional Cluster (only open to select research groups) is currently offline due to network problems. We expect it will return to service some time after the downtime.

2 years 1 week ago 1 year 7 months