Known Issues

Titlesort descending Cat. Res. Description Post Upd.
Certain modules not accessible Software Resolved

Certain modules are not working for all clusters since the downtime.  We have reports specifically that Amber, Gaussian, and Turbomole are not working.  We are working to resolve the issue, but... (Read more)

1 year 3 months ago 1 year 3 months
Downtime Update: All Major Services Online Resolved

Friday, Sept 25th 12PM Noon:

  • Oakley is back online and has resumed running jobs.  
  • Ruby... (Read more)
3 weeks 2 days ago 1 week 6 days
Emergency InfiniBand Shutdown (All systems) Network Resolved

We have returned to service. It appears that we have resolved the networking issues enough to allow jobs to run safely. We will continue working with our vendors to fix any remaining hardware... (Read more)

1 year 2 months ago 1 year 2 months
February 11 2014 Scheduled Downtime Outage Resolved

HPC systems are offline today for scheduled quarterly maintenance activity. For details, please visit

1 year 7 months ago 1 year 7 months
Intermittent DNS issues Resolved

3/9/15 Update: The DNS issues have been resolved.  In total, the following services may have been affected by the DNS issues:

7 months 6 days ago 7 months 3 days
Login Shell Issues on Oakley Account/Shell Resolved

UPDATE: The shells have all been switched back for affected users, and you can submit jobs normally again.  Additionally, if you are still logged in and have the incorrect shell, logging back out... (Read more)

1 year 3 months ago 1 year 3 months
Lustre bug causing Oakley login node crashes filesystem, login, Oakley Resolved

Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.  The bug (or issue otherwise) seems to be activated when a user does operations on a... (Read more)

1 month 2 weeks ago 2 days 14 hours
Lustre is still offline. HPC systems back up Maintenance Resolved

Day One of the scheduled downtime has been completed, and HPC operations have resumed. As planned, Lustre work will extend into Day Two. Jobs using /fs/lustre or $PFSDIR cannot run until this work... (Read more)

1 year 3 months ago 1 year 2 months
Lustre jobs suspended filesystem Resolved

The Lustre filesystem ($PFSDIR and /fs/lustre) has crashed several times Friday evening (8/15). We have degraded this service temporarily, while we work to isolate the actions that are triggering... (Read more)

1 year 1 month ago 1 year 1 month
Lustre Updates filesystem Resolved

9/10/14 - We have not seen any additional crashes of the Lustre servers since making this change.

- Lustre jobs are being accepted as of 10AM this... (Read more)

1 year 1 month ago 1 year 4 weeks