Over the past two weeks we have experienced Oakely login node crashes potentially caused by a Lustre bug.
|Armstrong offline until Noon||Armstrong||Resolved||
Armstrong will need to be taken down today until Noon. In the meantime, contact OSCHelp (OSCHelp@osc.edu) for account assistance.
|1 year 1 week ago||1 year 1 week|
|Backups of /nfs/gpfs||Backups||Resolved||
Changes to files on /nfs/gpfs may not be backed up during the following evening's backup, as would normally be expected. The backup software is attempting to recreate a full backup of the... (Read more)
|2 years 4 months ago||2 years 3 months|
|Brief disruption of GPFS on 8/28/2013||filesystem||Resolved||
On the morning August 28th, 2013 we will briefly disrupt the GPFS filesystem to reboot servers. This is necessary to upgrade the GPFS system. The in-place upgrade should only briefly interrupt... (Read more)
|2 years 4 days ago||2 years 3 days|
|Brief disruption on 8/1/2013 at 8AM||Network||Resolved||
At 8AM on the morning of 8/1/2013, we will be replacing some faulty hardware in our network infrastructure. Unfortunately, this work cannot be delayed until the next downtime, and the replacement... (Read more)
|2 years 1 month ago||2 years 3 weeks|
|Brief disruption to external network, 2013/11/27||Connectivity||Resolved||
This maintenence was cancelled, to be rescheduled at some undetermined point in the future.
|1 year 9 months ago||1 year 9 months|
|Brief disruption to external network, 2013/12/29||Connectivity||Resolved||
Between 5:00AM and 9:00AM EDT on Sunday,... (Read more)
|1 year 8 months ago||1 year 8 months|
|Brief interruption of batch services on 4/17||Batch||Resolved||
On April 17th 2013, at roughly 2PM, we will be rebooting the batch server on the Oakley cluster. Running jobs will not be affected, but there will be a brief disruption in scheduling, as well as... (Read more)
|2 years 4 months ago||2 years 4 months|
|Brief interruption of services for some users||filesystem||Resolved||
Today, May 14 2013, at 12:45PM we will be temporarily removing one of the home directory servers from service to address some reliability issues. Users with home... (Read more)
|2 years 3 months ago||2 years 3 months|
|Can not change GPU compute mode on Oakley||GPU||Resolved||
Update: The driver version has been updated and the issue has been fixed.
In updating the driver version for Oakley's NVIDIA GPUs the NVML libraries that are used in conjunction... (Read more)
|9 months 1 week ago||7 months 2 weeks|
|Cannot login to clusters||Resolved||
As of around 3PM today (Thursday 6/12), we have reports of users being unable to login in to the clusters. The error message given will make it sound like your password is incorrect, although it... (Read more)
|1 year 2 months ago||1 year 2 months|