Monitoring - Floating IP functionality has now been restored and all services are working
Nov 21, 2024 - 10:37 NZDT
Identified - Part of the Flexible HPC control plane has experienced an OS crash this morning about 0915 NZDT. We are working to restore service asap. This is currently impacting all connectivity to/from Floating IPs and interrupting access to the Dashboard and APIs.
Nov 21, 2024 - 09:26 NZDT
Identified - Hardware issues have been identified and we are awaiting the arrival of parts and service from IBM
Nov 20, 2024 - 09:55 NZDT
Investigating - We are currently investigating issues with the Greta Point tape library which is not working at all. All Nearline functions are down including retrieval of any Nearline data. Overnight backups will also not run until the tape library is fixed.
Nov 19, 2024 - 16:59 NZDT

About This Site

New Zealand eScience Infrastructure High Performance Compute and Storage Service Status

Apply for Access ? Operational
Data Transfer Operational
Submit new HPC Jobs Operational
Jobs running on HPC Operational
Jupyter on NeSI (beta) ? Operational
NeSI OnDemand ? Operational
90 days ago
99.94 % uptime
Today
HPC Storage Operational
Long-term Storage (Early Access) ? Operational
User Support System ? Operational
Flexible High Performance Cloud ? Operational
NeSI HPC Compute Infrastructure ? Operational
HPC Lander node ? Operational
HPC Login nodes - Māui ? Operational
HPC Login nodes - Mahuika ? Operational
HPC Compute nodes - Māui ? Operational
HPC Compute nodes - Mahuika ? Operational
Mahuika Extension nodes - Mahuika ? Operational
Māui Ancillary nodes ? Operational
Mahuika Ancillary nodes ? Operational
NeSI Storage Infrastructure Major Outage
HPC Shared Storage system ? Operational
Online storage ? Operational
Nearline storage Major Outage
Scratch storage ? Operational
NeSI Data Transfer Infrastructure ? Operational
NeSI HPC Facility (Greta Point, Wellington) DTN ? Operational
Flexible High Performance Cloud Services ? Operational
90 days ago
99.95 % uptime
Today
Virtual Compute Service Operational
Bare Metal Compute Service Operational
FlexiHPC Dashboard (web interface) ? Operational
90 days ago
99.95 % uptime
Today
FlexiHPC CLI interface ? Operational
90 days ago
99.97 % uptime
Today
Public API of the FlexiHPC Service ? Operational
90 days ago
99.95 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Scheduled Maintenance
We will be undergoing scheduled maintenance during this time.
Posted on Nov 11, 2024 - 13:26 NZDT
my.nesi.org.nz system update Nov 28, 2024 10:00-16:00 NZDT
We will be undergoing scheduled maintenance during this time.
Posted on Nov 14, 2024 - 11:18 NZDT
Past Incidents
Nov 21, 2024

Unresolved incident: Flexible HPC floating IP connectivity outage.

Nov 20, 2024

Unresolved incident: Mahuika tape library has a hardware error.

Nov 19, 2024
Resolved - This incident has been resolved.
Nov 19, 16:56 NZDT
Monitoring - A fix has been implemented and we are monitoring the results.
Nov 12, 09:36 NZDT
Identified - We have recently identified an issue that was preventing the regular operation of NeSI's nobackup filesystem autocleaner, which implements our inactive data management policy on the nobackup filesystem. As a result the autocleaner has not run since mid August and the nobackup filesystem is becoming critically full.

We have now resolved the issue and reinstated the autocleaning regime (https://docs.nesi.org.nz/Storage/File_Systems_and_Quotas/Automatic_cleaning_of_nobackup_file_system/). Users with data marked for cleaning should have received emails from the system this morning, which will be subsequently deleted in two weeks.

Nov 5, 11:46 NZDT
Nov 18, 2024

No incidents reported.

Nov 17, 2024

No incidents reported.

Nov 16, 2024

No incidents reported.

Nov 15, 2024

No incidents reported.

Nov 14, 2024

No incidents reported.

Nov 13, 2024
Completed - The scheduled maintenance has been completed.
Nov 13, 16:38 NZDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Nov 13, 15:30 NZDT
Scheduled - We will be undergoing scheduled maintenance during this time.
Nov 11, 11:37 NZDT
Nov 12, 2024
Completed - The scheduled maintenance has been completed.
Nov 12, 18:39 NZDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Nov 12, 15:39 NZDT
Scheduled - The update is taking longer then expected as we have run into some complications we hope to have this resolved soon
Nov 12, 15:38 NZDT
Completed - The scheduled maintenance has been completed.
Nov 12, 15:00 NZDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Nov 12, 13:00 NZDT
Scheduled - We will be upgrading the OnDemand cluster to allow for bigger nodes so we can support big memory workloads.

This will result in the OnDemand Jupyter and Rstudio not being able to launch during this time.

We apologize for any inconvenience caused during this time.

Nov 12, 09:11 NZDT
Nov 11, 2024

No incidents reported.

Nov 10, 2024

No incidents reported.

Nov 9, 2024

No incidents reported.

Nov 8, 2024

No incidents reported.

Nov 7, 2024
Completed - The scheduled maintenance has been completed.
Nov 7, 16:09 NZDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Nov 7, 15:30 NZDT
Scheduled - We will be undergoing scheduled maintenance during this time.
Nov 4, 09:07 NZDT