Previous incidents

May 2025 to Jul 2025

July 2025

No incidents reported

June 2025

Jun 09, 2025

1 incident

Juniper Core Switch Issue | Mitigation in progress

Degraded

Resolved Jun 09 at 05:45pm MSK

The software update helped to rectify the situation and at the moment the load on the software part has been reduced to a minimum.

However, we are still considering replacing the Juniper stack with another one in the near future.

1 previous update

Jun 05, 2025

1 incident

Partial Outage - RX-Line Cluster Node (Ryzen 9 9950X)

Degraded

Resolved Jun 06 at 12:57am MSK

After a more thorough inspection, our engineer identified a faulty motherboard. It has been replaced, and the server is currently active — no recurrence is expected.

The faulty motherboard will be sent to the vendor for further analysis to help prevent similar incidents in the future.

2 previous updates

May 2025

May 19, 2025

1 incident

High w_await on RX nodes

Resolved May 19 at 12:05am MSK

Our monitoring systems recorded a sharp increase in the w_await indicator on some servers of the RX cluster. This indicator reflects the response time of NVMe drives during write operations.

Since we use RAID1 on our servers to increase the reliability of data storage, its speed depends on the "slowest disk" itself. If at least one disk from the array starts working incorrectly, this is reflected in the entire array.

During an internal investigation, we found that all the problematic drives...

May 02, 2025

1 incident

CloudFlare Network Unavailability

Degraded

Resolved May 03 at 07:03pm MSK

Since we did not receive a clear response from CloudFlare, what exactly is the reason for this incident, we decided to replace the IP addresses with others. The replacement was successful, this subnet has been decommissioned, and customers have no problems working with the CloudFlare network.

1 previous update