Project

General

Profile

Actions

Incident #11870

closed

Two huge problems on the oVirt cluster

Added by Andrea Dell'Amico about 7 years ago. Updated about 7 years ago.

Status:
Closed
Priority:
Immediate
Category:
System Application
Start date:
Jun 01, 2018
Due date:
% Done:

100%

Estimated time:
Infrastructure:
Development, Pre-Production, Production

Description

Yesterday, a node upgrade + restart crashed the gluster file system. It happened because even if the restarted node did not have any non synchronized bricks, some other nodes had. oVirt does not alert in that situation, while it stops the procedure if the to-be-restarted node has not sync bricks itself.

The gluster failure caused the shutdown of all the VMs configured on oVirt: the DNS resolver, the authoritative DNS server, the SMTP relay, the VPN gateways.


Related issues

Related to D4Science Infrastructure - Task #11873: Fix the networking bug introduced by cloud-init on Ubuntu 16.04 oVirt guestsClosed_InfraScience Systems EngineerJun 04, 2018

Actions
Actions

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 8.91 MB)