Project

General

Profile

Actions

Incident #12363

closed

Openaire, SmartArea, SoBigData: collector down

Added by Roberto Cirillo over 6 years ago. Updated over 6 years ago.

Status:
Closed
Priority:
Immediate
Category:
Application
Target version:
Start date:
Aug 31, 2018
Due date:
% Done:

100%

Estimated time:
Infrastructure:
Production

Description

The IS-collector service instances of the VO above are down since this night.
I'm going to check and restart them


Related issues

Related to D4Science Infrastructure - Task #12413: Nagios check: Provide a functional check for checking the IS-Collector instancesClosedAndrea Dell'AmicoSep 07, 2018

Actions
Actions #1

Updated by Roberto Cirillo over 6 years ago

  • Status changed from New to In Progress
Actions #2

Updated by Roberto Cirillo over 6 years ago

  • % Done changed from 0 to 30

The OpenAIRE collector db was corrupted. I've executed a restore procedure and now I'm going to check if it works properly

Actions #3

Updated by Roberto Cirillo over 6 years ago

  • % Done changed from 30 to 50

OpenAIRE is back

Actions #4

Updated by Roberto Cirillo over 6 years ago

  • % Done changed from 50 to 100

After a db restore, SmartArea is back.

Actions #5

Updated by Roberto Cirillo over 6 years ago

  • Status changed from In Progress to Closed
Actions #6

Updated by Tommaso Piccioli over 6 years ago

I fixed timezone and clock sync on

collector-sa-d4s.smart-applications.area.pi.cnr.it
registry-sa-d4s.smart-applications.area.pi.cnr.it
n039.smart-applications.area.pi.cnr.it

Actions #7

Updated by Roberto Cirillo over 6 years ago

  • Status changed from Closed to In Progress
  • % Done changed from 100 to 30

collector-sbd is down for too many open file exception. It'e needed a db restore.

Actions #8

Updated by Roberto Cirillo over 6 years ago

  • Subject changed from Openaire, SmartArea: collector down to Openaire, SmartArea, SoBigData: collector down
Actions #9

Updated by Roberto Cirillo over 6 years ago

  • Status changed from In Progress to Closed
  • % Done changed from 30 to 100

SoBigData collector restored successfully.

Actions #10

Updated by Roberto Cirillo over 6 years ago

  • Status changed from Closed to In Progress
  • % Done changed from 100 to 90

Openaire collector is down. A db restore is needed

Actions #11

Updated by Roberto Cirillo over 6 years ago

  • Status changed from In Progress to Closed
  • % Done changed from 90 to 100

After the db restore, the OpenAIRE collector is up and running. The last two exist backup were not present on local state. I'm going to investigate.

Actions #12

Updated by Roberto Cirillo over 6 years ago

  • Related to Task #12413: Nagios check: Provide a functional check for checking the IS-Collector instances added
Actions

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 8.91 MB)