Project

General

Profile

Actions

Task #12413

closed

Nagios check: Provide a functional check for checking the IS-Collector instances

Added by Roberto Cirillo almost 7 years ago. Updated almost 7 years ago.

Status:
Closed
Priority:
Normal
Category:
System Application
Start date:
Sep 07, 2018
Due date:
% Done:

100%

Estimated time:
Infrastructure:
Production

Description

The idea is to use the icproxy REST service in order to perform a query for each IS-Collector instance deployed in production environment. In this way if a IS-Collector service is up but the db is corrupted, or the db doesn't work properly, the new check should give an alert.
At this time we have only a container check on IS-Collector service


Related issues

Related to D4Science Infrastructure - Incident #12363: Openaire, SmartArea, SoBigData: collector downClosedRoberto CirilloAug 31, 2018

Actions
Actions #1

Updated by Roberto Cirillo almost 7 years ago

  • Related to Incident #12363: Openaire, SmartArea, SoBigData: collector down added
Actions #2

Updated by Roberto Cirillo almost 7 years ago

  • Status changed from New to In Progress

The query could be done on a mandatory resource that must be present on every VO. For example we could check the StorageManager service endpoint:

<Category>DataStorage</Category>
<Name>StorageManager</Name>

The id is the same in all the production scopes:

<ID>0717b450-a698-11e2-900a-a46c6ff57f05</ID>

@andrea.dellamico@isti.cnr.it any opinion?

@lucio.lelii@isti.cnr.it what do you think? If this is possible, could you provide an example url in order to check the StorageManger resouce in a given VO?

Actions #3

Updated by Roberto Cirillo almost 7 years ago

  • Status changed from In Progress to Feedback
  • Assignee changed from Roberto Cirillo to Lucio Lelii
Actions #4

Updated by Andrea Dell'Amico almost 7 years ago

It's fine by me.

Actions #5

Updated by Lucio Lelii almost 7 years ago

this is an example on how to retrieve a resource by id using the ic-proxy in dev:

http://node10-d-d4s.d4science.org/icproxy/gcube/service/Resource/c0594b4e-107c-4c9f-bd00-818b68298aa4?gcube-scope=/gcube

Actions #7

Updated by Lucio Lelii almost 7 years ago

yes

Actions #8

Updated by Roberto Cirillo almost 7 years ago

  • Assignee changed from Lucio Lelii to Andrea Dell'Amico
Actions #12

Updated by Andrea Dell'Amico almost 7 years ago

  • % Done changed from 0 to 100

The checks have been added. They fail when the string Profile is not found.

I'm also adding the https checks.

Actions #13

Updated by Andrea Dell'Amico almost 7 years ago

  • Status changed from Feedback to Closed
Actions

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 8.91 MB)