Task #12413
closed
Nagios check: Provide a functional check for checking the IS-Collector instances
Added by Roberto Cirillo almost 7 years ago.
Updated almost 7 years ago.
Category:
System Application
Infrastructure:
Production
Description
The idea is to use the icproxy REST service in order to perform a query for each IS-Collector instance deployed in production environment. In this way if a IS-Collector service is up but the db is corrupted, or the db doesn't work properly, the new check should give an alert.
At this time we have only a container check on IS-Collector service
- Related to Incident #12363: Openaire, SmartArea, SoBigData: collector down added
- Status changed from New to In Progress
The query could be done on a mandatory resource that must be present on every VO. For example we could check the StorageManager service endpoint:
<Category>DataStorage</Category>
<Name>StorageManager</Name>
The id is the same in all the production scopes:
<ID>0717b450-a698-11e2-900a-a46c6ff57f05</ID>
@andrea.dellamico@isti.cnr.it any opinion?
@lucio.lelii@isti.cnr.it what do you think? If this is possible, could you provide an example url in order to check the StorageManger resouce in a given VO?
- Status changed from In Progress to Feedback
- Assignee changed from Roberto Cirillo to Lucio Lelii
- Assignee changed from Lucio Lelii to Andrea Dell'Amico
- % Done changed from 0 to 100
The checks have been added. They fail when the string Profile
is not found.
I'm also adding the https checks.
- Status changed from Feedback to Closed
Also available in: Atom
PDF