Support #238
closedThe nagios check 'gcube search' goes often in critical state
100%
Description
The "Gcube search" nagios check on portal.i-marine.d4science.org is often in critical state. The script source code is:
FILE_TMP=/tmp/xml-aquamaps # wget http://$1/images/report/xml -O $FILE_TMP > /dev/null 2>&1 # service.d4science.org "http://portal.i-marine.d4science.org/aslHttpInformationRetrieval/GenericSearch?responseType=xml&searchTerms="tuna"&allFields=false&count=1” -O $FILE_TMP > /dev/null 2>&1 nimages=`xmllint --xpath '//speciesCount/text()' $FILE_TMP` rm $FILE_TMP if [ "$nimages" -gt "0" ] then echo "Images cached : $nimages" exit $STATE_OK else echo "0 Images cached!" exit $STATE_CRITICAL fi
There was a proposal to dismiss the discovery service, but it still active. If the service cannot be dismissed, a fix is needed.
Files
Updated by Massimiliano Assante almost 10 years ago
I think this check is messy, it seems mixing the check on the Images Servlet serving the iOS and Android app AppliFish (in the first part) and the gCube Search where you indicate the URL portal,i-marine....
I'll try to explain in the following:
- The Images Servlet serving the iOS and Android app AppliFish is deployed on services.d4science.org only. (wget http://$1/images/report/xml ....)
- The cube search can be tested like you're doing through the ASLHttpblabla call which, in this very moment, is working -> http://www.i-marine.d4science.org/aslHttpInformationRetrieval/GenericSearch?responseType=xml&searchTerms=%22tuna%22&allFields=false&count=1
hope this helps
Updated by Massimiliano Assante almost 10 years ago
- Assignee changed from Massimiliano Assante to Andrea Dell'Amico
If my Analysis is right you may need to correct the Nagios check
Updated by Andrea Dell'Amico almost 10 years ago
Sorry, I just noted that the formatting lost a part of the script. The line
wget http://$1/images/report/xml -O $FILE_TMP > /dev/null 2>&1
is only used on the check dedicated to service.d4science.org.
The i-marine check uses the second URL only.
Updated by Andrea Dell'Amico almost 10 years ago
- File check_search.sh check_search.sh added
- File check_images.sh check_images.sh added
I attach the two scripts to clarify.
Updated by Massimiliano Assante almost 10 years ago
okay, clear now.
Is the second (search) failing now? It should not because if I click on http://portal.i-marine.d4science.org/aslHttpInformationRetrieval/GenericSearch?responseType=xml&searchTerms=%22tuna%22&allFields=false&count=1 i get an answer in 10 seconds (circa).
If it is failing now then there's something wrong in the search check. If it is not failing then I'm not sure what we can do as the problem resides on the QoS of the search implemented by NKUOA.
Updated by Andrea Dell'Amico almost 10 years ago
- File gcube_search_trends.png gcube_search_trends.png added
It's working correctly since the start of the month, but it flipped a lot in May.
When if fails it's always because the request times out, and the actual timeout is 240 seconds.
Updated by Massimiliano Assante almost 10 years ago
then the ticket can be closed for me.
Updated by Andrea Dell'Amico almost 10 years ago
- Status changed from New to Rejected