Project

General

Profile

Actions

VM Creation #9726

closed

Task #9703: wrong software and data distribution on node75.d4science.org

Provide new VM for hosting fhn-manager service

Added by Roberto Cirillo almost 8 years ago. Updated almost 8 years ago.

Status:
Closed
Priority:
Urgent
Target version:
Start date:
Sep 29, 2017
Due date:
% Done:

100%

Infrastructure:
Pre-Production
RAM:
3
CPU:
2
DiskSpace:
10
Smartgears:
Yes
Smartgears Version:
2.1.2-4.7.0-146408
Application Instructions:
Smartgears scopes:
Applications list and version:
Applications context path:
External storage data disk:
Smartgears auth connector:
data transfer enabled:
No
PostgreSQL database:
No
Number of VMs:
1

Description

The ansible role should be ready to use for preproduction environment. I've also update the fhn-manager version. We need only to substitute the host in the inventory.


Files

catalina.out (373 KB) catalina.out Roberto Cirillo, Sep 29, 2017 04:10 PM
catalina.out.1.gz (419 KB) catalina.out.1.gz Roberto Cirillo, Sep 29, 2017 04:10 PM

Subtasks


Related issues

Related to D4Science Infrastructure - Task #4718: installation of fhn-manager on gcube/preprodClosedRoberto CirilloJul 13, 2016

Actions
Actions #1

Updated by Roberto Cirillo almost 8 years ago

I've seen the following ticket for the last installation in preproduction environment #4718 . Please @nunzioandrea.galante@eng.it could you indicate the packages required by fhn-manager. This part is not provisioned.

Actions #2

Updated by Roberto Cirillo almost 8 years ago

  • Related to Task #4718: installation of fhn-manager on gcube/preprod added
Actions #3

Updated by Nunzio Andrea Galante almost 8 years ago

The documentation about the packages needed is available at the following link: https://wiki.egi.eu/wiki/Fedcloud-tf:CLI_Environment.

Actions #4

Updated by Andrea Dell'Amico almost 8 years ago

It's way too generic.

As I already wrote we have a couple of roles that cover the tools installation and we should have a valid configuration for the VO too.
But other parts need clarifications: what certificates have to be used, where to take them, where to install them so that the service works.
All the steps should avoid any manual intervention.

Actions #5

Updated by Nunzio Andrea Galante almost 8 years ago

I try to report what i remember cause i did these operations much time ago.
No certificate must be available on the node cause the service extracts it directly from the Information System and dinamically a short-time one needed to access to the federation of cloud is created.
For sure we need to install the script provided from EGI able to install both VOMS and OCCI-client (http://go.egi.eu/fedcloud.ui).
Once done this, it is necessary to edit the informations related both to the VOMS server and the VO d4science.org in /etc/grid-security/vomsdir/d4science.org and /etc/vomses.
If i do not get wrong, there should have to be a particular set of certificates to download from somewhere and to deploy in /etc/grid-security/certificates. It was provided to me by EGI.
I make myself fully available to cooperate with you in this task.

Actions #6

Updated by Tommaso Piccioli almost 8 years ago

  • Status changed from New to In Progress
  • % Done changed from 0 to 20

new host name will be fhn-manager-t.pre.d4science.org

Actions #7

Updated by Roberto Cirillo almost 8 years ago

  • Assignee changed from _InfraScience Systems Engineer to Roberto Cirillo

I'm going to check the configuration and install the new service instance

Actions #8

Updated by Roberto Cirillo almost 8 years ago

I'm not able to access to fhn-manager-t.pre.d4science.org Please @tommaso.piccioli@isti.cnr.it could you give me access when it is ready?

Actions #9

Updated by Andrea Dell'Amico almost 8 years ago

You have it now.

Actions #10

Updated by Roberto Cirillo almost 8 years ago

  • Status changed from In Progress to Feedback
  • Assignee changed from Roberto Cirillo to Nunzio Andrea Galante
  • % Done changed from 20 to 100

The fhn-manager-service is in place. Please @nunzioandrea.galante@eng.it could you check the configuration and give me a feedback?

Actions #11

Updated by Nunzio Andrea Galante almost 8 years ago

Dear @roberto.cirillo@isti.cnr.it , the configuration is ok.
Anyway, i realized that maybe there is a small bug in the service.
I try to explain: the service extracts the voms certificate from the Information System and create a new temporary certificate file to interact with the fedcloud infrastructures; until now we have considered the name of such file as static and univocal and everything was ok for all the environments in which the service has been deployed.
Unfortunately, this is not always true, cause the name of the certificate changes depending on the ID of the user; for instance, according to /etc/passwd file, the gcube user has id 1001 for the fhn-manager-t.pre.d4science.org and id 1000 for the node75.d4science.org.
So maybe a new version of the service containing these really minor changes is needed. I will test in dev and perform an etics build within tomorrow and later, if you believe is feasibile, we could deploy this new version.
What do you think?

Actions #12

Updated by Roberto Cirillo almost 8 years ago

Nunzio Andrea Galante wrote:

Dear @roberto.cirillo@isti.cnr.it , the configuration is ok.
Anyway, i realized that maybe there is a small bug in the service.
I try to explain: the service extracts the voms certificate from the Information System and create a new temporary certificate file to interact with the fedcloud infrastructures; until now we have considered the name of such file as static and univocal and everything was ok for all the environments in which the service has been deployed.
Unfortunately, this is not always true, cause the name of the certificate changes depending on the ID of the user; for instance, according to /etc/passwd file, the gcube user has id 1001 for the fhn-manager-t.pre.d4science.org and id 1000 for the node75.d4science.org.
So maybe a new version of the service containing these really minor changes is needed. I will test in dev and perform an etics build within tomorrow and later, if you believe is feasibile, we could deploy this new version.
What do you think?

I think it is reasonable. Please, fix the issue, perform a remote build and let me know the new service version. Thanks in advance.

Actions #13

Updated by Nunzio Andrea Galante almost 8 years ago

Please @roberto.cirillo@isti.cnr.it , according to http://paste.research-infrastructures.eu/index.php?id=20170928162015_21293 a library needed to the service seems to be missing.
Could you please check?

Has this exception been thrown by the service? if yes could you indicate the file that contains this exception, please?

Actions #14

Updated by Nunzio Andrea Galante almost 8 years ago

We have fixed the issues: the new version is the 1.2.4-4.7.0-154701 (actually in staging).
Such version will be released in 4.8.0 since it seems to be late to involve in 4.7.

Let me add that we are having some issues (in the recent past they were not) with the voms-client usage both with the dev and preprod environment.
Since these problems do not occur by going to perform some test locally, we are afraid that there could be some misscomunications with Ubuntu 14. By the way, we are investigating on it and we will let you know more as soon as possibile.

Actions #15

Updated by Roberto Cirillo almost 8 years ago

Please, could you specify what issue have you resolved? In addition, could you report the error/exception for the other issues, please?

Actions #16

Updated by Nunzio Andrea Galante almost 8 years ago

The solved issues are listed at post #12 of the current thread.
The exceptions are instead related to task #9863.

Actions #17

Updated by Roberto Cirillo almost 8 years ago

Nunzio Andrea Galante wrote:

Please @roberto.cirillo@isti.cnr.it , according to http://paste.research-infrastructures.eu/index.php?id=20170928162015_21293 a library needed to the service seems to be missing.
Could you please check?

Has this exception been thrown by the service? if yes could you indicate the file that contains this exception, please?

Actions #18

Updated by Roberto Cirillo almost 8 years ago

Nunzio Andrea Galante wrote:

Please @roberto.cirillo@isti.cnr.it , according to http://paste.research-infrastructures.eu/index.php?id=20170928162015_21293 a library needed to the service seems to be missing.
Could you please check?

Actions #19

Updated by Nunzio Andrea Galante almost 8 years ago

Yes, the service throws such exception when the voms-proxy-init is invoked.
It can be verified to /home/gcube/tomcat/logs/catalina.out
Actually on this new VM is still deployed the old version of the service.

Actions #20

Updated by Nunzio Andrea Galante almost 8 years ago

I have done a comparison between this VM and the one we have in dev.
It seems to be missing the library commons-io.jar in /var/lib/voms-clients3/lib.
This could solve this issue.

Actions #22

Updated by Roberto Cirillo almost 8 years ago

The exception that you are reporting is not throws by service. I've checked and attached the catalina.out files from 27 September to now. So this is not a service problem.

Actions #23

Updated by Nunzio Andrea Galante almost 8 years ago

Sorry, maybe there was a misunderstading. I mean, the service invokes the voms and the voms thorws the exception.
By the way, this exception should have to disappear once deployed the aforementioned library.

Actions #24

Updated by Andrea Dell'Amico almost 8 years ago

Nunzio Andrea Galante wrote:

I have done a comparison between this VM and the one we have in dev.
It seems to be missing the library commons-io.jar in /var/lib/voms-clients3/lib.
This could solve this issue.

The clients version is the same on every instance. Was that library added manually?

Actions #25

Updated by Nunzio Andrea Galante almost 8 years ago

As far as i remember, not.
I have verified also that node75 contains such library as well.
So this is an issue just related to this VM.

Actions #26

Updated by Andrea Dell'Amico almost 8 years ago

So, the same package version sometimes has a bug and sometimes not.

Actions #27

Updated by Nunzio Andrea Galante almost 8 years ago

Production VM node65 has no bug as well.

Actions #28

Updated by Andrea Dell'Amico almost 8 years ago

That doesn't help. All that hosts were manually configured and we do not know how. The new preproduction host is the first try at getting back in control of all the pieces.

Actions #29

Updated by Roberto Cirillo almost 8 years ago

  • Copied to VM Creation #9874: Provide new VM for hosting fhn-manager service in production environment added
Actions #30

Updated by Roberto Cirillo almost 8 years ago

  • Status changed from Feedback to Closed
Actions #31

Updated by Roberto Cirillo almost 8 years ago

  • Copied to deleted (VM Creation #9874: Provide new VM for hosting fhn-manager service in production environment)
Actions

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 8.91 MB)