Project

General

Profile

Actions

Task #10758

open

R Packages Lists for RStudio - Distinguish between Prototyping and Production Environments

Added by Paolo Scarponi over 7 years ago. Updated over 7 years ago.

Status:
New
Priority:
Normal
Category:
-
Start date:
Dec 19, 2017
Due date:
% Done:

0%

Estimated time:
Infrastructure:
Production

Description

We currently have to sets of R packages lists, one for the Prototyping Environments (the VREs with SAI) and one for the Production Environments (the VREs without SAI). The RStudio machines retrieve the packages lists from the production set, but this may cause some inconsistencies when a user is writing his code using RStudio in a Prototyping VRE and then publishes it onto DataMiner using SAI. In fact, he might be able to run his script inside RStudio without errors, while encountering issues once the algorithm is in DataMiner, because one or more of the packages he uses are not actually installed on the machines. In a perfect world the user wouldn't forget to include all the dependencies in SAI before the publication step so to have them automatically installed on the machines, but we do not live in a perfect world and if this happens we automatically have the aforementioned inconsistencies.

This ticket is meant to highlight such issue and find a solution together.

One possible way to tackle the problem consists in a distinction between RStudio machines, having the ones behind Prototyping Environments point to the Prototyping packages list, while the ones behind Production Environments points to the other set. Is it a viable escamotage? Do you have any other ideas?

Actions #1

Updated by Pasquale Pagano over 7 years ago

  • Priority changed from Low to Normal
Actions #2

Updated by Roberto Cirillo over 7 years ago

I think it is the right way for avoiding this kind of issue. The proto instances could be named with the "-proto" suffix in order to better understand if an instance is synchronized with the proto repository or the production repository. @andrea.dellamico@isti.cnr.it what do you think?

Actions #3

Updated by Andrea Dell'Amico over 7 years ago

It's doable, the R packages management is the same on both the dataminer and rstudio servers.
Renaming a node is a pain but we can add CNAMES. We need to know what VMs run as proto instances and create a group under the ansible playbook with the different configuration.

Actions #4

Updated by Roberto Cirillo over 7 years ago

It is not that simple, really. There are several rstudio instances that are shared with both proto and production VREs. We should first completely separate the two set of instances and after we can create a separated group under the ansible playbook. In order to separate the two set, I guess we should first transfer the user's home to the new production instances.

Actions #5

Updated by Andrea Dell'Amico over 7 years ago

I see. If we can wait, there's a chance that the GARR cloud people will enable CephFS for us in some weeks. With it we could mount the same home directory on every rstudio server that runs in the GARR cloud and then no more need to move home directories around.

Actions

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 8.91 MB)