Greetings!
I’m hoping for a little contextual insight into getting OOD client to connect to a noVNC job. I’m still in the POC stages of development for our cluster. I have Job Composer and Jupyter Notebook running successfully, but noVNC based image continue to be problematic. I’m trying to find that one “rosetta stone” like Interactive App to make this all come together.
Import cluster specific information:
Our cluster runs LSF10 where every job must run inside a docker container. So any OOD requirements must exist within the image that the submitted job runs. I’m currently attempting to set up Relion3.1 to run as this is a currently supported application on our production cluster. If there’s a better application to start with, I’m all ears.
On our cluster, job submission from a ssh terminal on a client host looks like this (grossly simplified):
LSF_DOCKER_PORTS=‘8080:8080’ LSF_DOCKER_NETWORK=host LSF_DOCKER_IPC=host bsub -Is -R ‘select[port8080=1] rusage[mem=4GB]’ -q general-interactive -G compute-ris -a ‘docker(us.gcr.io/ris-appeng-shared-dev/relion31-ood)’ supervisord -c /app/supervisord.conf
“relion31-ood” is an image I’ve created with the websockify and TurboVNC OOD requirements built in. On our cluster, once the job is up, browser connection is:
https://<host_fqdn>:8080/vnc.html
I have this configured in /var/www/ood/apps/sys/relion3 such that the job submits via batch_connect: vnc and runs but the port remains unavailable (according to the logs). I also notice that ood_core/batch_connect/templates/vnc.rb specifically starts vncserver on a different port, so maybe my use case is out of sync with how batch_connect expects the server to connect?
I’m probably just missing something obvious here in how OOD vnc expects to be able to connect to the host. Any help would be greatly appreciated.
Thanks!