We are a PBS shop and are starting to play around with SLURM on our clusters. We have everything working fine with one cluster, but now that we are adding another one I’m not sure what we need to do to get it to work. We have the two clusters setup independently, so a slurmdbd, slurmctld, etc. on each. We setup the clusters.d files to point to the appropriate slurm.conf files, but when we go to submit an interactive app it says it has an unrecognized cluster id.
Do we need to setup SLURM in a multi-cluster configuration in order to get this to work? Is there another way to set this up without that? We noticed a few other tickets talking about this, but they were not exactly our issue. This is most likely due to our inexperience with SLURM, so any help would be appreciated.
Please let me know what files/information would be helpful in answering this question and I’ll get it added.