'cylc gscan' showing someone else's suites

hi there

my cylc gscan output is currently showing someone else’s suite.

in the attached screenshot, one of them is mine (piClim-HC) but i have no idea who’s the other one is.

i vaguely remember this being an issue a while back on another platform (this is on www.jasmin.ac.uk) but can’t recall the details.

thanks for any tips!

Any chance you have a user account shared? Maybe someone else started that suite with your same user account?

1 Like

hey

thanks for that

no i am 99.9% sure that’s not the issue.

Hi Jonny,

You probably have a couple of suite “contact” files left over from unclean shutdowns, e.g. you (or root user) killed suites with kill -9 PID or the machine got rebooted. (If a Cylc scheduler shuts down cleanly, it removes its contact file on exit).

The contact file contains, among other things, the port that the suite daemon is listening on for commands etc. Your gscan GUI looks for these contact files in your cylc-run directory, assumes they belong to running suites, and connects to the ports it finds.

If your suites are actually no longer running though, the listed ports may have been used by someone else’s suites that started up since yours got killed. So gscan connects to those, and displays the summary data. This is annoying/confusing but safe - if you don’t own the suite you will only see the basic summary data and can’t execute commands (gscan can optionally show other users’ suites on purpose, with the right options).

Contact file locations: ~/cylc-run/<suite-name/.service/contact (note the dot in .service)

So get gscan to show port numbers, and grep for those in your contact files, confirm those suites really aren’t running and delete the contact files.

Or just look for any contact files present in the service directories of suites that are not running, and remove them.

Hilary

1 Like

perfect, thanks for the information and explanation!

i did this in ~/cylc-run and found two contact files, one of which was definitely not running…

> ls -l */.service/c*
cdds_UKESM1-0-LL_piClim-HC_r1i1p1f4_ap9/.service/contact
cdds_UKESM1-0-LL_piClim-N2O_r1i1p1f4_ap9/.service/contact

simply deleting it was all that needed to be done and it’s now gone from cylc gscan.

thanks again

jonny

1 Like