Relationship between [host self-identification] and [platforms]

areinecke · January 11, 2025, 9:39pm

On our cray-based systems the compute nodes see the login nodes with a different IP address than what the login node self identifies with. This was dealt with in Cylc v7 by using the hardwired option in [suite host self-identification]. It worked, but was limiting because we could only launch Cylc from the single login node that we hardwired into the global.rc file. I never investigated if we could add more hardwired options.

In Cylc 8 (specifically 8.3.6), I was hopeful that platforms was a way around this, but I am having trouble understanding the relationship between [platforms] and [[host self-identification]].

For example, if I have a platform setup like this:

[platforms]
    [[loginA, loginB]]
        hosts = localhost
        install target = localhost
        job runner = pbs

Can I relate loginA and loginB to different [[host self-identification]] hardwired options? What’s the syntax for that?

Thanks,

Alex

hilary.j.oliver · January 12, 2025, 10:21pm

[scheduler][host self-identification] determines how a scheduler self-reports its own location (i.e., the host it is running on) to its jobs, so that those jobs can successfully communicate back to it.

Platforms (a.k.a. “Job Platforms”) represent job hosts, so there’s not really any direct relationship between the two concepts.

However, if you want to launch schedulers on several hosts that see the same global.cylc file (on a shared filesystem) I think you could use Jinja2 to select the right hardwired hostname at runtime.

global.cylc:

#!Jinja2

{% from "__python__.subprocess" import check_output %}
{% set hostname = check_output("hostname", shell=True).decode('utf_8').strip() %}

[scheduler]
    [[host self-identification]]
        method = hardwired
{% if hostname == "NIWA-1022451" %}
        host = "cat"
{% elif hostname == "NIWA-1022450" %}
        host = "dog"
{% else %}
        host = "fish"
{% endif %}

$ hostname
NIWA-1022450

$ cylc config -i scheduler
[[host self-identification]]
    method = hardwired
    host = dog

(Note the global config file only gets evaluated by the scheduler on the scheduler host).

areinecke · January 13, 2025, 8:17pm

Thank-you…this makes sense. I’m still wrapping my head around [platforms], but I think that at least for now where everything we do is on a shared filesystem, I understand it.

Follow-up question about the global.cylc file. Is that only read at play time, or does the workflow continue to read it for the duration of the workflow? Also, is it copied to the install directory and read from there or is it read in place?

hilary.j.oliver · January 13, 2025, 9:04pm

I hear that quite a lot, which makes me wonder if we haven’t explained it well. If you find the documentation confusing, let us know and we’ll try to fix it.

Basically a job platform represents a cluster on a shared filesystem, with a job runner such as PBS. The platform “hosts” are the hosts that Cylc schedulers can use to interact with the job runner to submit, poll, and kill its jobs. Typically that might be e.g. the interactive or login nodes of the cluster. There can be more than one such host, which makes Cylc 8 job platforms more robust than the old singular job hosts.

The above seems pretty straightforward to me. Perhaps the potential for confusion comes in when your scheduler host also belongs to a job platform, and with the “install target” setting which allows Cylc to avoid redundant or clashing installs to multiple hosts on the same filesystem? If so, feel free to ask more questions, and we can consider documentation tweaks.

At the moment it’s only read at scheduler start up (i.e., “play time”). We have work in progress to consider reloading it on demand at runtime: reload the global configuration during a workflow run · Issue #3762 · cylc/cylc-flow · GitHub and Allow 'cylc reload' to reload global configuration by ScottWales · Pull Request #6509 · cylc/cylc-flow · GitHub

It is read from the central (and user-specific) location(s), not copied to workflow run directories.

Topic		Replies	Views
Can I add multiple hosts in [suite host self-identification] section? Cylc Support	3	199	January 30, 2023
How to configure remote run host with mutliple job runners Cylc Support	4	38	November 8, 2024
Unclear on how Cylc 8 components work together Cylc Support	2	284	October 24, 2023
Possible race condition in global.cylc reload Cylc Support	12	136	May 14, 2024
Is cylc 8 able to interact with workflows launched on different hosts? Cylc Support	3	325	April 12, 2023

Relationship between [host self-identification] and [platforms]

Related topics