Content-type: text/html Manpage of chosts

chosts

Section: User Commands (1)
Updated: SkyForm AIP Version 10.25.0 - April 2025
Index Return to Main Contents

 

NAME

chosts - displays hosts and their static and dynamic resources  

SYNOPSIS

chosts [-w | -l | -c | -S | -E] [-j] [-o "format"][-R "res_req"] [host_name | host_group] ...

chosts -s [shared_resource_name ...]

chosts [-h | -V]  

DESCRIPTION

Displays information about hosts.

By default, returns the following information about all hosts: host name, host status, job slot limits, and job state statistics.

The -s option displays information about the numeric shared resources and their associated hosts.  

OPTIONS

-w
Displays host information in wide format. Fields are displayed without truncation.

-l
Displays host information in a (long) multi-line format. In addition to the default fields, displays information about the CPU factor, the dispatch windows, the current load, and the load thresholds.

-j
Displays job ids that are running or suspsneded on each host. In short or wide format output, only job ids are display. In long format output, job information is in the format of jobid:status(number of slots), e.g. 1743:RUN(2)
-c
Displays only hosts that are closed by the AIP administrator together with the number of running jobs and the message posted by the administrator in the command "csadmin hclose".

-o "field_name ... [delimiter='character']"
Specifies custom output format by field names. Use delimiter= to set the delimiting character to display between different headers and fields. This must be a single character. By default, the delimiter is a space.
If a field has no value, the output is a dash (-).
Available field names are:
=====================================================================
host_name: The hostname
status: The host status. See OUTPUT
max: Maximum number of job slots on the host
njobs: Number of job slots occupied
run: Number of running job slots
ssusp: Number of job slots suspended by the scheduler
ususp: Number of job slots suspended by users
rsv: Number of job slots reserved by the scheduler
hgroups: Host groups the host belong to
queues: Queue names the host belong to
=====================================================================

-S
Displays summary of number of hosts in different status.

-E
Displays hostnames with status of "Unavail" and "Unreach".

-R "res_req"
Only displays information about hosts that satisfy the resource requirement expression. For more information about resource requirements. The size of the resource requirement string is limited to 512 bytes.

AIP supports ordering of resource requirements on all load indices, including external load indices, either static or dynamic.

host_name ... | host_group ...

Only displays information about the specified hosts or host groups. For host groups, the names of the hosts belonging to the group are displayed instead of the name of the host group. Do not use quotes when specifying multiple hosts or host groups.

host_name can be specified with the format of xxxx[001-100].

-s [shared_resource_name ...]

Displays information about the specified shared resources. The resources must have numeric values. Returns the following information: the resource names, the total and reserved amounts, and the resource locations. If no shared resources are specified, displays information about all numeric shared resources.

-h

Prints command usage to stderr and exits.

-V
Prints AIP release version to stderr and exits.

 

OUTPUT

 

Host-Based Default

Displays the following fields:

HOST_NAME
The name of the host. If a host has batch jobs running and the host is removed from the configuration, the host name will be displayed as lost_and_found.

STATUS
The current status of the host. Batch jobs can only be dispatched to hosts with an ok status. The possible values for host status are as follows:

ok
The host is available to accept batch jobs.

unavail
The host is down, or CBLS and CBJM on the host are unreachable.

unreach
CBLS on the host is running but CBJM is unreachable.

closed
The host is not allowed to accept any remote batch jobs. There are several reasons for the host to be closed (see Host-Based -l Options).

JL/U
The maximum number of job slots that the host can process on a per user basis.

These job slots are used by running jobs, as well as by suspended or pending jobs that have slots reserved for them.

MAX
The maximum number of job slots that the host can process. These job slots are used by running and suspended jobs on the host, and by pending jobs that have jobs slots reserved for them on the host.

NJOBS
The number of job slots used by started jobs on the host (including running, suspended).

RUN
The number of job slots used by running jobs on the host.

SSUSP
The number of job slots used by system suspended jobs on the host.

USUSP
The number of job slots used by user suspended jobs on the host. Jobs can be suspended by the user or by the AIP administrator.

RSV
The number of job slots used by pending jobs that have jobs slots reserved on the host.

 

Host-Based -l Option

In addition to the above fields, the -l option also displays the following:

STATUS

closed
The long format shown by the -l option gives the possible reasons for a host to be closed:

closed_Adm
The host is closed by the AIP administrator or root (see csadmin(8)). No job can be dispatched to the host, but jobs that are executing on the host will not be affected.

closed_Lock
The host is locked by the AIP administrator or root (see cadmin(8)). All batch jobs on the host are suspended by AIP.

closed_Wind
The host is closed by its dispatch windows, which are defined in the configuration file cb.yaml(5). All batch jobs on the host are suspended by the AIP system.

closed_Full
The configured maximum number of batch job slots on the host has been reached (see MAX field below).

closed_Excl
The host is currently running an exclusive job.

closed_Busy
The host is overloaded, because some load indices go beyond the configured thresholds (see cb.yaml(5)). The displayed thresholds that cause the host to be busy are preceded by an asterisk (*).

closed_LS
CBLS on the host is unreachable, but CBJM is ok.

closed_Power
The host is powered down by the administrator.

ok_Power
The host is powered down by the power saving scheduling policy. It can be powered on by pending jobs.

suspending
The host is in the process of powering down by either the administrator or the power saving policy.

resuming
The host is in the process of powering up by either the adminitrator or the power saving policy.

CPUF
Displays the CPU normalization factor of the host (see chinfo(1)).

DISPATCH_WINDOWS
Displays the dispatch windows for each host. The dispatch windows are the time windows during the week when batch jobs can be run on each host. Jobs already started are not affected by the dispatch windows. The default for the dispatch window is no restriction or always open (that is, twenty-four hours a day and seven days a week). For the dispatch window specification, see the description for the DISPATCH_WINDOWS keyword under the -l option in cqueues(1).

CURRENT LOAD
Displays the total and reserved host load.

Reserved
You specify reserved resources by using csub -R (see csub(1)). These resources are reserved by jobs running on the host.

Total
The total load has different meanings depending on whether the load index is increasing or decreasing.

For increasing load indices, such as run queue lengths, CPU utilization, paging activity, logins, and disk I/O, the total load is the consumed plus the reserved amount. The total load is calculated as the sum of the current load and the reserved load. The current load is the load seen by cload(1).

For decreasing load indices, such as available memory, idle time, available swap space, and available space in tmp, the total load is the available amount. The total load is the difference between the current load and the reserved load. This difference is the available resource as seen by cload(1).

LOAD THRESHOLD
Displays the scheduling threshold loadSched and the suspending threshold loadStop. Also displays the migration threshold if defined and the checkpoint support if the host supports checkpointing.

The format for the thresholds is the same as for batch job queues (see cqueues(1)) and cb.yaml(5)). For an explanation of the thresholds and load indices, see the description for the "QUEUE SCHEDULING PARAMETERS" keyword under the -l option in cqueues(1).

 

Resource-Based -s Option

The -s option displays the following: the amounts used for scheduling, the amounts reserved, and the associated hosts for the shared resources. Only shared resources with numeric values are displayed. See cb.yaml(5) on how to configure shared resources.

The following fields are displayed:

RESOURCE
The name of the resource.

TOTAL
The value of the shared resource used for scheduling. This is the sum of the current and the reserved load for the shared resource.

RESERVED
The amount reserved by jobs. You specify the reserved resource using csub -R (see csub(1)).

LOCATION
The hosts that are associated with the shared resource.

 

SEE ALSO

cb.yaml(5), cqueues(1), chinfo(1)


 

Index

NAME
SYNOPSIS
DESCRIPTION
OPTIONS
OUTPUT
Host-Based Default
Host-Based -l Option
Resource-Based -s Option
SEE ALSO

This document was created by man2html, using the manual pages.
Time: 18:57:47 GMT, April 23, 2025