Help:Cluster

From CECS wiki
Jump to navigation Jump to search

If you are using a cluster with an older configuration, you may want to see Help:Rocks cluster.

Newer clusters will be using different tools than what comes with Rocks, but some components are shared. The biggest change is the use of slurm instead of SGE, but also new configurations now have the choice of Ubuntu or CentOS (or a mix).


You can view the status of all known clusters in CECS at http://newton.i2lab.ucf.edu/ganglia2/

See also the general cluster notes.


Additional documentation[edit]

The cluster software is composed of various components, each of which has their own documentation.

See also Steve's cluster research page.

tutorial[edit]

This information is used for a 10 minute (or longer as necessary) short course on cluster use, which can be taylored to fit your needs and expected use.

general info[edit]

  • Monitor cluster status with ganglia
  • use ssh to enter the cluster
  • use ssh or rsh to get between nodes if necessary
  • use MPI to program and start your job if appropriate
  • use MobaXterm or similar to access the system from windows if you need graphics

specific info[edit]

(These should probably be expanded--ask if you need details.)

  • Use the batch queue systems appropriate for your cluster to schedule your jobs, such as slurm
  • Watch out for hidden performance penalties if hyperthreading is enabled.
  • your job can be optimized in various ways.
  • please use ganglia to make sure your job is not left running accidentally and does not interfere with others' jobs.
  • Be aware of disk quotas, move files to an appropriate storage location (or delete) when you are done with them.

External clusters and resources[edit]

Contact ARCC for access to the following external clusters:

Cluster programming guides[edit]