Skip to content

Chapter02 LSF foundations

Info

Get an overview of IBM Spectrum LSF workload management concepts and operations for both users and administrators.

IBM Spectrum LSF user foundations

Get an overview of IBM Spectrum LSF workload management concepts and operations.

IBM Spectrum LSF overview

Learn how LSF takes your job requirements and finds the best resources to run the job.

Introduction to IBM Spectrum LSF

The IBM Spectrum LSF ("LSF", short for load sharing facility) software is industry-leading enterprise-class software. LSF distributes work across existing heterogeneous IT resources to create a shared, scalable, and fault-tolerant infrastructure, that delivers faster, more reliable workload performance and reduces cost. LSF balances load and allocates resources, and provides access to those resources.

LSF cluster components

An LSF cluster manages resources, accepts and schedules workload, and monitors all events. LSF can be accessed by users and administrators by a command-line interface, an API, or through the IBM Spectrum LSF Application Center

Inside an LSF cluster

Learn about the various daemon processes that run on LSF hosts, LSF cluster communications paths, and how LSF tolerates host failure in the cluster.

Inside workload management

Understand the LSF job lifecycle. Use the bsub to submit jobs to a queue and specify job submission options to modify the default job behavior. Submitted jobs wait in queues until they are scheduled and dispatched to a host for execution. At job dispatch, LSF checks to see which hosts are eligible to run the job.

LSF with EGO enabled

Enable the enterprise grid orchestrator (EGO) with LSF to provide a system infrastructure to control and manage cluster resources. Resources are physical and logical entities that are used by applications. LSF resources are shared as defined in the EGO resource distribution plan.

IBM Spectrum LSF administrator foundations

Get an administrator overview of IBM Spectrum LSF how to manage various types of workload and cluster operations.

LSF cluster overview

Get an overview of your cluster and the location of important LSF directories and configuration files.

Work with LSF

Start and stop LSF daemons, and reconfigure cluster properties. Check LSF status and submit LSF jobs.

Troubleshooting LSF problems

Troubleshoot common LSF problems and understand LSF error messages. If you cannot find a solution to your problem here, contact IBM Support.