Hybrid Shared/Distributed Memory Parallelization (SPMD)

Single Program, Multiple Data (SPMD) is a parallelization technique in computing that is employed to achieve faster results by splitting the program into multiple subsets and running them simultaneously on multiple processors/machines.

SPMD typically implies running the same process or program on single or multiple machines (nodes) with different input data for each individual task. In this section, discussion about SPMD typically includes the application of Shared Memory Parallelization (SMP) in conjunction with the MPI-based parallelization. Typically, this combination can be termed as Hybrid Shared/Distributed Memory Parallelization, and it will henceforth be referred to as SPMD.

Supported Platforms

Supported platforms and MPI versions for Hybrid Shared/Distributed Memory Parallelization (SPMD).

Application	Version	Supported Platforms	MPI
Hybrid Shared/Distributed Memory Parallelization (SPMD)	2017	Linux (64-bit)	IBM Platform MPI - Version 9.1.2 (formerly HP-MPI) (or) Intel MPI - Version 5.1.3 (223) (Recommended)
Windows (64-bit)	IBM Platform MPI - Version 9.1.2 (formerly HP-MPI) (or) Intel MPI - Version 5.1.3 (Recommended)

Application

Version

Supported Platforms

MPI

Hybrid Shared/Distributed Memory Parallelization (SPMD)

2017

Linux (64-bit)

IBM Platform MPI - Version 9.1.2

(formerly HP-MPI)

(or)

Intel MPI - Version 5.1.3 (223)

(Recommended)

Windows (64-bit)

IBM Platform MPI - Version 9.1.2

(formerly HP-MPI)

(or)

Intel MPI - Version 5.1.3

(Recommended)

OptiStruct SPMD can be run on either a single node or multiple nodes in a cluster depending upon the program and hardware limitations/requirements. SPMD in OptiStruct is implemented by the following MPI-based functionalities:

Domain Decomposition Method

Domain Decomposition Method (DDM) is available for analysis and optimization. DDM is a parallelization option in OptiStruct that can help significantly reduce model runtime with improved scalability compared to legacy shared memory parallelization approaches, especially on machines with a high number of cores (for example, greater than 8).

DDM allows two main levels of parallelization depending on the attributes of the model.

DDM Level 1 – Task-based parallelization

The first level of parallelization is task-based parallelization. The following model attributes can be parallelized at the first level of distributing tasks to different MPI processes.

Table 1. Domain Decomposition Method (DDM) – Support for Task-based Distribution in DDM
Supported Solutions	Task-based Parallelizable Entities in DDM	Non-Parallelizable Tasks
Linear Static Analysis	√ Two or more static Boundary Conditions are parallelized (Matrix Factorization is the step that is parallelized since it is computationally intensive). √ Sensitivities are parallelized for Optimization.	√ Iterative Solution is not parallelized. (Only Direct Solution is parallelized).
Buckling Analysis	√ Two or more Buckling Subcases are parallelized. √ Sensitivities are parallelized for Optimization.
Direct Frequency Response Analysis	√ Loading Frequencies are parallelized. √ Optimization is not parallelized.
DGLOBAL Global Search Optimization with multiple starting points	√ Starting Points are parallelized.
Multi-Model Optimization	√ Models listed in the master file are parallelized depending on the number of specified `-np` overall and/or the number of `-np` defined for each model in the master file.	√ Task-based parallelization is not applicable within each MMO model. Only Geometric partitioning (DDM level 2) is conducted for each individual model in MMO.

A Task is a minimum distribution unit used in parallelization. Each buckling analysis subcase is one task. Each Left-Hand Side (LHS) of the static analysis subcases is one task. Typically, the static analysis subcases sharing the same SPC (Single Point Constraint) belong to one task. Not all tasks can be run in parallel at the same time (Example: A buckling subcase cannot start before the execution of its STATSUB subcase).

DDM Level 1 is purely task-based parallelization and you can enforce only Level 1 DDM by setting PARAM,DDMNGRPS,MAX. This maximizes the number of groups that can be assigned for the specified model, thereby leading to a pure Task-based parallelization.

Note: DDM Level 1 (Task-based parallelization) currently is not supported for Nonlinear Static or Nonlinear Transient Analysis. However, DDM Level 2 (geometric partitioning) is supported for Nonlinear Analysis.

DDM Level 2 – Parallelization of Geometric Partitions

The second level of parallelization occurs at the distributed task level. Each distributed task can be further parallelized via Geometric partitioning of the model. These geometric partitions are generated and assigned depending on the number of MPI groups and the number of MPI processes in each group.

The second level DDM process utilizes graph partition algorithms to automatically partition the geometric structure into multiple domains (equal to the number of MPI processes in the corresponding group). During FEA analysis/optimization, an individual domain/MPI process only processes its domain related calculations. Such procedures include element matrix assembly, linear solution, stress calculations, sensitivity calculations, and so on.

The platform dependent Message Passing Interface (MPI) handles the communication between various MPI processes.

The necessary communication across domains is accomplished by OptiStruct and is required to guarantee the accuracy of the final solution. When the solution is complete, result data is collected and output to a single copy of the .out file. From the user’s perspective in terms of results, there should be no difference between DDM and serial runs in this aspect.

The DDM functionality for level two parallelization is direct geometric partitioning of a model based on the number of MPI processes available in the MPI group that is assigned to this task.

DDM level 2 is purely geometric partitioning and you can enforce only level 2 DDM by setting PARAM,DDMNGRPS,MIN. This minimizes the number of groups that can be assigned for the specified model, thereby leading to a pure geometric partitioning parallelization.

Hybrid DDM parallelization using both Level 1 and Level 2 parallelization is possible by setting PARAM,DDMNGRPS,AUTO or PARAM,DDMNGRPS,#, where # is the number of MPI groups. In case of AUTO, which is also the default for any DDM run, OptiStruct heuristically assigns the number of MPI groups for a particular model, depending on model attributes and specified -np.

For a default DDM run (PARAM,DDMNGRPS,AUTO) or if PARAM,DDMNGRPS,# is specified to identify the number of MPI groups, hybrid DDM parallelization is typically active. Both level 1 and level 2 parallelization are performed, depending on the specified -np and the number of MPI groups. PARAM,DDMNGRPS,<ngrps> is an optional parameter and AUTO is always the default for any DDM run. The model is first divided into parallelizable tasks/starting points (refer to Table 1 for supported solution sequences for level 1). Each task/starting point is then solved sequentially by an MPI group. Each MPI group consists of one or more MPI Processes. If an MPI group consists of multiple MPI processes, then the task/starting point is solved by geometric partitioning in that MPI group (the number of MPI processes in a group is equal to the number of geometric partitions. Refer to Supported Solution Sequences for DDM Level 2 Parallelization (Geometric Partitioning) for solutions that support geometric partitioning level 2 DDM).

Note: If Global Search Option is used, then PARAM,DDMNGRPS (or default DDM run) activates parallelization of starting points in the initial parallelization level, then depending on the number of MPI processes in each MPI group, geometric partitioning is also done.

In the case of Global Search Option, the MPI process groups should not be confused with the number of groups (NGROUP) on the DGLOBAL entry. They are completely different entities and do not relate to one another.
In the case of Global Search Option, each MPI process prints the detailed report of the starting points processed and the summary is output in the .out file of the master process. The naming scheme of the generated folders is similar to the serial GSO approach (each folder ends with _SP#_UD#, identifying the Starting Point and Unique Design numbers).
PARAM,DDMNGRPS should not be used in conjunction with Multi-Model Optimization (MMO) runs. For MMO, DDM level 1 (task-based parallelization) is automatically activated when specified np is greater than number of models plus 1 (or if number of np for each model is explicitly defined in the master file). DDM Level 1 for MMO simply distributes the extra MPI processes to models evenly. DDM Level 2 parallelization is geometric partitioning of each model based on the number of MPI processes assigned to each one. For more information, refer to Multi-Model Optimization.

Table 2. Overview of Support for PARAM,`DDMNGRPS`
Solution	Activation	Level 1	Level 2	Grouping
Parallel Global Search Option (DDM)	Use `-np # -ddm` Default is Hybrid DDM with PARAM,`DDMNGRPS`,AUTO PARAM,`DDMNGRPS`,<ngrps> is optional	Starting points are parallelized (each starting point is solved sequentially in a MPI process, if multiple starting points are assigned to 1 MPI process).	A Starting point is geometrically partitioned and solved in parallel by an MPI group (if the MPI group contains multiple MPI processes).	Grouping via PARAM,`DDMNGRPS`,AUTO is the default. If you need to adjust number of MPI groups, use PARAM,`DDMNGRPS`,<ngrps> (if `np> ngrps` then this activates 2nd level parallelization wherein geometric partitioning of some/all starting point run(s) occurs when multiple MPI processes can operate on a single starting point).
General Analysis/Optimization runs (DDM)	Use `-np # -ddm` Default is Hybrid DDM with PARAM,`DDMNGRPS`,AUTO. PARAM,`DDMNGRPS`,<ngrps> is optional.	Loadcases, loading Frequencies (Task-based parallelization) Refer to Table 1 for supported solutions).	Each loadcase/loading frequency is geometrically partitioned and solved in parallel by an MPI group (if the MPI group contains multiple MPI processes).	Grouping via PARAM,`DDMNGRPS`,AUTO is the default. If you need to adjust number of MPI groups, use PARAM,`DDMNGRPS`,<ngrps> Like the above scenario, if `np > ngrps`, then this activates 2nd level parallelization wherein geometric partitioning of some (or all) tasks occur.
Multi-Model Optimization (MMO) with DDM	Use `-np # -mmo` PARAM,`DDMNGRPS` is not supported with MMO	Optimization models are parallelized wherein extra MPI processes (> no of models plus 1) are distributed evenly to models in master MMO file (or user- defined `-np` for each model in master file).	Any MMO optimization model is geometrically partitioned and solved in parallel (number of partitions depends on number of MPI processes/np assigned to it).	There is no grouping via PARAM,`DDMNGRPS` as it is not supported for MMO. However, grouping is indirectly accomplished by automatic distribution of np to the MMO models or by explicit user input via Master file.

Supported Solution Sequences for DDM Level 2 Parallelization (Geometric Partitioning)

Linear, Nonlinear Static Analysis/Optimization, Structural Direct Frequency Response Analysis (MUMPS is also available for SMP run via the SOLVTYP entry), Normal Modes Analysis, and Buckling Analysis solution sequences are generally supported. Preloaded Modal Frequency Response (with AMLS/AMSES) is supported. Direct Frequency Response with Fluid–Structure Interaction (Acoustic Analysis) is supported. Direct Frequency Response Analysis with MFLUID is also supported. NDirect and Modal Linear Transient Analysis, Nonlinear Direct Transient Analysis, and Structural Fluid-Structural Analysis (SFSI) are also supported. Fatigue Analysis (based on linear static or Modal Transient Analysis) is also supported. Periodic Boundary Conditions (PERBC) are also supported. Normal Mode and Buckling Analysis/Optimization are supported via the Lanczos eigensolver, additionally, the MUMPS solver can also be activated for the SMP run using the SOLVTYP entry. The Iterative solvers, however, are currently not supported in conjunction with DDM.

Note:

The -ddm run option can be used to activate DDM. Refer to How many MPI processes (-np) and threads (-nt) per MPI process should I use for DDM runs? for information on launching Domain Decomposition in OptiStruct.
In DDM mode, there is no distinction between MPI process types (for example, manager, master, slave, and so on). All MPI processes (domains) are considered as worker MPI processes. Additionally, hybrid DDM with multiple levels is the default for any DDM run. Therefore, if -np n is specified, OptiStruct first divides available -np into MPI groups heuristically (PARAM,DDMNGRPS,AUTO) and then within each MPI group, subsequent geometric partitions are conducted wherein the number of such partitions in each group are equal to the number of MPI processes available in that group. These MPI processes are then run on corresponding sockets/machines depending on availability.
DDM works by symmetric partitioning of the model, therefore, it is recommended to set -np (number of MPI processes spawned) such that the number of MPI processes assigned to each machine/socket is the same. For example, if the number of available machines/sockets that can support MPI processes is equal to k, then -np should be set equal to an integral multiple of k.
Hybrid computation is supported. -nt can be used to specify the number of threads (m) per MPI process in an SMP run. Sometimes, hybrid performance may be better than pure MPI or pure SMP mode, especially for blocky structures. It is also recommended that the total number of threads for all MPI processes (n x m) should not exceed the number of physical cores of the machine.

Launch OptiStruct DDM

There are several ways to launch parallel programs with OptiStruct DDM.

Remember to propagate environment variables when launching OptiStruct DDM, if needed (this is typically only required if running the OptiStruct executable directly, instead of the OptiStruct script/GUI). Refer to the respective MPI vendor’s manual for more details. In OptiStruct, commonly used MPI runtime software are automatically included as a part of the HyperWorks installation. The various MPI installations are located at $ALTAIR_HOME/mpi.

Linux Machines

Using OptiStruct Scripts

On a single host (for IBM Platform MPI (Formerly HP-MPI) using OptiStruct script

Domain Decomposition Method (DDM)

[optistruct@host1~]$ $ALTAIR_HOME/scripts/optistruct –mpi [MPI_TYPE] –ddm -np [n] -cores [c] [INPUTDECK] [OS_ARGS]

Where, -mpi: optional run option to select the MPI implementation used:

[MPI_TYPE] can be:

pl: For IBM Platform-MPI (Formerly HP-MPI)
i: For Intel MPI; (-mpi is optional, default MPI implementation on Linux machines is -mpi i); Refer to Run Options for further information.
-ddm: Optional parallelization type selector that activates DDM (DDM is the default)
-np: Optional run option to identify the total number of MPI processes for the run
[n]: Integer value specifying the number of total MPI processes (-np)
-cores: Optional run option to identify the number of total cores (-np*-nt) for the run
[c]: Integer value specifying the number of total cores (-cores)
[INPUTDECK]: Input deck file name
[OS_ARGS]: Lists the arguments to OptiStruct (Optional, refer to Run Options for further information).

Note:

Adding the command line option -testmpi, runs a small program which verifies whether your MPI installation, setup, library paths and so on are accurate.
OptiStruct DDM can also be launched using the Run Manager GUI. (Refer to HyperWorks Solver Run Manager).
Though it is not recommended, it is also possible to launch OptiStruct DDM without the GUI/ Solver Scripts. Refer to How many MPI processes (-np) and threads (-nt) per MPI process should I use for DDM runs? in the FAQ section.
Adding the optional command line option -mpipath PATH helps you find the MPI installation if it is not included in the current search path or when multiple MPI's are installed.
If an SPMD run option (-mmo/ -ddm /-fso) is not specified, DDM is run by default.

Windows Machines

Using OptiStruct Scripts

On a single host (for HP-MPI, Platform-MPI, Intel-MPI and MS-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ $ALTAIR_HOME/hwsolvers/scripts/optistruct.bat –mpi [MPI_TYPE] –ddm -np [n] -cores [c] [INPUTDECK] [OS_ARGS]

Where, -mpi: optional run option to select the MPI implementation used.

[MPI_TYPE] can be:

pl: For IBM Platform-MPI (Formerly HP-MPI)
i: For Intel MPI; (-mpi is optional, default MPI implementation on Linux machines is -mpi i); Refer to Run Options for further information.
-ddm: Optional parallelization type selector that activates DDM (DDM is the default)
-np: Optional run option to identify the total number of MPI processes for the run
[n]: Integer value specifying the number of total MPI processes (-np)
-cores: Optional run option to identify the number of total cores (-np*-nt) for the run
[c]: Integer value specifying the number of total cores (-cores)
[INPUTDECK]: Input deck file name
[OS_ARGS]: Lists the arguments to OptiStruct (Optional, refer to Run Options for further information).

Note:

Adding the command line option -testmpi, runs a small program which verifies whether your MPI installation, setup, library paths and so on are accurate.
OptiStruct DDM can also be launched using the Run Manager GUI. (Refer to HyperWorks Solver Run Manager).
It is also possible to launch OptiStruct DDM without the GUI/ Solver Scripts. (Refer to How many MPI processes (-np) and threads (-nt) per MPI process should I use for DDM runs? in the FAQ section.
Adding the optional command line option -mpipath PATH helps you find the MPI installation if it is not included in the current search path or when multiple MPI's are installed.
If a SPMD run option (-mmo/ -ddm /-fso) is not specified, DDM is run by default.

Linux Machines using Direct Calls to Executables

It is recommended to use the OptiStruct script ($ALTAIR_HOME/scripts/optistruct) or the OptiStruct GUI (HWSolvers Run Manager) instead of directly running the OptiStruct executable. However, if directly using the executable is unavoidable for a particular run, then it is important to first define the corresponding environment variables, RADFLEX license environment variables, library path variables and so on before the run.

On a Single Host (for IBM Platform-MPI and Intel MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpirun -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] [OS_ARGS] -ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpirun -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] [OS_ARGS] -mmomode

Failsafe Topology Optimization (FSO)

[optistruct@host1~]$ mpirun -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] [OS_ARGS] -fsomode

Where,

optistruct_spmd: The OptiStruct SPMD binary; Example: optistruct_2018_linux64_impi
[n]: Number of processors
[INPUTDECK]: Input deck file name
[OS_ARGS]: Lists the arguments to OptiStruct other than -ddmmode/-mmomode/-fsomode (Optional, refer to Run Options for further information).
Note: Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these run options is not used, there will be no parallelization and the entire program will be run on each node.

On a Linux cluster (for IBM Platform-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpirun –f [appfile]

Where, appfile contains:

optistruct@host1~]$ cat appfile

-h [host i] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] -ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpirun –f [appfile]

Where, appfile contains:

-h [host i] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] -mmomode

Failsafe Topology Optimization (FSO)

[optistruct@host1~]$ mpirun –f [appfile]

Where, appfile contains:

-h [host i] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] -fsomode

Where,

[appfile]: Text file which contains process counts and a list of programs
optistruct_spmd: OptiStruct SPMD binary; Example: optistruct_2018_linux64_plmpi

Note: Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these options is not used, there will be no parallelization and the entire program will be run on each node.

Example: 4 CPU job on 2 dual-CPU hosts (the two machines are named: c1 and c2)

[optistruct@host1~]$ cat appfile
-h c1 –np 2 $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd  [INPUTDECK] -ddmmode
-h c2 –np 2 $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd  [INPUTDECK] –ddmmode

On a Linux cluster (for Intel-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpirun –f [hostfile] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] [OS_ARGS] -ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpirun –f [hostfile] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] [OS_ARGS] -mmomode

Failsafe Topology Optimization (FSO)

[optistruct@host1~]$ mpirun –f [hostfile] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/linux64/optistruct_spmd [INPUTDECK] [OS_ARGS] -fsomode

Where,

optistruct_spmd: OptiStruct SPMD binary; Example: optistruct_2018_linux64_impi
[hostfile]: Text file which contains the host names

Line format is:

[host i]

Note:

One host requires only one line.
Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these options is not used, there will be no parallelization and the entire program will be run on each node.

Example: 4 CPU job on 2 dual-CPU hosts (the two machines are named: c1 and c2:

[optistruct@host1~]$ cat hostfile
c1
c2

Windows Machines using Direct Calls to Executables

On a Single Host (for IBM Platform-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpirun -np [n]

$ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] [OS_ARGS] -ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpirun -np [n]

$ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] [OS_ARGS] -mmomode

Failsafe Topology Optimization (FSO)

[optistruct@host1~]$ mpirun -np [n]

$ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] [OS_ARGS] -fsomode

Where,

optistruct_spmd: OptiStruct SPMD binary; Example: optistruct_2018_linux64_plmpi
[n]: Number of processors
[INPUTDECK]: Input deck file name
[OS_ARGS]: Lists the arguments to OptiStruct other than -ddmmode/-mmomode/-fsomode (Optional, refer to Run Options for further information
Note: Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these run options is not used, there will be no parallelization and the entire program will be run on each node.

On a Single Host (for Intel-MPI and MS-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpiexec -np [n]

$ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] [OS_ARGS] -ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpiexec -np [n]

$ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] [OS_ARGS] -mmomode

Failsafe Topology Optimization (FSO)

optistruct@host1~]$ mpiexec -np [n]

$ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] [OS_ARGS] -fsomode

Where,

optistruct_spmd: OptiStruct SPMD binary; Example: optistruct_2018_linux64_plmpi
[n]: Number of processors
[INPUTDECK]: Input deck file name
[OS_ARGS]: Lists the arguments to OptiStruct other than -ddmmode/-mmomode/-fsomode (Optional, refer to Run Options for further information).; Note: Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these options is not used, there will be no parallelization and the entire program will be run on each node.

On Multiple Windows Hosts (for IBM Platform-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpirun –f [hostfile]

Where, appfile contains:

optistruct@host1~]$ cat appfile

-h [host i] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] -ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpirun –f [hostfile]

Where, appfile contains:

optistruct@host1~]$ cat appfile

-h [host i] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] -mmomode

Failsafe Topology Optimization (FSO)

[optistruct@host1~]$ mpirun –f [hostfile]

Where, appfile contains:

optistruct@host1~]$ cat appfile

-h [host i] -np [n] $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] -fsommode

Where,

[appfile]: Text file which contains process counts and a list of programs; Note: Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these run options is not used, there will be no parallelization and the entire program will be run on each node.

Example: 4 CPU job on 2 dual-CPU hosts (the two machines are named: c1 and c2)

optistruct@host1~]$ cat hostfile
-host c1 –n 2 $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd  [INPUTDECK] -mpimode
-host c2 –n 2 $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd  [INPUTDECK] –mpimode

On Multiple Windows Hosts (for Intel-MPI and MS-MPI)

Domain Decomposition Method (DDM)

[optistruct@host1~]$ mpiexec –configfile [config_file]

Where, config_file contains:

optistruct@host1~]$ cat config_file

-host [host i] –n [np] $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] –ddmmode

Multi-Model Optimization (MMO)

[optistruct@host1~]$ mpiexec –configfile [config_file]

Where, config_file contains:

optistruct@host1~]$ cat config_file

-host [host i] –n [np] $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] -mmomode

Failsafe Topology Optimization (FSO)

[optistruct@host1~]$ mpiexec –configfile [config_file]

Where, config_file contains:

optistruct@host1~]$ cat config_file

-host [host i] –n [np] $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd [INPUTDECK] -fso

Where,

[config_file]: Text file which contains the command for each host

Note:

One host needs only one line.
Running OptiStruct SPMD, using direct calls to the executable, requires an additional command-line option -ddmmode/-mmomode/-fsomode (as shown above). If one of these options is not used, there will be no parallelization and the entire program will be run on each node.

Example: 4 CPU job on 2 dual-CPU hosts (the two machines are named: c1 and c2)

optistruct@host1~]$ cat config_file
-host c1 –n 2 $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd  [INPUTDECK] -ddmmode
-host c2 –n 2 $ALTAIR_HOME/hwsolvers/optistruct/bin/win64/optistruct_spmd  [INPUTDECK] –ddmmode