Index
Prev
Index
Altix architecture overview
Altix 3000 series systems
Altix 3000 Series Systems
Altix 4000 series systems
Altix 4000 Series Systems
Amdahl's law
Understanding Parallel Speedup and Amdahl's Law
execution time given n and p
Predicting Execution Time with n CPUs
parallel fraction p
Understanding Amdahl's Law
parallel fraction p given speedup( n )
Calculating the Parallel Fraction of a Program
speedup( n ) given p
Understanding Amdahl's Law
superlinear speedup
Understanding Superlinear Speedup
analysis
system configuration
Determining System Configuration
application tuning process
Performance Analysis and Debugging
automatic parallelization
limitations
Use Compiler Options
cache bank conflicts
Tuning the Cache Performance
cache performance
Tuning the Cache Performance
cacheline traffic and CPU utilization
pmshub
(1) Command
cluster environment
Scalable Computing
commands
dlook
Using the dlook Command
dplace
Using the
dplace
Command
topology
topology(1)
Command
common compiler options
Compiler Overview
compiler command line
Compiler Overview
compiler libaries
C/C++
C/C++ Libraries
dynamic libraries
Dynamic Libraries
message passing
SHMEM Message Passing Libraries
overview
Library Overview
compiler libraries
static libraries
Static Libraries
compiler options
tracing and porting
Getting the Correct Results
compiler options for tuning
Using Compiler Options Where Possible
compiling environment
The SGI Compiling Environment
compiler overview
Compiler Overview
debugger overview
Other Compiling Environment Features
libraries
Library Overview
modules
Modules
CPU utilization
pmshub
(1) Command
CPU-bound processes
Sources of Performance Problems
csrep command
histx
Filters
data decomposition
Data Decomposition
data dependency
Identifying Parallel Opportunities in Existing Code
data parallelism
Data Decomposition
data placement tools
Data Placement Tools
cpusets
Data Placement Tools Overview
dplace
Data Placement Tools Overview
overview
Data Placement Tools Overview
taskset
Data Placement Tools Overview
debugger overview
Other Compiling Environment Features
debuggers
Debugging Tools
gdb
Other Compiling Environment Features
idb
Other Compiling Environment Features
TotalView
Other Compiling Environment Features
denormalized arithmetic
Compiler Overview
determining parallel code amount
Parallelizing Your Code
determining tuning needs
tools used
Determining Tuning Needs
dlook command
Using the dlook Command
dplace command
Using the
dplace
Command
dumppm
histx
Filters
Electric Fence debugger
Managing Heap Corruption Problems
Environment variables
Environment Variables for Performance Tuning
explicit data decomposition
Data Decomposition
False sharing
Fixing False Sharing
file limit resources
resetting
Resetting the File Limit Resource Default
Flexible File I/O (FFIO)
Multithreading Considerations
environment variables to set
Environment Variables
operation
FFIO Operation
overview
Flexible File I/O
simple examples
Simple Examples
floating-point programs
Floating-point Programs Performance
Floating-Point Software Assist
Floating-point Programs Performance
FPSWA
See
Floating-Point Software Assist
functional parallelism
Data Decomposition
gdb tool
Debugging Tools
GNU debugger
Debugging Tools
gtopology command
gtopology(1)
Command
GuideView tool
Using GuideView
histx
Using
histx
histx data collection
histx
Data Collection
histx filters
histx
Filters
hwinfo command
hwinfo(1)
Command
idb tool
Debugging Tools
implicit data decomposition
Data Decomposition
I/O-bound processes
Sources of Performance Problems
iostat command
System Usage Commands
iprep command
histx
Filters
latency
Scalable Computing
limits
system
Resetting System Limits
linkstat command
linkstat
(1) Command
lipfpm command
histx
Data Collection
memory management
The Basics of Memory Management
Managing Memory
memory page
The Basics of Memory Management
memory strides
Tuning the Cache Performance
memory-bound processes
Sources of Performance Problems
Message Passing Toolkit
for parallelization
Use MPT
using profile.pl
profile.pl
with MPI programs
modules
Modules
command examples
Modules
MPP definition
Scalable Computing
NUMA Tools
command
dlook
dlook
Command
dplace
Using the
dplace
Command
installing
Installing NUMA Tools
OpenMP
Use OpenMP
environment variables
Environment Variables for Performance Tuning
Guide OpenMP Compiler
Other Performance Tools
parallel execution
Amdahl's law
Understanding Parallel Speedup and Amdahl's Law
parallel fraction p
Understanding Amdahl's Law
parallel speedup
Understanding Parallel Speedup
parallelization
automatic
Use Compiler Options
using MPI
Use MPT
using OpenMP
Use OpenMP
performance
Assure Thread Analyzer
Other Performance Tools
Guide OpenMP Compiler
Other Performance Tools
GuideView
Using GuideView
VTune
Using VTune for Remote Sampling
performance analysis
Performance Analysis and Debugging
Performance Co-Pilot monitoring tools
Performance Co-Pilot Monitoring Tools
linkstat
linkstat
(1) Command
Other Performance Co-Pilot monitoring tools
Other Performance Co-Pilot Monitoring Tools
pmshub
pmshub
(1) Command
shubstats
shubstats
(1) Command
performance gains
types of
Performance Analysis and Debugging
performance problems
sources
Sources of Performance Problems
pfmon tool
Profiling with
pfmon
pmshub command
pmshub
(1) Command
process placement
Determining Process Placement
MPI and OpenMP
Combination Example (MPI and OpenMP)
set-up
Determining Process Placement
using OpenMP
Example Using OpenMP
using pthreads
Example Using pthreads
profile.pl script
Profiling with
profile.pl
profiling
pfmon
Profiling with
pfmon
profile.pl
Profiling with
profile.pl
ps command
System Usage Commands
resetting default system stack size
Resetting the Default Stack Size
resetting file limit resources
Resetting the File Limit Resource Default
resetting system limit resources
Resetting System Limits
resident set size
The Basics of Memory Management
samppm command
histx
Data Collection
sar command
System Usage Commands
scalable computing
Scalable Computing
SHMEM
SHMEM Message Passing Libraries
shortening execution time
Adding CPUs to Shorten Execution Time
shubstats command
shubstats
(1) Command
SMP definition
Scalable Computing
stack size
resetting
Resetting the Default Stack Size
superlinear speedup
Understanding Superlinear Speedup
swap space
The Basics of Memory Management
system
overview
System Overview
system configuration
Determining System Configuration
system limit resources
resetting
Resetting System Limits
system limits
address space limit
Resetting System Limits
core file siz
Resetting System Limits
CPU time
Resetting System Limits
data size
Resetting System Limits
file locks
Resetting System Limits
file size
Resetting System Limits
locked-in-memory address space
Resetting System Limits
number of logins
Resetting System Limits
number of open files
Resetting System Limits
number of processes
Resetting System Limits
priority of user process
Resetting System Limits
resetting
Resetting System Limits
resident set size
Resetting System Limits
stack size
Resetting System Limits
system monitoring tools
Monitoring Tools
command
hwinfo
hwinfo(1)
Command
topology
topology(1)
Command
system usage commands
System Usage Commands
iostat
System Usage Commands
ps
System Usage Commands
sar
System Usage Commands
top
System Usage Commands
uptime
System Usage Commands
vmstat
System Usage Commands
w
System Usage Commands
taskset command
taskset
Command
tools
Assure Thread Analyzer
Other Performance Tools
Guide OpenMP Compiler
Other Performance Tools
GuideView
Using GuideView
pfmon
Profiling with
pfmon
profile.pl
Profiling with
profile.pl
profile.pl
with MPI programs
VTune
Using VTune for Remote Sampling
top command
System Usage Commands
topology command
topology(1)
Command
topology(1)
Command
tuning
cache performance
Tuning the Cache Performance
debugging tools
Electric Fence
Managing Heap Corruption Problems
idb
Debugging Tools
dplace
Using
dplace
and
taskset
Electric Fence
Managing Heap Corruption Problems
environment variables
Environment Variables for Performance Tuning
false sharing
Fixing False Sharing
heap corruption
Managing Heap Corruption Problems
managing memory
Managing Memory
multiprocessor code
Multiprocessor Code Tuning
parallelization
Parallelizing Your Code
profiling
GuideView
Using GuideView
histx command
Using
histx
mpirun command
profile.pl
with MPI programs
pfmon
Profiling with
pfmon
profile.pl script
Profiling with
profile.pl
VTune analyzer
Using VTune for Remote Sampling
single processor code
Single Processor Code Tuning
using compiler options
Using Compiler Options Where Possible
using dplace
Using
dplace
and
taskset
using math functions
Using Tuned Code
using taskset
Using
dplace
and
taskset
verifying correct results
Getting the Correct Results
uname command
Determining System Configuration
unflow arithmetic
effects of
Compiler Overview
uptime command
System Usage Commands
virtual addressing
The Basics of Memory Management
virtual memory
The Basics of Memory Management
vmstat command
System Usage Commands
VTune performance analyzer
Using VTune for Remote Sampling
w command
System Usage Commands