OpenMP Information

2.1.8. Example Program in Fortran/C language

Simple OpenMP Program "Omp_Hello_World.f"

The simple OpenMP program is "Hello World" program, in which each thread simply prints the message "Hello World". In this example, thread with identifier 0, 1, 2, ......, p-1 will print the message "Hello World".

The simple OpenMP program in Fortran language in which each thread prints "Hello World" message is explained below. We describe the features of the entire program and describe the program in details.
First few lines of the program explain variable definitions, and constants. Followed by these declarations, OpenMP library calls are declared in the program. The library call OMP_GET_THREAD_NUM returns ThreadID the identifier of each thread and the library call OMP_GET_NUM_THREADS returns NoofThreads the total number of threads that the user has started for this program.
The following fragment of the program explains these features. The description of program is as follows:
program HelloWorld
integer NoofThreads, ThreadID, OMP_GET_NUM_THREADS
integer OMP_GET_THREAD_NUM
ThreadID is the identifier of each thread and NoofThreads is total number of threads used in the program. ThreadID and NoofThreads are private to each thread. Each thread obtains its own identifier and then prints the message "Hello World" in parallel.
Starting of OpenMP PARALLEL directive and PRIVATE clause. The PARALLEL directive pair specifies that the enclosed block of program, referred to as PARALLEL region, be executed in parallel by multiple threads. The PRIVATE clause is typically used to identify variables that are used as scratch storage in the program segment within the parallel region. It provides a list of variables and specifies that each thread have a private copy of those variables for the duration of the parallel region.
C$OMP PARALLEL PRIVATE(NoofThreads, ThreadID)
Each thread gets its own copies of variables, identifier and prints it.
ThreadID = OMP_GET_THREAD_NUM()
print *, 'Hello World from thread = ', ThreadID
Only master thraed obtains NoofThreads the total number of threads used in program and prints it.
if (ThreadID .EQ. 0) then
NoofThreads = OMP_GET_NUM_THREADS()
print *, 'Number of threads = ', NoofThreads
end if
Ending of OpenMP PARALLEL directive. All threads join master thread and disband.
C$OMP END PARALLEL
stop
end

The simple OpenMP program in C language in which each thread prints "Hello World" message is as below.
#include <stdio.h>
#include <omp.h>

/* Main Program */
main()
{

int ThreadID, NoofThreads;

/* OpenMP Parallel Directive */

#pragma omp parallel private(ThreadID)
{

ThreadID = omp_get_thread_num();
printf("\nHello World is being printed by the thread id %d\n", ThreadID);
/* Master Thread Has Its ThreadID 0 */

if (ThreadID == 0) {

printf("\nMaster prints Numof threads \n");
NoofThreads = omp_get_num_threads();
printf("Total number of threads are %d\n", NoofThreads);
}

}

2.1.9. List of Tools available in OpenMP

Performance is a critical issue in current cluster of workstations. Performance evaluation and visualization is an important and useful technique that helps the user to understand the behavior of a parallel program and improve complex parallel performance phenomena.

There are many problems associated with debugging an OpenMP program. It's difficult to understand the flow of your program's execution, and accessing variables is complicated because of the nature of private and shared variables in OpenMP program.

There are many tools available for debugging and performance visualization of an OpenMP programs. Some of them are listed below :

KAP/Pro Toolset for OpenMP

The KAP/Pro Toolset consists of

Guide : OpenMP compiler,
GuideView : a performance analysis tool for OpenMP; and
Assure : OpenMP Analyzer, a tool for verifying the correctness of OpenMP applications.

The Guide OpenMP Compiler provides OpenMP directive-based application development for Fortran, C, and C++. The GuideView Performance Analyzer presents a window into the performance details of a program's parallel execution. The Assure OpenMP Analyzer is the industry's first parallel correctness verifier.

They are available at

http://developer.intel.com/software/products/threadtool.htm

With the KAP/Pro tools and OpenMP, the application developer can easily and quickly develop, debug and tune programs for Windows NT and Unix. The Toolset provides a major advance in usability over alternative systems.

Sun's Compilers and Tools for OpenMP

http://www.cs.uh.edu/wompat2000/SLIDES/itzkowitz.pdf

TotalView

TotalView is the debugger for complex code. TotalView is far and away the best choice for those working with parallelism or large amounts of data because it scales transparently to support the big code and data sets running on anywhere from one to thousands of processes or processors. It's been proven in the world's toughest debugging environments.

It is available at

http://www.etnus.com/Products/TotalView

TotalView's support for OpenMP debugging lets you view the state of your program as if it were a non-parallel code. With TotalView, you can

Debug threaded codes whether OpenMP directives are present or not.
Understanding OpenMP code execution
Access private and shared variables as well as threadprivate variables.

On SGI system, OpenMP is supported by the standard development tools

ProDev Workshop (including ProMPF) for IRIX system.

2.1.10. OpenMP Information on the Web

There are large number of resources are available on the Internet. Following is a just pointer to few of them.

The official site for OpenMP - Simple, Portable Scalable, SMP Programming

http://www.openmp.org

The OpenMP Tutorial

http://hpcf.nersc.gov/training/tutorials/openmp

Parallel Programming with OpenMP

http://oscinfo.osc.edu/training/openmp/big/fsld.002.html

The Usenet Newsgroup for OpenMP from Google Web Site

comp.parallel comp.lang.fortran

2.1.11. Reference Books on OpenMP

[1] Rohit Chandra, Leonardo Dagum, parallel programming in OpenMP, Morgan Kaufmann Publishers, SanFrancisco, CA.
http://www.mkp.com

[2] "OpenMP C and C++ Application Program Interface, Version 1.0". OpenMP Architecture Review Board. October 1998.

[3] "OpenMP Fortran Application Program Interface, Version 1.0". OpenMP Architecture Review Board. October 1997.

[4] "OpenMP". Workshop presentation. John Engle, Lawrence Livermore National Laboratory. October, 1998.

[5] "OpenMP". Alliance 98 Tutorial. Faisel Saied, NCSA

[6] "Introduction to OpenMP Using the KAP/PRO Toolset". Kuck & Associates, Inc.

[7] "Guide Reference Manual (C/C++ Edition, Version 3.6". Kuck & Associates, Inc.

[8] "Guide Reference Manual (Fortran Edition, Version 3.6". Kuck & Associates, Inc.

2.1.12. OpenMP FAQ

The OpenMP FAQ (frequently asked questions) list is available at http://www.openmp.org

Q1: What is OpenMP?
Q2: What does the MP in OpenMP stand for?
Q3: How does OpenMP compare with ... ?
Q4: What languages does OpenMP work with?
Q5: Is OpenMP scalable?
Q6: Can I execute OpenMP program on 2 nodes of PARAM 10000?

Q1:What is OpenMP?
A1:OpenMP is a specification for a set of compiler directives, library routines, and environment variables that can be used to specify shared memory parallelism in Fortran and C/C++ programs.

Q2: What does the MP in OpenMP stand for?
A2: The MP in OpenMP stands for Multi Processing. We provide Open specifications for Multi Processing via collaborative work with interested parties from the hardware and software industry, government and academia.

Q3:How does OpenMP compare with ... ?
A3: MPI? Message-passing has become accepted as a portable style of parallel programming, but has several significant weaknesses that limit its effectiveness and scalability. Message-passing in general is difficult to program and doesn't support incremental parallelization of an existing sequential program. Message-passing was initially defined for client/server applications running across a network, and so includes costly semantics (including message queuing and selection and the assumption of wholly separate memories) that are often not required by tightly-coded scientific applications running on modern scalable systems with globally addressable and cache coherent distributed memories.

HPF? HPF has never really gained wide acceptance among parallel application developers or hardware vendors. Some applications written in HPF perform well, but others find that limitations resulting from the HPF language itself or the compiler implementations lead to disappointing performance. HPF's focus on data parallelism has also limited its appeal.

Pthreads? Pthreads have never been targeted toward the technical/HPC market. This is reflected in the minimal Fortran support, and its lack of support for data parallelism. Even for C applications, pthreads requires programming at a level lower than most technical developers would prefer.

FORALL loops? FORALL loops are not rich or general enough to use as a complete parallel programming model. Their focus on loops and the rule that subroutines called by those loops can't have side effects effectively limit their scalability. FORALL loops are useful for providing information to automatic parallelizing compilers and preprocessors.

BSP or LINDA or SISAL or...? There are lots of parallel programming languages being researched or prototyped in the industry. These may be targeted towards a specific architecture, or focused on exploring one key requirement. If you have a question about how OpenMP compares with a specific language or model, we can help you figure this out.

Q4: What languages does OpenMP work with?
A4: OpenMP is designed for Fortran, C and C++ to support the language that the underlying compiler supports. The OpenMP specification does not introduce any constructs that require specific Fortran 90 or C++ features. OpenMP cannot be supported by compilers that do not support one of Fortran 77, Fortran 90, ANSI 89 C or ANSI C++.

Q5: Is OpenMP scalable?
A5: OpenMP can deliver scalability for applications using shared- memory parallel programming. Significant effort was spent to ensure that OpenMP can be used for scalable applications. Ultimately, scalability is a property of the application and the algorithms used. The parallel programming language can only support the scalability by providing constructs that simplify the specification of the the parallelism and can be implemented with low overhead by compiler vendors. OpenMP certainly delivers these kinds of constructs.

Q6: Can I execute OpenMP program on 2 nodes of PARAM 10000?
A6: No you can not execute OpenMP programs on 2 nodes of PARAM 10000. OpenMP is an Application Program Interface (API) that may be used to explicitly direct multi-threaded, shared memory parallelism. It is a specification for a set of compiler directives, library routines and environment variables that can be used to specify shared memory parallelism.