Programming Language Concepts Using C and C++/Control Level Structure

In this chapter, we discuss implicit and explicit control structures, structures that affect the sequence in which the computer performs the set of operations that make up the program, at the expression level, the statement level, and the subprogram level.

This sequence will sometimes be implicitly specified by the rules of the language, while at times it will be imposed by the programmer explicitly by means of some construct found in the programming language. The former is referred to as an implicit control structure, whereas the latter is called an explicit control structure.

Control Over Operations

edit

Control at the expression level is concerned with the control over operations, the order of operations to be performed in computing the value of an expression.

Expressions

edit

An expression is a formula for computing a value, and it is represented as a formalized sequence of operators, operands, and parentheses. An expression results in a computed value that resides in a temporary storage location until it is used. So, an expression has an implicit type that is derived from its subexpressions.

An expression is written as a sequence of operators, operands, and parentheses. An operator is a primitive function represented by a single symbol or a string of two symbols. An operand, which represents a data value and is actually a means of accessing a value, may be a constant, a variable name, a function reference, an array reference, or even another expression.

Programming languages provide implicit control over expressions by means of precedence and associativity rules. Programmer can impose explicit control through the use of parentheses.

Expression evaluation order.
 9 + 7 * 3 is evaluated as 9 + (7 * 3) because * has higher precedence than +.
 9 + 7 + 3 is evaluated as (9 + 7) + 3 because + associates to left.
 (9 + 7) * 3 is evaluated differently than the first expression due to the use of parentheses.
Many PL’s do not specify the order in which the operands of an operator are evaluated. That is, in subexp1 op subexp2, the programmer cannot make the assumption of subexp1 being evaluated before subexp2; this is true for subprogram argument evaluation order, too. In both cases, compiler is free to choose the order of evaluation: while a compiler behaves in accordance with your expectations, another may choose to upset them. Best way to avoid this is to replace side-effect producing expressions with expressions without side-effects. For example, greater_than(i++, i); must be replaced with

k = i++; greater_than(k, i); One final note: programming languages may provide exceptions to this general rule. For instance, in C operands of &&, ||, ?:, and , are evaluated in well-defined order.

An operator may be classified on the basis of the number of operands it takes.

  • Unary: It takes only one operand. Examples are ++, -- operators in C-based programming languages.
  • Binary: The operator takes two operands. Typical examples are the fundamental arithmetic operators +, -, *, and /.
  • Ternary: There is only one operator that takes three operands, conditional expression: ?:
Example: C function that returns the maximum of two values.
int max(int i, int j) { return(i > j ? i : j); }

Representation of Expressions

edit

There are a variety of ways to represent expressions. These are:

  1. Functional form: Functional form represents all operations as functions; the operands are arguments to these functions. It may also be called applicative form or ordinary prefix form. Using this representation, 9 + 7 * 3 would be expressed as Add(9, Mult(7, 3)).
  2. Cambridge Polish: Being the type of functional form employed in LISP, the parentheses surround the operator and its associated operands. According to this representaiton, 9 + 7 * 3 is expressed as (Add 9 (Mult 7 3)).
  3. Infix: This is the traditional way of representing expressions where the operator appears in the middle of the operands.
  4. Prefix: In prefix notation, the operator appears before the operands. It is also referred to as the Polish notation. Using this notation, 9 + 7 * 3 is expressed as + * 7 3 9.
  5. Postfix: In postfix notation, operands appear before the operator. It is also known as the Reverse Polish notation. 9 + 7 * 3 is expressed as 7 3 * 9 +.

Note that the prefix and postfix notations are parenthesis-free. That is because we do not need to impose the order of evaluation explicitly; the sequence of operators and operands alone are sufficient to indicate the order of operation application.

Control Over Statements

edit

Control at the statement level is concerned with the ordering of the statements that make up the program–a composition that may be accomplished sequentially, by selection among alternatives, or by iteration.

Simple Sequence

edit

Simple sequence, or sequential flow, is the default control flow at the statement level. Execution of the statements follows their order in the program source. Often, simple sequence alone is insufficient for expressing the necessary computation.

Selection

edit

Selective composition is employed when we wish to choose among two or more alternative statements (or, groups of statements). The typical examples of selective composition are:

  • if: An if statement provides for the conditional execution of either a statement or statement block based on whether a specified expression is true. Generally, it takes the form

if (condition) statement; else statement;

Some programming languages, such as Scheme and Perl, offer a negated version of the if statement: unless.
  • case: Deeply nested if-else statements can often be correct syntactically and yet not express the intended logic of the programmer. Unintended else-if matchings, for example, are more likely to pass unnoticed. Modifications to the statements are also much harder to get right. That’s why, as an alternative method of choosing among a set of mutually exclusive choices, most programming languages provide a case statement.
One point to keep in mind: many programming languages constrain the type of selector expression to integral values. That is, the following fragment may be rejected.

switch (name) { case "Deniz": ... break; case "Defne": ... break; case "Mete": ... break; ... default: ... } /* end of switch(name) */

Example: A C++ program fragment that counts the number of occurrences of each vowel.

char ch; int aCnt = 0, eCnt = 0, iCnt = 0, oCnt = 0, uCnt = 0, consonantCnt = 0, othersCnt = 0;

// switch-case version while (cin >> ch) switch (ch) { case 'a': case 'A': ++aCnt; break; case 'e': case 'E': ++eCnt; break; case 'i': case 'I': ++iCnt; break; case 'o': case 'O': ++oCnt; break; case 'u': case 'U': ++uCnt; break; default: if (isalpha(ch)) ++consonantCnt; else ++othersCnt; } /* end of switch(ch) */

// if-else version while (cin >> ch) if (ch == a || ch == A) ++aCnt; else if (ch == e || ch == E) ++eCnt; else if (ch == i || ch == I) ++iCnt; else if (ch == o || ch == O) ++oCnt; else if (ch == u || ch == U) ++uCnt; else if (isalpha(ch)) ++consonantCnt; else ++othersCnt;

Iteration

edit

Iterative composition is employed when part of a program is to be executed for zero or more times. There are four basic iteration constructs.

Indexed Loop

edit

Also known as the deterministic loop, we use this structure when we know the number of times the loop will be executed. An example is the for loop in Pascal.

Example: A Pascal fragment that computes the summation of numbers from 1 to n.

sum := 0; for counter := 1 to n do sum := sum + counter;

Note that the for statement in C-based languages is a much more powerful, general construct and should be treated as a test-before loop. In addition to implementing nondeterministic loops, deterministic ones can easily be simulated. For example, the above fragment can be written as

for (sum = 0, counter = 1; counter <= n; counter++) sum += counter;

There is a variation on the for loop that iterates over a list of values. It takes each value in the list—instead of a number from a number range—and executes one or more commands on it.

Example: A csh fragment that checks whether each command-line argument is an option or not.

foreach arg ($argv) # Does it begin with ‘-‘? if ($arg =~ -*) then echo “An option” else echo “Not an option” endif end

Test-Before Loop

edit

Being one of the two nondeterministic loops, the condition is checked at the beginning of the loop. It means that the loop body may never get executed. A typical example is the while statement in ALGOL-based programming languages.

Example: A Modula-3 subprogram that computes the greatest common divisor of two numbers.

PROCEDURE gcd(x, y: CARDINAL): CARDINAL = BEGIN WHILE y # 0 DO VAR temp: CARDINAL := y; BEGIN y := x MOD y; x := temp; END; END; RETURN x; END gcd;

Example: A C fragment to count the number of characters on a single line.

for (n = 0; getchar() != ‘\n; ++n) /* no-op */ ;

Note that the closing parenthesis is followed by an isolated semicolon. This means that the for loop contains a null statement.

The general form of the for loop in C-based languages is given below. expression-1 is evaluated once before the loop starts. It serves to make initializations. expression-2 is evaluated before each iteration and as soon as it evaluates to zero (or false in safer programming languages like Java and C#) execution will continue at the line following the for loop. expression-3 is evaluated at the end of each iteration.

for (expression1; expression2; expression3) statement;

Any of these expressions can be omitted, although the semicolons must remain. If the first expression is omitted, the for loop simply does not perform any initializations. If the third expression is omitted, there won’t be any side effects taking place at the end of the iterations. As for the absence of the second expression, it will be taken to be true. That is,

for ( ; ; ) statement; while (any non-zero value) statement;

In Java and C#, it is equivalent to while (true) statement;

For more on the equivalence of for and while loops, see the Goto Statement section.

Test-After Loop

edit

In the case of a test after loop, the condition is checked at the end of the loop. So, the loop body is executed at least once. Typical examples are the repeat-until statement in Pascal-based programming languages and do-while statement in C-based programming languages.

Example: A C fragment that keeps reading from the input stream until 'a' is read.

do ch = getchar(); while (ch != a);

Unconditional Loop

edit

In addition to these three constructs, some programming languages provide an unconditional looping structure. This structure is typically used together with an exit statement. An example to this is the LOOP-END structure in Modula-3.

Example: A Modula-3 fragment that computes the summation of numbers from 1 to n.

sum := 0; LOOP IF counter > n THEN EXIT; END; /* end of IF */ sum := sum + counter; INC(counter, 1); END;

Goto Statement

edit

It has been shown that if constructs for selection and iteration are available, then any program can be written without the use of the goto statement. But it doesn’t mean that it is entirely useless. Different forms of iteration can also be written in terms of others. Does it mean that they are useless? No!

Expressing the C-style for-loop in terms of the while-loop.

for (expression1; expression2; expression3) statement;

can be rewritten using while loop as follows

expression1; while (expression2) { statement; expression3; }

It is granted that goto is a rather low-level control structure and it had better be avoided where possible. But there may be places where it turns out to be the best choice. For instance, what can we do when we have to exit a loop prematurely? Some programming languages, such as C, C++, Java[1] and Modula-3, as seen from the example of unconditional looping structure, provide control structures like break and continue; but there are programming languages that do not[2]. The only answer in such cases is either to complicate the test expression of the loop or use a goto with its target as the statement following the end of the loop. So, the corollary is: avoid its use but don’t ever say that it is entirely useless.

Control Over Subprograms

edit

Control at the subprogram level is concerned with subprogram invocation and the relationship between the calling module and the called module. What follows is a list of possible ways the caller and the callee are related.

Simple Call

edit

In simple call, the caller exerts total control over the callee. That is, there is a master/slave relationship between them.

 

When a subprogram is invoked, control of execution branches to the entry point of the subprogram, while the address of the next instruction, the one that will be processed after completion of the subprogram, is saved. When the exit point of the subprogram is encountered during execution, control returns to the caller and to the instruction corresponding to the address saved at the point of call.

The constraints inherent in this sort of relationship between the caller and the callee are:

  1. A subprogram cannot call itself.
  2. The subprogram is invoked by means of an explicit call statement within the sequence of statements that make up the calling program.
  3. Only a single subprogram has control of execution at any one time. Thus, two subprograms cannot be invoked to execute concurrently.
  4. And, of course, the calling program has total control over the subordinate subprogram, as a master to slave.

Recursion

edit

When we remove the first constraint in the above list we get recursion. A separate activation record (frame) is created for each invocation of the recursive subprogram. Considering the cost of a call instruction and space used in the run-time stack, it is fair to say that a recursive solution is likely to be less efficient than its iterative counterpart with regard to both time and space considerations. However, for the sake of fairness, it should be stressed that the cost is due to the call instruction, not recursion itself.

In functional programming languages, recursion, a looping structure in disguise, is preferred over iteration. For avoiding the performance penalty and the run-time stack overflow due to function calls, Scheme—a functional programming language—dictates that tail-recursive calls be transformed to goto statements by the compiler. For the same reasons, many compilers—not languages!—include tail-recursion removal in their bag of tricks.

Example: Tail recursion removal using gcc.

#include <stdio.h> int add_expensive(int a, int b) { if (b > 0) return add_expensive(a + 1, b - 1); else if (b < 0) return add_expensive(a 1, b + 1); else return a; } /* end of int add_expensive(int, int) */ int main(void) { int a, b; printf("Enter two integers: "); scanf("%i %i", &a, &b); printf("%i and %i added gives %i\n", a, b, add_expensive(a, b)); return 0; } /* end of int main(void) */

If you run the above program after compiling it without optimizations, for large enough numbers it will exit without writing any results. With optimizations turned on, however, output is guaranteed.

# Instead of -foptimize-sibling-calls you can use -O2, -O3, or -Os gcc -foptimize-sibling-calls -o Tail_Recursion.exe Tail_Recursion.c↵ # Using DJGPP-gcc Tail_Recursion↵ Enter two integers: 123456789 987654321↵ 123456789 and 987654321 added gives 1111111110 gcc -o Tail_Recursion.exe Tail_Recursion.c↵ Tail_Recursion↵ Enter two integers: 123456789 987654321↵

Some programming languages require a subprogram to be explicitly declared recursive. Examples are PL/I and FORTRAN 90. Pre-90 versions of FORTRAN and COBOL do not even allow recursive subprogram definitions. The reason why such a crucial problem solving tool is ruled out is not because the designers of these programming language could not foresee the use of recursion. These programming languages, when they came out, had to compete with assembly and machine code. For making this goal more attainable, their designers decided to make all data entities static, which automatically excluded the possibility of recursion.

Implicit Calls

edit

When we lift the requirement that subprograms must be called explicitly, we have the possibility of subprograms being called outside the control of the programmer. This happens in two ways:

  1. Exception Handling: An exception is an event that occurs unexpectedly, infrequently, and at random intervals. Examples are divide by zero, subscript out of range, and end-of-file. An exception handler is a subprogram written by the programmer, interfaces with the operating system, and is invoked only when a specified “exceptional condition” is encountered.
  1. Scheduled Call: A scheduled call is typical of the type of subprogram control used in event-oriented simulation programs. Control over some subprograms is managed by a timing mechanism, a subprogram that may be built into the programming language or coded by the programmer. For example, suppose subprogram A "calls"—that is, pre-schedules—subprogram B with a particular activation time.[3] An activation record is created for subprogram B and inserted into a queue maintained in ascending order by activation time. The timing routine periodically examines the first record in this queue; when its activation time matches the current simulated time, the record is removed from the queue, placed on the run-time stack, and the subprogram is invoked. Subprograms may also be scheduled to occur not at an explicit time, but "as soon as possible." This might occur when the subprogram depends on the availability of a particular resource or the completion of another subprogram.

Parallel Processing

edit

Parallel processing, also called concurrent processing, refers to the concurrent execution of two or more subprograms in order to solve a specific problem. Parallelism may be real, that is, executing simultaneously in real time on multiple processors, or virtual, simulated parallelism on a single central processor.

 
Example: Tasks in Ada83.
Suppose that the accounting office of a small firm is responsible for carrying out these functions (tasks): ordering supplies, paying bills, and preparing invoices. With one accountant, the process might be represented as:

PROCEDURE Accounting IS BEGIN Order_Supplies; Pay_Bills; Prepare_Invoices; END Accounting;

With three accountants, however, the tasks could be performed in parallel. A typical Ada representation of such a situation would be:

PROCEDURE Accounting IS TASK Order; TASK BODY Order IS BEGIN Order_Supplies; END Order; TASK Pay; TASK BODY Pay IS BEGIN Pay_Bills; END Pay; BEGIN Prepare_Invoices; END Accounting;

In designing programs with parallel control, a programmer must consider some important problems including synchronization, critical section, and deadlock.

Coroutines

edit

If we remove the requirement of caller having total control over callee, we get mutual control. Subprograms exerting mutual control over each other are called coroutines. Coroutines can each call the other. But, different from an ordinary call, a call by B to A actually transfers control to the point in subprogram A where A last called B. A different keyword, such as resume, may be used in place of call.

 

The way different processes in an operating system are related to each other is pretty much the same. When a process gives up the processor another process gets executed on the processor. Once that process completes its time slot, it yields the processor, probably back to the previous process. This process does not start execution from the first line but rather from where it left off before.

Notes

edit
  1. Java does not have a goto statement. But it is a reserved word
  2. Unlike Java, C and C++ don’t have labeled-breaks. In cases where we need to break out of deeply nested loops we may have to end up using a solution with a goto.
  3. In a simulation program, this usually refers to simulated time, not real time.