Theory of Computation

Mathematical Foundations of Computer Science

Introduction

The notes on Mathematical Foundations (or the Theory of Computation) presented below are mainly based on Cohen, Daniel (1997), Introduction to Computer Theory, 2nd Edition, John Wiley & Sons, and Hopcroft, J., Motwani, R. & Ullman, J. (2007), Introduction to Automata Theory, Languages, and Computation, 3rd Edition, Addison Wesley, and Sipser, M. (2012) Introduction to the Theory of Computation, Cengage Learning, 3rd edition. We will emphasize mathematical models of computation in this session. Chapter 1 of Cohen (1997) includes a brief history of the subject; read it and pay attention to the following questions:

1. What was the main theme of David Hilbert's speech in 1900 that turned out to be of major significance to the development of computer science?

2. What was the 10th problem on Hilbert's list?

3. What did Turing prove about his abstract machine?

4. How did World War II affect the development of the computer?

The way we shall be studying about computers is to build mathematical models, which we shall call machines, and then to study their limitations by analyzing the types of input on which they operate successfully. (Cohen 1997, page 3). The most important mathematical models are Finite Automata (FA), Pushdown Automata (PDA), and Turing Machines (TMs). The most powerful automata are the Turing Machines developed by Alan Turing (1936) because these machines can process (accept) all recursively enumerable sets (or computable languages) whereas Pushdown Automata can process a proper subset called Context-Free languages, and Finitie Automata can process a proper subset (Regular languages) of Context-Free languages. Our main goal is to understand the relationship among these languages and their corresponding automata (FA, PDA and TMs) as shown in the diagram given below.

Diagram 1: Elements of the Automata Theory

Please note that RG = Regular Grammar, CFG = Context-Free Grammar, UG = Unrestricted Grammar (or Unrestricted Phrase Structure Grammar).

Turing's goal was to describe precisely the boundary between what a computing machine could do and what it could not do; his conclusions apply not only to his abstract Turing Machines, but to today's real machines. (Hopcroft, Motwani & Ullman 2007, Page 1). Today's computers are Turing Machines. Pushdown Automata (PDA) are computationally less powerful than Turing Machines and Finite Automata (FA) are less powerful than PDA. One of the best ways to understand the computing power of automata is to examine the way in which various sets are processed by these automata. These sets are also called languages because these sets can be also defined by grammars following Noam Chomsky's suggestions. If you are not familiar with grammar formalisms, you can skip the grammatical aspects initially.

Overview

Automata theory provides theoretical (or mathematical) foundations to computer science. In this section we discuss some intuitive relationship among the primary automata classes: Finite Automata (FA), Pushdown Automata (PDA), and Turing Machines (TMs).

It is good to start with a Finite Automaton for processing a regular language (or set), because Finite Automata are easy to understand. Consider the following set or language: {a, ab, abb, abbb, abbbb, abbbbb, . . . }

The elements of the set are separated by commas; each element is a string (or token). There is a regular expression for every regular language (or set). In other words, a regular expression denotes a set. Thus, ab* is a regular expression that denotes the above set. The * at the end of ab* is known as Kleene-star. By ab* Kleene implied one a followed by zero or more b's. We can write ab* = {a, ab, abb, abbb, abbbb, abbbbb, . . . }.

The following finite automation accepts every string of the set denoted by ab*.

Diagram 2: A Finite Automaton for ab* (using Cohen's notation)
The above finite automaton accepts the string a by starting at the start state (marked by -) and taking the transition (arrow) marked by a it reaches the final state (or accept state). The same automaton accepts the string ab because it starts from the start state, and then takes transition (arrow) marked by a and then reaches the final state and then takes loop transition marked by b and comes back to the final state (or accept state). The accepting conditions for FA are: (1) the input must be finished (i.e., no more symbols to scan), (2) the automaton is in a final state. Compared to Finite Automata, Turing Machines (TMs) are more powerful because they can read, write, move the Tape-Head (Read-Write-Head) forward (Right), or backward (Left). A TM for the same set is given below: Turing_Machine_ab_star_

Diagram 3: A Turing Machine for ab* (using Cohen's notation)
As shown in Diagram 3, every transition (arrow) in a TM has three components (Read, Write, Move). A TM accepts an input string by reaching the Halt state. The execution of a TM halts at the entrance to the Halt state; the Halt state cannot have a out going transition. Finite Automata (FA) cannot accept non-regular languages; Turing Machines (TMs) can accept regular and non-regular languages (that is, all recursively enumerable languages or sets). Cohen (1997) presents a proof for showing that Read only Turing Machines are equivalent to Finite Automata. In many programming languages equal number of {'s and }'s must be matched which cannot be done with Finite Automata. The following Turing Machine accepts strings such as {}, {{}}, {{{}}}, {{{{}}}}, that characterize many programming languages; _Turing_Machine_Ob_n_Cb_n_

Diagram 4: A Turing Machine for { {ⁿ }ⁿ } with a trace for {{}}

Turing Machines (TMs) are also known as algorithm machines, because every algorithm can be expressed as a TM. TMs are the most powerful well-defined machines. Finite Automata (FA) are useful for regular languages which are relatively simple sets denoted by regular expressions; they are not good for non-regular languages. Pushdown Automata (PDA) are acceptors of Context-Free Languages(CFLs) that include programming languages, because syntax of programming languages can be defined by Context-Free Grammars (CFGs). PDA can process programming languages efficiently by using a stack. There are other aspects of theoretical foundations that have great implications for scope and limitations of computer science that we will study.

There are several reasons why the study of automata and complexity is an important part of the core of Computer Science. The principal motivation and some of the important applications are briefly mentioned below:

(1) Software for designing and checking the behavior of digital circuits is modeled with Finite Automata (FA).

(2) Finite Automata (FA) model and define the lexical analyzer of a typical compiler where the input text is broken into logical units such as identifiers, keywords, and punctuation.

(3) Software for searching large bodies of text is also modeled in Finite Automata (FA).

(4) The Syntax Analyzer of a typical compiler is modeled by Pushdown Automata (PDA).

(5) Turing Machines (TMs) define and model computability in the most elegant form. TMs also allow clear distinction between intractable and tractable computation. TMs define un-decidable problems in an elegant way.

FINITE AUTOMATA (See Hopcroft, Motwani & Ullman (2007) Chapter 2 & 3, Cohen (1997) Chapters 4-6, and Sipser (2012)Chapter 1)

This section introduces "Regular Expressions", "Regular Languages" and Finite Automata (FA). A finite automaton is an abstract machine which can process words or strings from a Regular Language or Regular Expressions. These terms will be defined after some intuitive introductions. An FA is a machine that reads an input string or word one character at a time from a Read Only Input Buffer.

Figure 1: A Finite Automaton (using Sipser's notation) for ab* = {a, ab, abb, abbb, abbbb, abbbbb, . . . }
Assume that abb is given as an input to this FA. Beginning in the start state, the machine reads the first letter which is a and takes the transition marked by a and goes to the state q₁. From the state q₁ the machine reads the next character which is a b and takes the loop transition marked by b . The machine then reads the next character which is a b and takes the same loop transition again. By now the machine has finished the input string and ends in the final state or accept state and therefore, it accepts the input. The same FA can be represented in a different notation as in Figure 2; this notation is popularized by Cohen (1997).

Figure 2: A Finite Automaton (FA) for ab* in the notation of Cohen (1997)
A type of dynamic visualization of the above FA is given at: http://www.asethome.org/fa/. For a demonstration, please click here.

For a definition of FA, please click here or visit http://www.asethome.org/fa/fa_regularExpression.html.

A non-deterministic finite Automaton (NFA) for ab* is given in Figure 3 which looks simpler than the deterministic FA (DFA) given in Figure 2. However, the DFA of Figure 2 and the NFA of Figure 3 are equivalent, because both accept every string from the set {a, ab, abb, abbb, abbbb, abbbbb, . . . } = ab*.

Figure 3: A Non-deterministic Finite Automaton (NFA) for ab*
The above NFA of Figure 3 is represented in Figure 4 in the notation of Cohen (1997).

Figure 4: A Non-deterministic Finite Automaton (NFA) for ab* in the notation of Cohen (1997)

We begin our study of computation considering the nature of input, that is, the strings that may be given as input to a machine or automata. In order to study the input strings or string patterns precisely, formal languages are proposed. By formal we don't mean dressed up, we mean languages with a form defined by rules that say which strings are in the language and which are not. We are only interested in syntax here, not semantics (meaning). We are interested in the form of a string of symbols, not the meaning of the string.

The finite set of symbols that we will use to build our strings is called the alphabet. We will have rules that tell us how to put symbols together to form valid strings called words or strings. The entire set of valid strings is called a language. In other words, a language is a set of well-formed strings.

The expressions we have been using are called regular expressions. The languages they define are called regular languages. For a detailed description of regular languages please visit this site

Here is a recursive definition of the set of regular expressions over the alphabet Σ

1. Every element of alphabet (Σ) is a regular expression; ^ (null) is a regular expression.
2. If x and y are regular expressions, then so are x+y (union), xy (concatenation), x* (Kleene star), and (x).
3. Nothing else is a regular expression.

According to rule 2, union, concatenation, Kleene Star and parentheses can be used to create new regular expressions from old ones. Suppose

= { a, b }, then a and b are regular expressions according to rule 1. According to rule 2, ab, a+b, a* and (a) are also regular expressions . Please note that a+b is often written as a U b (see Sipser, M. (2012) Introduction to the Theory of Computation , Course Technology.)

We use

(phi) to denote the regular expression that generates the empty language

(a+b)*a(a+b)* generates strings containing at least one a. The string aabab is such a string. How could we get that string using the regular expression (a+b)*a(a+b)*? In three different ways. The required a could be either the first, the second, or the third a in aabab.

The expressions b* + (a+b)*a(a+b)* and (a+b)* both generate the same language, the set of all strings over {a,b}. Why? The only strings not generated by (a+b)*a(a+b)* are strings made completely of b's. The expression b* + (a+b)*a(a+b)* generates the union of the two languages 1) strings made of 0 or more b's, and 2) strings containing at least one a. This union contains all strings over {a,b}.

How would we write a regular expression for strings that contain at least two a's? Note that it is not necessary that the two a's be contiguous. One such regular expression is (a+b)*a(a+b)*a(a+b)*. Another is b*ab*a(a+b)*. The difference in these two regular expressions becomes obvious when you attempt to generate a particular string using the regular expression. How would you generate babaaba? With the first expression there are six different ways, but with the second expression there is only one way. The required a's in the second expression must be the first two a's of the string.

Two regular expressions are equivalent if they generate the same set of strings. Here are some more regular expressions for the language of strings containing at least two a's: (a+b)*ab*ab* and b*a(a+b)*ab*. In the first one the required a's are the last two a's of the string and in the second one the required a's are the first and the last.

How would we write a regular expression for strings containing exactly two a's? b*ab*ab*; it allows exactly two a's and zero or more b's

How about at least one a and at least one b? Note that whatever regular expression we choose must be able to generate both ab and ba. An expression that does the job:

If a language is finite, a regular expression for that language is just an OR of all the words in the language. For example, if L = {ab, aba, bb} then a regular expression for L is ab + aba + bb. If the language includes

, we add it to the regular expression as in

+ ab + aba + bb.

We can concatenate two languages in a similar way to concatenating strings, except that we pair up and concatenate all possible strings. If S = {a, aa, ba} and T = {bab, bb}, then ST contains all strings that begin with an element of S and end with an element of T (with nothing in between.) ST = {abab, abb, aabab, aabb, babab, babb} (not listed in lexicographic order.) A regular expression for ST is (a + aa + ba)(bab + bb).

Every regular expression generates some language but not every language is generated by a regular expression. Every finite language can be generated by a regular expression but many infinite languages cannot.

Figuring out what language is generated by a regular expression can be tricky. Try these:

Please note that the Non-Deterministic Finite Automaton (NFA) for the same regular expression, ab*, is given below which looks simpler. Why? Because in order to meet the definition of FA there should be one transition for every element of

from each state, where as the NFA definition allows to have only a subset of them. .

The above notation for FA and NFA is given in Daniel I. Cohen, 1997, Introduction to Computer Theory , 2nd Edition. Finite automata can be drawn with different notations. Please note that some books show the start state with an incoming arrow that is not attached to any other state (instead of a � sign). Accept states or final states are often drawn as double concentric circles in some books (instead of a + sign). Thus, the NFA for ab* would look as given below:

A Finite Automaton for (a+b)* or (a U b)* or (a|b)* is given below in Cohen's Notaion.

The same Finite Automaton for (a+b)* or (a U b)* or (a|b)* can be drawn as shown below:

(a+b)* = { ^, a, b, aa, ab, ba, bb, aaa, aba, aab, abb, baa, bba, bab, bbb, . . . }. That is, (a+b)* denotes a set that includes every string that can be made out of a & b. In other words, all possible combinations of a's and b's are in (a+b)* or (a|b)* or (a U b)*.

We draw another finite automaton which accepts all strings from a*bb*aa*(ba*bb*aa*)*.

Please note that some books show the start state with an incoming arrow that is not attached to any other state (instead of a � sign). Accept states or final states are often drawn as double concentric circles in some books (instead of a + sign). States are given names (0,1,2) to make discussions of the behavior of the machine easier. Transitions are shown as labeled arcs from one state to another or from a state back to itself. In the above, if the machine is in the start state 0 and it reads an a, the transition leaves state 0 and reenters state 0. If it reads a b from state 0, the machine enters state 1.

It is common for each state to have a transition leaving it for each letter of the alphabet. The above machine, when given the string bbabba will end up in state 2 and accept the string. If it reads string abbbab it ends up in state 0 and does not accept the string. We say that the machine rejects abbbab.

Note that the author uses a slightly different schema for drawing finite automata. He puts a minus sign in the start state and plus signs in the accept states.

The set of strings accepted by a finite automaton is referred to as the language accepted by the finite automaton. We might describe a finite automaton as a language recognizer whereas a regular expression is a language generator.

For each finite automaton there is a regular expression that defines the same language. Later we will learn an algorithm for determining the regular expression, but sometimes we can figure it out using our common sense. Look back at the above finite automaton. A regular expression corresponding to that machine is a*bb*aa*(ba*bb*aa*)*. Note that the portion before the parentheses moves you from the start state to the accept state. The portion in the parentheses moves from the accept state back to the accept state and this expression is "starred" because you can repeat this cycle as many times as necessary.

Another way to represent a finite automaton is with a transition table. Here is the table for the above machine. The rows correspond to states, the columns correspond to characters from the alphabet, and the cell contents correspond to the transitions of the machine.

A drawing of a finite automaton (FA) is easier for a human to understand than a table, but implementing a machine with a computer program requires storing the finite automaton's transitions in a table. There is not just one FA, NFA or DFA for a given language; usually there are several alternatives. You may like to study Finite Automata and Regular Languages in details.

[1] Cohen, D. Introduction to Computer Theory. (2nd. Ed.). John Wiley, 1996.
[21 Hopcroft, J., Motwani, R., and Ullman, J. Introduction to Automate Theory, Languages, and Computation. (2nd. Ed) Pearson Education, 2007.
[3] Sipser, M. Introduction to the Theory of Computation, (3rd Ed.) Cengage Learning, 2012.
[4] Rodger, S, H,, . Bressler, B., Finley, T., and Reading, S. Turning automata theory into a hands-on course. In Thirty-seventh SIGCSE Technical Symposium on Computer Science Education, pp. 379-383. SIGCSE, March 2006.

Mathematical Foundations of Computer Science

Introduction

Overview

Examples

Recursive Definitions

More Examples

REGULAR EXPRESSIONS

State	a	b
0	0	1
1	2	1
2	2	0