| Universal Machine |
Shopping Turing |
Website Links For Turing |
Information AboutUniversal Machine |
| CATEGORIES ABOUT TURING MACHINE | |
| recursion theory | |
| alan turing | |
| computational models | |
| formal methods | |
| english inventions | |
| theoretical computer science | |
For Alan Turing's test devised to determine the quality of an Artificial Intelligence , see Turing Test Turing machines are extremely basic symbol-manipulating devices which — despite their simplicity — can be adapted to simulate the logic of any Computer that could possibly be constructed. They were described in 1936 by Alan Turing . Though they were intended to be technically feasible, Turing machines were not meant to be a practical computing technology, but a Thought Experiment about the limits of mechanical computation; thus they were not actually constructed. Studying their Abstract Properties yields many insights in Computer Science and Complexity Theory . A Turing machine that is able to simulate any other Turing machine is called a universal Turing machine ('''UTM''', or simply a '''universal machine'''). A more mathematically-oriented definition with a similar "universal" nature was introduced by Alonzo Church , whose work on Lambda Calculus intertwined with Turing's in a formal theory of Computation known as the Church–Turing Thesis . The thesis states that Turing machines indeed capture the informal notion of effective method in logic and mathematics, and provide a precise definition of an Algorithm or 'mechanical procedure'. SINGLE-TAPE MACHINES Informal description The concept of the Turing machine is based on the idea of a person executing a well-defined procedure by changing the contents of an unlimited paper tape, which is divided into squares that can contain one of a finite set of symbols. The person needs to remember one of a finite set of states and the procedure is formulated in very basic steps in the form of "If your state is 42 and the symbol you see is a '0' then replace this with a '1', move one symbol to the right, and assume state 17 as your new state." A Turing machine is equivalent to a Pushdown Automaton made more powerful by relaxing the Last-in-first-out requirement of its stack. (Interestingly, this seemingly minor relaxation enables the Turing machine to perform such a wide variety of computations that it can serve as a model for the computational capabilities of all modern computer software.) More precisely, a Turing machine consists of: # A ''tape'' which is divided into cells, one next to the other. Each cell contains a symbol from some finite alphabet. The alphabet contains a special ''blank'' symbol (here written as '0') and one or more other symbols. The tape is assumed to be arbitrarily extensible to the left and to the right, i.e., the Turing machine is always supplied with as much tape as it needs for its computation. Cells that have not been written to before are assumed to be filled with the blank symbol. # A ''head'' that can read and write symbols on the tape and move left and right one (and only one) step at a time. # A ''state register'' that stores the state of the Turing machine. The number of different states is always finite and there is one special ''start state'' with which the state register is initialized. # An ''action table'' (or ''transition function'') that tells the machine what symbol to write, how to move the head ('L' for one step left, and 'R' for one step right) and what its new state will be, given the symbol it has just read on the tape and the state it is currently in. If there is no entry in the table for the current combination of symbol and state then the machine will halt. Note that every part of the machine is finite; it is the potentially unlimited amount of tape that gives it an unbounded amount of Storage Space . Example The following Turing machine has an alphabet {'0', '1'}, with 0 being the blank symbol. It expects a series of 1s on the tape, with the head initially on the leftmost 1, and doubles the 1s with a 0 in between, i.e., "111" becomes "1110111". The set of states is {s1, s2, s3, s4, s5} and the start state is s1. The action table is as follows. Old Read Wr. New St. Sym. Sym. Mv. St. - - - - - - - - - - - - s1 1 -> 0 R s2 s2 1 -> 1 R s2 s2 0 -> 0 R s3 s3 0 -> 1 L s4 s3 1 -> 1 R s3 s4 1 -> 1 L s4 s4 0 -> 0 L s5 s5 1 -> 1 L s5 s5 0 -> 1 R s1 A computation of this Turing machine might for example be: (the position of the head is indicated by displaying the cell in bold face) Step State Tape - - - - - - - - 01 s1 11 02 s2 01 03 s2 010 04 s3 0100 05 s4 0101 06 s5 0101 07 s5 0101 08 s1 1101 09 s2 1001 10 s3 1001 11 s3 10010 12 s4 10011 13 s4 10011 14 s5 10011 15 s1 11011 -- halt -- The behavior of this machine can be described as a loop: it starts out in s1, replaces the first 1 with a 0, then uses s2 to move to the right, skipping over 1s and the first 0 encountered. S3 then skips over the next sequence of 1s (initially there are none) and replaces the first 0 it finds with a 1. S4 moves back to the left, skipping over 1s until it finds a 0 and switches to s5. s5 then moves to the left, skipping over 1s until it finds the 0 that was originally written by s1. It replaces that 0 with a 1, moves one position to the right and enters s1 again for another round of the loop. This continues until s1 finds a 0 (this is the 0 in the middle of the two strings of 1s) at which time the machine halts. Formal definition More formally, a (one-tape) Turing machine is usually defined as a 6- Tuple , where
Definitions in literature sometimes differ slightly, to make arguments or proofs easier or clearer, but this is always done in such a way that the resulting machine has the same computational power. For example, changing the set to , where ''S'' would allow the machine to stay on the same tape cell instead of moving left or right, does not increase the machine's computational power. MULTI-TAPE MACHINES In practical analysis, various types of multi-tape Turing machines are often used. Multi-tape machines are similar to single-tape machines, but there is some constant ''k'' number of independent tapes. Each tape has a separate read/write head. The transition rules depend on the symbols underneath each of the ''k'' heads at once. This model intuitively seems much more powerful than the single-tape model, but any multi-tape machine, no matter how large the ''k'', can be simulated by a single-tape machine using only quadratically more computation time (Papadimitriou 1994, Thrm 2.1). Thus, multi-tape machines cannot calculate any more functions than single-tape machines, and none of the robust complexity classes (such as Polynomial Time ) are affected by a change between single-tape and multi-tape machines. Formal definition A k-tape Turing machine can be described as a 6-tuple , where
Machines with input and output It is difficult to study sublinear Space Complexity on multi-tape machines with the traditional model, because an input of size ''n'' already takes up space ''n''. Thus, to study small DSPACE classes, we must use a different model. In some sense, if we never "write to" the input tape, we don't want to charge ourself for this space. And if we never "read from" our output tape, we don't want to charge ourself for this space. We solve this problem by introducing a ''k''-string Turing machine with input and output. This is the same as an ordinary ''k''-string Turing machine, except that the transition function is restricted so that the input tape can never be changed, and so that the output head can never move left. This model allows us to define deterministic space classes smaller than linear. Turing machines with input-and-output also have the same time complexity as other Turing machines; in the words of Papaditriou 1994 Prop 2.2: :For any ''k''-string Turing machine ''M'' operating within time bound ''f(n))'' there is a ''(k+2)''-string Turing machine ''M''’ with input and output, which operates within time bound ''O(f(n))''. ''k''-string Turing machines with input and output are used in the formal definiton of the complexity resource DSPACE in, for example, Papadimitriou 1994 (Def. 2.6). DETERMINISTIC AND NON-DETERMINISTIC TURING MACHINES If the action table has at most one entry for each combination of symbol and state then the machine is a deterministic Turing machine (DTM). If the action table contains multiple entries for a combination of symbol and state then the machine is a ''' Non-deterministic Turing Machine ''' (NDTM). The two are computationally equivalent, that is, it is possible to turn any NDTM into a DTM (and ''vice versa''). UNIVERSAL TURING MACHINES Every Turing machine computes a certain fixed Partial Computable Function from the input strings over its alphabet. In that sense it behaves like a computer with a fixed program. However, we can encode the action table of any Turing machine in a string. Thus we can construct a Turing machine that expects on its tape a string describing an action table followed by a string describing the input tape, and computes the tape that the encoded Turing machine would have computed. Turing described such a construction in some detail in his 1936 paper. In 1947, Turing said: It can be shown that a single special machine of that type can be made to do the work of all. It could in fact be made to work as a model of any other machine. The special machine may be called the universal machine. This was, perhaps, the seminal theoretical idea for Operating System , a program to run (controlledly) other programs... showing that it exists — and making sense of investing into a practical one. With this encoding of action tables as strings, it becomes possible in principle for Turing machines to answer questions about the behaviour of other Turing machines. Most of these questions, however, are Undecidable , meaning that the function in question cannot be calculated mechanically. For instance, the problem of determining whether any particular Turing machine will halt on a particular input, or on all inputs, known as the Halting Problem , was shown to be, in general, undecidable in Turing's original paper. Rice's Theorem shows that any non-trivial question about the behaviour or output of a Turing machine is undecidable. If we define "universal Turing machine" to include any Turing machine that simulates some Turing-complete computational model, not just Turing machines that directly simulate other Turing machines, a universal Turing machine can be fairly simple, using just a few states and a few symbols. For example, only 2 states are needed, since a 2×18 (meaning 2 states, 18 symbols) universal Turing machine is known. For some time, the smallest known universal Turing machines, which simulated a computational model called a reports in his book, ''A New Kind of Science,'' a smaller universal Turing machine with 2 states and just 5 symbols, which emulates a cellular automaton also known to be universal, making this the simplest known universal Turing machine. A universal Turing machine is Turing-complete . It can calculate any Recursive Function , decide any Recursive Language , and accept any Recursively Enumerable Language . According to the Church-Turing thesis, the problems solvable by a universal Turing machine are exactly those problems solvable by an ''algorithm'' or an ''effective method of computation'', for any reasonable definition of those terms. An abstract version of the universal Turing machine is the Universal Function , a computable function which can be used to calculate any other computable function. The Utm Theorem proves the existence of such a function. COMPARISON WITH REAL MACHINES It is often said that Turing machines, unlike simpler automata, are as powerful as real machines, and are able to execute any operation that a real program can. What is missed in this statement is that almost any particular program running on ''a particular machine'' is in fact nothing but a Deterministic Finite Automaton , since the machine it runs on can only be in finitely many ''configurations''. Turing machines would actually only be equivalent to a machine that had an unlimited amount of storage space. We might ask, then, why Turing machines are useful models of real computers. There are a number of ways to answer this: # Anything a real computer can compute, a Turing machine can also compute. Thus, a statement about the limitations of Turing machines will also apply to real computers. # The difference lies only with the ability of a Turing machine to manipulate an unbounded amount of data. However, given a finite amount of time, a Turing machine (like a real machine) can only manipulate a finite amount of data. # Like a Turing machine, a real machine can have its storage space enlarged as needed, by acquiring more disks or other storage media. If the supply of these runs short, the Turing machine may become less useful as a model. But the fact is that neither Turing machines nor real machines need astronomical amounts of storage space in order to perform useful computation. The processing time required is usually much more of a problem. # Real machines are much more complex than a Turing machine. For example, a Turing machine describing an algorithm may have a few hundred states, while the equivalent Deterministic Finite Automaton on a given real machine has quadrillions. # Turing machines describe algorithms independent of how much memory they utilize. There is a maximum to the amount of memory that any machine which we know of has, but this limit can rise arbitrarily in time. Turing machines allow us to make statements about algorithms which will (theoretically) hold forever, regardless of advances in ''conventional'' computing machine architecture. # Turing machines simplify the statement of algorithms. Algorithms running on Turing-equivalent abstract machines are usually more general than their counterparts running on real machines, because they have arbitrary-precision data types available and never have to deal with unexpected conditions (including, but not limited to, running out of memory). One way in which Turing machines are a poor model for programs is that many real programs, such as Operating System s and Word Processor s, are written to receive unbounded input over time, and therefore do not halt. Turing machines do not model such ongoing computation well (but can still model portions of it, such as individual procedures). Another limitation of Turing Machines is that they do not model the strengths of a particular arrangement well. For instance, modern computers are actually instances of a more specific form of computing machine, known as the Random Access Machine . The primary difference between this machine and the Turing Machine is that the Turing Machine uses an infinite tape, while the random access machine uses a numerically indexed sequence (typically an integer field). The upshot of this distinction is that there are computational optimizations that can be performed based on the memory indices, which are not possible in a general Turing Machine; thus when Turing Machines are used as the basis for bounding running times, a 'false lower bound' can be proven on certain algorithms' running times (due to the false simplifying assumption of a Turing Machine). An example of this is Counting Sort , which seemingly violates the Ω(n log n) Lower Bound on Sorting Algorithms . MODELS EQUIVALENT TO THE TURING MACHINE MODEL Many machines that might be thought to have more computational capability than a simple universal Turing machine can be shown to have no more power (Hopcroft and Ullman p. 159, cf Minsky). They might compute faster, perhaps, or use less memory, or their instruction set might be smaller, but they cannot compute more powerfully (i.e. more mathematical functions). (Recall that the Church-Turing Thesis ''hypothesizes'' this to be true: that anything that can be “computed” can be computed by some Turing machine.) At the other extreme, some very simple models turn out to be Turing-equivalent , i.e. to have the same computational power as the Turing machine model. A prime example (see Post-Turing Machine ) is the model introduced by Emil Post in a paper received for publication in October 1936. (This was several months after Turing’s paper was received in May 1936, although the latter wasn't published until January 1937.) Post described his system, called ''Formulation 1'', as a process in which there is a given two-way infinite sequence of boxes, each of which is either marked or unmarked (with finitely-many initially marked), and a worker follows a finite set of numbered instructions to move among and mark/unmark the boxes. The worker is to start at a specified box and follow the instructions (which are numbered consecutively as 1, 2, 3, ...) starting with instruction 1, the ''i'' th instruction being one of the following types:
This extremely simple model can emulate any Turing machine, and although ''Formulation 1'' does not use the word "program" or "machine", it is effectively a formulation of a very primitive programmable computer and associated Programming Language , with the boxes acting as an unbounded bitstring memory, and the set of instructions constituting a program. Some alternate constructions include those described by Hopcroft and Ullman, Chapter 7.5 and 7.8 :
Minsky (cf Chapters 11 and 14) describes a computational model “similar to computers” that has been shown to be equivalent in computational power to a Turing machine. This "Minsky machine" has only two instructions: :(1) Add 1 to the number in register a, and go to next instruction, :(2) If number in a is not zero then subtract 1 from a and go to the next instruction, otherwise go to the nth instruction. Minksy then shows that the following are equivalent to Turing machines:
SEE ALSO
REFERENCES
EXTERNAL LINKS
SIMULATORS
|
|
|