Control table: Difference between revisions

Content deleted Content added
One-dimensional tables: Get to the point; accuracy
top: remove uncited and arguable claim
Tags: Mobile edit Mobile app edit Android app edit App section source
 
(46 intermediate revisions by 2 users not shown)
Line 1:
{{Short description |Data table used to control program flow}}
{{Refimprove|date=February 2009}}
[[File:Control table.png|thumb|220px|ThisControl simpletable control tablethat directs program flow according to the value of thea single input variablevalue. Each table entry holds a possible input value to be tested for equality (implied) and a relevant subroutinereference to performan inassociated the action columnfunction. The name of the subroutine could be replaced by a relative subroutine number if pointers are not supported]]
A '''control table''' is a table [[data structure]] (i.e. [[Array (data structure)|array]] of [[Record (computer science)|record]]s) used to direct the [[control flow]] of a [[computer program]]. [[Software]] that uses a control table is said to be ''table-driven''.<ref>''Programs from decision tables'', Humby, E., 2007,Macdonald, 1973 ... Biggerstaff, Ted J. Englewood Cliffs, NJ : Prentice-Hall {{ISBN|0-444-19569-6}}</ref><ref>{{Cite web |url=http://www.dkl.com/wp-content/uploads/2016/05/DataKinetics-Table-Driven-Design.pdf |title=Archived copy |access-date=17 May 2016 |archive-date=10 June 2016 |archive-url=https://web.archive.org/web/20160610160908/http://www.dkl.com/wp-content/uploads/2016/05/DataKinetics-Table-Driven-Design.pdf |url-status=dead }}</ref> InA somecontrol cases,table controlencodes tablesboth canthe be[[Parameter specific(computer implementationsprogramming)|parameters]] ofto a [[finite-stateconditional machine(programming)|finite-state-machineconditional expression]]-based and a [[automata-basedFunction (computer programming)|function]]. If[[reference there(computer arescience)|reference]]. severalAn hierarchical[[interpreter levels(computing)|interpreter]] ofprocesses controla table theyby mayevaluating behavethe inconditional expression for input data and invoking the selected function. Using a mannercontrol equivalenttable tocan [[hierarchicalreduce statethe machine|UMLneed statefor machine]]srepetitive code that implements the same logic.
 
In general, the mapping of input parameters can be via any data structure. A common data structure is the [[lookup table |lookup]] which provides relatively high performance but at a relatively high memory footprint. An [[associative array]] can minimize memory use at the cost of more lookup time.
Control tables often have the equivalent of [[conditional (programming)|conditional expressions]] or [[subroutine|function]] [[reference (computer science)|references]] embedded in them, usually implied by their relative column position in the [[association list]]. Control tables reduce the need for programming similar [[data structures|structures]] or program statements over and over again. The two-dimensional nature of most tables makes them easier to view and update than the one-dimensional nature of program code.
 
How the associated behavior is referenced varies. Some languages provide a direct function reference (i.e. [[pointer (computer programming)|pointer]]) that can be used to invoke a function directly, but some languages do not. Some languages provide for [[goto |jumping]] to a ___location (i.e.[[label (programming language)|label]]). As a fallback, any language allows for mapping input to an index that can then be used to branch to a particular part of the code.
In some cases, non-programmers can be assigned to maintain the content of control tables. For example, if a user-entered search phrase contains a certain phrase, a URL (web address) can be assigned in a table that controls where the search user is taken. If the phrase contains "skirt", then the table can route the user to "www.shopping.example/catalogs/skirts", which is the skirts product catalog page. (The example URL doesn't work in practice). Marketing personnel may manage such a table instead of programmers.
 
A control table is often used as part of a higher-level algorithm. It can control the [[main loop]] of an [[event-driven programming |event-driven program]]. A relatively advanced use is instructions for a [[virtual machine]] {{endash}} similar to [[bytecode]] but usually with operations implied by the table structure itself insteaad of encoded in the table data.
==Use==
Typical uses include:
* Transform an input value to an [[associative array |index]], for branching or [[pointer (computer programming)|pointer]] [[lookup table |lookup]]
* Transform an input value to a program name, relative [[subroutine]] number, [[label (programming language)|program label]] or program [[offset (computer science)|offset]], to alter [[control flow]]
* Control a [[main loop]] in [[event-driven programming]] using a [[control variable (programming)|control variable]] for [[state transition]]s
* Control the program cycle for [[online transaction processing]] applications
 
==Data structure==
A relatively advanced use as instructions for a [[virtual machine]] processed by an interpreter {{endash}} similar to [[bytecode]] but usually with operations implied by the table structure itself.
A table can be structured in a variety of ways. It may have one or multiple dimensions and be of fixed or [[variable length code |variable length]]. The structure of the table may be similar to a [[multimap (data structure)|multimap]] [[associative array]], where a data value (or combination of data values) may be mapped to one or more functions to be performed. Often, the structure allows tabular data is [[software portability |portable]] between [[computer platform]]s as long as a compatible interpreter exists on each platform.
 
===Trivial hash function===
==Table structure==
A relatively simple implementation consists of a [[sparse file |sparse]], one-dimensional array of values. The index space is fully covered by the array such that lookup involves indexing into the array by the input value, and a value is found even for an index that is not intended to be used; preventing an error that might otherwise occur for an unused index value. The lookup is achieved in [[constant time]]; without searching. In most [[computer architecture |architecture]]s, this can be accomplished in two or three [[machine instruction]]s. The technique is known as a "[[trivial hash function]]" or, when used specifically for branch tables, "[[double dispatch]]".
A table can be structured in a variety of ways. It may one or multiple dimensions and be of fixed or [[variable length code |variable length]]. The structure of the table may be similar to a [[multimap (data structure)|multimap]] [[associative array]], where a data value (or combination of data values) may be mapped to one or more functions to be performed. Often, the structure allows tabular data is [[software portability |portable]] between [[computer platform]]s as long as a compatible interpreter exists on each platform.
 
To be feasible, the range of index values should be relatively small. In the example below, the control table is indexed by ASCII value so it has 256 entries; an entry for each ASCII value; omitted entries are shown as '...'. Only the values for the letter A, D, M, and S are important. All other values are uninteresting and set to 0. A two-byte index would require a minimum of 65,536 entries to handle all input possibilities {{endash}} which might consume more memory than is considered worth the value it provides.
===Sparse one-dimensional array===
 
{| class="wikitable floatright" style="text-align:center; "
{| class="wikitable" style="text-align:center; "
|+ Table that map each ASCII value (A,D,M,S) to a subroutine index (1,4,3,2) via a one-dimensional array<br/>{{nobold|(omitted values are shown as '...')}}
|+
|- style="vertical-align:bottom;"
! index <br>(ASCII) !! Array
|-
| 0 ||style="background:lightblue;"| 0
|-
| ... ||style="background:lightblue;"| 0
|-
| ...{{hex2dec|41}} (A) ||style="background:lightblue;"| 001
|-
| {{hex2dec|41}} (A)... ||style="background:lightblue;"| '''01'''0
|-
| ...{{hex2dec|44}} (D) ||style="background:lightblue;"| 004
|-
| {{hex2dec|44}} (D)... ||style="background:lightblue;"| '''04'''0
|-
| ...{{hex2dec|4D}} (M) ||style="background:lightblue;"| 003
|-
| {{hex2dec|4D}} (M)... ||style="background:lightblue;"| '''03'''0
|-
| ...{{hex2dec|53}} (S) ||style="background:lightblue;"| 002
|-
| {{hex2dec|53}} (S)... ||style="background:lightblue;"| '''02'''0
|-
| ...255 ||style="background:lightblue;"| 000
|}
A relatively simple implementation consists of a sparse, one-dimensional array of values. The index space is fully covered by the array such that lookup involves indexing into the array by the input value, and a value is found even for an index that is not intended to be used; preventing an error that might otherwise occur for an unused index value. The lookup is achieved in [[constant time]]; without searching. In most [[computer architecture |architecture]]s, this can be accomplished in two or three [[machine instruction]]s. The technique is known as a "[[trivial hash function]]" or, when used specifically for branch tables, "[[double dispatch]]". To be feasible, the range of index values should be relatively small. In the example, the table is indexed by ASCII character value so it has 256 entries; an entry for each ASCII value. Only the values for the letter A, D, M, and S are important. All other values are uninteresting and set to 0.
 
In automata-based programming and [[pseudoconversational transaction]] processing, if the number of distinct program states is small, a "dense sequence" control variable can be used to efficiently dictate the entire flow of the main program loop.
 
===Branch table===
A two byte raw data value would require a ''minimum'' table size of 65,536 bytes – to handle all input possibilities – whilst allowing just 256 different output values. However, this direct translation technique provides an extremely fast [[data validation|validation]] & conversion to a (relative) subroutine pointer if the [[heuristic]]s, together with sufficient fast access memory, permits its use.
A [[branch table]] is a one-dimensional array of [[machine code]] [[branch (computer science)|branch/jump]] instructions to effect a [[multiway branch]] to a program label when branched into by an immediately preceding, and indexed branch. It is sometimes generated by an [[optimizing compiler]] to execute a [[switch statement]] – provided that the input range is small and dense, with few gaps.<ref>http://www.netrino.com/node/137</ref> (as created by the previous array example)
 
Although quite compact – compared to the multiple equivalent <code>If</code> statements – the branch instructions still carry some redundancy, since the branch [[opcode]] and condition code mask are repeated alongside the branch offsets. Control tables containing only the offsets to the program labels can be constructed to overcome this redundancy (at least in assembly languages) and yet requiring only minor execution time [[computational overhead |overhead]] compared to a conventional branch table.
===Branch tables===
{{Main|Branch table}}
A [[branch table]] is a one-dimensional 'array' of contiguous [[machine code]] [[branch (computer science)|branch/jump]] instructions to effect a [[multiway branch]] to a program label when branched into by an immediately preceding, and indexed branch. It is sometimes generated by an [[optimizing compiler]] to execute a [[switch statement]] – provided that the input range is small and dense, with few gaps.<ref>http://www.netrino.com/node/137</ref> (as created by the previous array example)
 
===Decision table===
Although quite compact – compared to the multiple equivalent <code>If</code> statements – the branch instructions still carry some redundancy, since the branch [[opcode]] and condition code mask are repeated alongside the branch offsets. Control tables containing only the offsets to the program labels can be constructed to overcome this redundancy (at least in assembly languages) and yet requiring only minor execution time [[computational overhead|overhead]] compared to a conventional branch table.
Often, a control table implements a [[decision table]] which like a control table contains (often implied) [[propositional formula |propositions]], and one or more associated actions.
 
A control table can act like a [[switch statement]] or more generally as a nested [[if-then-else]] construct that includes [[logical predicate]]s (using [[Boolean algebra (logic)|boolean]] style [[logical conjunction |AND]]/[[logical disjunction |OR]] conditions) for each case. Such as control table provides for a language-independent implementation of what otherwise is a language-dependent construct. The table embodies the [[essence]] of a program; stripped of programming language syntax and platform dependent aspects; condensed to data and implied logic. The meaning of the table includes implied operations instead of being explicit as in a more typical a [[programming paradigm]].
===Multi-dimensional tables===
More usually, a control table can be thought of as a [[truth table]] or as an executable ("binary") implementation of a printed [[decision table]] (or a [[tree (data structure)|tree]] of decision tables, at several levels). They contain (often implied) [[propositional formula|propositions]], together with one or more associated 'actions'. These actions are usually performed by generic or custom-built [[subroutine]]s that are called by an "[[interpreter (computing)|interpreter]]" program. The interpreter in this instance effectively functions as a [[virtual machine]], that 'executes' the control table entries and thus provides a higher level of [[abstraction (computer science)|abstraction]] than the underlying code of the interpreter.
 
Typically, a two-dimensional control table contains value/action pairs and may additionally contain operators and [[type system |type]] information such as the ___location, size and format of input or output data, whether [[data conversion]] is required. The table may contain [[array index |indexes]] or relative or absolute [[pointer (computer programming)|pointer]]s to generic or customized primitives or functions to be executed depending upon other values in the row.
A control table can be constructed along similar lines to a language dependent [[switch statement]] but with the added possibility of testing for combinations of input values (using [[boolean algebra (logic)|boolean]] style [[logical conjunction|AND]]/[[logical disjunction|OR]] conditions) and potentially calling multiple [[subroutine]]s (instead of just a single set of values and 'branch to' program labels). (The switch statement construct in any case may not be available, or has confusingly differing implementations in [[high-level programming language|high level languages]] (HLL). The control table concept, by comparison, has no intrinsic language dependencies, but might nevertheless be ''implemented'' differently according to the available data definition features of the chosen programming language.)
 
The type of values used to in a control table depends on the [[computer language]] used for the interpreter. [[Assembly language]] provides the widest scope for [[data types]] including [[machine code]] for lookup values. Typically, a control table contains values for each possible matching class of input together with a corresponding pointer to an action function. For a language without [[pointer (computer programming)|pointer]] support, the table can encode an index which can imply an offset value that can be used to accomplish the same as a pointer.
===Table content===
A control table essentially embodies the '[[essence]]' of a conventional program, stripped of its programming language syntax and platform dependent components (e.g. {{mono|IF/THEN DO.., FOR.., DO WHILE.., SWITCH, GOTO, CALL}}) and 'condensed' to its variables (e.g. input1), values (e.g. 'A','S','M' and 'D'), and subroutine identities (e.g. 'Add','subtract,..' or #1, #2,..). The structure of the table itself typically ''implies'' the (default) logical operations involved – such as 'testing for equality', performing a subroutine and 'next operation' or following the default sequence (rather than these being explicitly stated within program statements – as required in other [[programming paradigm]]s).
 
==Storage==
A multi-dimensional control table will normally, as a minimum, contain value/action pairs and may additionally contain operators and [[type system|type]] information such as, the ___location, size and format of input or output data, whether [[data conversion]] (or other [[run time (program lifecycle phase)|run-time]] processing nuances) is required before or after processing (if not already implicit in the function itself). The table may or may not contain [[array index|indexes]] or relative or absolute [[pointer (computer programming)|pointer]]s to generic or customized primitives or [[subroutine]]s to be executed depending upon other values in the "row".
A control table can be stored a variety of ways. It can be stored as part of a program; for example as a [[static variable]]. It can stored in the [[file system]]; for example as [[flat file]] or in a [[database]]. It can be built dynamically by the program. For efficiency, the table should be resident in program memory when the interpreter uses it.
 
==Interpreter==
The table illustrated below applies only to 'input1' since no specific input is specified in the table.
A control table interpreter executes the operations as selected by input parameters. The table and resulting runtime behavior can be modified without modifying the interpreter. An interpreter can require [[software maintenance |maintenance]] to change its behavior, but hopefully the interpreter is designed to support future functionality via table changes instead.
 
Handler functions often are coded in the same language as the interpreter, but could be in another language provided a suitable inter-language call-linkage mechanisms exists. The choice of language for the interpreter and handler functions usually depends on how portable it needs to be across various [[Platform (computing)|platform]]s. There may be several versions of the interpreter to support [[Porting |portability]]. A subordinate control table pointer may optionally substitute for a function pointer in the action column if the interpreter supports this construct; representing a conditional drop to a lower logical level, mimicking [[structured programming]].
{| class="wikitable"
|+ conditions and actions implied by structure
! (implied) IF = || (implied) perform
|-
| value|| action
|-
| value|| action
|}
(This side-by-side pairing of value and action has similarities to constructs in [[event-driven programming]], namely 'event-detection' and 'event-handling' but without (necessarily) the [[asynchronous system|asynchronous]] nature of the event itself)
 
The variety of values that can be [[encoded]] within a control table is largely dependent upon the [[computer language]] used. [[Assembly language]] provides the widest scope for [[data types]] including (for the actions), the option of directly executable [[machine code]]. Typically a control table will contain values for each possible matching class of input together with a corresponding pointer to an action subroutine. Some languages claim not to support [[pointer (computer programming)|pointer]]s (directly) but nevertheless can instead support an index which can be used to represent a 'relative subroutine number' to perform conditional execution, controlled by the value in the table entry (e.g. for use in an optimized [[switch statement|SWITCH]] statement – designed with zero gaps i.e. a [[multiway branch]]).
 
Comments positioned above each column (or even embedded textual documentation) can render a decision table 'human readable' even after 'condensing down' (encoding) to its essentials (and still broadly in-line with the original program specification – especially if a printed decision table, [[enumeration|enumerating]] each unique action, is created before coding begins).
The table entries can also optionally contain counters to collect run-time statistics for 'in-flight' or later optimization
 
==Table ___location==
Control tables can reside in [[static variables|static]] storage, on [[auxiliary storage]], such as a [[flat file]] or on a [[database]] or may alternatively be partially or entirely built dynamically at program [[booting|initialization]] time from parameters (which themselves may reside in a table). For optimum efficiency, the table should be memory resident when the interpreter begins to use it.
 
==The interpreter and subroutines==
The interpreter can be written in any suitable programming language including a [[high level language]]. A suitably designed [[generic programming|generic]] interpreter, together with a well chosen set of generic subroutines (able to process the most commonly occurring primitives), would require additional conventional coding only for new custom subroutines (in addition to specifying the control table itself). The interpreter, optionally, may only apply to some well-defined sections of a complete application program (such as the [[main loop|main control loop]]) and not other, 'less conditional', sections (such as program initialization, termination and so on).
 
The interpreter does not need to be unduly complex, or produced by a programmer with the advanced knowledge of a compiler writer, and can be written just as any other application program – except that it is usually designed with efficiency in mind. Its primary function is to "execute" the table entries as a set of "instructions". There need be no requirement for parsing of control table entries and these should therefore be designed, as far as possible, to be 'execution ready', requiring only the "plugging in" of variables from the appropriate columns to the already compiled generic code of the interpreter. The [[instruction (computer science)|program instructions]] are, in theory, infinitely [[extensible]] and constitute (possibly arbitrary) values within the table that are meaningful only to the interpreter. The [[control flow]] of the interpreter is normally by sequential processing of each table row but may be modified by specific actions in the table entries.
 
These arbitrary values can thus be designed with [[algorithmic efficiency|efficiency]] in mind – by selecting values that can be used as direct indexes to data or [[function pointers]]. For particular platforms/[[computer language|language]], they can be specifically designed to minimize [[instruction path length]]s using [[branch table]] values or even, in some cases such as in [[Just-in-time compilation|JIT]] compilers, consist of directly executable [[machine code]] "[[Snippet (programming)|snippets]]" (or pointers to them).
 
The subroutines may be coded either in the same language as the interpreter itself or any other supported program language (provided that suitable inter-language 'Call' linkage mechanisms exist). The choice of language for the interpreter and/or subroutines will usually depend upon how portable it needs to be across various [[Platform (computing)|platform]]s. There may be several versions of the interpreter to enhance the [[Porting|portability]] of a control table. A subordinate control table pointer may optionally substitute for a subroutine pointer in the 'action' columns if the interpreter supports this construct, representing a conditional 'drop' to a lower logical level, mimicking a conventional [[Structured programming|structured program]] structure.
 
==Performance considerations==
AtEven firstthough sight,using the use ofa control tablestable wouldentails appearthe todevelopment addcost quiteand adeployment lotsize to a program's [[Computational overhead|overhead]], requiring, as it does,of an interpreter, process before the 'native' programming language statements are executed. This however is not always the case. Byby separating (or 'encapsulating') the executable coding from the logic, as expressed in the table, it can be more readily targeted to perform its function most efficiently. This may be experienced most obviously in a [[spreadsheet]] application where the underlying spreadsheet software transparently converts complex logical 'formulae' in the most efficient manner it is able, in order to display its results.
 
The examples below have been chosen partly to illustrate potential performance gains that may not only ''compensate'' significantly for the additional tier of abstraction, but also ''improve'' upon – what otherwise might have been – less efficient, less maintainable and lengthier code. Although the examples given are for a 'low level' assembly language and for the [[C (language)|C language]], it can be seen, in both cases, that very few lines of code are required to implement the control table approach and yet can achieve very significant [[constant time]] performance improvements, reduce repetitive source coding and aid clarity, as compared with [[verbose]] conventional program language constructs. See also the [[Control table#Quotations|quotations]] by [[Donald Knuth]], concerning tables and the efficiency of [[multiway branch]]ing in this article.
 
The examples below have been chosen partly to illustrate potential performance gains that may not only ''compensate'' significantly for the additional tier of abstraction, but also ''improve'' upon – what otherwise might have been – less efficient, less maintainable and lengthier code. Although the examples given are for a 'low level' assembly language and for the [[C (language)|C language]], it can be seen, in both cases, that very few lines of code are required to implement the control table approach and yet can achieve very significant [[constant time]] performance improvements, reduce repetitive source coding and aid clarity, as compared with [[verbose]] conventional program language constructs.
==Examples of control tables==
The following examples are [[arbitrary]] (and based upon just a single input for simplicity), however the intention is merely to demonstrate how control flow can be effected via the use of tables instead of regular program statements. It should be clear that this technique can easily be extended to deal with multiple inputs, either by increasing the number of columns or utilizing multiple table entries (with optional and/or operator). Similarly, by using (hierarchical) 'linked' control tables, [[structured programming]] can be accomplished (optionally using indentation to help highlight subordinate control tables).
 
==Examples==
"CT1" is an example of a control table that is a simple [[lookup table]]. The first column represents the input value to be tested (by an implied 'IF input1 = x') and, if TRUE, the corresponding 2nd column (the 'action') contains a subroutine address to perform by a [[System call|call]] (or [[goto|jump]] to – similar to a [[Switch statement|SWITCH]] statement). It is, in effect, a [[multiway branch]] with return (a form of "[[dynamic dispatch]]"). The last entry is the default case where no match is found.
===General===
CT1 is a control table that is a simple [[lookup table]]. The first column represents the input value to be tested (by an implied 'IF input = x') and, if so, the action in the corresponding 2nd column is invoked. It is like a [[multiway branch]] with return; a form of [[dynamic dispatch]].
 
:{| class="wikitable"
|+ CT1
! input !! action
! input 1!! !! [[pointer (computer programming)|pointer]]
|-
| {{mono|A}} || &rarr; ||Add
|-
| {{mono|S}} || &rarr; ||Subtract
|-
| {{mono|M}} || &rarr; ||Multiply
|-
| {{mono|D}} || &rarr; ||Divide
|-
| {{mono|?}} || &rarr; ||Default
|}
For programming languages that support pointers within [[data structure]]s alongside other data values, the above table (CT1) can be used to direct [[control flow]] to an appropriate [[subroutine]]s according to matching value from the table (without a column to indicate otherwise, equality is assumed in this simple case).
 
The next example illustrates how a similar effect can be achieved in a language that does not support pointer definitions in data structures but does support indexed branching to a function – contained within a ([[zero-based numbering|0-based]]) array of function pointers. The table (CT2) is used to extract the index (from 2nd column) to the pointer array (CT2P). If pointer arrays are not supported, a SWITCH statement or equivalent can be used to alter the control flow to one of a sequence of program labels (e.g.: case0, case1, case2, case3, case4) which then either process the input directly, or else perform a call (with return) to the appropriate function.
=== Assembly language ===
No attempt is made to optimize the lookup in coding for this first example (for [[IBM/360]] maximum 16Mb address range or [[Z/Architecture]]), and it uses instead a simple [[linear search]] technique – purely to illustrate the concept and demonstrate fewer source lines. To handle all 256 different input values, approximately 265 lines of source code would be required (mainly single line table entries) whereas multiple 'compare and branch' would have normally required around 512 source lines (the size of the [[binary file|binary]] is also approximately halved, each table entry requiring only 4 bytes instead of approximately 8 bytes for a series of 'compare immediate'/branch instructions (For larger input variables, the saving is even greater).
 
* ------------------ interpreter --------------------------------------------*
LM R14,R0,=A(4,CT1,N) Set R14=4, R15 --> table, and R0 =no. of entries in table (N)
TRY CLC INPUT1,0(R15) ********* Found value in table entry ?
BE ACTION * loop * YES, Load register pointer to sub-routine from table
AR R15,R14 * * NO, Point to next entry in CT1 by adding R14 (=4)
BCT R0,TRY ********* Back until count exhausted, then drop through
. default action ... none of the values in table match, do something else
LA R15,4(R15) point to default entry (beyond table end)
ACTION L R15,0(R15) get pointer into R15,from where R15 points
BALR R14,R15 Perform the sub-routine ("CALL" and return)
B END go terminate this program
* ------------------ control table -----------------------------------------*
* | this column of allowable EBCDIC or ASCII values is tested '=' against variable 'input1'
* | | this column is the 3-byte address of the appropriate subroutine
* v v
'''CT1''' DC C'A',AL3(ADD) START of Control Table (4 byte entry length)
DC C'S',AL3(SUBTRACT)
DC C'M',AL3(MULTIPLY)
DC C'D',AL3(DIVIDE)
N EQU (*-CT1)/4 number of valid entries in table (total length / entry length)
DC C'?',AL3(DEFAULT) default entry – used on drop through to catch all
INPUT1 DS C input variable is in this variable
* ------------------ sub-routines ------------------------------------------*
ADD CSECT sub-routine #1 (shown as separate CSECT here but might
. alternatively be in-line code)
. instruction(s) to add
BR R14 return
SUBTRACT CSECT sub-routine #2
. instruction(s) to subtract
BR R14 return
. etc..
 
'''improving the performance of the interpreter in above example '''
 
:To make a selection in the example above, the average [[instruction path length]] (excluding the subroutine code) is '4n/2 +3', but can easily be reduced, where n = 1 to 64, to a [[Linear time|constant time]] <math>O(1)\,</math> with a path length of '5' with ''zero comparisons'', if a 256 byte translate table is first utilized to create a ''direct'' index to CT1 from the raw EBCDIC data. Where n = 6, this would then be equivalent to just 3 sequential compare & branch instructions. However, where n<=64, on average it would need approximately 13 ''times'' less instructions than using multiple compares. Where n=1 to 256, on average it would use approximately 42 ''times'' less instructions – since, in this case, one additional instruction would be required (to multiply the index by 4).
 
'''Improved interpreter''' (up to '''26 times less executed instructions''' than the above example on average, where n= 1 to 64 and up to 13 times less than would be needed using multiple comparisons).
 
To handle 64 different input values, approximately 85 lines of source code (or less) are required (mainly single line table entries) whereas multiple 'compare and branch' would require around 128 lines (the size of the [[binary file|binary]] is also almost halved – despite the additional 256 byte table required to extract the 2nd index).
 
* ------------------ interpreter --------------------------------------------*
SR R14,R14 ********* Set R14=0
CALC IC R14,INPUT1 * calc * put EBCDIC byte into lo order bits (24–31) of R14
IC R14,CT1X(R14) * * use EBCDIC value as index on table 'CT1X' to get new index
FOUND L R15,CT1(R14) ********* get pointer to subroutine using index (0,4, 8 etc.)
BALR R14,R15 Perform the sub-routine ("CALL" and return or Default)
B END go terminate this program
* --------------- additional translate table (EBCDIC --> pointer table INDEX) 256 bytes----*
CT1X DC 12AL1(00,00,00,00,00,00,00,00,00,00,00,00,00,00,00,00) 12 identical sets of 16 bytes of x'00
* representing X'00 – x'BF'
DC AL1(00,'''04''',00,00,'''16''',00,00,00,00,00,00,00,00,00,00,00) ..x'C0' – X'CF'
DC AL1(00,00,00,00,'''12''',00,00,00,00,00,00,00,00,00,00,00) ..x'D0' – X'DF'
DC AL1(00,00,'''08''',00,00,00,00,00,00,00,00,00,00,00,00,00) ..x'E0' – X'EF'
DC AL1(00,00,00,00,00,00,00,00,00,00,00,00,00,00,00,00) ..x'F0' – X'FF'
* the assembler can be used to automatically calculate the index values and make the values more user friendly
* (for e.g. '04' could be replaced with the symbolic expression 'PADD-CT1' in table CT1X above)
* modified CT1 (added a default action when index = 00, single dimension, full 31 bit address)
'''CT1''' DC A(DEFAULT) index =00 START of Control Table (4 byte address constants)
PADD DC A(ADD) =04
PSUB DC A(SUBTRACT) =08
PMUL DC A(MULTIPLY) =12
PDIV DC A(DIVIDE) =16
* the rest of the code remains the same as first example
 
'''Further improved interpreter''' (up to '''21 times less executed instructions (where n>=64)''' than the first example on average and up to 42 ''times'' less than would be needed using multiple comparisons).
 
To handle 256 different input values, approximately 280 lines of source code or less, would be required (mainly single line table entries), whereas multiple 'compare and branch' would require around 512 lines (the size of the [[binary file|binary]] is also almost halved once more).
 
* ------------------ interpreter --------------------------------------------*
SR R14,R14 ********* Set R14=0
CALC IC R14,INPUT1 * calc * put EBCDIC byte into lo order bits (24–31) of R14
IC R14,CT1X(R14) * * use EBCDIC value as index on table 'CT1X' to get new index
SLL R14,2 * * '''multiply index by 4 (additional instruction)'''
FOUND L R15,CT1(R14) ********* get pointer to subroutine using index (0,4, 8 etc.)
BALR R14,R15 Perform the sub-routine ("CALL" and return or Default)
B END go terminate this program
* --------------- additional translate table (EBCDIC --> pointer table INDEX) 256 bytes----*
CT1X DC 12AL1(00,00,00,00,00,00,00,00,00,00,00,00,00,00,00,00) 12 identical sets of 16 bytes of x'00'
* representing X'00 – x'BF'
DC AL1(00,'''01''',00,00,'''04''',00,00,00,00,00,00,00,00,00,00,00) ..x'C0' – X'CF'
DC AL1(00,00,00,00,'''03''',00,00,00,00,00,00,00,00,00,00,00) ..x'D0' – X'DF'
DC AL1(00,00,'''02''',00,00,00,00,00,00,00,00,00,00,00,00,00) ..x'E0' – X'EF'
DC AL1(00,00,00,00,00,00,00,00,00,00,00,00,00,00,00,00) ..x'F0' – X'FF'
* the assembler can be used to automatically calculate the index values and make the values more user friendly
* (for e.g. '01' could be replaced with the symbolic expression 'PADD-CT1/4' in table CT1X above)
* modified CT1 (index now based on 0,1,2,3,4 not 0,4,8,12,16 to allow all 256 variations)
'''CT1''' DC A(DEFAULT) index =00 START of Control Table (4 byte address constants)
PADD DC A(ADD) =01
PSUB DC A(SUBTRACT) =02
PMUL DC A(MULTIPLY) =03
PDIV DC A(DIVIDE) =04
* the rest of the code remains the same as the 2nd example
 
=== C language ===
This example in [[C (programming language)|C]] uses two tables, the first (CT1) is a simple [[linear search]] one-dimensional lookup table – to obtain an index by matching the input (x), and the second, associated table (CT1p), is a table of addresses of labels to jump to.
<syntaxhighlight lang="c" line style="font-size: 90%;">
static const char CT1[] = { "A", "S", "M", "D" }; /* permitted input values */
static const void *CT1p[] = { &&Add, &&Subtract, &&Multiply, &&Divide, &&Default}; /* labels to goto & default*/
for (int i = 0; i < sizeof(CT1); i++) { /* loop thru ASCII values */
if (x==CT1[i]) goto *CT1p[i]; /* found --> appropriate label */
}
goto *CT1p[i+1]; /* not found --> default label */
</syntaxhighlight>
This can be made more efficient if a 256 byte table is used to translate the raw ASCII value (x) directly to a dense sequential index value for use in directly locating the branch address from CT1p (i.e. "[[index mapping]]" with a byte-wide array). It will then execute in [[constant time]] for all possible values of x (If CT1p contained the names of functions instead of labels, the jump could be replaced with a dynamic function call, eliminating the switch-like goto – but decreasing performance by the additional cost of function [[Housekeeping (computing)|housekeeping]]).
<syntaxhighlight lang="c" line style="font-size: 80%;">
static const void *CT1p[] = {&&Default, &&Add, &&Subtract, &&Multiply, &&Divide};
/* the 256 byte table, below, holds values (1,2,3,4), in corresponding ASCII positions (A,S,M,D), all others set to 0x00 */
static const char CT1x[]={
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x01', '\x00', '\x00', '\x04', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x03', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x02', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x03', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00',
'\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00', '\x00'};
/* the following code will execute in constant time, irrespective of the value of the input character (x) */
i = CT1x(x); /* extract the correct subroutine index from table CT1x using its ASCII value as an index initially */
goto *CT1p[i]; /* goto (Switch to) the label corresponding to the index (0=default, 1=Add, 2=Subtract,.) - see CT1p */
</syntaxhighlight>
 
The next example below illustrates how a similar effect can be achieved in languages that do not support pointer definitions in data structures but do support indexed branching to a subroutine – contained within a ([[zero-based numbering|0-based]]) array of subroutine pointers. The table (CT2) is used to extract the index (from 2nd column) to the pointer array (CT2P). If pointer arrays are not supported, a SWITCH statement or equivalent can be used to alter the control flow to one of a sequence of program labels (e.g.: case0, case1, case2, case3, case4) which then either process the input directly, or else perform a call (with return) to the appropriate subroutine (default, Add, Subtract, Multiply or Divide,..) to deal with it.
 
:{| class="wikitable"
|+ CT2
! input 1!! {{mono|subr #}}
|-
| {{mono|A}} || {{mono|1}}
|-
| {{mono|S}}A || {{mono|2}}1
|-
| {{mono|M}}S || {{mono|3}}2
|-
| {{mono|D}}M || {{mono|4}}3
|-
| {{mono|?}}D || {{mono|0}}4
|}
As in above examples, it is possible to very efficiently translate the potential [[ASCII]] input values (A, S, M, D or unknown) into a pointer array index without actually using a table lookup, but is shown here as a table for consistency with the first example.
 
:{| class="wikitable"
|+ CT2P
|+ CT2P<br/>{{nobold|pointer array}}
! index || array
! !! [[pointer (computer programming)|pointer]] [[Array data structure|array]]
|-
| &rarr;1 || {{mono|default}}Add
|-
| &rarr;2 || {{mono|Add}}Subtract
|-
| &rarr;3 || {{mono|Subtract}}Multiply
|-
| &rarr;4 || {{mono|Multiply}}Divide
|-
| &rarr; || {{mono|Divide}}
|-
| &rarr; || {{mono|?other}}
|}
 
MultiA two-dimensional control tablestable cancould be constructedused (i.e.to customized)support that can be 'more complex' than the above examples that might test fortesting multiple conditions on multiple inputs or performperforming more than one 'action', based on some matching criteria. An 'action' can include a pointer to another subordinate control table. The simple example below has had an ''implicit'' 'OR' condition incorporated as an extra column (to handle lower case input, however in this instance, this could equally have been handled simply by having an extra entry for each of the lower case characters specifying the same subroutinefunction identifier as the upper case characters). An extra column to count the actual run-time events for each input as they occur is also included.
 
:{| class="wikitable"
|+ CT3
! input 1!!alternate!! {{mono|subr #}} !! {{mono|count}}
|-
| {{mono|A}} ||{{mono| a}} || {{mono|1}} || {{mono|0}}
|-
| {{mono|S}} ||{{mono| s}} || {{mono|2}} || {{mono|0}}
|-
| {{mono|M}} ||{{mono| m}} || {{mono|3}} || {{mono|0}}
|-
| {{mono|D}} ||{{mono| d}} || {{mono|4}} || {{mono|0}}
|-
| {{mono|?}} ||{{mono|?}}|| {{mono|0}}|| {{mono|0}}
|}
 
The control table entries are then much more similar to conditional statements in [[procedural language]]s but, crucially, without the actual (language dependent) conditional statements (i.e. instructions) being present (the generic code is ''physically'' in the interpreter that processes the table entries, not in the table itself – which simply embodies the program logic via its structure and values).
 
In tables such as these, where a series of similar table entries defines the entire logic, a table entry number or pointer may effectively take the place of a [[program counter]] in more conventional programs and may be reset in an 'action', also specified in the table entry. The example below (CT4) shows how extending the earlier table, to include a 'next' entry (and/or including an 'alter flow' ([[branch (computer science)|jump]]) subroutinefunction) can create a [[program loop|loop]] (This example is actually not the most efficient way to construct such a control table but, by demonstrating a gradual 'evolution' from the first examples above, shows how additional columns can be used to modify behaviour.) The fifth column demonstrates that more than one action can be initiated with a single table entry – in this case an action to be performed ''after'' the normal processing of each entry ('-' values mean 'no conditions' or 'no action').
 
[[Structured programming]] or [[structured programming|"Goto-less" code]], (incorporating the equivalent of '[[do while loop|DO WHILE]]' or '[[for loop]]' constructs), can also be accommodated with suitably designed and 'indented' control table structures.
Line 313 ⟶ 145:
|
{| class="wikitable"
|+ CT4
|+ CT4<br/>{{nobold|(a complete 'program' to read input1 and process, repeating until 'E' encountered)}}
! input 1!!alternate!! subr # !! count !! jump
|-
| {{sdash}} || {{sdash}} || {{mono|5}}|| {{mono|0}} || {{sdash}}
|-
| {{mono|E}} ||{{mono| e}} || {{mono|7}} || {{mono|0}} || {{sdash}}
|-
| {{mono|A}} ||{{mono| a}} || {{mono|1}} || {{mono|0}} || {{sdash}}
|-
| {{mono|S}} ||{{mono| s}} || {{mono|2}} || {{mono|0}} || {{sdash}}
|-
| {{mono|M}} ||{{mono| m}} || {{mono|3}} || {{mono|0}} || {{sdash}}
|-
| {{mono|D}} ||{{mono| d}} || {{mono|4}} || {{mono|0}} || {{sdash}}
|-
| {{mono|?sdash}} || {{mono|?sdash}} || {{mono|0}}6 || {{mono|0}} || {{sdash}}1
|-
| {{sdash}} || {{sdash}} || {{mono|6}}|| {{mono|0}} || {{mono|1}}
|}
|
{| class="wikitable"
|+ CT4P {{nobold|pointer array}}
! index || pointer
! !! [[pointer (computer programming)|pointer]] [[array data structure|array]]
|-
| &rarr;0 || {{mono|Default}}
|-
| &rarr;1 || {{mono|Add}}
|-
| &rarr;2 || {{mono|Subtract}}
|-
| &rarr;3 || {{mono|Multiply}}
|-
| &rarr;4 || {{mono|Divide}}
|-
| &rarr;5 || {{mono|Read Input1}}
|-
| &rarr;6 || {{mono|Alter flow}}
|-
| &rarr;7 || {{mono|End}}
|}
|}
 
=== C language ===
The following example in [[C (programming language)|C]], uses two arrays that together form a table data structure. The {{code|ops}} array is a simple [[linear search]], one-dimensional array {{endash}} for obtaining the index of the item that matches the input, {{code |op}}. The {{code|handler_labels}} array contains addresses of labels to jump to. The loop tests each {{code |ops}} item against {{code |op}} and if they match, invokes {{code |goto}} for the label at the same index as the found {{code |ops}} item.
 
<syntaxhighlight lang="c" line style="font-size: 90%;">
static const char ops[] = { "A", "S", "M", "D" };
static const void *handler_labels[] = { &&Add, &&Subtract, &&Multiply, &&Divide };
for (int i = 0; i < sizeof(ops); i++) {
if (op == ops[i]) goto *handler_labels[i];
}
</syntaxhighlight>
 
This would execute faster if a 256 byte table is used to translate the ASCII input ({{code|op}}) to an index without searching. It would execute in [[constant time]] for all values of {{code |op}}. If a {{code |handler_labels}} item contained the name of a function instead of a label, the jump could be replaced with a dynamic function call, eliminating the switch-like {{code|goto}} but decreasing runtime performance.
 
<syntaxhighlight lang="c" line style="font-size: 80%;">
static const void *handler_labels[] = {&&Default, &&Add, &&Subtract, &&Multiply, &&Divide};
static const char handler_label_index_by_op[] = {
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 1, 0, 0, 4, 0, 0, 0, 0, 0, 0, 0, 0, 3, 0, 0,
0, 0, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0};
size_t i = handler_label_index_by_op(x);
goto *handler_labels[i];
</syntaxhighlight>
 
===Table-driven rating===
Line 362 ⟶ 228:
 
===Spreadsheets===
A [[spreadsheet]] datacould sheet can be thought ofused as a two -dimensional control table, with the non -empty cells representing data to the underlying spreadsheet program (the interpreter). The cells containing formula are usually prefixed with an equals sign and simply designate a special type of data input that dictates the processing of other referenced cells – by altering the control flow within the interpreter. It is the externalization of formulae from the underlying interpreter that clearly identifies both spreadsheets, and the above cited "rule based rating" example as readily identifiable instances of the use of control tables by non -programmers.
 
==Programming paradigm==
If the control tables technique could be said to belong to any particular [[programming paradigm]], the closest analogy might be automata-based programming or [[reflection (computer science)|"reflective"]] (a form of [[metaprogramming]] – since the table entries could be said to 'modify' the behaviour of the interpreter). The interpreter itself however, and the subroutinesfunctions, can be programmed using any one of the available paradigms or even a mixture. The table itself can be essentially a collection of "[[raw data]]" values that do not even need to be compiled and could be read in from an external source (except in specific, platform dependent, implementations using memory pointers directly for greater efficiency).
 
==Analogy to bytecode / virtualVirtual machine instruction set==
A multitwo-dimensional control table hasis somesimilar conceptualto similaritiesintermediate to[[executable code]] (i.e. [[bytecode]]) operatingrunning on a [[virtual machine]], in that a [[platform dependent |platform-dependent]] [[Interpreter (computing)|"interpreter"]] program is usually required to perform the actual execution (that is largely conditionally determined by the tablestable content). There are also some conceptual similarities to the recent [[Common Intermediate Language]] (CIL) in the aim of creating a common intermediate 'instruction set' that is independent of platform (but unlike CIL, no pretensions to be used as a common resource for other languages). [[p-code machine |P-code]] can also be considered a similar but earlier implementation with origins as far back as 1966.
 
==Instruction fetch==
When a multitwo-dimensional control table is used to determine program flow, the normal "hardware" [[program counter]] function is effectively simulated with either a [[Pointer (computer programming)|pointer]] to the first (or next) table entry or else an [[array index |index]] to it. "Fetching" the instruction involves decoding the ''data'' in that table entry – without necessarily copying all or some of the data within the entry first. Programming languages that are able to use [[Pointer (computer programming)|pointer]]s have the dual advantage that less [[Computational overhead|overhead]] is involved, both in accessing the contents and also advancing the counter to point to the next table entry after execution. Calculating the next 'instruction' address (i.e. table entry) can even be performed as an optional additional action of every individual table entry allowing [[Program loops|loops]] and or [[Branch (computer science)|jump]] instructions at any stage.
 
==Monitoring control table execution==
Line 377 ⟶ 243:
 
==Advantages==
*; clarity –Clarity: [[table (information)|information tables]] are [[ubiquitous computing|ubiquitous]] and mostly [[inherently]] [[understanding|understood]] even by the [[general public]] (especially [[fault diagnosis|fault diagnostic]] tables in [[user guide|product guides]])
 
* portability – can be designed to be 100% language independent (and platform independent – except for the interpreter)
; Portability: can be designed to be language and platform independent {{endash}} except for the interpreter
* flexibility – ability to execute either [[language primitive|primitives]] or [[subroutine]]s transparently and be custom designed to suit the problem
 
* compactness – table usually shows condition/action pairing side-by-side (without the usual platform/language implementation dependencies), often also resulting in
; Flexibility: ability to execute either [[language primitive |primitives]] or functions transparently and be custom designed to suit the problem
** [[binary file]] – reduced in size through less duplication of instructions
 
** [[Source code|source]] file – reduced in size through elimination of multiple conditional statements
; Compactness: table usually shows condition/action pairing side-by-side (without the usual platform/language implementation dependencies), often also resulting in reduced binary file size due to less duplication of instructions, reduced source code size due to eliminating conditional statements and reduced program load (or download) speeds
** improved program load (or download) speeds
 
* maintainability – tables often reduce the number of source lines needed to be maintained v. multiple compares
; Maintainability: tables often reduce the number of source lines needed to be maintained v. multiple compares
* locality of reference – compact tables structures result in tables remaining in [[cache (computing)|cache]]
 
* code re-use – the "interpreter" is usually reusable. Frequently it can be easily adapted to new programming tasks using precisely the same technique and can grow 'organically' becoming, in effect, a [[standard library]] of tried and tested [[subroutines]], controlled by the table definitions.
; Locality of reference: compact tables structures result in tables remaining in [[cache (computing)|cache]]
* efficiency – systemwide optimization possible. Any performance improvement to the interpreter usually improves ''all'' applications using it (see examples in 'CT1' above).
 
* extensible – new 'instructions' can be added – simply by extending the interpreter
; Code re-use: the interpreter is usually reusable. Frequently it can be adapted to new programming tasks using the same technique and can grow organically, becoming, in effect, a [[standard library]] of tried and tested functions, controlled by the table definitions.
* interpreter can be written like an application program
 
Optionally:-
; Efficiency: system wide optimization possible. Any performance improvement to the interpreter usually improves ''all'' applications using it (see examples in 'CT1' above).
* the interpreter can be [[introspection|introspective]] and "self [[optimization (computer science)|optimize]]" using runtime [[software metric|metrics]] collected within the table itself (see CT3 and CT4 – with entries that could be periodically sorted by descending count). The interpreter can also optionally choose the most efficient lookup technique dynamically from metrics gathered at run-time (e.g. size of array, range of values, sorted or unsorted)
 
* [[dynamic dispatch]] – common functions can be pre-loaded and less common functions fetched only on first encounter to reduce [[memory]] usage. In-table [[memoization]] can be employed to achieve this.
; Extensible: new instructions can be added by extending the interpreter
 
Optionally:
 
* Interpreter can be [[introspection |introspective]] and "self [[optimization (computer science)|optimize]]" using runtime [[software metric |metrics]] collected within the table itself (see CT3 and CT4 – with entries that could be periodically sorted by descending count). The interpreter can also optionally choose the most efficient lookup technique dynamically from metrics gathered at run-time (e.g. size of array, range of values, sorted or unsorted)
 
* [[Dynamic dispatch]]: common functions can be pre-loaded and less common functions fetched only on first encounter to reduce [[memory]] usage. In-table [[memoization]] can be employed to achieve this.
 
* The interpreter can have debugging, trace and monitor features built-in – that can then be switched on or off at will according to test or 'live' mode
 
* control tables can be built 'on-the-fly' (according to some user input or from parameters) and then executed by the interpreter (without building code literally).
* Control tables can be built 'on-the-fly' (according to some user input or from parameters) and then executed by the interpreter (without building code literally).
 
==Disadvantages==
*; training requirement –Training: application programmers are not usually trained to produce generic solutions
 
The following mainly apply to their use in multi-dimensional tables, not the one-dimensional tables discussed earlier.
*; [[Computational overhead|overhead]]: some increase because of extra level of [[indirection (programming)|indirection]] caused by virtual instructions having to be 'interpreted' (this however can usually be more than offset by a well designed generic interpreter taking full advantage of efficient direct translate, search and conditional testing techniques that may not otherwise have been utilized)
 
* Complex [[expression (programming)|expression]]s cannot always be used ''directly'' in data table entries for comparison purposes
; Complexity: Complex [[expression (theseprogramming)|expression]]s cannot always be used ''directly'' in data table entries for comparison purposes. These intermediate values' can however be calculated beforehand instead within a subroutinefunction and their values referred to in the conditional table entries. Alternatively, a subroutinefunction can perform the complete complex conditional test (as an unconditional 'action') and, by setting a [[truth bit |truth flag]] as its result, it can then be tested in the next table entry. See [[Structured program theorem]])
 
==Quotations==
Line 417 ⟶ 292:
 
==See also==
*[[ {{Annotated link |Database-centric architecture]]}}
*[[ {{Annotated link |Data-driven testing]]}}
*[[ {{Annotated link |Keyword-driven testing]]}}
*[[ {{Annotated link |Threaded code]]}}
* {{Annotated link |Truth table}}
*[[Threaded code#Token threading|Token threading]]
 
==Notes==