Content deleted Content added
"fallthrough" statement in go |
m unpiped links using script |
||
(432 intermediate revisions by more than 100 users not shown) | |||
Line 1:
{{short description|Type of selection control mechanism in computer programming languages}}
{{more citations needed|date=April 2013}}
{{Use American English|date=December 2024}}
In [[computer programming language]]s, a '''switch statement''' is a type of selection control mechanism used to allow the value of a [[variable (programming)|variable]] or expression to change the [[control flow]] of program execution via search and map.
Switch statements function somewhat similarly to the <code>if</code> statement used in programming languages like [[C (programming language)|C]]/[[C++]], [[C Sharp (programming language)|C#]], [[Visual Basic .NET]], [[Java (programming language)|Java]] and exist in most high-level [[imperative programming]] languages such as [[Pascal (programming language)|Pascal]], [[Ada (programming language)|Ada]], [[C (programming language)|C]]/[[C++]], [[C Sharp (programming language)|C#]],<ref name=Skeet>{{cite book |last=Skeet|first=Jon|title= C# in Depth |date=23 March 2019 |publisher= Manning |isbn= 978-1617294532}}</ref>{{rp|374–375}} [[Visual Basic .NET]], [[Java (programming language)|Java]],<ref name=Bloch>{{cite book | title= "Effective Java: Programming Language Guide" |last=Bloch| first=Joshua| publisher=Addison-Wesley | edition=third | isbn=978-0134685991| year=2018}}</ref>{{rp|157–167}} and in many other types of language, using such [[keyword (computer programming)|keyword]]s as <code>switch</code>, <code>case</code>, <code>select</code>, or <code>inspect</code>.
Switch statements come in two main variants: a structured switch, as in Pascal, which takes exactly one branch, and an unstructured switch, as in C, which functions as a type of [[goto]]. The main reasons for using a switch include improving clarity, by reducing otherwise repetitive coding, and (if the [[heuristic]]s permit) also offering the potential for faster execution through easier [[compiler optimization]] in many cases.
{|align="center"
|+ '''An example of a switch statement in C'''
|<syntaxhighlight lang="c">
switch (age) {
case 1: printf("You're one."); break;
case 2: printf("You're two."); break;
case 3: printf("You're three.");
case 4: printf("You're three or four."); break;
default: printf("You're not 1, 2, 3 or 4!");
}
</syntaxhighlight>
|}
==
In his 1952 text ''Introduction to Metamathematics'', [[Stephen Kleene]] formally proved that the CASE function (the IF-THEN-ELSE function being its simplest form) is a [[primitive recursive function]], where he defines the notion "definition by cases" in the following manner:
{{blockquote|text=
"#F. The function φ defined thus
: φ(x<sub>1</sub> , ... , x<sub>n</sub> ) =
::*φ<sub>1</sub>(x<sub>1</sub> , ... , x<sub>n</sub> ) if Q<sub>1</sub>(x<sub>1</sub> , ... , x<sub>n</sub> ),
::* . . . . . . . . . . . .
::*φ<sub>m</sub>(x<sub>1</sub> , ... , x<sub>n</sub> ) if Q<sub>m</sub>(x<sub>1</sub> , ... , x<sub>n</sub> ),
::*φ<sub>m+1</sub>(x<sub>1</sub> , ... , x<sub>n</sub> ) otherwise,
where Q<sub>1</sub> , ... , Q<sub>m</sub> are mutually exclusive predicates (or φ(x<sub>1</sub> , ... , x<sub>n</sub>) shall have the value given by the first clause which applies) is primitive recursive in φ<sub>1</sub>, ..., φ<sub>m+1</sub>, Q<sub>1</sub>, ..., Q<sub>m+1</sub>.|author=Stephen Kleene|source=<ref>"Definition by cases", Kleene 1952:229</ref>}}
Kleene provides a proof of this in terms of the Boolean-like recursive functions "sign-of" sg( ) and "not sign of" ~sg( ) (Kleene 1952:222-223); the first returns 1 if its input is positive and −1 if its input is negative.
Boolos-Burgess-Jeffrey make the additional observation that "definition by cases" must be both [[mutually exclusive]] and [[collectively exhaustive]]. They too offer a proof of the primitive recursiveness of this function (Boolos-Burgess-Jeffrey 2002:74-75).
The IF-THEN-ELSE is the basis of the [[McCarthy formalism]]: its usage replaces both primitive recursion and the [[mu-operator]].
The earliest [[Fortran]] compilers supported the [[Goto#Computed_GOTO_and_Assigned_GOTO|computed GOTO]] statement for multi-way branching. Early [[ALGOL]] compilers supported a SWITCH data type which contains a list of "designational expressions". A GOTO statement could reference a switch variable and, by providing an index, branch to the desired destination. With experience it was realized that a more formal multi-way construct, with single point of entrance and exit, was needed. Languages such as [[BCPL]], [[ALGOL-W]], and [[ALGOL-68]] introduced forms of this construct which have survived through modern languages.
== Typical syntax ==
In most languages, programmers write a switch statement across many individual lines using one or two keywords. A typical syntax involves:
* the first <code>select</code>, followed by an expression which is often referred to as the ''control expression'' or ''control variable'' of the switch statement
* subsequent lines defining the actual cases (the values), with corresponding sequences of statements for execution when a match occurs
* In languages with fallthrough behavior, a <code>break</code> statement typically follows a <code>case</code> statement to end said statement. [Wells]
* In some languages, e.g., [[PL/I]], the control expression is optional; if there is no control expression then each alternative begins with a <code>WHEN</code> clause containing a Boolean expression and a match occurs for the first case for which that expression evaluates to true. This usage is similar to the if/then/elseif/else structures in some other languages, e.g., [[Perl]].
* In some languages, e.g., [[Rexx]], no control expression is allowed and each alternative begins with a <code>WHEN</code> clause containing a Boolean expression and a match occurs for the first case for which that expression evaluates to true.
Each alternative begins with the particular value, or list of values (see below), that the control variable may match and which will cause the control to [[goto]] the corresponding sequence of statements. The value (or list/range of values) is usually separated from the corresponding statement sequence by a colon or by an implication arrow. In many languages, every case must also be preceded by a keyword such as <code>case</code> or <code>when</code>.
An optional default case is typically also allowed, specified by a <code>default</code>, <code>otherwise</code>, or <code>else</code> keyword. This executes when none of the other cases match the control expression. In some languages, such as C, if no case matches and the <code>default</code> is omitted the <code>switch</code> statement simply does nothing. In others, like PL/I, an error is raised.
== Semantics ==
Semantically, there are two main forms of switch statements.
The first form are structured switches, as in Pascal, where exactly one branch is taken, and the cases are treated as separate, exclusive blocks. This functions as a generalized if–then–else [[Conditional (computer programming)|conditional]], here with any number of branches, not just two.
The second form are unstructured switches, as in C, where the cases are treated as labels within a single block, and the switch functions as a generalized goto. This distinction is referred to as the treatment of fallthrough, which is elaborated below.
=== Fallthrough ===
In many languages, only the matching block is executed, and then execution continues at the end of the switch statement. These include the [[Pascal (programming language)|Pascal]] family (Object Pascal, Modula, Oberon, Ada, etc.) as well as [[PL/I]], modern forms of [[Fortran]] and [[BASIC]] dialects influenced by Pascal, most functional languages, and many others. To allow multiple values to execute the same code (and avoid needing to [[duplicate code]]), Pascal-type languages permit any number of values per case, given as a comma-separated list, as a range, or as a combination.
Languages derived from C language, and more generally those influenced by Fortran's [[computed GOTO]], instead feature fallthrough, where control moves to the matching case, and then execution continues ("falls through") to the statements associated with the ''next'' case in the source text. This also allows multiple values to match the same point without any special syntax: they are just listed with empty bodies. Values can be [[special condition]]ed with code in the case body. In practice, fallthrough is usually prevented with a <code>break</code> keyword at the end of the matching body, which exits execution of the switch block, but this can cause bugs due to unintentional fallthrough if the programmer forgets to insert the <code>break</code> statement. This is thus seen by many<ref>van der Linden, Peter (1994). ''Expert C Programming: Deep C Secrets'', p. 38. Prentice Hall, Eaglewood Cliffs. {{ISBN|0131774298}}.</ref> as a language wart, and warned against in some [[lint tool]]s. Syntactically, the cases are interpreted as labels, not blocks, and the switch and break statements explicitly change control flow. Some languages influenced by C, such as [[JavaScript]], retain default fallthrough, while others remove fallthrough, or only allow it in special circumstances. Notable variations on this in the C-family include [[C Sharp (programming language)|C#]], in which all blocks must be terminated with a <code>break</code> or <code>return</code> unless the block is empty (i.e. fallthrough is used as a way to specify multiple values).
In some cases languages provide optional fallthrough. For example, [[Perl]] does not fall through by default, but a case may explicitly do so using a <code>continue</code> keyword. This prevents unintentional fallthrough but allows it when desired. Similarly, [[Bash (Unix shell)|Bash]] defaults to not falling through when terminated with <code>;;</code>, but allows fallthrough<ref>since [http://git.savannah.gnu.org/cgit/bash.git/tree/NEWS?id=3185942a5234e26ab13fa02f9c51d340cec514f8#n111 version 4.0], released in 2009.</ref> with <code>;&</code> or <code>;;&</code> instead.
An example of a switch statement that relies on fallthrough is [[Duff's device]].
== Compilation ==
[[Optimizing compiler]]s such as [[GNU Compiler Collection|GCC]] or [[Clang]] may compile a switch statement into either a [[branch table]] or a [[binary search]] through the values in the cases.<ref>Vlad Lazarenko. [https://web.archive.org/web/20220313040503/http://lazarenko.me/switch/ From Switch Statement Down to Machine Code]</ref>
A branch table allows the switch statement to determine with a small, constant number of instructions which branch to execute without having to go through a list of comparisons, while a binary search takes only a logarithmic number of comparisons, measured in the number of cases in the switch statement.
Normally, the only method of finding out if this optimization has occurred is by actually looking at the resultant [[Assembly language|assembly]] or [[machine code]] output that has been generated by the compiler.
== Advantages and disadvantages ==
In some languages and programming environments, the use of a <code>case</code> or <code>switch</code> statement is considered superior to an equivalent series of ''if [[else if]]'' statements because it is:
* Easier to debug (e.g. setting breakpoints on code vs. a call table, if the debugger has no conditional breakpoint capability)
* Easier for a person to read
* Easier to understand, and consequently easier to maintain
* Fixed depth: a sequence of "if else if" statements may yield deep nesting, making compilation more difficult (especially in automatically generated code)
* Easier to verify that all values are handled. Compilers can issue a warning if some enum values are not handled.
Additionally, an [[optimization (computer science)|optimized]] implementation may execute much faster than the alternative, because it is often implemented by using an indexed [[branch table]].<ref>{{cite book |last1=Guntheroth |first1=Kurt |title=Optimized C++ |date=April 27, 2016 |publisher=O'Reilly Media |isbn=9781491922033 |page=182}}</ref> For example, deciding program flow based on a single character's value, if correctly implemented, is vastly more efficient than the alternative, reducing [[instruction path length]]s considerably. When implemented as such, a switch statement essentially becomes a [[perfect hash]].
In terms of the [[control-flow graph]], a switch statement consists of two nodes (entrance and exit), plus one edge between them for each option. By contrast, a sequence of "if...else if...else if" statements has an additional node for every case other than the first and last, together with a corresponding edge. The resulting control-flow graph for the sequences of "if"s thus has many more nodes and almost twice as many edges, with these not adding any useful information. However, the simple branches in the if statements are individually conceptually easier than the complex branch of a switch statement. In terms of [[cyclomatic complexity]], both of these options increase it by ''k''−1 if given ''k'' cases.
== Case and Switch expressions==
''Case expressions'' are supported by languages dating at least as far back as [[ALGOL-W]].<ref name="Wirth">{{cite journal |last1=Wirth |first1=Niklaus |author1-link=Niklaus Wirth |last2=Hoare |first2=C. A. R. |author2-link=Tony Hoare |date=June 1966 |title=A contribution to the development of ALGOL |url=https://dl.acm.org/doi/10.1145/365696.365702 |journal=Communications of the ACM |volume=9 |issue=6 |pages=413–432 |doi=10.1145/365696.365702 |s2cid=11901135 |via=[[Association for Computing Machinery]] |access-date=2020-10-07|doi-access=free }}</ref> In ALGOL-W, an integer expression was evaluated, which then evaluated the desired expression from a list of expressions:
<syntaxhighlight lang="pascal">
J :=
A := case DECODE(C)-128 of ("A", "B", "C", "D", "E", "F");
</syntaxhighlight>
Other languages supporting case expressions include [[SQL]], [[Standard ML]], [[Haskell]], and [[Common LISP]].
''Switch expressions'' are introduced in [[Java version history#Java SE 12|Java SE 12]], 19 March 2019, as a preview feature. Here a whole switch expression can be used to return a value. There is also a new form of case label, {{code|case L-> }} where the right-hand-side is a single expression. This also prevents fall through and requires that cases are exhaustive. In Java SE 13 the <code>yield</code> statement is introduced, and in Java SE 14 switch expressions become a standard language feature.<ref>{{cite web|access-date=2021-04-28|title=JEP 325: Switch Expressions (Preview)|url=https://openjdk.java.net/jeps/325|website=openjdk.java.net}}</ref><ref>{{cite web|access-date=2021-04-28|title=JEP 354: Switch Expressions (Second Preview)|url=https://openjdk.java.net/jeps/354|website=openjdk.java.net}}</ref><ref>{{cite web|access-date=2021-04-28|title=JEP 361: Switch Expressions|url=https://openjdk.java.net/jeps/361|website=openjdk.java.net}}</ref> For example:
<syntaxhighlight lang="java">
int ndays = switch (month) {
case JAN, MAR, MAY, JUL, AUG, OCT, DEC -> 31;
case APR, JUN, SEP, NOV -> 30;
case FEB -> {
if (year % 400 == 0) yield 29;
else if (year % 100 == 0) yield 28;
else if (year % 4 == 0) yield 29;
else yield 28; }
};
</syntaxhighlight>
== Alternative uses ==
Many languages evaluate expressions inside <code>switch</code> blocks at runtime, allowing a number of less obvious uses for the construction. This prohibits certain compiler optimizations, so is more common in dynamic and scripting languages where the enhanced flexibility is more important than the performance overhead.
=== PHP ===
For example, in [[PHP]], a constant can be used as the "variable" to check against, and the first case statement which evaluates to that constant will be executed:
<
switch (true) {
case ($x == 'hello'):
foo();
break;
case ($z == 'howdy'): break;
}
switch (5) {
case $x: break;
case $y: break;
}
</syntaxhighlight>
This feature is also useful for checking multiple variables against one value rather than one variable against many values.
=== Ruby ===
In [[Ruby (programming language)|Ruby]], due to its handling of <code>===</code> equality, the statement can be used to test for variable’s class:
<
case input
when Array
when Hash
end
</syntaxhighlight>
Ruby also returns a value that can be assigned to a variable, and doesn’t actually require the <code>case</code> to have any parameters (acting a bit like an <code>else if</code> statement):
<
catfood =
case junior senior else
end</syntaxhighlight>
=== Assembler ===
A switch statement in [[assembly language]]:
<syntaxhighlight lang="nasm">
switch:
cmp ah, 00h
je a
cmp ah, 01h
je b
jmp swtend ; No cases match or "default" code here
a:
push ah
mov al, 'a'
mov ah, 0Eh
mov bh, 00h
int 10h
pop ah
jmp swtend ; Equivalent to "break"
b:
push ah
mov al, 'b'
mov ah, 0Eh
mov bh, 00h
int 10h
pop ah
jmp swtend ; Equivalent to "break"
...
swtend:
</syntaxhighlight>
=== Python ===
For Python 3.10.6, [[Python Enhancement Proposal|PEPs]] 634-636 were accepted, which added {{code|lang=python|match}} and {{code|lang=python|case}} keywords.<ref>{{Cite web |last=Galindo Salgado |first=Pablo |title=What's New In Python 3.10 |url=https://docs.python.org/3/whatsnew/3.10.html |access-date=2022-08-19 |website=Python 3.10.6 documentation}}</ref><ref>{{Cite web |last1=Bucher |first1=Brandt |last2=van Rossum |first2=Guido |author-link2=Guido van Rossum |date=2020-09-12 |title=PEP 634 – Structural Pattern Matching: Specification |url=https://peps.python.org/pep-0634/ |access-date=2022-08-19 |website=Python Enhancement Proposals}}</ref><ref>{{Cite web |last1=Kohn |first1=Tobias |author-link=Guido van Rossum |last2=van Rossum |first2=Guido |date=2020-09-12 |title=PEP 635 – Structural Pattern Matching: Motivation and Rationale |url=https://peps.python.org/pep-0635/ |access-date=2022-08-19 |website=Python Enhancement Proposals}}</ref><ref>{{Cite web |last=Moisset |first=Daniel F |title=PEP 636 – Structural Pattern Matching: Tutorial |url=https://peps.python.org/pep-0636/ |access-date=2022-08-19 |website=Python Enhancement Proposals}}</ref> Unlike other languages, Python notably doesn't exhibit fallthrough behavior.
<syntaxhighlight lang="python" line>
letter = input("Put in a single letter: ").strip()[0].casefold() # First non-whitespace character of the input, lowercase
match letter:
case "a" | "e" | "i" | "o" | "u": # Unlike conditions in if statements, the `or` keyword cannot be used here to differentiate between cases
print(f"Letter {letter} is a vowel!")
case "y":
print(f"Letter {letter} may be a vowel.")
case _: # `case _` is equivalent to `default` from C and others
print(f"Letter {letter} is not a vowel!")
</syntaxhighlight>
=== Exception handling ===
A number of languages implement a form of switch statement in [[exception handling]], where if an exception is raised in a block, a separate branch is chosen, depending on the exception. In some cases a default branch, if no exception is raised, is also present. An early example is [[Modula-3]], which use the <code>TRY</code>...<code>EXCEPT</code> syntax, where each <code>EXCEPT</code> defines a case. This is also found in [[Delphi (programming language)|Delphi]], [[Scala (programming language)|Scala]], and [[Visual Basic .NET]].
== Alternatives ==
Some alternatives to switch statements can be:
* A series of ''if-else'' [[Conditional (programming)|conditionals]] that examine the target one value at a time. Fallthrough behavior can be achieved with a sequence of ''if'' conditionals each without the ''else'' clause.
* A [[lookup table]], which contains, as keys, the <code>case</code> values and, as values, the part under the <code>case</code> statement.
::(In some languages, only actual data types are allowed as values in the lookup table. In other languages, it is also possible to assign [[Function (computer science)|functions]] as lookup table values, gaining the same flexibility as a real <code>switch</code> statement. See [[Control table]] article for more detail on this).
::[[Lua]] does not support case/switch statements.<ref name="lua"/> This lookup technique is one way to implement <code>switch</code> statements in the Lua language, which has no built-in <code>switch</code>.<ref name="lua">[http://lua-users.org/wiki/SwitchStatement Switch statement in Lua]</ref>
::In some cases, lookup tables are more efficient than non-[[Program optimization|optimized]] <code>switch</code> statements since many languages can optimize table lookups, whereas switch statements are not optimized unless the range of values is small with few gaps. A non-optimized, non-[[binary search]] lookup, however, will almost certainly be slower than either a non-optimized switch or the equivalent multiple ''if-else'' statements.{{citation needed|date=October 2011}}
* A [[control table]] (that may be implemented as a simple lookup table) can also be customized to accommodate multiple conditions on multiple inputs if required and usually exhibits greater 'visual compactness' than an equivalent switch (that can occupy many statements).
* [[Pattern matching]], which is used to implement switch-like functionality in many [[functional programming|functional]] languages.
== See also ==
* [[Algorithmic efficiency]]
* [[Branch table]]
* [[Control table]]
* [[Duff's device]]
* [[Index mapping]]
== References ==
{{reflist}}
== Further reading ==
* [[Stephen Kleene]], 1952 (10th reprint 1991), ''Introduction to Metamathematics'', North-Holland Publishing Company, Amsterdam NL, {{ISBN|0-7204-2103-9}}
* [[George Boolos]], [[John P. Burgess|John Burgess]], and [[Richard Jeffrey]], 2002, ''Computability and Logic: Fourth Edition'', Cambridge University Press, Cambridge UK, {{ISBN|0-521-00758-5}} paperback. cf page 74-75.
[[Category:Conditional constructs]]
[[ru:Оператор ветвления#Переключатель (оператор множественного выбора)]]
|