#REDIRECT [[Joel Spolsky#Schlemiel the Painter's algorithm]]
{{Orphan|date=February 2009}}
In software development, a '''Schlemiel the Painter'''['s]<!-- Spolsky uses it both with and without the possessive 's -->''' algorithm''' denotes any methodology that is inefficient because the programmer has overlooked some fundamental issues at the very [[High and low level|lowest levels]] of [[software design]]. The term was coined by software engineer and essayist [[Joel Spolsky]].
{{Redirect category shell|1=
__TOC__
{{R from Merge}}
==Spolsky's analogy==
{{R to Section}}
Spolsky used a [[Yiddish]] joke to illustrate a certain poor programming practice. In the joke, Schlemiel (also rendered Shlemiel) has a job painting the dotted lines down the middle of a road. Each day, Schlemiel paints less than he painted the day before. When he is asked why, Schlemiel complains that it is because each day he gets farther away from the paint can.<ref name="basics">{{citation|last=Spolsky|first=Joel|title=Back to Basics|date=December 11, 2001|series=Joel on Software|url=http://www.joelonsoftware.com/articles/fog0000000319.html|publisher=joelonsoftware.com}}.</ref>
}}
The inefficiency Spolsky was drawing an analogy that refers to the poor programming practice of repeated [[concatenation]] of [[C (programming language)|C]]-style [[Null character|null]]-terminated character arrays (in general computing parlance, these are known as "[[String (computer science)|strings]]") in which the position of the destination string has to be recomputed from the beginning of the string each time because it is not carried over from a previous concatenation.
Spolsky condemned such inefficiencies as typical for programmers who had not been taught basic programming techniques before they began programming using higher level languages: "Generations of graduates are descending on us and creating ''Shlemiel The Painter algorithms'' right and left and they don't even realize it, since they fundamentally have no idea that strings are, at a very deep level, difficult."<ref name="basics" />
Coined in 2001, the term has since become part of the vernacular to denote inefficient programming techniques.<sup>''cf.'' </sup><ref>{{citation|title=Programming interview questions|last=Cox|first=William|date=November 19, 2005|url=http://discuss.techinterview.org/default.asp?interview.11.246942.7|publisher=techinterview.org}}.</ref>
<ref>{{citation|last=Atwood|first=Jeff|date=September 19, 2007|title=Everything Is Fast For Small n|url=http://www.codinghorror.com/blog/archives/000957.html|publisher=codinghorror.com}}.</ref> Spolsky's essays have been cited as examples of good writing "about their insular world in a way that wins the respect of their colleagues and the attention of outsiders."<ref>{{citation|last=Rosenberg|first=Scott|title=The Shlemiel way of software|date=December 9, 2004|url=http://dir.salon.com/story/tech/feature/2004/12/09/spolsky/|publisher=salon.com}}.</ref>
==Spolsky's example==
The programming practice that Spolsky used to make his point was repeated concatenation of null-terminated character arrays ("strings").<ref name="basics" />
The first step in every implementation of the [[C standard library|standard C library]] function for concatenating strings is determining the length of the string being appended to by checking each character in the array, starting from the beginning, to see if it is the terminating [[Null character|null character]]. In subsequent steps, another string is then copied to the end of the first string, so effectively concatenating the two. At the end of the concatenation, the length of the combined string is discarded upon return to the calling code.
In Spolsky's example, the "Schlemiels" occur when multiple strings are being concatenated together:
# <code>[[strcat]]( buffer, "John" ); </code>/* Here, the string "John" is appended to the buffer */
# <code>strcat( buffer, "Paul" ); </code>/* Now the string "Paul" is appended to ''that'' */
# <code>strcat( buffer, "George" ); </code>/* ... and the string "George" is appended to ''that'' */
# <code>strcat( buffer, "Ringo" ); </code>/* ... and the string "Ringo" is appended to ''that'' */
After Paul is finished appending to John, the length of "JohnPaul" (or, more precisely, the position of the terminating null character) is known within the [[Scope (programming)|scope]] of <code>strcat()</code> but is discarded upon its return to the point after Paul and before George. Afterwards, when <code>strcat()</code> is told to append George to "JohnPaul", <code>strcat()</code> starts at the very first character of the array (which is 'J') all over again just to find the terminating null character. Each subsequent call to <code>strcat()</code> has to compute the length again before concatenating another name to the <code>buffer</code>.
Analogous to Schlemiel's not carrying the paint-bucket (or the string's length) with him, all the subsequent <code>strcat()</code>s have to again "walk" the length of the string to determine where the second string should be copied. As more data is added to <code>buffer</code>, that terminating null character also gets farther away from the beginning with each call to <code>strcat()</code>, meaning more checks must be taken to find that character and subsequent calls are increasingly slower—just as "Schlemiel's" path to his bucket keeps getting longer.
The problems illustrated by Spolsky's example are not noticed by a programmer who is using a high level language and has little or no knowledge of its underlying principles and functions. "Some of the biggest mistakes people make even at the highest architectural levels come from having a weak or broken understanding of a few simple things at the very lowest levels."<ref name="basics" />
== References ==
{{reflist}}
[[Category:Software engineering terminology]]
[[Category:Software development philosophies]]
|