Extensible programming: Difference between revisions

Content deleted Content added
updated link (mps)
 
(132 intermediate revisions by 65 users not shown)
Line 1:
{{short description|Style of computer programming}}
'''Note''': this article is currently a work in progress.
In [[computer science]], '''extensible programming''' is a style of computer programming that focuses on mechanisms to extend the [[programming language]], [[compiler]], and [[runtime system]] (environment). Extensible programming languages, supporting this style of programming, were an active area of work in the 1960s, but the movement was marginalized in the 1970s.<ref name="Standish1975"/> Extensible programming has become a topic of renewed interest in the 21st century.<ref name="Wilson2005">Gregory V. Wilson, "[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.453.3676&rep=rep1&type=pdf Extensible Programming for the 21st Century]", ''ACM Queue'' 2 no. 9 (Dec/Jan 2004–2005).</ref>
 
== Historical movement ==
'''Extensible programming''' is a term used in [[computer science]] to describe a style of computer programming that focuses on mechanisms to extend the [[programming language]](s), [[compiler]] and [[runtime]] environment. A system that supports extensible programming will provide ''all'' of the features described below.
The first paper usually<ref name="Standish1975">Standish, Thomas A., "[https://pdfs.semanticscholar.org/7f11/082b409647e8d50dadd3a369a10278b5890f.pdf Extensibility in Programming Language Design]", ''SIGPLAN Notices'' 10 no. 7 (July 1975), pp. 18–21.</ref><ref name="Sammet1969">Sammet, Jean E., ''Programming Languages: History and Fundamentals'', Prentice-Hall, 1969, section III.7.2</ref> associated with the extensible programming language movement is M. [[Douglas McIlroy]]'s 1960 paper on [[Macro (computer science)|macros]] for [[high-level programming language]]s.<ref name="McIlroy1960">McIlroy, M.D., "[https://dl.acm.org/citation.cfm?id=367223 Macro Instruction Extensions of Compiler Languages]", ''Communications of the ACM'' 3 no. 4 (April 1960), pp. 214–220.</ref> Another early description of the principle of extensibility occurs in Brooker and Morris's 1960 paper on the [[compiler-compiler]].<ref name="Brooker&Morris1962">Brooker, R.A. and Morris, D., "[https://dl.acm.org/citation.cfm?id=321106 A General Translation Program for Phrase Structure Languages]", ''Journal of the ACM'' 9 no. 1 (January 1962), pp. 1–10. The paper was received in 1960.</ref> The peak of the movement was marked by two academic symposia, in 1969 and 1971.<ref name="Christensen&Shaw1969">Christensen, C. and Shaw, C.J., eds., Proceedings of the Extensible Languages Symposium, ''SIGPLAN Notices'' 4 no. 8 (August 1969).</ref><ref name="Schuman1971">Schuman, S.A., ed., Proceedings of the International Symposium on Extensible Languages, ''SIGPLAN Notices'' 6 no. 12 (December 1971).</ref> By 1975, a survey article on the movement by Thomas A. Standish<ref name="Standish1975"/> was essentially a post mortem. The [[Forth (programming language)|Forth]] was an exception, but it went essentially unnoticed.
 
=== Character of the historical movement ===
== Extensible Syntax ==
As typically envisioned, an extensible language consisted of a base language providing elementary computing facilities, and a [[metalanguage]] able to modify the base language. A program then consisted of metalanguage modifications and code in the modified base language.
This simply means that the source language(s) to be compiled must not be closed, fixed or static. It must be possible to add new keywords, concepts and structures to the source language(s). While it is acceptable for some fundamental and intrinsic language features to be immutable, the system must not rely solely on those language features. It must be possible to add new ones.
 
The most prominent language-extension technique used in the movement was macro definition. Grammar modification was also closely associated with the movement, resulting in the eventual development of [[adaptive grammar]] [[Formalism (philosophy of mathematics)|formalisms]]. The [[Lisp (programming language)|Lisp]] language community remained separate from the extensible language community, apparently because, as one researcher observed,
== Extensible Compiler ==
In extensible programming, a compiler is not a monolithic program that converts source code input into binary executable output. The compiler itself must be extensible to the point that it is really a collection of plugins that assist with the translation of source language input into ''anything''. For example, an extensible compiler will support the generation of object code, code documentation, re-formatted source code, or any other desired output. The architecture of the compiler must permit its users to "get inside" the compilation process and provide alternative processing tasks at every reasonable step in the compilation process.
 
<blockquote>any programming language in which programs and data are essentially interchangeable can be regarded as an extendible [sic] language. ... this can be seen very easily from the fact that Lisp has been used as an extendible language for years.<ref name="Harrison1969">Harrison, M.C., in "Panel on the Concept of Extensibility", pp. 53–54 of the 1969 symposium.</ref></blockquote>
For just the task of translating soure code into something that can be executed on a computer, an extensible compiler should:
 
* utilize a plug-in or component architecture for nearly every aspect of its function
At the 1969 conference, [[Simula]] was presented as an extensible language.
 
Standish described three classes of language extension, which he named ''[[paraphrase]]'', ''orthophrase'', and ''metaphrase'' (otherwise paraphrase and metaphrase being [[translation]] terms).
 
* [[Paraphrase]] defines a facility by showing how to exchange it for something formerly defined (or to be defined). As examples, he mentions macro definitions, ordinary procedure definitions, grammatical extensions, data definitions, operator definitions, and control structure extensions.
* Orthophrase adds features to a language that could not be achieved using the base language, such as adding an [[input/output]] (I/O) system to a base language formerly with no I/O primitives. Extensions must be understood as orthophrase ''relative'' to some given base language, since a feature not defined in terms of the base language must be defined in terms of some other language. This corresponds to the modern notion of [[Plug-in (computing)|plug-ins]].
* Metaphrase modifies the interpretation rules used for pre-existing expressions. This corresponds to the modern notion of [[reflective programming]] (reflection).
 
=== Death of the historical movement ===
Standish attributed the failure of the extensibility movement to the difficulty of programming successive extensions. A programmer might build a first shell of macros around a base language. Then, if a second shell of macros is built around that, any subsequent programmer must be intimately familiar with both the base language, and the first shell. A third shell would require familiarity with the base and both the first and second shells, and so on. Shielding a programmer from lower-level details is the intent of the [[Abstraction (computer science)|abstraction]] movement that supplanted the extensibility movement.
 
Despite the earlier presentation of Simula as extensible, by 1975, Standish's survey does not seem in practice to have included the newer abstraction-based technologies (though he used a very general definition of extensibility that technically could have included them). A 1978 history of programming abstraction from the invention of the computer until then, made no mention of macros, and gave no hint that the extensible languages movement had ever occurred.<ref name="Guarino1978">Guarino, L.R., "[https://cds.cern.ch/record/119689 The Evolution of Abstraction in Programming Languages]", ''CMU-CS-78-120'', Department of Computer Science, Carnegie-Mellon University, Pennsylvania, 22 May 1978.</ref> Macros were tentatively admitted into the abstraction movement by the late 1980s (perhaps due to the advent of [[hygienic macros]]), by being granted the pseudonym ''syntactic abstractions''.<ref name="Gabriel1989">Gabriel, Richard P., ed., "[https://dl.acm.org/citation.cfm?id=66092 Draft Report on Requirements for a Common Prototyping System]", ''SIGPLAN Notices'' 24 no. 3 (March 1989), pp. 93ff.</ref>
 
== Modern movement ==
In the modern sense, a system that supports extensible programming will provide ''all'' of the features described below{{Citation needed|reason=Reliable source needed for this definition|date=October 2017}}.
 
=== Extensible Syntaxsyntax ===
{{category see also|Extensible syntax programming languages}}
This simply means that the source language(s) to be compiled must not be closed, fixed, or static. It must be possible to add new keywords, concepts, and structures to the source language(s). Languages which allow the addition of constructs with user defined syntax include [[Coq (software)|Coq]],<ref>{{Cite web |title=Syntax extensions and notation scopes – Coq 8.17.0 documentation |url=https://coq.inria.fr/refman/user-extensions/syntax-extensions.html |access-date=2023-05-25 |website=coq.inria.fr}}</ref> [[Racket (programming language)|Racket]], [[Camlp4]], [[OpenC++ (software tool)|OpenC++]], [[Seed7]],<ref name="Zingaro2007">Zingaro, Daniel, "[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.69.2848&rep=rep1&type=pdf Modern Extensible Languages]", SQRL Report 47 McMaster University (October 2007), page 16.</ref> [[Red (programming language)|Red]], [[Rebol]], and [[Felix (programming language)|Felix]]. While it is acceptable for some fundamental and intrinsic language features to be immutable, the system must not rely solely on those language features. It must be possible to add new ones.
 
=== Extensible Compilercompiler ===
In extensible programming, a compiler is not a monolithic program that converts source code input into binary executable output. The compiler itself must be extensible to the point that it is really a collection of plugins that assist with the translation of source language input into ''anything''. For example, an extensible compiler will support the generation of object code, code documentation, re-formatted source code, or any other desired output. The architecture of the compiler must permit its users to "get inside" the compilation process and provide alternative processing tasks at every reasonable step in the compilation process.
 
For just the task of translating souresource code into something that can be executed on a computer, an extensible compiler should:
* utilizeuse a plug-in or component architecture for nearly every aspect of its function
* determine which language or language variant is being compiled and locate the appropriate plug-in to recognize and validate that language
* use formal language specifications to syntactically and structurally validate arbitrary source languages
* assist with the semantic validation of arbitrary source languages by invoking an appropriate validation plug-in
* allow users to select from different kinds of code generators so that the resulting executable can be targeted for different processors, operating systems, virtual machines, or other execution environment.
Line 19 ⟶ 45:
* allow new values in nodes of the AST,
* allow new kinds of edges between nodes,
* support the transformation of the input AST, or portions thereof, by some external "pass"
* support the translation of the input AST, or portions thereof, into another form by some external "pass"
* assist with the flow of information between internal and external passes as they both transform and translate the AST into new ASTs or other representations
 
=== Extensible Runtimeruntime ===
At runtime, extensible programming systems must permit languages to extend the set of operations that it permits. For example, if the system uses a bytecode[[byte-code]] interpreter, it must allow new bytecodebyte-code values to be defined. As with extensible syntax, it is acceptable for there to be some (smallish) set of fundamental or intrinsic operations that are immutable. However, it must be possible to overload or augment those intrinsic operations so that new or additional behavior can be supported.
 
=== Content Separatedseparated Fromfrom Formform ===
Extensible programming systems should regard programs as data to be processed. Those programs should be completely devoid of any kind of formatting information. The visual display and editing of programs to users should be a translation function, supported by the extensible compiler, that translates the program data into forms more suitable for viewing or editing. Naturally, this should be a two-way translation. This is important because it must be possible to easily process extensible programs in a ''variety'' of ways. It is unacceptable for the only uses of source language input to be editing, viewing and translation to machine code. The arbitrary processing of programs is facilitated by de-coupling the source input from specifications of how it should be processed (formatted, stored, displayed, edited, etc.).
 
=== Source Languagelanguage Debuggingdebugging Supportsupport ===
Extensible programming systems must support the debugging of programs using the constructs of the original source language regardless of the extensions or transformation the program has undergone in order to make it executable. Most notably, it cannot be assumed that the only way to display runtime data is in ''structures'' or ''arrays''. The debugger, or more correctly 'program inspector', must permit the display of runtime data in forms suitable to the source language. For example, if the language supports a data structure for a [[business process]] or [[work flow]], it must be possible for the debugger to display that data structure as a [[fishbone chart]] or other form provided by a plugin.
 
== Related Links Examples==
* [[Camlp4]]
=== External Links ===
* Felix
# [http://acmqueue.com/modules.php?name=Content&pa=showpage&pid=247&page=1 Greg Wilson's Article in ACM Queue]
* [[Nemerle]]
# [http://pyre.third-bit.com/mailman/listinfo/extprog Extensible Programming mailing list]
* [[Seed7]]
* [[Rebol]]
** [[Red (programming language)|Red]]
* [[Ruby (programming language)|Ruby]] ([[metaprogramming]])
* [[IMP (programming language)|IMP]]
* OpenC++
* [[XL (programming language)|XL]]
* [[XML]]
* [[Forth (programming language)|Forth]]
* [[Factor (programming language)|Factor]]
* [[Lisp (programming language)|Lisp]]
** [[Racket (programming language)|Racket]]
** [[Scheme (programming language)|Scheme]]
* [[Lua (programming language)|Lua]]
* [[PL/I]]
* [[Smalltalk]]
 
== See also ==
* [[Adaptive grammar]]
* [[Concept programming]]
* [[Dialecting]]
* [[Grammar-oriented programming]]
* [[Language-oriented programming]]
* [[Homoiconicity]]
 
== References ==
{{Reflist}}
 
=== External Linkslinks ===
 
=== General ===
# [https://web.archive.org/web/20050209071400/http://www.acmqueue.com/modules.php?name=Content&pa=showpage&pid=247&page=1 Greg Wilson's Article in ACM Queue]
# [http://developers.slashdot.org/article.pl?sid=05/01/18/2157249&from=rss Slashdot Discussion]
# [http://www.cas.mcmaster.ca/sqrl/papers/SQRLreport47.pdf Modern Extensible Languages] {{Webarchive|url=https://web.archive.org/web/20110612014339/http://www.cas.mcmaster.ca/sqrl/papers/SQRLreport47.pdf |date=2011-06-12}} – A paper from [[Daniel Zingaro]]
# [http://www.vbxml.com/XPL/spec_draft.asp An extensible programming language specification]
 
=== Tools ===
# [http://www.meta-language.net/ MetaL] - [https://web.archive.org/web/20050220100913/http://pyre.third-bit.com/pipermail/extprog/2005-January/000019.html an extensible programming compiler engine implementation]
# [httphttps://x-p-ssourceforge.net/projects/extprosys/ XPS] - eXtensible programmingProgramming systemSystem (in development)
# [http://www.jetbrains.com/mps/ MPS] - JetBrains Metaprogramming system
 
=== Languages with extensible syntax ===
# [https://openzz.sourceforge.net/ OpenZz]
# [http://cs.nyu.edu/rgrimm/xtc/ xtc – eXTensible C]
# [https://github.com/pannous/english-script English-script]
# [https://web.archive.org/web/20050622032429/http://nemerle.org/Macros Nemerle Macros]
# [https://web.archive.org/web/20050817205802/http://boo.codehaus.org/Syntactic+Macros Boo Syntactic Macros]
# [https://web.archive.org/web/20061022071450/http://suif.stanford.edu/ Stanford University Intermediate Format compiler]
# [https://seed7.sourceforge.net/ Seed7 – The extensible programming language]
# [https://github.com/chrisseaton/katahdin Katahdin] – a language with syntax and semantics that are mutable at runtime
# [http://www.pi-programming.org/What.html π] – a language with extensible syntax, implemented using an [[Earley parser]]
 
{{Programming paradigms navbox}}
{{Types of programming languages}}
 
{{DEFAULTSORT:Extensible Programming}}
[[Category:Extensible syntax programming languages]]
[[Category:Programming paradigms]]