Static program analysis

This is an old revision of this page, as edited by 155.99.194.94 (talk) at 19:12, 29 July 2004 (Tools). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Static code analysis is a set of methods for analysing software source code or object code in an effort to gain understanding of what the software does. Applications include targeting areas for review and/or rewrite.

Schematically, there exist several types of static analysis (which may be used in combination, even inside the same programming tool):

  • tools such as lint essentially look for constructs that "look dangerous" from an informal point of view;
  • formal methods consider mathematical definition of the behaviors of programs, known as semantics:

Formal methods

Static analysis is a family of formal methods for automatically deriving information about the behavior of computer software (and also hardware). One possible application of static analysis is automated debugging aid, especially the finding of run-time errors -- roughly speaking, events causing program crashes.

Briefly, program analysis — including finding possible run-time errors -- is undecidable: there's is no mechanical method that can always answer truthfully whether programs may or not exhibit runtime errors. This is a mathematical resulting dating from the works of Church, Gödel and Turing in the 1930s (see halting problem and Rice's theorem).

There exist two main families of formal static analysis:

Interest in the development of static analysis tools, especially for use on safety-critical computer systems, was renewed after the high profile disaster of Ariane 5 Flight 501, when a space rocket exploded shortly after launch due to a computer bug, surely one of the most expensive computer bugs in history.

See also

Software metrics

This family of methods aims at identifying possible problems in software using some numerical measurements over the source code. The number of metrics (measures) that can be applied are numerous. One of the crudest measurements, for example, is the size of code, usually expressed in kLOC's (1000 lines of code). Defining a consistent definition of a line of code has long been an item for debate.

One common metric in static analysis is McCabes Cyclomatic Complexity Metric which measures the number of choices a function makes. Functions with a high number of conditional statements (if's, while etc.) will have a high complexity. Such functions therefore can be considered more likely to contain bugs and be harder to maintain.

Tools

  1. AntiC
  2. BLAST
  3. BOON
  4. Broadway
  5. C++Test
  6. Checkstyle
  7. Cqual
  8. CodeWizard
  9. ESC/Java
  10. FindBugs
  11. Flawfinder
  12. GCC Introspector
  13. JDepend
  14. Jlint
  15. Jtest
  16. Kaylo
  17. MOPS
  18. OpenC++
  19. PAG and PAG/WWW
  20. PMD
  21. PREfast, PREfix
  22. SLAM
  23. Smatch
  24. StackAnalyzer
  25. Stacktool
  26. SPlint

See also