Content deleted Content added
→Other models: removed reference to deleted article Concept-oriented model |
m link mass surveillance |
||
(74 intermediate revisions by 45 users not shown) | |||
Line 1:
{{Short description|1=Overview of and topical guide to databases}}
part of the set of 830+ outlines listed at
▲<!-- Attention: NEW OUTLINE
a special type of list article.
They make up one of Wikipedia's
content navigation systems
[[Wikipedia:WikiProject Outlines]] for more details.
Further improvements▼
...-->▼
'''[[Database]]''' – organized collection of
▲ [[Portal:Contents/Outlines]].
▲ Further improvements
▲ are on the way.
▲-->
▲The following outline is provided as an overview of and topical guide to databases:
▲'''[[Database]]''' – organized collection of [[data]], today typically in digital form. The data are typically organized to model relevant aspects of reality (for example, the availability of rooms in hotels), in a way that supports processes requiring this information (for example, finding a hotel with vacancies).
== What type of things are databases? ==
Databases can be described as all of the following:
* [[Information]] – sequence of symbols that can be interpreted as a message. Information can be recorded as signs, or transmitted as signals.
== Types of databases ==
* [[Active database]] – includes an event driven architecture (often in the form of ECA rules) which can respond to conditions both inside and outside the database.
* [[Animation database]] – stores fragments of animations or human movements and which can be accessed, analyzed and queried to develop and assemble new animations.
* [[Back-end database]] – accessed by users indirectly through an external application rather than by application programming stored within the database itself or by low level manipulation of the data (e.g. through [[SQL]] commands).
* [[Bibliographic database]] – database of bibliographic records, an organized digital collection of references to published literature, including journal and newspaper articles, conference proceedings, reports, government and legal publications, patents, books, etc.
* [[Centralized database]] – database located and maintained in one ___location, unlike a distributed database.
Line 47 ⟶ 45:
* [[Endgame tablebase]] – computerized database that contains precalculated exhaustive analysis of a chess endgame position.
* [[Food composition database]] (FCDB) – provides detailed information on the nutritional composition of foods.
* [[Full
* [[
* [[Graph database]] – uses graph structures with nodes, edges, and properties to represent and store data.
* [[Knowledge base]] – special kind of database for knowledge management. A knowledge base provides a means for information to be collected, organised, shared, searched and utilised.
Line 60 ⟶ 57:
* [[Probabilistic database]] – uncertain database in which the possible worlds have associated probabilities.
* [[Real-time database]] – processing system designed to handle workloads whose state is constantly changing (Buchmann).
* [[Relational database]] – collection of data items organized as a set of formally
* [[Spatial database]] – database that is optimized to store and query data that is related to objects in space, including points, lines and polygons.
* [[Temporal database]] – database with built-in time aspects, for example a temporal data model and a temporal version of Structured Query Language (SQL).
Line 83 ⟶ 80:
== History of databases ==
* [[Database#History|History of databases]] –
* [[Database management system#History|History of database management systems]] –:
==Database use==
Line 109 ⟶ 105:
* [[Log shipping]] – process of automating the backup of a database and transaction log files on a primary (production) database server, and then restoring them onto a standby server.
=== Database languages ===
[[Database#Database languages|Database languages]] –
* [[Data definition language]] –
Line 115 ⟶ 111:
* [[Query language]] –
** [[Information retrieval query language]] – query language used to make queries into database, where the semantics of the query are defined not by a precise rendering of a formal syntax, but by an interpretation of the most suitable results of the query.
** [[SQL]] (Structured Query Language) – special-purpose programming language designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS).
** [[XQuery]] – a query and functional programming language that queries and transforms collections of structured and unstructured data.
=== Database security ===
[[Database security]] –
* [[Database activity monitoring]] (DAM) – database security technology for monitoring and analyzing database activity that operates independently of the database management system (DBMS) and does not rely on any form of native (DBMS-resident) auditing or native logs such as trace or transaction logs.
Line 123 ⟶ 121:
* [[Negative database]] – credit card terminology for a list of credit card owners who chargeback a lot.
=== Database design ===
[[Database design]] –
* [[
* [[Database normalization]] – process of organizing the fields and tables of a relational database to minimize redundancy and dependency.
* [[Database refactoring]] – simple change to a database schema that improves its design while retaining both its behavioral and informational semantics.
=== Database programming ===
* [[Database abstraction layer]] – application programming interface which unifies the communication between a computer application and databases such as SQL Server, DB2, MySQL, PostgreSQL, Oracle or SQLite.
* [[
=== Database management ===
* [[Database virtualization]] – it is the decoupling of the database layer, which lies between the storage and application layers within the application stack.
* [[Database tuning]] – describes a group of activities used to optimize and homogenize the performance of a database.
** [[Database caching]] – effective approach to achieve high scalability and performance.
*
* [[Database preservation]] – usually involves converting the information stored in a database, without losing the characteristics (Context, Content, Structure, Appearance and Behaviour) of the data, to a format which can be used in the long term, even if the technology and daily life knowledge changes.
* [[Database integrity]] – ensures that data entered into the database is accurate, valid, and consistent.
Line 143 ⟶ 141:
== Database management systems ==
[[Database management system]] –
* [[Database model]] –
* [[Database normalization]] – organizing tables based on their attributes so that the data presented can avoid having redundancy and dependency.
*
* [[Distributed database management system]] –
* [[Federated database system]] – type of meta-database management system (DBMS), which transparently maps multiple autonomous database systems into a single federated database.
Line 151 ⟶ 149:
* [[Relational algebra]] – offshoot of first-order logic (and of algebra of sets), deals with a set of finitary relations (see also relation (database)) that is closed under certain operators.
* [[Relational calculus]] – consists of two calculi, the tuple relational calculus and the ___domain relational calculus, that are part of the relational model for databases and provide a declarative way to specify database queries.
* [[Relational database]] – collection of data items organized as a set of formally
* [[Relational database management system]] (RDBMS) – database management system (DBMS) that is based on the relational model as introduced by E.
* [[Relational model]] – for database management is a database model based on first-order predicate logic, first formulated and proposed in 1969 by Edgar F.
* [[
* [[Transaction processing]] –
Line 161 ⟶ 159:
* [[ACID]] – (''atomicity, consistency, isolation, durability'') is a set of properties that guarantee that database transactions are processed reliably.
* [[Create, read, update and delete]] (CRUD) – are the four basic functions of persistent storage.
* [[Null (SQL)|Null]] –
* [[Candidate key]] – minimal superkey for a relation.
* [[Foreign key]] – referential constraint between two tables.
* [[Primary key]] –
* [[Superkey]] – set of attributes of a relation variable for which it holds that in all relations assigned to that variable, there are no two distinct tuples (rows) that have the same values for the attributes in this set.
* [[Surrogate key]] – unique identifier in a
* [[Armstrong's axioms]] – set of axioms (or, more precisely, inference rules) used to infer all the functional dependencies on a relational database.
* [[NoSQL]] – class of database management system identified by its non-adherence to the widely
=== Objects ===
* [[Relation (database)|Relation]] –
** [[Table (database)|Table]] –
** [[Column (database)|Column]] –
** [[Row (database)|Row]] –
* [[View (database)|View]] –
* [[Database transaction]] –
* [[Transaction log]] – history of actions executed by a database management system to guarantee ACID properties over crashes or hardware failures. Also called "transaction journal", "database log" or "binary log".
* [[Database trigger]] – procedural code that is automatically executed in response to certain events on a particular table or view in a database.
* [[Index (database)|Index]] –
* [[Stored procedure]] – subroutine available to applications that access a relational database system.
* [[Cursor (databases)|Cursor]] –
* [[Partition (database)|Partition]] –
=== Components ===
Line 188 ⟶ 186:
* [[Data dictionary]] – as defined in the ''IBM Dictionary of Computing'', is a "centralized repository of information about data such as meaning, relationships to other data, origin, usage, and format." Also called a "metadata repository".
* [[Java Database Connectivity]] – .
* [[Open Database Connectivity]] –
* [[Query language]] –
* [[Query optimizer]] – component of a database management system that attempts to determine the most efficient way to execute a query.
* [[Query plan]] – ordered set of steps used to access or modify information in a SQL relational database management system. Also called a "query execution plan".
=== Functions ===
* [[Database administration
* [[Query optimization]] – function of many relational database management systems in which multiple query plans for satisfying a query are examined and a good query plan is identified.
* [[Database replication]] –
Line 200 ⟶ 198:
=== Database products ===
* [[List of object-oriented database management systems]] –
** [[Comparison of
* [[List of relational database management systems]] –
** [[Comparison of relational database management systems]] –
** [[MySQL]] ([[Outline of MySQL|outline]]) –
* [[Document-oriented database]] – computer program designed for storing, retrieving, and managing document-oriented, or Semi-structured model|semi structured data, information.
Line 211 ⟶ 210:
* [[Flat file database]] – various means to encode a database model (most commonly a table) as a single file.
* [[Hierarchical database model]] – data model in which the data is organized into a tree-like structure.
*
* [[Network model]] – database model conceived as a flexible way of representing objects and their relationships.
* [[Relational model]] –
* [[
* [[Graph database]] – uses graph structures with nodes, edges, and properties to represent and store data.
* [[Object database]] – database management system in which information is represented in the form of objects as used in object-oriented programming. Also called an "object-oriented database management system".
* [[
=== Other models ===
▲*[[Online analytical processing#Multidimensional databases]] –
* [[Semantic data model]] –
* [[Star schema]] – is the simplest style of data warehouse schema. Also called "star-join schema", "[[data cube]]", or "multi-dimensional schema".
* [[XML database]] –
=== Implementations ===
* [[Flat file database]] –
* [[Deductive database]] – database system that can make deductions.
* [[Document-oriented database]] –
* [[
* [[Temporal database]] – database with built-in time aspects, for example a temporal data model and a temporal version of Structured Query Language (SQL).
* [[XML database]] –
* [[Triplestore]] – purpose-built database for the storage and retrieval of triples, a triple being a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred".
Line 239 ⟶ 237:
=== Creating the data warehouse ===
==== Concepts ====
* [[Dimension (data warehouse)|Dimension]] –
* [[Dimensional modeling]] (DM) – is the name of a set of techniques and concepts used in data warehouse design.
* [[Fact (data warehouse)|Fact]] –
* [[Online analytical processing]] (OLAP) – or is an approach to swiftly answer multi-dimensional analytical (multi-dimensional analytical|MDA) queries.
* [[Star schema]] –
* [[Aggregate (Data Warehouse)|Aggregate]] –
==== Variants ====
Line 257 ⟶ 256:
==== Elements ====
* [[Data dictionary]] – /[[Metadata]] –
* [[Data mart]] – access layer of the data warehouse environment that is used to get data out to the users.
* [[Sixth normal form]] (6NF) – term in relational database theory, used in two different ways.
* [[Surrogate key]] –
==== Fact ====
* [[Fact table]] – consists of the measurements, metrics or facts of a business process.
* [[Early-arriving fact]] –
* [[Measure (data warehouse)|Measure]] –
==== Dimension ====
Line 273 ⟶ 272:
==== Filling ====
* [[Extract, transform, load|Extract-Transform-Load (ETL)]] –
* [[Data extraction]] – act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage (data migration).
* [[Data transformation]] – converts data from a source data format into destination data.
* [[Data loading]] –
=== Using the data warehouse ===
==== Concepts ====
* [[Business intelligence]] (BI) – is defined as the ability for an organization to take all its capabilities and convert them into knowledge, ultimately, getting the right information to the right people, at the right time, via the right channel.
* [[Dashboard (business)|Dashboard]] –
* [[Data mining]] – is the process that results in the discovery of new patterns in large data sets. It is the analysis step of the "Knowledge Discovery in Databases" process, or KDD.
* [[Decision support system|Decision support system (DSS)]] –
* [[OLAP cube]] – set of data, organized in a way that facilitates non-predetermined queries for aggregated information, or in other words, online analytical processing.
==== Languages ====
* [[Data Mining Extensions]] (DMX) –
* [[MultiDimensional eXpressions]] (MDX) –
* [[XML for Analysis]] (XMLA) –
==== Tools ====
* [[Business intelligence tools]] –
* [[List of reporting software]] –
* [[Spreadsheets]] –
=== People ===
* [[Edgar F. Codd]] – English Computer scientist who introduced the relational database model
* [[Bill Inmon]] –
* [[Ralph Kimball]] (Born 1944) – author on the subject of data warehousing and business intelligence.
=== Products ===
* [[Comparison of OLAP
== Database-related organizations ==
* ''[[Database (journal)|Database]]''
* ''[[Journal of Database Management]]''
== Database-related publications ==
* [[Ling Liu (computer scientist)|Ling Liu]] and Tamer M. Özsu (Eds.) (2009). "[
* Beynon-Davies, P. (2004). Database Systems. 3rd Edition. Palgrave, Houndmills, Basingstoke.
* Connolly, Thomas and Carolyn Begg. ''Database Systems.'' New York: Harlow, 2002.
* {{cite book|last=Date |first=C. J. |authorlink=Christopher J. Date |title=An Introduction to Database Systems, Fifth Edition |publisher=Addison Wesley |year= 2003 |isbn=0-201-51381-1
* Gray, J. and Reuter, A. ''Transaction Processing: Concepts and Techniques'', 1st edition, Morgan Kaufmann Publishers, 1992.
* Kroenke, David M. and David J. Auer. ''Database Concepts.'' 3rd ed. New York: Prentice, 2007.
* {{cite book|last1=Lightstone |first1=S. |first2=T. |last2=Teorey |first3=T. |last3=Nadeau |title=Physical Database Design: the database professional's guide to exploiting indexes, views, storage, and more |publisher=Morgan Kaufmann Press |year=2007 |isbn=978-0-12-369389-
* Teorey, T.; Lightstone, S. and Nadeau, T. ''Database Modeling & Design: Logical Design'', 4th edition, Morgan Kaufmann Press, 2005. {{ISBN
== Database scholars ==
* [[Serge Abiteboul]]
* [[David Maier]]
* [[Ralph Kimball]]
* [[Kamran Parsaye]]
* [[C. J. Date]]
* [[Peter Chen]]
== See also ==
* [[Outline of computing]]
** [[Outline of software]]
* [[Databank]]
* [[Database model]]
* [[Database schema]]
* [[Record (database)]]
* [[Key (database)]]
* [[Internet research]]
* [[List of academic databases and search engines]]
* [[List of biodiversity databases]]
* [[List of Internet forums]]
* [[List of online databases]]
{{Clear}}
Line 328 ⟶ 346:
== External links ==
{{
* [http://www.fileextension.org/DB DB File extension] – information about files with the DB extension
{{Outline footer}}
[[Category:Databases|
[[Category:Outlines of computing and engineering|Databases]]
[[Category:Outlines|Databases]]
|