Content deleted Content added
→Types of databases: add links |
add annotations |
||
Line 24:
== Types of databases ==
* [[Active database]] – database that includes an event driven architecture (often in the form of ECA rules) which can respond to conditions both inside and outside the database.
* [[Active database]] – ▼
* [[Animation database]] – database which stores fragments of animations or human movements and which can be accessed, analyzed and queried to develop and assemble new animations.
* [[Back-end database]] – database that is accessed by users indirectly through an external application rather than by application programming stored within the database itself or by low level manipulation of the data (e.
* [[Bibliographic database]] – database of bibliographic records, an organized digital collection of references to published literature, including journal and newspaper articles, conference proceedings, reports, government and legal publications, patents, books, etc.
* [[
* [[Cloud database]] – database that typically runs on a cloud computing platform, such as Amazon EC2, GoGrid and Rackspace.
* [[
* [[Collective Optimization Database]] – open repository to enable sharing of benchmarks, data sets and optimization cases from the community, provide web services and Plug-in (computing)|plugins to analyze optimization data and predict program transformations or better hardware designs for multi-objective optimizations based on statistical and machine learning techniques provided there is enough information collected in the repository from multiple users.
* [[Column-oriented DBMS]] – database management system (DBMS) that stores data tables as sections of columns of data rather than as rows of data, like most relational DBMSs.
* [[Configuration management database]] –
* [[Cooperative database]] – holds information on customers and their transactions.
* [[Correlation database]] – database management system (DBMS) that is data model independent and designed to efficiently handle unplanned, ad hoc queries in an analytical system environment.
* [[
* [[Current database]] – conventional database that stores data that is valid now.
* [[
* [[
* [[Document-oriented database]] – computer program designed for storing, retrieving, and managing document-oriented, or Semi-structured model|semi structured data, information.
* [[EDA database]] – database specialized for the purpose of electronic design automation.
* [[Endgame tablebase]] – computerized database that contains precalculated exhaustive analysis of a chess endgame position.
* [[
* [[Full text database]] – or a '''complete text database''' is a database that contains the complete text of books, dissertations, journals, magazines, newspapers or other kinds of textual documents.
* [[Gellish database]] – does not need to be extended when the scope of the database increases.
* [[Government database]] –
* [[Graph database]] – uses graph structures with nodes, edges, and properties to represent and store data.
* [[
* [[Mobile database]] – database that can be connected to by a mobile computing device over a mobile network.
* [[Navigational database]] – database in which objects (or records) in it are found primarily by following references from other objects.
* [[
* [[
* [[Online database]] – database accessible from a network, including from the Internet.
* [[Operational database]] – accessed by an Operational System to carry out regular operations of an organization.
* [[Pakistan MNP Database (Guarantee) Limited]] – ▼
* [[Parallel database]] –
* [[Probabilistic database]] – an uncertain database in which the possible worlds have associated probabilities.
* [[Project-Level Aid Database]] –
* [[
* [[Real-time database]] – processing system designed to handle workloads whose state is constantly changing (Buchmann).
* [[Relational database]] – collection of data items organized as a set of formally-described tables from which data can be accessed easily.
* [[Simple Sloppy Semantic Database]] –
* [[Spatial database]] – database that is optimized to store and query data that is related to objects in space, including points, lines and polygons.
* [[Suppliers and Parts database]] –
* [[Temporal database]] – database with built-in time aspects, for example a temporal data model and a temporal version of Structured Query Language (SQL).
* [[Time series database]] –
* [[Triplestore]] – purpose-built database for the storage and retrieval of triples, a triple being a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred".
* [[Very large database]] – , or '''VLDB''', is a database that contains an extremely high number of tuples (database rows), or occupies an extremely large physical filesystem storage space.
* [[Virtual private database]] – or '''VPD''' masks data in a larger database so that security allows only the use of apparently private data.
* [[Vulnerability database]] – platform aimed at collecting, maintaining, and disseminating information about discovered vulnerabilities targeting real computer systems.
* [[XLDB]] – '''eXtremely Large Data Bases'''.
* [[XML database]] –
<!-- Following are specific databases that need to be placed somewhere else.
* [[Oxford English Corpus]] – text corpus of English language used by the makers of the Oxford English Dictionary and by Oxford University Press's language research programme.
* [[RecordSetter]] – formerly known as the Universal Record Database (URDB), is a social media website containing a collection of world records created by the public and evaluated by staff and visitors to the site. -->
== History of databases ==
[[History of databases]] –
==Database use==
* [[Database#Major database usage requirements|Database usage requirements]] –
* [[Database theory]] – encapsulates a broad range of topics related to the study and research of the theoretical realm of databases and database management systems.
* [[Database model]] – theoretical foundation of a database and fundamentally determines in which manner data can be stored, organized, and manipulated in a database system.
Line 91:
* [[Database application]] – computer program whose primary purpose is entering and retrieving information from a computer-managed database.
* [[Database connection]] – facility in computer science that allows client software to communicate with database server software, whether on the same machine or not.
** [[datasource]] –
** [[Data Source Name]] (DSN) – are data structures used to describe a connection to a data source. Sometimes known as a database source name though data sources are not limited to databases.
* [[Database administrator]] (DBA) –
* [[Lock (database)|Lock]] –
* [[Comparison of database tools]] –
* [[Database-centric architecture]] – software architectures in which databases play a crucial role. Also called "data-centric architecture".
* [[Intelligent database]] – was put forward as a system that manages information (rather than data) in a way that appears natural to users and which goes beyond simple record keeping.
* [[Two-phase locking]] (2PL) –
* [[Locks with ordered sharing]] – comprises several variants of the ''Two phase locking'' (2PL) concurrency control protocol generated by changing the blocking semantics of locks upon conflicts.
* [[Load file]] – in the litigation community is commonly referred to as the file used to import data (coded, captured or extracted data from ESI processing) into a database; or the file used to link images.
* [[Database publishing]] – area of automated media production in which specialized techniques are used to generate paginated documents from source data residing in traditional databases.
* [[Halloween Problem]] –
* [[Log shipping]] – process of automating the backup of a database and transaction log files on a primary (production) database server, and then restoring them onto a standby server.
===Database languages ===
[[Database#Database languages|Database languages]] –
* [[Data definition language]] –
* [[Data manipulation language]] –
* [[Query language]] –
** [[Information retrieval query language]] – query language used to make queries into database, where the semantics of the query are defined not by a precise rendering of a formal syntax, but by an interpretation of the most suitable results of the query.
===Database security===
[[Database security]] –
* [[Database activity monitoring]] (DAM) – database security technology for monitoring and analyzing database activity that operates independently of the database management system (DBMS) and does not rely on any form of native (DBMS-resident) auditing or native logs such as trace or transaction logs.
* [[Database audit]] –
* [[Database forensics]] – branch of digital forensic science relating to the forensic study of databases and their related metadata.
* [[Negative database]] – credit card terminology for a list of credit card owners who chargeback a lot.
===Database design===
[[Database design]] –
* [[Entity-relationship model]] (ER model) – abstract and conceptual representation of data.
* [[Database normalization]] – process of organizing the fields and tables of a relational database to minimize redundancy and dependency.
Line 133:
* [[Database tuning]] – describes a group of activities used to optimize and homogenize the performance of a database.
** [[Database caching]] – effective approach to achieve high scalability and performance.
* [[Data migration#Database migration]] –
* [[Database preservation]] – usually involves converting the information stored in a database, without losing the characteristics (Context, Content, Structure, Appearance and Behaviour) of the data, to a format which can be used in the long term, even if the technology and daily life knowledge changes.
* [[Database integrity]] – ensures that data entered into the database is accurate, valid, and consistent.
== Database management systems ==
[[Database management system]] –
*[[Database model]] –
*[[Database normalization]] –
* [[Database storage structures]] –
* [[Distributed database management system]] –
* [[Federated database system]] – type of meta-database management system (DBMS), which transparently maps multiple autonomous database systems into a single federated database.
* [[Referential integrity]] – .
Line 151:
* [[Relational model]] – for database management is a database model based on first-order logic|first-order predicate logic, first formulated and proposed in 1969 by Edgar F.
* [[Object-relational database]] (ORD) – database management system (DBMS) similar to a relational database, but with an object-oriented database model: objects, classes and inheritance are directly supported in database schemas and in the query language. Also called object-relational database management system (ORDBMS).
* [[Transaction processing]] –
=== Concepts ===
Line 157:
* [[ACID]] – (''atomicity, consistency, isolation, durability'') is a set of properties that guarantee that database transactions are processed reliably.
* [[Create, read, update and delete]] (CRUD) – are the four basic functions of persistent storage.
*[[Null (SQL)|Null]] –
* [[Candidate key]] – minimal superkey for a relation.
* [[Foreign key]] – referential constraint between two tables.
*[[Primary key]] –
* [[Superkey]] – set of attributes of a relation variable for which it holds that in all relations assigned to that variable, there are no two distinct tuples (rows) that have the same values for the attributes in this set.
* [[Surrogate key]] – unique identifier in a datbase for either an ''entity'' in the modeled world or an ''object'' in the database.
Line 167:
=== Objects ===
*[[Relation (database)|Relation]] –
**[[Table (database)|Table]] –
**[[Column (database)|Column]] –
**[[Row (database)|Row]] –
*[[View (database)|View]] –
* [[Database transaction]] –
* [[Transaction log]] – history of actions executed by a database management system to guarantee ACID properties over crashes or hardware failures. Also called "transaction journal", "database log" or "binary log".
* [[Database trigger]] – procedural code that is automatically executed in response to certain events on a particular table or view in a database.
*[[Index (database)|Index]] –
* [[Stored procedure]] – subroutine available to applications that access a relational database system.
*[[Cursor (databases)|Cursor]] –
*[[Partition (database)|Partition]] –
=== Components ===
Line 184:
* [[Data dictionary]] – as defined in the ''IBM Dictionary of Computing'', is a "centralized repository of information about data such as meaning, relationships to other data, origin, usage, and format." Also called a "metadata repository".
* [[Java Database Connectivity]] – .
*[[Open Database Connectivity]] –
*[[Query language]] –
* [[Query optimizer]] – component of a database management system that attempts to determine the most efficient way to execute a query.
* [[Query plan]] – ordered set of steps used to access or modify information in a SQL relational database management system. Also called a "query execution plan".
=== Functions ===
* [[Database administration and automation]] –
* [[Query optimization]] – function of many relational database management systems in which multiple query plans for satisfying a query are examined and a good query plan is identified.
* [[Database replication]] –
=== Database products ===
* [[List of object-oriented database management systems]] –
** [[Comparison of object-relational database management systems]] –
* [[List of relational database management systems]] –
** [[Comparison of relational database management systems]] –
* [[Document-oriented database]] – computer program designed for storing, retrieving, and managing document-oriented, or Semi-structured model|semi structured data, information.
== Database models ==
* [[Database models]] –
=== Models ===
* [[Flat file database]] – various means to encode a database model (most commonly a table) as a single file.
* [[Hierarchical database model]] – data model in which the data is organized into a tree-like structure.
*[[Database model#Dimensional model]] –
* [[Network model]] – database model conceived as a flexible way of representing objects and their relationships.
*[[Relational model]] –
*[[Entity-relationship model]] –
* [[Graph database]] – uses graph structures with nodes, edges, and properties to represent and store data.
* [[Object database]] – database management system in which information is represented in the form of objects as used in object-oriented programming. Also called an "object-oriented database management system".
*[[Entity-attribute-value model]] –
===Other models===
* [[Associative model of data]] – alternative data model for database systems.
* [[Concept-oriented model]] –
*[[Online analytical processing#Multidimensional databases]] –
* [[Semantic data model]] –
* [[Star schema]] – is the simplest style of data warehouse schema. Also called "star-join schema", "data cube", or "multi-dimensional schema".
* [[XML database]] –
===Implementations===
*[[Flat file database]] –
* [[Deductive database]] – database system that can make deductions.
*[[Document-oriented database]] –
*[[Object-relational database]] –
* [[Temporal database]] – database with built-in time aspects, for example a temporal data model and a temporal version of Structured Query Language (SQL).
*[[XML database]] –
* [[Triplestore]] – purpose-built database for the storage and retrieval of triples, a triple being a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred".
== Data warehouse ==
[[Data warehouse]] –
=== Creating the data warehouse ===
==== Concepts ====
*[[Dimension (data warehouse)|Dimension]] –
* [[Dimensional modeling]] (DM) – is the name of a set of techniques and concepts used in data warehouse design.
*[[Fact (data warehouse)|Fact]] –
* [[Online analytical processing]] (OLAP) – or is an approach to swiftly answer multi-dimensional analytical (multi-dimensional analytical|MDA) queries.
*[[Star schema]] –
*[[Aggregate (Data Warehouse)|Aggregate]] –
==== Variants ====
* [[Anchor Modeling]] –
* [[Column-oriented DBMS]] – database management system (DBMS) that stores data tables as sections of columns of data rather than as rows of data, like most relational DBMSs.
* [[Data Vault Modeling]] –
* [[HOLAP]] –
* [[MOLAP]] – stands for Multidimensional Online Analytical Processing.
* [[ROLAP]] – stands for Relational Online Analytical Processing.
Line 255:
==== Elements ====
*[[Data dictionary]] – /[[Metadata]] –
* [[Data mart]] – access layer of the data warehouse environment that is used to get data out to the users.
* [[Sixth normal form]] (6NF) – term in relational database theory, used in two different ways.
*[[Surrogate key]] –
==== Fact ====
* [[Fact table]] – consists of the measurements, metrics or facts of a business process.
* [[Early-arriving fact]] –
*[[Measure (data warehouse)|Measure]] –
==== Dimension ====
* [[Dimension table]] – one of the set of companion tables to a fact table.
* [[Degenerate dimension]] – dimension key in the fact table that does not have its own dimension table, because all the interesting attributes have been placed in analytic dimensions.
* [[Slowly changing dimension]] –
==== Filling ====
*[[Extract, transform, load|Extract-Transform-Load (ETL)|]] –
* [[Data extraction]] – act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage (data migration).
* [[Data transformation]] – converts data from a source data format into destination data.
*[[Data loading]] –
=== Using the data warehouse ===
Line 280:
==== Concepts ====
* [[Business intelligence]] (BI) – is defined as the ability for an organization to take all its capabilities and convert them into knowledge, ultimately, getting the right information to the right people, at the right time, via the right channel.
*[[Dashboard (business)|Dashboard]] –
* [[Data mining]] – is the process that results in the discovery of new patterns in large data sets. It is the analysis step of the "Knowledge Discovery in Databases" process, or KDD.
*[[Decision support system|Decision support system (DSS)|]] –
* [[OLAP cube]] – set of data, organized in a way that facilitates non-predetermined queries for aggregated information, or in other words, online analytical processing.
==== Languages ====
*[[Data Mining Extensions]] (DMX) –
*[[MultiDimensional eXpressions]] (MDX) –
*[[XML for Analysis]] (XMLA) –
==== Tools ====
* [[Business intelligence tools]] –
* [[List of reporting software]] –
*[[Spreadsheets]] –
=== People ===
* [[Bill Inmon]] –
* [[Ralph Kimball]] (Born 1944) – author on the subject of data warehousing and business intelligence.
=== Products ===
* [[Comparison of OLAP Servers]] –
== Database-related organizations ==
|