OpenDocument technical specification: Difference between revisions

Content deleted Content added
Encryption: dash created confusion that rest of text was an explanation of text before it. Semicolon clearer that they are two separate statements.
Rescuing 1 sources and tagging 0 as dead.) #IABot (v2.0.9.5
 
(16 intermediate revisions by 11 users not shown)
Line 8:
* As a '''single XML document''' – also known as ''Flat XML'' or ''Uncompressed XML Files''. Single OpenDocument XML files are not widely used,{{citation needed|date=December 2014}} they are also unsupported on some office software which claims to support ODF (Microsoft Office applications version 2007 to 2013 do not recognize them). Filename extensions for a single OpenDocument XML documents are not defined in the OpenDocument technical specification, but commonly used are <code>.xml</code>, <code>.fodt</code>,<ref>{{cite web |url=http://www.ensode.net:80/roller/dheffelfinger/entry/openoffice_documents_version_control_with |archive-url=https://web.archive.org/web/20171125144435/http://www.ensode.net/roller/dheffelfinger/entry/openoffice_documents_version_control_with |url-status=dead |archive-date=2017-11-25 |title=OpenOffice.org Document Version Control With Mercurial |accessdate=2010-06-07 }}</ref> <code>.fods</code>, ... etc.
 
The recommended filename extensions and [[MIME]] types are included in the official standard (OASIS, May 1, 2005, and its later revisions or versions). The MIME types and extensions contained in the ODF specification are applicable only to office documents that are contained in a package. Office documents that conform to the OpenDocument specification but are not contained in a package should use the MIME type text/xml.
 
The MIME type is also used in the <code>office:mimetype</code> attribute. It is very important to use this attribute in flat XML files/single XML documents, where this is the only way the type of the document can be detected (in a package, the MIME type is also present in a separate file ''mimetype''). Its values are the MIME types that are used for the packaged variant of office documents.
Line 17:
 
{| class="wikitable"
|- bgcolor="#cccccc"
! File type
! Extension
Line 63 ⟶ 62:
|1.0
|-
|rowspan=3|Database
|rowspan=3|.odb
|application/vnd.sun.xml.base<ref>{{citation |url=http://dlc.sun.com/osol/jds/downloads/sources/defaults.list |title=MIME types - OpenSolaris Default Applications |accessdate=2010-06-06 |url-status=dead |archiveurl=https://web.archive.org/web/20110716163416/http://dlc.sun.com/osol/jds/downloads/sources/defaults.list |archivedate=2011-07-16 }}</ref><ref>{{citation |url=http://extension.nirsoft.net/odb |title=.odb Extension - List of programs that can open .odb files |accessdate=2010-06-06}}</ref>
|not defined in ODF 1.0/1.1 specifications;<br>used in OpenOffice.org 2.x
|-
|Database
|.odb
|application/vnd.oasis.opendocument.base
|ODF 1.2;<br>used in OpenOffice.org 3.x
|-
|Database
|.odb
|application/vnd.oasis.opendocument.database
|defined in [https://www.iana.org/assignments/media-types/application/vnd.oasis.opendocument.database IANA registration]
Line 89 ⟶ 84:
 
{| class="wikitable"
|- bgcolor="#cccccc"
! File type
! Extension
Line 173 ⟶ 167:
The OpenDocument format implements spreadsheets as sets of tables. Thus it features extensive capabilities for formatting the display of tables and spreadsheets. OpenDocument also supports database ranges, filters, and "data pilots" (known in Microsoft Excel contexts as "[[pivot table]]s"). Change tracking is available for spreadsheets as well.
 
The graphics format supports a vector graphic representation, in which a set of layers and the contents [https://web.archive.org/web/20051212150956/http://www.auton.nl/software/apps/adms/en/adms.html] of each layer is defined. Available drawing shapes include Rectangle, Line, Polyline, Polygon, Regular Polygon, Path, Circle, Ellipse, and Connector. 3D Shapes are also available; the format includes information about the Scene, Light, Cube, Sphere, Extrude, and Rotate (it is intended for use as for office data exchange, and not sufficient to represent videos or other extensive 3D scenes). Custom shapes can also be defined.
 
Presentations are supported. Users can include animations in presentations, with control over the sound, showing a shape or text, hiding a shape or text, or dimming something, and these can be grouped. In OpenDocument, much of the format capabilities are reused from the text format, simplifying implementations. However, tables are not supported within OpenDocument as drawing objects, so may only be included in presentations as embedded tables.
Line 191 ⟶ 185:
** Drawings
** Presentations
* Objects that do not have an XML representation. These objectsand only have a binary representation. An exampleExamples for thisthese kindkinds of objects are [[raster images]], recorded audios and videos, and [[Object Linking and Embedding|OLE]] objects.<ref>According to the OpenDocument 1.0 specification, OLE is defined in ''Kraig Brockschmidt, Inside OLE, Microsoft Press, 1995, {{ISBN|1-55615-843-2}}''.</ref>
 
Use of Microsoft Object Linking and Embedding (OLE) objects limits the interoperability, because these objects are not widely supported in programs for viewing or editing files (e.g. embedding of other files inside the file, such as tables or charts from a spreadsheet application in a text document or presentation file).<ref>{{cite web |url=https://www.linux.com/news/foss-word-processors-compared-ooo-writer-abiword-and-kword/ |title=FOSS word processors compared: OOo Writer, AbiWord, and KWord |author=Bruce Byfield |date=2005-08-23 |accessdate=2010-04-06}} </ref><ref>{{cite web |url=http://www.linux.com/archive/feed/46599 |title=Sharing files between OpenOffice.org and Microsoft Office |date=2005-07-28 |accessdate=2010-04-06 |url-status=dead |archiveurl=https://web.archive.org/web/20100204134005/http://www.linux.com/archive/feed/46599 |archivedate=2010-02-04 }}</ref><ref>{{cite web |url=https://www.linux.com/news/softmaker-office-2008-focuses-compatibility-microsoft-office/ |title=SoftMaker Office 2008 focuses on compatibility with Microsoft Office |date=2008-11-20 |accessdate=2010-04-06}}</ref><ref>{{cite web |url=https://www.linux.com/news/softmaker-office-2006-beta-not-killer-app/ |title=SoftMaker Office 2006 beta: Not a killer app |date=2006-11-21 |accessdate=2010-04-06 }} </ref><ref>{{citation |url=http://pacsec.jp/psj06/psj06lagadec-e.pdf |format=PDF |title=OpenOffice / OpenDocument and Microsoft Office 2007 / Open XML security |author=Philippe Lagadec |date=2006-11-30 |accessdate=2010-04-06 |archive-date=2011-07-22 |archive-url=https://web.archive.org/web/20110722122720/http://pacsec.jp/psj06/psj06lagadec-e.pdf |url-status=dead }}</ref> If a software that understands an OLE object is not available, the object is usually replaced by a picture (bitmap representation of the object) or not displayed at all.<ref>{{cite web |url=http://www.keyongtech.com/2560234-ole-object-bitmap-representation |title=OLE object - bitmap representation? |accessdate=2010-04-06 |url-status=dead |archiveurl=https://web.archive.org/web/20110724111954/http://www.keyongtech.com/2560234-ole-object-bitmap-representation |archivedate=2011-07-24 }}</ref><ref>{{cite web |url=http://www.codeproject.com/KB/edit/COleRichEditCtrl.aspx |title=A Rich Edit Control That Displays Bitmaps and Other OLE Objects |accessdate=2010-04-06}}</ref><ref>{{cite web |url=http://support.microsoft.com/kb/123151 |archive-url=https://web.archive.org/web/20091213042624/http://support.microsoft.com/kb/123151 |url-status=dead |archive-date=2009-12-13 |title=ACC: Why OLE Objects Cause Databases to Grow |date=2007-01-19 |accessdate=2010-04-29}}</ref>
 
===Formatting===
Line 211 ⟶ 205:
 
===Encryption===
When an OpenDocument file is password protected the file structure of the bundle remains the same, but contents of XML files in the package are encrypted using the following algorithm:
 
# The file contents are compressed with the [[DEFLATE]] algorithm.
# A checksum of a portion of the compressed file is computed (SHA-1 of the file contents, or SHA-1 of the first 1024 bytes of the file, or SHA-256 of the first 1024 bytes of the file) and stored, so password correctness can be verified when decrypting.
# A digest (hash) of the user -entered password in UTF-8 encoding is created and passed to the package component. ODF versions 1.0 and 1.1 only mandate support for the SHA-1 digest here, while version 1.2 recommends SHA-256.
# This digest is used to produce a derived key by undergoing [[key stretching]] with [[PBKDF2]] using HMAC-SHA-1 with a salt of arbitrary length (in ODF 1.2; it's 16 bytes in ODF 1.1 and below) generated by the random number generator for an arbitrary iteration count (1024 by default in ODF 1.2).
# The random number generator is used to generate a random initialization vector for each file.
Line 221 ⟶ 215:
 
==Format internals==
An OpenDocument file commonly consists of a standard [[ZIP (file format)|ZIP]] archive ([[JAR (file format)|JAR]] archive<ref>{{cite web |last=Sobhi |first=Ali |date=2007-08-14 |orig-date=August 2006 |title=Introduction to OpenDocument Format |publication-place=Armonk, New York, United States |publisher=[[IBM]] |url=http://www-03.ibm.com/able/resources/odfintro.html |archive-url=https://web.archive.org/web/20080602152400/http://www-03.ibm.com/able/resources/odfintro.html |url-status=dead |archive-date=2008-06-02 |title=Web resources & interesting links - easy and simple introduction to OpenDocument Format (ODF) |accessdate=2010-06-07}}</ref>) containing a number of files and directories; but OpenDocument file can also consist only of a single XML document. An OpenDocument file is commonly a collection of several subdocuments within a (ZIP) ''package''. An OpenDocument file as a ''single XML'' is not widely used.
According to the OpenDocument 1.0 specification, the ZIP file specification is defined in ''Info-ZIP Application Note 970311, 1997''.<ref>{{cite web |url=http://tools.oasis-open.org/issues/browse/OFFICE-2205 |title=NEEDS-DISCUSSION: ZIP reference - N 1309 |accessdateaccess-date=2010-06-07}}</ref><ref>{{cite web |url=http://tools.oasis-open.org/issues/browse/OFFICE-2082 |title= Zip reference is neither public nor authoritative |date=2009-10-11 |accessdateaccess-date=2010-06-07}}</ref>
The simple compression mechanism used for a package normally makes OpenDocument files significantly smaller than equivalent Microsoft "<code>.doc</code>" or "<code>.ppt</code>" files. This smaller size is important for organizations who store a vast number of documents for long periods of time, and to those organizations who must exchange documents over low bandwidth connections. Once uncompressed, most data is contained in simple text-based XML files, so the uncompressed data contents have the typical ease of modification and processing of XML files. The standard also allows for the creation of a single XML document, which uses ''<office:document>'' as the root element, for use in document processing.
 
Line 291 ⟶ 285:
 
===mimetype (file)===
'''mimetype''' is just a one-line file with the mimetype of the document. One implication of this is that the file extension is actually immaterial to the format. The file extension is only there for the benefit of the user. It is important to note that this special file is always the first file entry in the ZIP archive and it is uncompressed. Because the ZIP header uses fields with fixed lengths, this allows the direct identification of the different OpenDocument formats without decompression of the content (e. g. with magic bytes).
 
===Thumbnails (directory)===
'''Thumbnails''' is a separate folder for a document thumbnail. The thumbnail must be saved as “thumbnail.png”. A thumbnail representation of a document should be generated by default when the file is saved. It should be a representation of the first page, first sheet, etc. of the document. The required size for the thumbnails is 128x128 pixel. In order to conform to the [httphttps://standardsspecifications.freedesktop.org/thumbnail-spec/thumbnail-spec-latest/x142.html#AEN144 Thumbnail Managing Standard] (TMS) at www.freedesktop.org, thumbnails must be saved as 8bit, non-interlaced PNG image with full alpha transparency.
 
===META-INF (directory)===
Line 312 ⟶ 306:
 
==History==
* Version 1.0 became an OASIS Standard on 2005-05-01;<ref>
{{cite web
| url = http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office
| title = OASIS Open Document Format for Office Applications (OpenDocument) TC
| work = OASIS website
| publisher = OASIS
| accessdate = 20102024-0211-0826
| quote = Open Document = The OpenDocumentFormat v1.0 specification was approved as an OASIS Standard on 1 May 2005.
}}
</ref>
* Version 1.1 became an OASIS Standard on 2007-02-07;<ref>
{{cite web
| url = http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office#odf11
| title = OASIS Open Document Format for Office Applications (OpenDocument) TC
| title = OpenDocument v1.1 Specification
| work = OASIS website
| publisher = OASIS
| accessdate = 20102024-0211-0826
| quote = Open Document = The OpenDocumentFormat v1.1 specification was approved as an OASIS Standard on 2 February 2007.
}}
</ref>
* Version 1.2 became an OASIS Standard on 2011-09-29;<ref name="odf12">
{{cite web
| url = http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office
| title = OASIS Open Document Format for Office Applications (OpenDocument) TC
| work = OASIS website
| publisher = OASIS
| accessdate = 2024-11-26
| quote = Open Document Format v1.2 was approved as a OASIS Standard on 29 September 2011.
}}
</ref>
* Version 1.3 became an OASIS Standard on 2021-04-27.<ref>
{{cite web
| url = http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office
| title = OASIS Open Document Format for Office Applications (OpenDocument) TC
| work = OASIS website
| publisher = OASIS
| accessdate = 2024-11-26
| quote = Open Document Format for Office Applications (OpenDocument) Version 1.3 OASIS Standard was approved by the members of OASIS on 27 April 2021.
}}
</ref>
* Version 1.2 became an OASIS Standard on 2011-09-29<ref name="odf12">{{citation |url=http://www.oasis-open.org/news/pr/odf-1-2-approval |title=Members Approve OpenDocument Format (ODF) Version 1.2 as OASIS Standard |date=2011-10-05 |accessdate=2012-04-12}}</ref>
 
==Versions detection==
To indicate which version of the OpenDocument specification a file complies with, all root elements take an <code>office:version</code> attribute. The version number is (in the format ''revision.version'', (e.g.such as <code>office:version="1.1"). The <code>office:version</code>), attributewhich identifies the version of ODF specification that defined the associated element, its schema, its complete content, and its interpretation.
 
===ODF 1.0/1.1===
It is not mandatory to use <code>office:version</code> attribute in ODF 1.0 and ODF 1.1 files, so when an element has office:version omitted, the element is based on ODF 1.0 or 1.1. If the file has a version known to an XML processor, it may validate the document. Otherwise, it is optional to validate the document, but the document must be well formed. It is not mandatory to use <code>office:version</code> attribute in ODF 1.0 and ODF 1.1 files.
 
===ODF 1.2===
The <code>office:version</code> attribute shall be present in each and every <office:document>, <office:document-content>, <office:document-styles>, <office:document-meta>, and <office:document-settings> element in the XML documents that comprise an OpenDocument 1.2 document. The value of the office:version attribute shall be "1.2".
 
===ODF 1.2 and newer===
When an element has office:version="1.1" the element and its content are based on the OpenDocument v1.1 specification. For office:version="1.0" the element and its content are based on the OpenDocument v1.0 specification. When an element has office:version omitted, the element is based on a version of the OpenDocument specification earlier than v1.2. In these cases and in the case of values other than "1.2", the elements do not comprise an OpenDocument 1.2 document.<ref>{{cite web |url=http://docs.oasis-open.org/office/v1.2/os/OpenDocument-v1.2-os-part1.html#__RefHeading__1417682_253892949 |title=office:version attribute - OpenDocument Version 1.2, Part 1, 29 September 2011 |accessdate=2012-12-05}}</ref>
The <code>office:version</code> attribute shall be present in each and every <office:document>, <office:document-content>, <office:document-styles>, <office:document-meta>, and <office:document-settings> element in the XML documents that comprise an OpenDocument 1.2 or newer document. The value of the office:version attribute shall bereflect "1the OpenDocument version.<ref>{{cite web |url=https://docs.oasis-open.org/office/v1.2"/os/OpenDocument-v1.2-os-part1.html#__RefHeading__1417682_253892949 |title=office:version attribute - OpenDocument Version 1.2, Part 1, 29 September 2011 |accessdate=2024-11-26}}</ref>
 
==Conformance==
Line 367 ⟶ 378:
==References==
{{Refbegin}}
* [https://docs.oasis-open.org/office/OpenDocument/v1.4/ OpenDocument 1.4 specification] in ODT, HTML and PDF formats
* [https://docs.oasis-open.org/office/OpenDocument/v1.3/ OpenDocument 1.3 specification] in ODT, HTML and PDF formats
* [https://docs.oasis-open.org/office/v1.2/ OpenDocument 1.2 specification] in ODT, HTML and PDF formats