Content deleted Content added
Alvestrand (talk | contribs) →Foreign resources: Moving 3 more sections over from main OOXML article |
m →Office MathML (OMML): HTTP to HTTPS for Blogspot |
||
(82 intermediate revisions by 66 users not shown) | |||
Line 1:
{{Short description|Family of file formats}}
{{Main|Office Open XML}}
Line 4 ⟶ 5:
{{Infobox file format
| name = Office Open XML Document
| icon =
| logo
| screenshot =
| caption =
| extension =
| mime = application/vnd.<br />openxmlformats-officedocument.<br />wordprocessingml.<br />document<ref name="mimetype">{{ cite web | url =
| type code =
| uniform type =
| magic =
| owner = [[Microsoft]], [[Ecma International|Ecma]], [[International Organization for Standardization|ISO]]/[[International Electrotechnical Commission|IEC]]
| released = {{Start date and age|2006}}
| latest release version =
| latest release date =
Line 23 ⟶ 24:
| extended to =
| standard = ECMA-376, ISO/IEC 29500
| url = [http://www.ecma-international.org/publications-and-standards/standards/
{{Infobox file format
| name = Office Open XML Presentation
| icon =
| logo =
| screenshot =
| caption =
| extension =
| mime = application/vnd.<br />openxmlformats-officedocument.<br />presentationml.<br />presentation<ref name="mimetype">{{ cite web | url = https://technet.microsoft.com/en-us/library/cc179224.aspx | title = Register file extensions on third party servers | author = Microsoft | date = 26 February 2008 | access-date = 2009-09-04 | publisher = microsoft.com }}</ref>
| type code =
| uniform type =
Line 45 ⟶ 46:
| extended to =
| standard = ECMA-376, ISO/IEC 29500
| url = [http://www.ecma-international.org/publications-and-standards/standards/
{{Infobox file format
| name = Office Open XML Workbook
| icon =
| logo =
| screenshot =
| caption =
| extension =
| mime = application/vnd.<br />openxmlformats-officedocument.<br />spreadsheetml.<br />sheet<ref name="mimetype">{{ cite web | url = https://technet.microsoft.com/en-us/library/cc179224.aspx | title = Register file extensions on third party servers | author = Microsoft | date = 26 February 2008 | access-date = 2009-09-04 | publisher = microsoft.com }}</ref>
| type code =
| uniform type =
Line 67 ⟶ 68:
| extended to =
| standard = ECMA-376, ISO/IEC 29500
| url = [http://www.ecma-international.org/publications/standards-and-standards/
}}
The '''Office Open XML file formats''' are a set of [[file format]]s that can be used to represent electronic [[Office suite|office]] documents. There are formats for [[word processing]] documents, [[spreadsheets]] and [[presentations]] as well as specific formats for material such as mathematical
The formats were developed by [[Microsoft]] and first appeared in
== Container ==
Line 78 ⟶ 79:
[[Image:Open Packaging Convention.png|220px|thumb|right|Container structure of Part 2 of the Ecma Office Open XML standard, ECMA-376]]
Office Open XML documents are stored in [[Open Packaging
A basic package contains an XML file called ''[Content_Types].xml'' at the root, along with three directories: ''_rels'', ''docProps'', and a directory specific for the document type (for example, in a .docx word processing package, there would be a ''word'' directory). The ''word'' directory contains the ''document.xml'' file which is the core content of the document.
; [Content_Types].xml: This file provided MIME type information for parts of the package, using defaults for certain file extensions and overrides for parts
; _rels: This directory contains relationships for the files within the package. To find the relationships for a specific file, look for the ''_rels'' directory that is a sibling of the file, and then for a file that has the original file name with a ''.rels'' appended to it. For example, if the content types file had any relationships, there would be a file called ''[Content_Types].xml.rels'' inside the ''_rels'' directory.
; _rels/.
; docProps/core.xml: This file contains the core properties for any Office Open XML document.
; word/document.xml: This file is the main part for any Word document.
Line 92 ⟶ 93:
An example relationship file (''word/_rels/document.xml.rels''), is:
<
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<Relationships
Line 105 ⟶ 106:
TargetMode="External" />
</Relationships>
</syntaxhighlight>
As such, images referenced in the document can be found in the relationship file by looking for all relationships that are of type <code><nowiki>http://schemas.microsoft.com/office/2006/relationships/image</nowiki></code>. To change the used image, edit the relationship.
Line 111 ⟶ 112:
The following code shows an example of inline markup for a [[hyperlink]]:
<
<w:hyperlink r:id="rId2" w:history="1"
xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships"
xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main">
</syntaxhighlight>
In this example, the [[Uniform Resource Locator]] (URL) is
Pictures can be embedded or linked using a tag:
<
<v:imagedata w:rel="rId1" o:title="example" />
</syntaxhighlight>
This is the reference to the image file. All references are managed via relationships. For example, a document.xml has a relationship to the image. There is a _rels directory in the same directory as document.xml, inside _rels is a file called document.xml.rels. In this file there will be a relationship definition that contains type, ID and ___location. The ID is the referenced ID used in the XML document. The type will be a reference schema definition for the media type and the ___location will be an internal ___location within the ZIP package or an external ___location defined with a URL.
Line 133 ⟶ 134:
An example document properties file (''docProps/core.xml'') that uses Dublin Core metadata, is:
<
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<cp:coreProperties xmlns:cp="http://schemas.openxmlformats.org/package/2006/metadata/core-properties"
Line 150 ⟶ 151:
<cp:contentStatus>Final</cp:contentStatus>
</cp:coreProperties>
</syntaxhighlight>
== Document markup languages ==
Line 163 ⟶ 164:
Shared markup language materials include:
* Office Math Markup Language (OMML)
* DrawingML used for vector drawing, charts, and for example, text art (additionally, though deprecated, [[Vector Markup Language|VML]] is supported for drawing)
* Extended properties
* Custom properties
Line 174 ⟶ 175:
=== Design approach ===
Patrick Durusau, the editor of [[OpenDocument|ODF]], has viewed the markup style of OOXML and ODF as representing two sides of a debate: the "element side" and the "attribute side". He notes that OOXML represents "the element side of this approach" and singles out the <code>KeepNext</code> element as an example:
<
<w:pPr>
<w:keepNext/>
…
</w:pPr>
</syntaxhighlight>
In contrast, he notes ODF would use the single attribute <code>fo:keep-next</code>, rather than an element, for the same semantic.<ref>{{ cite web
Line 189 ⟶ 190:
| date=21 October 2008 }}</ref>
The [[W3C XML Schema|XML Schema]] of Office Open XML emphasizes reducing load time and improving [[parsing]] speed.<ref>{{Cite web| title=Software Developer uses Office Open XML to Minimize File Space, Increase Interoperability| url=http://www.openxmlcommunity.org/documents/casestudies/Intellisafe_OpenXML_Final.pdf
The naming of elements and attributes within the text
Some have argued the design is based too closely on Microsoft applications.
In August 2007, the [[Linux Foundation]] published a blog post calling upon ISO National Bodies to vote "No, with comments" during the International Standardization of OOXML. It said, "OOXML is a direct port of a single vendor's binary document formats. It avoids the re-use of relevant existing international standards (e.g. several cryptographic algorithms, VML, etc.). There are literally hundreds of technical flaws that should be addressed before standardizing OOXML including continued use of binary code tied to platform specific features, propagating bugs in MS-Office into the standard, proprietary units, references to proprietary/confidential tags, unclear [[Intellectual property|IP]] and patent rights, and much more".<ref>{{
The version of the standard submitted to [[ISO/IEC
=== WordprocessingML (WML) ===
Word processing documents use the XML vocabulary known as WordprocessingML normatively defined by the schema <
=== SpreadsheetML (SML) ===
Spreadsheet documents use the XML vocabulary known as SpreadsheetML normatively defined by the schema <
Each worksheet in a spreadsheet is represented by an XML document with a root element named
The representation of date and time values in SpreadsheetML has attracted some criticism. ECMA-376 1st edition does not conform to ISO 8601:2004 "Representation of Dates and Times". It requires that implementations replicate a [[Lotus 1-2-3]]
=== Office MathML (OMML) ===
Office Math Markup Language is a mathematical markup language which can be embedded in WordprocessingML, with intrinsic support for including word processing markup like revision markings,<ref>{{cite web|url = http://idippedut.dk/post/Do-your-math-OOXML-and-OMML|title = Do your math - OOXML and OMML (Updated 2008-02-12)|author = Jesper Lund Stocholm|publisher = A Mooh Point blog|date = 2008-02-12|access-date = 2015-11-18|archive-date = 2016-03-26|archive-url = https://web.archive.org/web/20160326225935/http://idippedut.dk/post/Do-your-math-OOXML-and-OMML|url-status = dead}}</ref> footnotes, comments, images and elaborate formatting and styles.<ref>{{cite web| url=http://blogs.msdn.com/murrays/archive/2007/06/05/science-and-nature-have-difficulties-with-word-2007-mathematics.aspx| title=Science and Nature have difficulties with Word 2007 mathematics| author=Murray Sargent| publisher=MSDN blogs| date=2007-06-05| access-date=2007-07-31}}</ref>
The following Office MathML example defines the [[fraction (mathematics)|fraction]]: <math>\frac{\pi}{2}</math>
<
<m:oMathPara><!-- mathematical block container used as a paragraph -->
<m:oMath><!-- mathematical inline formula -->
<m:f><!-- a fraction -->
<m:num><m:r><m:t>π</m:t></m:r></m:num><!-- numerator containing a single run of text -->
<m:den><m:r><m:t>2</m:t></m:r></m:den><!-- denominator containing a single run of text -->
</m:oMathPara>
</syntaxhighlight>
Some have queried the need for Office MathML (OMML) instead advocating the use of [[MathML]], a [[World Wide Web Consortium|W3C]] recommendation for the "inclusion of mathematical expressions in Web pages" and "machine to machine communication".<ref>{{
=== DrawingML ===<!--
[[Image:
DrawingML is the [[vector graphics markup language]] used in Office Open XML documents. Its major features are the graphics rendering of text elements, graphical vector-based shape elements, graphical tables and charts.
Line 249 ⟶ 238:
DrawingML contains graphics effects (like shadows and reflection) that can be used on the different graphical elements that are used in DrawingML.
In DrawingML you can also create 3d effects, for instance to show the different graphical elements through a flexible camera viewpoint.
It is possible to create separate DrawingML theme parts in an Office Open XML package. These themes can then be applied to graphical elements throughout the Office Open XML package.<ref>{{
DrawingML is unrelated to the other [[vector graphics]] formats such as [[Scalable Vector Graphics|SVG]]. These can be converted to DrawingML to include natively in an Office Open XML document. This is a different approach to that of the [[OpenDocument]] format, which uses a subset of SVG, and includes vector graphics as separate files.
A DrawingML graphic's dimensions are specified in
Some have criticised the use of DrawingML (and the transitional-use-only [[Vector Markup Language|VML]]) instead of [[World Wide Web Consortium|W3C]] recommendation [[Scalable Vector Graphics|SVG]].<ref>{{ cite web | url=http://reddevnews.com/features/article.aspx?editorialsid=2356 | title= The X Factor | publisher=reddevnews.com |
== Foreign resources ==
=== Non-XML content ===
Line 273 ⟶ 263:
These are defined in clause 17.5 of Part 1.
{{main|Office Open XML file formats}}▼
== Compatibility settings ==
Line 282 ⟶ 268:
Versions of Office Open XML contain what are termed "compatibility settings". These are contained in Part 4 ("Markup Language Reference") of ECMA-376 1st Edition, but during standardization were moved to become a new part (also called Part 4) of ISO/IEC 29500:2008 ("Transitional Migration Features").
These settings (including element with names such as ''autoSpaceLikeWord95'', ''footnoteLayoutLikeWW8'', ''lineWrapLikeWord6'', ''mwSmallCaps'', ''shapeLayoutLikeWW8'', ''suppressTopSpacingWP'', ''truncateFontHeightsLikeWP6'', ''uiCompat97To2003'', ''useWord2002TableStyleRules'', ''useWord97LineBreakRules'', ''wpJustification'' and ''wpSpaceWidth'') were the focus of some controversy during the standardisation of DIS 29500.<ref>{{ cite web | url=http://www.freesoftwaremagazine.com/articles/odf_ooxml_technical_white_paper?page=0%2C9 | title=ODF/OOXML technical white paper — A white paper based on a technical comparison between the ODF and OOXML formats | publisher=Free Software Magazine}}</ref> As a result, new text was added to ISO/IEC 29500 to document them.<ref>{{ cite web | url = http://www.ecma-international.org/publications/standards/Ecma-376.htm | title = ECMA-376 2nd edition Part 4 (paragraph 9.7.3) | publisher = Ecma-international.org | access-date
An article in ''[[Free Software Magazine]]'' has criticized the markup used for these settings. Office Open XML uses distinctly named elements for each compatibility setting, each of which is declared in the schema. The repertoire of settings is thus limited — for new compatibility settings to be added, new elements may need to be declared, "potentially creating thousands of them, each having nothing to do with interoperability".<ref>{{ cite web | quote="... OOXML chose this route. Rather than create an application-definable configuration tag there is a unique tag for each setting ... Currently, the only application's unique settings that are catered for are the applications that the standard's authors have decided to include, ... For other applications to be added, further tag names would need to be defined in the specification, potentially creating thousands of them, each having nothing to do with interoperability ..". | url=http://www.freesoftwaremagazine.com/articles/odf_ooxml_technical_white_paper?page=0%2C7 | title=ODF/OOXML technical white paper — A white paper based on a technical comparison between the ODF and OOXML formats | publisher=Free Software Magazine }}</ref>
== Extensibility ==
Line 292 ⟶ 278:
==References==
{{reflist|colwidth=30em}}
{{ISO standards}}
{{Ecma International Standards}}
{{Microsoft Office}}
[[Category:Office Open XML|*]]
[[Category:Computer-related introductions in 2006]]
[[Category:XML]]
[[Category:Document-centric XML-based standards]]
[[Category:Markup languages]]
[[Category:Open formats]]
[[Category:Computer file formats]]
[[Category:Microsoft Office]]
|