Content deleted Content added
→Relationships: clarified URL reference |
m →Office MathML (OMML): HTTP to HTTPS for Blogspot |
||
(40 intermediate revisions by 33 users not shown) | |||
Line 1:
{{Short description|Family of file formats}}
{{Main|Office Open XML}}
Line 4 ⟶ 5:
{{Infobox file format
| name = Office Open XML Document
| icon =
| logo sg
| screenshot =
| caption =
| extension = .docx, .docm
| mime = application/vnd
| type code =
| uniform type =
| magic =
| owner = [[Microsoft]], [[Ecma International|Ecma]], [[International Organization for Standardization|ISO]]/[[International Electrotechnical Commission|IEC]]
| released = {{Start date and age|2006}}
| latest release version =
| latest release date =
Line 23 ⟶ 24:
| extended to =
| standard = ECMA-376, ISO/IEC 29500
| url = [http://www.ecma-international.org/publications-and-standards/standards/
{{Infobox file format
| name = Office Open XML Presentation
| icon =
| logo =
| screenshot =
| caption =
| extension = .pptx, .pptm
| mime = application/vnd
| type code =
| uniform type =
Line 45 ⟶ 46:
| extended to =
| standard = ECMA-376, ISO/IEC 29500
| url = [http://www.ecma-international.org/publications-and-standards/standards/
{{Infobox file format
| name = Office Open XML Workbook
| icon =
| logo =
| screenshot =
| caption =
| extension = .xlsx, .xlsm
| mime = application/vnd
| type code =
| uniform type =
Line 67 ⟶ 68:
| extended to =
| standard = ECMA-376, ISO/IEC 29500
| url = [http://www.ecma-international.org/publications/standards-and-standards/
}}
The '''Office Open XML file formats''' are a set of [[file format]]s that can be used to represent electronic [[Office suite|office]] documents. There are formats for [[word processing]] documents, [[spreadsheets]] and [[presentations]] as well as specific formats for material such as mathematical
The formats were developed by [[Microsoft]] and first appeared in [[Microsoft Office 2007]]. They were standardized between December 2006 and November 2008, first by the [[Ecma International]] consortium, where they became ECMA-376, and subsequently, after a [[Standardization of Office Open XML|contentious standardization process]], by the ISO/IEC's Joint Technical Committee 1, where they became ISO/IEC 29500:2008.
Line 78 ⟶ 79:
[[Image:Open Packaging Convention.png|220px|thumb|right|Container structure of Part 2 of the Ecma Office Open XML standard, ECMA-376]]
Office Open XML documents are stored in [[Open Packaging
A basic package contains an XML file called ''[Content_Types].xml'' at the root, along with three directories: ''_rels'', ''docProps'', and a directory specific for the document type (for example, in a .docx word processing package, there would be a ''word'' directory). The ''word'' directory contains the ''document.xml'' file which is the core content of the document.
Line 92 ⟶ 93:
An example relationship file (''word/_rels/document.xml.rels''), is:
<
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<Relationships
Line 105 ⟶ 106:
TargetMode="External" />
</Relationships>
</syntaxhighlight>
As such, images referenced in the document can be found in the relationship file by looking for all relationships that are of type <code><nowiki>http://schemas.microsoft.com/office/2006/relationships/image</nowiki></code>. To change the used image, edit the relationship.
Line 111 ⟶ 112:
The following code shows an example of inline markup for a [[hyperlink]]:
<
<w:hyperlink r:id="rId2" w:history="1"
xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships"
xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main">
</syntaxhighlight>
In this example, the [[Uniform Resource Locator]] (URL) is in the Target attribute of the Relationship referenced through the relationship Id, "rId2" in this case. Linked images, templates, and other items are referenced in the same way.
Line 121 ⟶ 122:
Pictures can be embedded or linked using a tag:
<
<v:imagedata w:rel="rId1" o:title="example" />
</syntaxhighlight>
This is the reference to the image file. All references are managed via relationships. For example, a document.xml has a relationship to the image. There is a _rels directory in the same directory as document.xml, inside _rels is a file called document.xml.rels. In this file there will be a relationship definition that contains type, ID and ___location. The ID is the referenced ID used in the XML document. The type will be a reference schema definition for the media type and the ___location will be an internal ___location within the ZIP package or an external ___location defined with a URL.
Line 133 ⟶ 134:
An example document properties file (''docProps/core.xml'') that uses Dublin Core metadata, is:
<
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<cp:coreProperties xmlns:cp="http://schemas.openxmlformats.org/package/2006/metadata/core-properties"
Line 150 ⟶ 151:
<cp:contentStatus>Final</cp:contentStatus>
</cp:coreProperties>
</syntaxhighlight>
== Document markup languages ==
Line 163 ⟶ 164:
Shared markup language materials include:
* Office Math Markup Language (OMML)
* DrawingML used for vector drawing, charts, and for example, text art (additionally, though deprecated, [[Vector Markup Language|VML]] is supported for drawing)
* Extended properties
* Custom properties
Line 176 ⟶ 177:
Patrick Durusau, the editor of [[OpenDocument|ODF]], has viewed the markup style of OOXML and ODF as representing two sides of a debate: the "element side" and the "attribute side". He notes that OOXML represents "the element side of this approach" and singles out the <code>KeepNext</code> element as an example:
<
<w:pPr>
<w:keepNext/>
…
</w:pPr>
</syntaxhighlight>
In contrast, he notes ODF would use the single attribute <code>fo:keep-next</code>, rather than an element, for the same semantic.<ref>{{ cite web
Line 189 ⟶ 190:
| date=21 October 2008 }}</ref>
The [[W3C XML Schema|XML Schema]] of Office Open XML emphasizes reducing load time and improving [[parsing]] speed.<ref>{{Cite web| title=Software Developer uses Office Open XML to Minimize File Space, Increase Interoperability| url=http://www.openxmlcommunity.org/documents/casestudies/Intellisafe_OpenXML_Final.pdf
The naming of elements and attributes within the text has attracted some criticism. There are three different syntaxes in OOXML (ECMA-376) for specifying the color and alignment of text depending on whether the document is a text, spreadsheet, or presentation. Rob Weir (an [[IBM]] employee and co-chair of the [[OASIS (organization)|OASIS]] [[OpenDocument Format]] TC) asks "What is the engineering justification for this horror?". He contrasts with [[OpenDocument]]: "ODF uses the W3C's XSL-FO vocabulary for text styling, and uses this vocabulary consistently".<ref>{{ cite web | url=http://www.robweir.com/blog/2008/03/disharmony-of-ooxml.html | title= Disharmony of OOXML | author=Rob Weir | date=14 March 2008}}</ref>
Some have argued the design is based too closely on Microsoft applications.
In August 2007, the [[Linux Foundation]] published a blog post calling upon ISO National Bodies to vote "No, with comments" during the International Standardization of OOXML. It said, "OOXML is a direct port of a single vendor's binary document formats. It avoids the re-use of relevant existing international standards (e.g. several cryptographic algorithms, VML, etc.). There are literally hundreds of technical flaws that should be addressed before standardizing OOXML including continued use of binary code tied to platform specific features, propagating bugs in MS-Office into the standard, proprietary units, references to proprietary/confidential tags, unclear [[Intellectual property|IP]] and patent rights, and much more".<ref>{{
The version of the standard submitted to [[ISO/IEC
=== WordprocessingML (WML) ===
Word processing documents use the XML vocabulary known as WordprocessingML normatively defined by the schema <
=== SpreadsheetML (SML) ===
Spreadsheet documents use the XML vocabulary known as SpreadsheetML normatively defined by the schema <
Each worksheet in a spreadsheet is represented by an XML document with a root element named {{tag|worksheet}} in the {{LinkBlue|<nowiki>http://schemas.openxmlformats.org/spreadsheetml/2006/main</nowiki>}} Namespace.
The representation of date and time values in SpreadsheetML has attracted some criticism. ECMA-376 1st edition does not conform to ISO 8601:2004 "Representation of Dates and Times". It requires that implementations replicate a [[Lotus 1-2-3]]
=== Office MathML (OMML) ===
Office Math Markup Language is a mathematical markup language which can be embedded in WordprocessingML, with intrinsic support for including word processing markup like revision markings,<ref>{{cite web|url = http://idippedut.dk/post/Do-your-math-OOXML-and-OMML|title = Do your math - OOXML and OMML (Updated 2008-02-12)|author = Jesper Lund Stocholm|publisher = A Mooh Point blog|date = 2008-02-12|
The OMML format is different from the [[World Wide Web Consortium]] (W3C) [[MathML]] recommendation that does not support those office features, but is partially compatible<ref>{{cite web| url=
The following Office MathML example defines the [[fraction (mathematics)|fraction]]: <math>\frac{\pi}{2}</math>
<
<m:oMathPara><!-- mathematical block container used as a paragraph -->
<m:oMath><!-- mathematical inline formula -->
Line 225 ⟶ 226:
</m:oMath>
</m:oMathPara>
</syntaxhighlight>
Some have queried the need for Office MathML (OMML) instead advocating the use of [[MathML]], a [[World Wide Web Consortium|W3C]] recommendation for the "inclusion of mathematical expressions in Web pages" and "machine to machine communication".<ref>{{cite web| url=
=== DrawingML ===<!-- English Metric Unit links to here -->
Line 237 ⟶ 238:
DrawingML contains graphics effects (like shadows and reflection) that can be used on the different graphical elements that are used in DrawingML.
In DrawingML you can also create 3d effects, for instance to show the different graphical elements through a flexible camera viewpoint.
It is possible to create separate DrawingML theme parts in an Office Open XML package. These themes can then be applied to graphical elements throughout the Office Open XML package.<ref>{{
DrawingML is unrelated to the other [[vector graphics]] formats such as [[Scalable Vector Graphics|SVG]]. These can be converted to DrawingML to include natively in an Office Open XML document. This is a different approach to that of the [[OpenDocument]] format, which uses a subset of SVG, and includes vector graphics as separate files.
A DrawingML graphic's dimensions are specified in ''English Metric Units'' (EMUs). It is so called because it allows an exact common representation of dimensions originally in either [[English units|English]] or [[Metric
Some have criticised the use of DrawingML (and the transitional-use-only [[Vector Markup Language|VML]]) instead of [[World Wide Web Consortium|W3C]] recommendation [[Scalable Vector Graphics|SVG]].<ref>{{ cite web | url=http://reddevnews.com/features/article.aspx?editorialsid=2356 | title= The X Factor | publisher=reddevnews.com |date=October 2007 }}</ref> VML did not become a W3C recommendation.<ref>{{ cite web | url = http://www.w3.org/TR/NOTE-VML | title = VML — the Vector Markup Language | publisher = W3.org | date = 1998-05-13 |
== Foreign resources ==
Line 267 ⟶ 268:
Versions of Office Open XML contain what are termed "compatibility settings". These are contained in Part 4 ("Markup Language Reference") of ECMA-376 1st Edition, but during standardization were moved to become a new part (also called Part 4) of ISO/IEC 29500:2008 ("Transitional Migration Features").
These settings (including element with names such as ''autoSpaceLikeWord95'', ''footnoteLayoutLikeWW8'', ''lineWrapLikeWord6'', ''mwSmallCaps'', ''shapeLayoutLikeWW8'', ''suppressTopSpacingWP'', ''truncateFontHeightsLikeWP6'', ''uiCompat97To2003'', ''useWord2002TableStyleRules'', ''useWord97LineBreakRules'', ''wpJustification'' and ''wpSpaceWidth'') were the focus of some controversy during the standardisation of DIS 29500.<ref>{{ cite web | url=http://www.freesoftwaremagazine.com/articles/odf_ooxml_technical_white_paper?page=0%2C9 | title=ODF/OOXML technical white paper — A white paper based on a technical comparison between the ODF and OOXML formats | publisher=Free Software Magazine}}</ref> As a result, new text was added to ISO/IEC 29500 to document them.<ref>{{ cite web | url = http://www.ecma-international.org/publications/standards/Ecma-376.htm | title = ECMA-376 2nd edition Part 4 (paragraph 9.7.3) | publisher = Ecma-international.org | access-date
An article in ''[[Free Software Magazine]]'' has criticized the markup used for these settings. Office Open XML uses distinctly named elements for each compatibility setting, each of which is declared in the schema. The repertoire of settings is thus limited — for new compatibility settings to be added, new elements may need to be declared, "potentially creating thousands of them, each having nothing to do with interoperability".<ref>{{ cite web | quote="... OOXML chose this route. Rather than create an application-definable configuration tag there is a unique tag for each setting ... Currently, the only application's unique settings that are catered for are the applications that the standard's authors have decided to include, ... For other applications to be added, further tag names would need to be defined in the specification, potentially creating thousands of them, each having nothing to do with interoperability ..". | url=http://www.freesoftwaremagazine.com/articles/odf_ooxml_technical_white_paper?page=0%2C7 | title=ODF/OOXML technical white paper — A white paper based on a technical comparison between the ODF and OOXML formats | publisher=Free Software Magazine }}</ref>
== Extensibility ==
Line 284 ⟶ 285:
[[Category:Office Open XML|*]]
[[Category:Computer-related introductions in 2006]]
[[Category:XML]]
[[Category:Document-centric XML-based standards]]
|