Office Open XML file formats: Difference between revisions

Content deleted Content added
mNo edit summary
Rescuing 2 sources and tagging 0 as dead.) #IABot (v2.0.9.5) (Eastmain - 17674
Line 190:
| date=21 October 2008 }}</ref>
 
The [[W3C XML Schema|XML Schema]] of Office Open XML emphasizes reducing load time and improving [[parsing]] speed.<ref>{{Cite web| title=Software Developer uses Office Open XML to Minimize File Space, Increase Interoperability| url=http://www.openxmlcommunity.org/documents/casestudies/Intellisafe_OpenXML_Final.pdf | author=Intellisafe Technologies}}</ref> In a test with applications current in April 2007, XML-based office documents were slower to load than binary formats.<ref>{{ cite web | url=http://blogs.zdnet.com/Ou/?p=480 | title=MS Office 2007 versus Open Office 2.2 shootout | author=George Ou | date=2007-04-27 | access-date=2007-04-27 | publisher=ZDnet.com | archive-date=2009-03-26 | archive-url=https://web.archive.org/web/20090326081043/http://blogs.zdnet.com/Ou/?p=480 | url-status=dead }}</ref> To enhance performance, Office Open XML uses very short element names for common elements and spreadsheets save dates as index numbers (starting from 1900 or from 1904).<ref>{{ cite web | url=https://support.microsoft.com/en-us/kb/214330 | title=Differences between the 1900 and the 1904 date system in Excel | date=2013-03-05 | access-date=2016-08-23 | publisher=Microsoft}}</ref> In order to be systematic and generic, Office Open XML typically uses separate child elements for data and metadata (element names ending in ''Pr'' for ''properties'') rather than using multiple attributes, which allows structured properties. Office Open XML does not use mixed content but uses elements to put a series of text runs (element name ''r'') into paragraphs (element name ''p''). The result is terse{{Citation needed|date=October 2009}} and highly nested in contrast to [[HTML]], for example, which is fairly flat, designed for humans to write in [[text editors]] and is more congenial for humans to read.
 
The naming of elements and attributes within the text has attracted some criticism. There are three different syntaxes in OOXML (ECMA-376) for specifying the color and alignment of text depending on whether the document is a text, spreadsheet, or presentation. Rob Weir (an [[IBM]] employee and co-chair of the [[OASIS (organization)|OASIS]] [[OpenDocument Format]] TC) asks "What is the engineering justification for this horror?". He contrasts with [[OpenDocument]]: "ODF uses the W3C's XSL-FO vocabulary for text styling, and uses this vocabulary consistently".<ref>{{ cite web | url=http://www.robweir.com/blog/2008/03/disharmony-of-ooxml.html | title= Disharmony of OOXML | author=Rob Weir | date=14 March 2008}}</ref>
Line 212:
 
=== Office MathML (OMML) ===
Office Math Markup Language is a mathematical markup language which can be embedded in WordprocessingML, with intrinsic support for including word processing markup like revision markings,<ref>{{cite web|url = http://idippedut.dk/post/Do-your-math-OOXML-and-OMML|title = Do your math - OOXML and OMML (Updated 2008-02-12)|author = Jesper Lund Stocholm|publisher = A Mooh Point blog|date = 2008-02-12|access-date = 2015-11-18|archive-date = 2016-03-26|archive-url = https://web.archive.org/web/20160326225935/http://idippedut.dk/post/Do-your-math-OOXML-and-OMML|url-status = dead}}</ref> footnotes, comments, images and elaborate formatting and styles.<ref>{{cite web| url=http://blogs.msdn.com/murrays/archive/2007/06/05/science-and-nature-have-difficulties-with-word-2007-mathematics.aspx| title=Science and Nature have difficulties with Word 2007 mathematics| author=Murray Sargent| publisher=MSDN blogs| date=2007-06-05| access-date=2007-07-31}}</ref>
The OMML format is different from the [[World Wide Web Consortium]] (W3C) [[MathML]] recommendation that does not support those office features, but is partially compatible<ref>{{cite web| url=http://dpcarlisle.blogspot.com/2007/04/xhtml-and-mathml-from-office-20007.html| title=XHTML and MathML from Office 2007| author=David Carlisle| publisher=David Carlisle| date=2007-05-09| access-date=2007-09-20}}</ref> through [[XSL Transformations]]; tools are provided with office suite and are automatically used via clipboard transformations.<ref>{{Cite web|url=http://blogs.msdn.com/b/murrays/archive/2007/06/05/science-and-nature-have-difficulties-with-word-2007-mathematics.aspx|title = DevBlogs}}</ref>