in070132



From: Kevin Wright [kwright68@gmail.com]
Sent: Tuesday, January 23, 2007 5:07 PM
To: Rajchel, Lisa; Garner, Jennifer; Desautels, Sara
Cc: bryden@iso.org; kwright68@gmail.com



Subject:  Kevin Wright Contribution in Response to JTC 1 N 8455 - 30 Day Review for Fast Track Ballot ECMA-376 | ISO/IEC DIS 29500 Office Open XML File Formats




Dear Sir or Madam,

I am sure that you will receive many quasi-form letters expressing concerns with the Ecma 376 Office Open XML standard, so let me differentiate my letter with some personal comments.

I work as a statistician for a Fortune 500 company.  From time to time, I am asked to speak on issues of Data Quality.  In my presentations I always include examples that illustrate how Excel is a poor choice for long-term data storage and for interoperability with other software.  For example, I use SAS, S-Plus, R, Minitab and other statistical software programs to access data from Excel.  Every single one of these software packages is unable to 100% reliably import data from Excel.

One trivial example: in Excel, the data type for a column is determined by the first 16 rows of data.  If the column contains mixed data (that is, some numeric and some character) the column may either be identified as numeric or as character.  Importantly, if the rows are re-ordered and imported again into the statistics software, there is no guarantee the data will be imported the same way.

I readily admit that I have not used Excel 2007, which may correct problems associated with my example, but Ecma 376 is full of specifications that exist solely for backward compatibility with earlier versions of Microsoft Office files (such as the infamous 1900 leap-year bug).

I have other examples of data-quality problems associated with Microsoft Excel and wish to re-iterate that it (as currently
documented) is NOT an acceptable long-term data storage format.

I hope you will agree that Ecma 376 is not ready for "fast-track"
approval.  At a minimum, Ecma 376 should remove all references to undocumented features of previous versions of Microsoft Office file formats.  There is simply no place in an international standard for such language.

Sincerely,

K. Wright