Follow Techotopia on Twitter

On-line Guides
All Guides
eBook Store
iOS / Android
Linux for Beginners
Office Productivity
Linux Installation
Linux Security
Linux Utilities
Linux Virtualization
Linux Kernel
System/Network Admin
Programming
Scripting Languages
Development Tools
Web Development
GUI Toolkits/Desktop
Databases
Mail Systems
openSolaris
Eclipse Documentation
Techotopia.com
Virtuatopia.com
Answertopia.com

How To Guides
Virtualization
General System Admin
Linux Security
Linux Filesystems
Web Servers
Graphics & Desktop
PC Hardware
Windows
Problem Solutions
Privacy Policy

  




 

 

Appendix�E.�KWord Technical details

KWord file format

Mike McBride

KWord uses two open source, independently developed standards for its file format. The combination was chosen for its balance between convenience and open development models.

First, it should be noted that all KWord files are multiple XML files that are compressed to reduce their space requirements.

Select the KWord version you are interested in:

KWord 1.1 and earlier

The XML files are compressed into a single file using the same algorithm as used by tar .

You can uncompress the files with the following command:

% 



tar -xzvf 




filename




This will expand the KWord document file into its component files.

The text portion of all KWord files are XML (eXtensible Markup Language) files.

Note

For more information on XML documents, processors and technology, please visit

World Wide Web Consortium XML pages
XML.org Resource Guide
The XML FAQ

All KWord documents consist of at least two XML files:

maindoc.xml

This file contains the bulk of the KWord text, tables and formula information. It is marked with XML tags according to the official DTD. A copy of the KWord 1.1 DTD is located at: https://www.koffice.org/DTD/kword-1.1.dtd.

documentinfo.xml

This file contains the document information. This is information entered into the dialog boxes when selecting File -> Document Information from the menubar. This information is useful for tracking authors, contact information etc.

The DTD for KOffice 1.1 is located at: https://www.koffice.org/DTD/document-info-1.1.dtd.

In addition, there may be other files included in the KWord document file. Pictures, embedded documents and other binary information are stored within the KWord document as separate files.

For more specific information on KWord file storage or other internal information, please see The KOffice API and the General KDE developer information pages.

KWord 1.2

The text files are compressed into a single file using the same algorithm as used by zip . This change was made because of its broad use in other open source office suites and its improved performance with lower memory requirements.

You can uncompress the files with the following command:

% 



unzip 




filename




This will expand the KWord document file into its component files.

The text portion of all KWord files are XML (eXtensible Markup Language) files.

Note

For more information on XML documents, processing and technology, please visit

World Wide Web Consortium XML pages
XML.org Resource Guide
The XML FAQ

All KWord documents consist of at least three files:

maindoc.xml

This file contains the bulk of the KWord text, tables and formula information. It is marked with XML tags according to the official DTD.

A copy of the KWord 1.2 DTD is located at: https://www.koffice.org/DTD/kword-1.2.dtd.

documentinfo.xml

This file contains the document information. This is information entered into the dialog boxes when selecting File -> Document Information from the menubar. This information is useful for tracking authors, contact information etc.

The DTD for KOffice 1.2 is located at: https://koffice.kde.org/DTD/document-info-1.2.dtd.

mimetype

This file contains the mimetype for KWord files. This information is used by KDE to determine that this is a KWord file.

This file always contains: application/x-kword

In addition, there may be other files included in the KWord document file. Pictures, embedded documents and other binary information are stored within the KWord document as separate files.

For more specific information on KWord file storage or other internal information, please see The KOffice API and the General KDE developer information pages.

KWord 1.3 (current version)

The text files are compressed into a single file using the same algorithm as used by zip . This change was made because of its broad use in other open source office suites and its improved performance with lower memory requirements.

You can uncompress the files with the following command:

% 



unzip 




filename




This will expand the KWord document file into its component files.

The text portion of all KWord files are XML (eXtensible Markup Language) files.

Note

For more information on XML documents, processing and technology, please visit

World Wide Web Consortium XML pages
XML.org Resource Guide
The XML FAQ

All KWord documents consist of at least three files:

maindoc.xml

This file contains the bulk of the KWord text, tables and formula information. It is marked with XML tags according to the official DTD.

A copy of the KWord 1.3 DTD is located at: https://www.koffice.org/DTD/kword-1.3.dtd.

documentinfo.xml

This file contains the document information. This is information entered into the dialog boxes when selecting File -> Document Information from the menubar. This information is useful for tracking authors, contact information etc.

The DTD for KOffice 1.3 is located at: https://koffice.kde.org/DTD/document-info-1.3.dtd.

mimetype

This file contains the mimetype for KWord files. This information is used by KDE to determine that this is a KWord file.

This file always contains: application/x-kword

In addition, there may be other files included in the KWord document file. Pictures, embedded documents and other binary information are stored within the KWord document as separate files.

For more specific information on KWord file storage or other internal information, please see The KOffice API and the General KDE developer information pages.




 
 
  Published under the terms of the GNU General Public License Design by Interspire