Web Archive (.mht) File Generated by Microsoft Word

Q

What contents are stored in a Web Archive (.mht) file? When you convert a word document into a single file Web page, Word generates a single with the extension of .mht. How text, pictures, and objects from the original word document are stored in this single .mht file?

✍: FYIcenter.com

A

Word uses the Single File Web Page format to store text, pictures and objects from the original word document into a single file with the Web Archive (.mht) file extension.

The Single File Web Page format is also known as a Web Archive File format. It uses the MIME Content-Type "multipart/related" to bundle multiple files into a single archive file. The first file bundled in the archive file is normally the HTML document of the Web page. Subsequent files bundled in the archive file are additional supporting files to the Web page.

Image files and other binary files are encoded in Base64 format in a Web Archive file.

Here is an example of Web Archive (.mht) file generated by Word:

MIME-Version: 1.0
Content-Type: multipart/related; boundary="----=_NextPart_01CA4105.13A346F0"

This document is a Single File Web Page, also known as a Web Archive file.
If you are seeing this message, your browser or editor doesn't support
Web Archive files. Please download a browser that supports Web Archive,
such as Windows Internet Explorer.

------=_NextPart_01CA4105.13A346F0
Content-Location: file:///C:/ECC8A9E5/Single.htm
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html; charset="us-ascii"

<html xmlns:v=3D"urn:schemas-microsoft-com:vml"
xmlns:o=3D"urn:schemas-microsoft-com:office:office"
xmlns:w=3D"urn:schemas-microsoft-com:office:word"
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml"
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; charset=3Dus-ascii">
<meta name=3DProgId content=3DWord.Document>
<meta name=3DGenerator content=3D"Microsoft Word 12">
<meta name=3DOriginator content=3D"Microsoft Word 12">
<link rel=3DFile-List href=3D"Single_files/filelist.xml">
<link rel=3DEdit-Time-Data href=3D"Single_files/editdata.mso">
<link rel=3DOLE-Object-Data href=3D"Single_files/oledata.mso">
<!--[if !mso]>
<style>
v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style>
<![endif]--><!--[if gte mso 9]><xml>
<o:DocumentProperties>
......

</html>

------=_NextPart_01CA4105.13A346F0
Content-Location: file:///C:/ECC8A9E5/Single_files/themedata.thmx
Content-Transfer-Encoding: base64
Content-Type: application/vnd.ms-officetheme

UEsDBBQABgAIAAAAIQCCirwT+gAAABwCAAATAAAAW0NvbnRlbnRfVHlwZXNdLnhtbKyRy2rDMBBF
94X+g9C22HK6KKXYzqJJd30s0g8Y5LEtao+ENAnJ33fsuFC6CC10IxBizpl7Va6P46AOGJPzVOlV
XmiFZH3jqKv0++4pu9cqMVADgyes9AmTXtfXV+XuFDApmaZU6Z45PBiTbI8jpNwHJHlpfRyB5Ro7
......

------=_NextPart_01CA4105.13A346F0--

 

Converting Microsoft Word to/from Other Format

⇒⇒MS Word - Frequently Asked Questions

2016-11-05, 961👍, 0💬