Wednesday, May 07, 2008

How to - Convert Word Docs to Web Pages

Useful ...

Convert Word Docs to Web Pages

http://howto.wired.com/wiki/Convert_Word_Docs_to_Web_Pages

Microsoft Word has its place, but that place isn't the web. If you've ever tried to convert a Word document to an HTML document, you know that Word's built-in tools can have disastrous results -- bloated files, proprietary markup and exposed personal information are among the gems you'll get with Word's "Convert to HTML" function.

To get to a semi-sane starting point, try using Word's "Save As: Web Page, Filtered" rather than the regular web page option. This will strip out many of the proprietary tags and won't include potentially personal and revealing info contained in the File Properties dialog.

TinyMCE

Another viable option is TinyMCE, a JavaScript Rich Text Editor that offers a "Paste from Word" option. Paste From Word is intended to be used by those who would like to just "Select All" in Word and paste the content into TinyMCE. Depending on the complexity of your document, TinyMCE may be able to fix some of Word's styling quirks and output usable HTML.

Textism

The good folks over at Textism have a tool that will, to quote the Textism website, "strip Microsoft's proprietary tags and other superfluous noise from Word-generated HTML documents." The results are not only much closer to standards compliant web markup, they also create much much smaller, quickly loading pages.

I used Textism to convert this document from Word to clean HTML. I think, it did a pretty good job.

Labels: , , ,

1 Comments:

Blogger Alex said...

I know a good tool for work with word file-Fix Word, can recover information from corrupted Microsoft Word documents and templates via the local area network, export the recovered information both to Microsoft Word and to a plain text file, recover doc file software can fix word file docx of Office 2007, allows users to avoid losing data important for them. Indeed, Microsoft Word is nowadays the most popular tool for creating any documents, including corporate ones.

2:23 PM  

Post a Comment

<< Home