Jump to content

Word to XHTML Converter


vchris
 Share

Recommended Posts

Anyone know of a great script that convert word files to valid xhtml? I don't like programs people create, they're usually slow and I can't put any custom stuff in there. A small script in coldfusion or other server side language could do the job a lot faster and could be customized easily. I'm converting large documents with large tables, images, table of contents, graphics...

Link to comment
Share on other sites

Can I do batch jobs with this program? Will it automatically clean the html create by MS word?edit: forgot to mention free!

Link to comment
Share on other sites

Can I do batch jobs with this program? Will it automatically clean the html create by MS word?edit: forgot to mention free!
No, it won't allow batch jobs. You have to do it all at once. But it will clean it up and also it is free!
Link to comment
Share on other sites

HTML-Kit is a nice little program. It's light which is fast and as powerful as dreamweaver with it's plugins. HTML Tidy cleans up the Word HTML but not as much as I need. I'm creating a plugin to do the rest of the work :)

Link to comment
Share on other sites

What is the code required to remove all attributes except for colspan and rowspan? I got this:\swidth="?[0-9]+"?|\sheight="?[0-9]+"?|\salign="?.*"?|\svalign="?.*"?|\sstyle=['|"](.|\s)*['|"]|\snowrap|\sclass="?.*"?|\slang="?.*"?|\sxml:lang="?.*"?The only problem I have with this line is the line returns in the style attribute, \s doesn't seem to work.

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
 Share

×
×
  • Create New...