Click to See Complete Forum and Search --> : DOC to HTML


t0ny
May 15th, 2003, 02:43 PM
Hi,

I have a Word doc file template that I wish users to use to create documents with. Once they have created them I want them to upload the file using a section on my site.

OK. Once I have the file I need to convert it into HTML, but I want it without all the crap that Microsoft add in.

I have been using the resource hungry word object up until now opening the file and then using saveAs to stick it into HTML. If I use the filtered HTML flag then it removes ALL HTML tags and gives up when it meets a picture.

I have heard that I can convert my word doc into XML and then into HTML keeping the formating and ignoring the Microsoft rubbish??

Can anyone help with converting a word doc to html without using the word object, or is this the best way to do it? #

lord loh
May 22nd, 2003, 04:24 AM
If your prime concern is to remove the microsoft add ins, then follow the microsoft procedure and get a HTML from your DOC....

Then doctor the HTML to remove their crap...