t0ny
May 15th, 2003, 02:43 PM
Hi,
I have a Word doc file template that I wish users to use to create documents with. Once they have created them I want them to upload the file using a section on my site.
OK. Once I have the file I need to convert it into HTML, but I want it without all the crap that Microsoft add in.
I have been using the resource hungry word object up until now opening the file and then using saveAs to stick it into HTML. If I use the filtered HTML flag then it removes ALL HTML tags and gives up when it meets a picture.
I have heard that I can convert my word doc into XML and then into HTML keeping the formating and ignoring the Microsoft rubbish??
Can anyone help with converting a word doc to html without using the word object, or is this the best way to do it? #
I have a Word doc file template that I wish users to use to create documents with. Once they have created them I want them to upload the file using a section on my site.
OK. Once I have the file I need to convert it into HTML, but I want it without all the crap that Microsoft add in.
I have been using the resource hungry word object up until now opening the file and then using saveAs to stick it into HTML. If I use the filtered HTML flag then it removes ALL HTML tags and gives up when it meets a picture.
I have heard that I can convert my word doc into XML and then into HTML keeping the formating and ignoring the Microsoft rubbish??
Can anyone help with converting a word doc to html without using the word object, or is this the best way to do it? #