LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Programming (https://www.linuxquestions.org/questions/programming-9/)
-   -   Word To HTML (https://www.linuxquestions.org/questions/programming-9/word-to-html-686405/)

nehaandrew 11-26-2008 12:26 PM

Word To HTML
 
Any ideas how we can convert an MS Word Doc to HTML in PHP?

I tried using a COM object to do this - but some reason it gives me an "No Authorization" error on the Windows 2003 server.

I tried using an external software to do this - still same approach.


As a last resort - I am trying to use Open Office to do this conversion. I read about an approach where you can write a VBA plugin that will use the standard methods (writer_something) in Open Office to achieve this. Has anyone tried this approach?

Does anyone have any better suggestions on how one can achieve a conversion from DOC to HTML from within a PHP code?
(I've been breaking my arse on this for the last 2 weeks now!)


Thanks a zillion in advance!!!

ilikejam 11-26-2008 12:57 PM

Really not an expert on this, but just a suggestion...

Would it be possible to get PHP to call a VBScript to do the export, then pick up the resulting HTML file?

I've used VBScript to export excel spreadsheets to HTML, but only within Office itself, so I don't know if it's possible from an external system call.

Dave

knudfl 11-26-2008 01:51 PM

I once used this ( #2 )
http://www.linuxquestions.org/questi...hlight=abiword

jiml8 11-26-2008 04:42 PM

I would think the easiest way would be to invoke a vb script from PHP as has been suggested.

Note, though, that Word writes simply horrible HTML code.

nehaandrew 11-27-2008 04:38 AM

@ilikejam,
If you dont mind - can you please share the VBScript code that you used to convert excel into HTML. Maybe I'll be able tweak it to convert Word to HTML.
I plan to invoke the VBScript (embedded into MS word or Open Office) from PHP to do the trick.

(Whether it works or not is another story though!)
:D


Thanks a lot for the suggestions folks. You guys are swell ...
:)

Linux

Sergei Steshenko 11-27-2008 05:39 AM

Look for "demoroniser" when dealing with MS tools generated HTML.

ilikejam 11-27-2008 08:59 AM

Hi again. No can do, I'm afraid - I'm off work at the moment, and my VPN password's expired. I did find this though:
http://www.xefteri.com/articles/show.cfm?id=22
Probably better than my efforts.

Dave


All times are GMT -5. The time now is 02:37 PM.