Web Hosting Talk







View Full Version : How to convert .doc to text?


TheHero
03-04-2008, 05:00 PM
Hi,

I am looking for a way to convert MS Word (.doc) files to plain text files from Linux command line.

I know there is catdoc, antiword and wv but there are all pretty old and I either can't find them or they won't convert Office XP documents.

Thank for your help.

Stephane

isurus
03-06-2008, 06:43 AM
I realise that Open Office is a bit of a heavy weight for this task but you could use it from the command line to convert your files...
http://www.oooforum.org/forum/viewtopic.phtml?t=7242
http://www.oooforum.org/forum/viewtopic.phtml?t=4998

OO has a headless mode that may be useful:-headless (supported from srx644)

The headless mode makes it possible to script an Office without any user interface and user interface interaction. It is a special mode used typically by external scripting clients. The Office has no user interface and the lifetime must be controlled by the external scripting client.HTH