Recently I had to automatically extract text from a bunch of Word documents under Windows. I liked the looks of catdoc, but didn’t see a native Win32 port around. The source code looked so very close to compiling under MinGW, so I made the few minor changes necessary and got it working (catdoc, catppt, and xls2csv). Native Win32 executables, support for long filenames, etc.
Basically all I did was:
- Add a glob function from the BSD-licensed unixem library.
- Change a few of the
if defined(__MSDOS__) || defined(_WIN32).
- Make one or two other minor changes to
fileutil.c, including the
Nothing special, and it’s not perfect. But here is a zip of the compiled binaries and (GPL-licensed) source code, just for you:
15 September 2009 by Ben 12 comments