How extract source from MS Word file with php?
Moderator: General Moderators
How extract source from MS Word file with php?
Hello,
I want to extract the source (text, table and other info) from MS Word file. And this source show in HTML code. Are possible make this? If yes, then how can do it with php?
PS. php are installed on Unix OS. In this case I can’t to use COM().
I want to extract the source (text, table and other info) from MS Word file. And this source show in HTML code. Are possible make this? If yes, then how can do it with php?
PS. php are installed on Unix OS. In this case I can’t to use COM().
- jayshields
- DevNet Resident
- Posts: 1912
- Joined: Mon Aug 22, 2005 12:11 pm
- Location: Leeds/Manchester, England
Not possible. Is it absolutely necessary to use MS Word?
If not, Open Office (http://www.openoffice.org) is free and the files are XML encoded, which could be parsed by PHP.
Someone could do with backing me up on the above statements, I'm not 100% although I'm pretty sure
If not, Open Office (http://www.openoffice.org) is free and the files are XML encoded, which could be parsed by PHP.
Someone could do with backing me up on the above statements, I'm not 100% although I'm pretty sure
- Chris Corbyn
- Breakbeat Nuttzer
- Posts: 13098
- Joined: Wed Mar 24, 2004 7:57 am
- Location: Melbourne, Australia
There are classes out there to work with MS Word files but I'm pretty sure they all use com(). Most of them are not free neither if that's any issue.
Most of the new word processors will allow you to save a file as .htm though. Just checked my Open Office on *nix certainly does. Open Office will open the MS Word files too....
Most of the new word processors will allow you to save a file as .htm though. Just checked my Open Office on *nix certainly does. Open Office will open the MS Word files too....
- jayshields
- DevNet Resident
- Posts: 1912
- Joined: Mon Aug 22, 2005 12:11 pm
- Location: Leeds/Manchester, England
nope. Open Office has been using an import filter for opening them since first released.jayshields wrote:Never knew OOo could open/edit MS Word files... I thought MS files were encrypted so that only Microsoft programs could handle them?
Proprietary formats in this area will soon be a thing of the past as the Open Document Format will be adopted by all major software houses.
- Ambush Commander
- DevNet Master
- Posts: 3698
- Joined: Mon Oct 25, 2004 9:29 pm
- Location: New Jersey, US
-
d3ad1ysp0rk
- Forum Donator
- Posts: 1661
- Joined: Mon Oct 20, 2003 8:31 pm
- Location: Maine, USA
Hello to everyone and thanks for replies!
I found the tool which convert from DOC to HTML. Can download it from here:
http://fresh.t-systems-sfr.com/linux/src/
http://fresh.t-systems-sfr.com/linux/sr ... 0.3.tar.gz
The now I want convert *.xls (MS Excel) to *.html. Maybe who anyone to known which tool I must to use?
I found the tool which convert from DOC to HTML. Can download it from here:
http://fresh.t-systems-sfr.com/linux/src/
http://fresh.t-systems-sfr.com/linux/sr ... 0.3.tar.gz
The now I want convert *.xls (MS Excel) to *.html. Maybe who anyone to known which tool I must to use?