searching text in MS Word

PHP programming forum. Ask questions or help people concerning PHP code. Don't understand a function? Need help implementing a class? Don't understand a class? Here is where to ask. Remember to do your homework!

Moderator: General Moderators

Post Reply
mistik
Forum Newbie
Posts: 1
Joined: Sat Apr 01, 2006 4:31 am

searching text in MS Word

Post by mistik »

Hi. For my master thesis at university, I need to code the search engine for e-learning and it should do also fulltext search in Word documents. Can you, please, somebody help me and write how to do it (if it is possible), or write that it is impossible? Thanks a lot
timvw
DevNet Master
Posts: 4897
Joined: Mon Jan 19, 2004 11:11 pm
Location: Leuven, Belgium

Post by timvw »

Since it's your thesis, you should do the invistigations.. Here's a hint: Look at the COM extension... $app = new COM('word.application');
User avatar
dibyendrah
Forum Contributor
Posts: 491
Joined: Wed Oct 19, 2005 5:14 am
Location: Nepal
Contact:

use phpdig to index doc, xls, pdf etc

Post by dibyendrah »

You can use the phpDig to index many documents like pdf, doc, xls etc and later on it search the indexed documents using MySQL Full text. It's an open source project.

You can get the current stable version from this link :

Code: Select all

http://www.phpdig.net/navigation.php?action=download
Hope this might help you in some extent.

Dibyendra
User avatar
dibyendrah
Forum Contributor
Posts: 491
Joined: Wed Oct 19, 2005 5:14 am
Location: Nepal
Contact:

more indexing programs

Post by dibyendrah »

To convert the doc and xls to text and csv you can use catdoc and xls2csv:
http://www.45.free.net/~vitus/ice/catdoc/

To convert the ps to text
http://research.compaq.com/SRC/virtualp ... otext.html

To convert pdf to text
http://public.planetmirror.com/pub/xpdf/


:lol:

Dibyendra
User avatar
Ambush Commander
DevNet Master
Posts: 3698
Joined: Mon Oct 25, 2004 9:29 pm
Location: New Jersey, US

Post by Ambush Commander »

Mm... two dead links and another program in a different language. It'll lots of fun getting it integrated :roll: (go with COM)
Post Reply