Page 1 of 1

searching text in MS Word

Posted: Sat Apr 01, 2006 4:34 am
by mistik
Hi. For my master thesis at university, I need to code the search engine for e-learning and it should do also fulltext search in Word documents. Can you, please, somebody help me and write how to do it (if it is possible), or write that it is impossible? Thanks a lot

Posted: Sat Apr 01, 2006 4:40 am
by timvw
Since it's your thesis, you should do the invistigations.. Here's a hint: Look at the COM extension... $app = new COM('word.application');

use phpdig to index doc, xls, pdf etc

Posted: Sun Apr 02, 2006 1:38 am
by dibyendrah
You can use the phpDig to index many documents like pdf, doc, xls etc and later on it search the indexed documents using MySQL Full text. It's an open source project.

You can get the current stable version from this link :

Code: Select all

http://www.phpdig.net/navigation.php?action=download
Hope this might help you in some extent.

Dibyendra

more indexing programs

Posted: Sun Apr 02, 2006 1:47 am
by dibyendrah
To convert the doc and xls to text and csv you can use catdoc and xls2csv:
http://www.45.free.net/~vitus/ice/catdoc/

To convert the ps to text
http://research.compaq.com/SRC/virtualp ... otext.html

To convert pdf to text
http://public.planetmirror.com/pub/xpdf/


:lol:

Dibyendra

Posted: Sun Apr 02, 2006 2:50 pm
by Ambush Commander
Mm... two dead links and another program in a different language. It'll lots of fun getting it integrated :roll: (go with COM)