Page 1 of 1
searching text in MS Word
Posted: Sat Apr 01, 2006 4:34 am
by mistik
Hi. For my master thesis at university, I need to code the search engine for e-learning and it should do also fulltext search in Word documents. Can you, please, somebody help me and write how to do it (if it is possible), or write that it is impossible? Thanks a lot
Posted: Sat Apr 01, 2006 4:40 am
by timvw
Since it's your thesis, you should do the invistigations.. Here's a hint: Look at the
COM extension... $app = new COM('word.application');
use phpdig to index doc, xls, pdf etc
Posted: Sun Apr 02, 2006 1:38 am
by dibyendrah
You can use the phpDig to index many documents like pdf, doc, xls etc and later on it search the indexed documents using MySQL Full text. It's an open source project.
You can get the current stable version from this link :
Code: Select all
http://www.phpdig.net/navigation.php?action=download
Hope this might help you in some extent.
Dibyendra
more indexing programs
Posted: Sun Apr 02, 2006 1:47 am
by dibyendrah
To convert the doc and xls to text and csv you can use catdoc and xls2csv:
http://www.45.free.net/~vitus/ice/catdoc/
To convert the ps to text
http://research.compaq.com/SRC/virtualp ... otext.html
To convert pdf to text
http://public.planetmirror.com/pub/xpdf/
Dibyendra
Posted: Sun Apr 02, 2006 2:50 pm
by Ambush Commander
Mm... two dead links and another program in a different language. It'll lots of fun getting it integrated

(go with COM)