Results 1 to 2 of 2

Thread: Search option for extracting content in word document

  1. #1
    Join Date
    Nov 2010
    Posts
    115
    Thanks
    27
    Thanked 0 Times in 0 Posts

    Default Search option for extracting content in word document

    Hi All,

    I just want to know if it is possible to extract the specified search keyword from doc file. I am having a website and i am having many doc files(resumes) in that. Now what i want is if i type JAVA in my search box i need to get all the documents that are having JAVA string in the doc files. If it is possible let me know how to get it.

  2. #2
    Join Date
    Mar 2007
    Location
    New York, NY
    Posts
    557
    Thanks
    8
    Thanked 66 Times in 66 Posts

    Default

    Yes, but if you have a lot of documents, it may take a while to process the search. You would have to loop through all of your doc files, decode them to get their text, and do a regular string REGEX search in the document. If the search returns true, the doc file contains the string you're looking for, and based on that you can return your data.

    I know this is a pretty vague response, but it is possible.

    Antiword is a library that will allow you to read .doc files:
    http://www.winfield.demon.nl/
    - Josh

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •