Results 1 to 7 of 7

Thread: The Open library Project

  1. #1
    Join Date
    Aug 2005
    Posts
    971
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Lightbulb The Open library Project

    Hello everyone!
    The Open Library Project: http://openlibrary.org/

    seems to have very neat features, but the feature that I am most interested is in the text-search feature(click on a book). As one can see it searches the book and highlights the text in the book, which is an image. How can such an effect be possible? Anyone have any idea how one can search an image for matching text? Just curious.

    Thanks.

  2. #2
    Join Date
    Jun 2005
    Location
    英国
    Posts
    11,876
    Thanks
    1
    Thanked 180 Times in 172 Posts
    Blog Entries
    2

    Default

    The search doesn't work for me so I can't be entirely sure, but I'd suspect the highlighted page images are generated server-side, and the text of the book is also stored.
    Twey | I understand English | 日本語が分かります | mi jimpe fi le jbobau | mi esperanton komprenas | je comprends franšais | entiendo espa˝ol | t˘i Ýt hiểu tiếng Việt | ich verstehe ein bisschen Deutsch | beware XHTML | common coding mistakes | tutorials | various stuff | argh PHP!

  3. #3
    Join Date
    Aug 2005
    Posts
    971
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    Quote Originally Posted by Twey
    The search doesn't work for me so I can't be entirely sure
    Oh.

    Quote Originally Posted by Twey
    but I'd suspect the highlighted page images are generated server-side,
    True, but how unless they've stored the text of the book as well.

    I'd love to know how it's done without actually storing the text.

    Thanks for your reply.

  4. #4
    Join Date
    Jun 2005
    Location
    英国
    Posts
    11,876
    Thanks
    1
    Thanked 180 Times in 172 Posts
    Blog Entries
    2

    Default

    Quote Originally Posted by myself
    and the text of the book is also stored.
    It is possible that it could have been done via a text-recognition algorithm.
    Twey | I understand English | 日本語が分かります | mi jimpe fi le jbobau | mi esperanton komprenas | je comprends franšais | entiendo espa˝ol | t˘i Ýt hiểu tiếng Việt | ich verstehe ein bisschen Deutsch | beware XHTML | common coding mistakes | tutorials | various stuff | argh PHP!

  5. #5
    Join Date
    Aug 2005
    Posts
    971
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    Well, that seems pretty reasonable, I wonder how they do it. What I'd love to see is a text-recognition algorithm in PHP. That'd be quite a challenge.

  6. #6
    Join Date
    Mar 2006
    Location
    Illinois, USA
    Posts
    12,164
    Thanks
    265
    Thanked 690 Times in 678 Posts

    Default

    I think it's stored serverside. That image may be rendered, as well. I doubt they are storing full images for every single page.
    If you notice, the audio is just a recording of the book, so it's probably the same with text... just interrelated pieces.
    Daniel - Freelance Web Design | <?php?> | <html>| espa˝ol | Deutsch | italiano | portuguŕs | catalÓ | un peu de franšais | some knowledge of several other languages: I can sometimes help translate here on DD | Linguistics Forum

  7. #7
    Join Date
    Aug 2005
    Posts
    971
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    drj33: True, I completely missed that.

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •