TIP: Click on subject to list as thread! ANSI
echo: nthelp
to: Bill Lucy
from: Geo.
date: 2004-02-24 21:37:22
subject: Re: Word Compression

From: "Geo." 

"Bill Lucy"  wrote in message
news:MPG.1aa5a3fd2b192c8b98bf9e{at}news.barkto.com...

> To index the text content, SimpleSearch uses PaperPort’s OCR software to
extract
> and copy textual content from the items, and creates a database of the
words or
> phrases in those items, much like the index of a book.

You gotta be kidding me , I want to see it work on this:

http://www.budgetmolders.com/PDF03/196-198.pdf

OCR stuff typically fails on tables where there is no context check like
there is for a paragraph of text. If it can index the second page
accurately I'll be really impressed.

Geo (try the search feature on that site, works perfectly, search for
"gkcu01620")

--- BBBS/NT v4.01 Flag-5
* Origin: Barktopia BBS Site http://HarborWebs.com:8081 (1:379/45)
SEEN-BY: 633/267 270
@PATH: 379/45 1 633/267

SOURCE: echomail via fidonet.ozzmosis.com

Email questions or comments to sysop@ipingthereforeiam.com
All parts of this website painstakingly hand-crafted in the U.S.A.!
IPTIA BBS/MUD/Terminal/Game Server List, © 2025 IPTIA Consulting™.