| TIP: Click on subject to list as thread! | ANSI |
| echo: | |
|---|---|
| to: | |
| from: | |
| date: | |
| subject: | Re: Word Compression |
From: Bill Lucy From the Mars probe, Antti Kurenniemi says... > I was actually today looking into PaperPort. I was hoping to find a demo > version but didn't (at least yet - I almost died after watching that stupid > flash "demonstration" stuff and had to take a break), so maybe you could > answer one question: we have a bunch of contracts on paper that we'd now > like to scan (yes I know, how about some planning ahead before there are > 15.000 contracts ) and store in a server. It would be alright to have the > contract number as a file name, and we've already got a big scanner in which > we can just pile the paper in and it will scan one page as one image (I > think it can do pdf as well but haven't gotten that far yet). > > So: is PaperPort capable of finding a number (8 digits mostly) from a > (almost) fixed point in a scanned image or pdf file, and then use it as file > name, in a batch mode? It seems that we could do it one by one, but that's a > definite no-no as there are roughly 15.000 pages of this stuff... > > Any info appreciated. This is from the help file on searches: earching by text content SimpleSearch can index the text content of PaperPort Image (.max), PDF, TIFF, and DCX files. It can also index the content of text items, including Word, Notepad, WordPad, Excel, and HTML files. To index the text content, SimpleSearch uses PaperPort’s OCR software to extract and copy textual content from the items, and creates a database of the words or phrases in those items, much like the index of a book. You can then find scanned items by searching on words contained in those items. For example, if you have scanned items from different investment companies, you might search for words such as bonds, gold, or mutual funds, to find items that contain those words. -+- So, yes, you can. With 15,000 pages, it might take awhile to index your PDFs, but it will do it. --- BBBS/NT v4.01 Flag-5* Origin: Barktopia BBS Site http://HarborWebs.com:8081 (1:379/45) SEEN-BY: 633/267 270 @PATH: 379/45 1 633/267 |
|
| SOURCE: echomail via fidonet.ozzmosis.com | |
Email questions or comments to sysop@ipingthereforeiam.com
All parts of this website painstakingly hand-crafted in the U.S.A.!
IPTIA BBS/MUD/Terminal/Game Server List, © 2025 IPTIA Consulting™.