[ih] RAND Unix Port code
pnr at planet.nl
Sun Feb 19 01:24:31 PST 2017
>From recent experience I can confirm that OCR'ing a faded 40 year old printout from a drum line printer is hard. I found that correcting the OCR output was as time consuming as retyping, so after doing some 10 pages the OCR way I switched to retyping from scratch for the rest. For C code I got to about 5 pages per hour.
On 19 Feb 2017, at 0:49 , Scott Brim wrote:
> On Mon, Feb 13, 2017 at 3:33 PM, Jack Haverty <jack at 3kitty.org> wrote:
> OK, OK, I'll get my old listing of the Unix TCP and start scanning.
> This will generate some huge files. It's probably about 100 8.5x11
> pages of stuff. Has anybody figured out the "right" way to do this to
> maximize usability (in case someone really wants to OCR or whatever)?
> 600DPI TIFs?
> For 100 pages, could you get some eager (cheap) undergrads to type it in again and then correct the typos? Give five kids 20 pages each?
> internet-history mailing list
> internet-history at postel.org
> Contact list-owner at postel.org for assistance.
More information about the Internet-history