[ih] archiving the history of Internet-History
touch at strayalpha.com
touch at strayalpha.com
Fri Aug 29 07:40:10 PDT 2025
Well, it’s a publicly available list, so anyone can jump in and help and your help is appreciated.
Joe (list admin/owner)
—
Dr. Joe Touch, temporal epistemologist
www.strayalpha.com
> On Aug 29, 2025, at 1:30 AM, John Gilmore via Internet-history <internet-history at elists.isoc.org> wrote:
>
> John Levine via Internet-history <internet-history at elists.isoc.org> wrote:
>> FWIW, the list archives are here and the Internet Archive drops by to save a copy every month or so.They go back to 2001.
>> https://elists.isoc.org/pipermail/internet-history/
>
> The Internet Archive crawls and saves the list archives bimonthly, only
> because I have an Archive-It.org account (a service run by the Archive)
> and ask them to. See:
>
> https://archive-it.org/collections/15071
>
> The saved web pages go into the Wayback Machine's permanent collection
> as well as into my own little collection hosted at the Archive. The
> full text from the saved pages can also be searched in a search box at
> the above URL (e.g. "flag day" produces hits on 9 sites, including 406
> messages from this list).
>
> If you know of other web sites that should be in such an archive of
> Internet and Unix history, please suggest them to me. I ask for
> one-time or annual crawls of historical (unchanging) sites, and annual
> or more frequent crawls of ones that get regular updates.
>
> John
>
> PS: The Internet Archive's regular web crawls focus more on breadth than
> depth, so they would otherwise miss crawling down to thousands of
> messages from this list. Also, a lot of web sites contain unintentional
> (or sometimes intentional) "crawler traps" that require manual
> configuration to avoid the crawler going down a rathole of
> www.foo.bar/symlink/symlink/symlink/symlink/foo.html forever.
> Archive-It provides ways to manually avoid such traps once your crawl
> has run down them once.
>
> PPS:
>> Related museums have been contacted and are not available to support us.
>
> I bet I know somebody smart enough at the Computer History Museum to at
> least subscribe a logfile to the existing list, which would accumulate
> new messages on a CHM server. And a simple wget command, run once,
> would pull in the already-archived logfiles saved by Date to get all the
> past postings to the list. At current storage prices, this would cost
> them a small rounding error. Yes, it would break someday for a random
> reason, but at least it would get all the discussions up to that point,
> archived in a third place where neither an ISOC nor an IA failure could
> touch them.
> --
> Internet-history mailing list
> Internet-history at elists.isoc.org
> https://elists.isoc.org/mailman/listinfo/internet-history
> -
> Unsubscribe: https://app.smartsheet.com/b/form/9b6ef0621638436ab0a9b23cb0668b0b?The%20list%20to%20be%20unsubscribed%20from=Internet-history
More information about the Internet-history
mailing list