[Info-vax] AltaVista background information
dsnyder
danieldsnyder at gmail.com
Mon Oct 17 07:34:50 EDT 2011
On Oct 15, 3:52 pm, MG <marcog... at SPAMxs4all.nl> wrote:
> Recently I thought about AltaVista, the search engine I used in the
> 1990s, without any doubt my favourite one back then. I remember the
> "[d|i|g|i|t|a|l]" branding well, but unfortunately didn't really look
> beyond that back then. (I was also quite young.)
>
> I was wondering, are there documents (articles, books and so forth)
> telling a thing or two about how it came into being and outlining
> the technological foundation? I'm, needless to say, interested if
> VMS was involved. (Else Tru64 or any other DEC operating system
> and hardware platform of interest.)
>
> - MG
>From the archives, a bit of info on AltaVista:
The Hardware Behind AltaVista
AltaVista: AlphaStation 500, 256 MB memory, 6GB disk.
AlphaStation 500's handle all external traffic to the site.
They run a custom multi-threaded Web server which sends
queries to the Web indexer and News indexer.
Web Indexer: AlphaServer 8400 5/300, 10 processors, 6 GB
memory, 210 GB RAID disk. This model is the most powerful
computer built by Digital. These servers run the query
engine. The Web index is larger than 40 GB, but most
requests take less than a second.
Scooter: AlphaServer 4100 5/300, 1.5 GB memory, 30 GB RAID
disk. The super-spider runs from this machine. It fetches
pages from the Web and sends them to Vista, our primary web
indexer.
Vista: AlphaServer 4100 5/300, 2 processors, 2GB memory,
180GB RAID disk. This machine indexes Scooter output and
serves as a central distribution point for new index data.
News Indexer: AlphaServer 600 5/333, 896MB memory, 13 GB
disk. This machine keeps an up-to-date index of the news
spool: since new articles appear and old articles expire all
the time, it is in fact quite busy, even though the index it
serves is much smaller than the Web index.
News Server: AlphaServer 600 5/333, 896MB memory, 24 GB RAID
disks. It maintains a current news spool for the News
Indexer. It also serves the articles via http to those of
you who don't want to know about news servers but want to
read news.
Those aren't typos, either -- those are machines with many, MANY
megabytes
of RAM, and they're hooked up to some of the fastest random-access
backing
store in the world with capacities in the many, many gigabyte range.
AltaVista is fast because DEC threw some phenomenally capable
resources at
the problem.
More information about the Info-vax
mailing list