NIST's Text Retrieval Conference (TREC) has been gathering data sets for information retrieval access for over two decades. As these collections have grown, it has become impractical to ship disks to each researcher participating in TREC. Thanks to Amazon Web Services, it is now easy to process large data sets and run entire TREC systems in the cloud.
TREC collections currently available in AWS: