How to determine the size of the Web? ... #pages, #hyperlinks * get random selection of pages * find how many are indexed by Google * multiply Google index size by ratio * look up DNS records (how many) ... estimate # sites * count average pages on a selection of these * multiply #records by avg pages * relies on Google index * counting publically reachable web * blocked by robots.txt * http://www.cse.unsw.edu.au/xyz?id=123