Skip to main content

Internet Archive Web Crawls

The Internet Archive discovers and captures web pages through many different web crawls.



rss RSS

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Government Web & Data Archive
collection
6,269
ITEMS
8.4M
VIEWS
collection
eye 8.4M
This collaborative project is an extension of the 2016  End of Term  project, intended to document the federal government's web presence by archiving government websites and data. As part of this preservation effort, URLs supplied from partner institutions, as well as nominated by the public, will be crawled regularly to provide an on-going view of federal agencies' web and social media presence. Key partners on this effort are the Environmental Data & Governance...
Topics: government, data, federal, congress
Live Web Proxy Crawls
web
eye 8M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 8M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 6.6M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 4.7M
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl344.us.archive.org:survey from Thu Oct 12 08:48:34 PDT 2017 to Thu Oct 12 01:56:31 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 2.9M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 2.6M
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl422.us.archive.org:wide from Wed Jan 4 01:00:14 PST 2017 to Tue Jan 3 19:50:56 PST 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 2.3M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 2.1M
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl428.us.archive.org:wide from Tue Jun 13 00:55:34 PDT 2017 to Mon Jun 12 19:36:27 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 1.9M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.8M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.6M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.5M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.5M
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl423.us.archive.org:wide from Tue Jun 6 06:46:26 PDT 2017 to Tue Jun 6 01:11:01 PDT 2017.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl809.us.archive.org:wide from Tue Jun 6 00:51:50 PDT 2017 to Mon Jun 5 19:34:08 PDT 2017.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl801.us.archive.org:wide from Tue Jun 6 07:52:14 PDT 2017 to Tue Jun 6 03:02:49 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 1.3M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.3M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.3M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.2M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.2M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.1M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.1M
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl421.us.archive.org:wide from Sat Jun 3 01:20:30 PDT 2017 to Sat Jun 3 14:17:04 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 1.1M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.1M
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 1.1M
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl813.us.archive.org:wide from Tue Jun 6 23:57:11 PDT 2017 to Tue Jun 6 17:57:05 PDT 2017.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl808.us.archive.org:wide from Wed Jun 7 01:07:21 PDT 2017 to Tue Jun 6 19:06:13 PDT 2017.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl422.us.archive.org:wide from Wed Jun 7 00:20:30 PDT 2017 to Tue Jun 6 18:18:45 PDT 2017.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl808.us.archive.org:wide from Tue Jun 6 23:48:51 PDT 2017 to Tue Jun 6 17:43:39 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 1M
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl803.us.archive.org:wide from Sat Jun 3 01:16:09 PDT 2017 to Sat Jun 3 14:12:26 PDT 2017.
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 996,644
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl429.us.archive.org:wide from Sat Jun 3 01:16:43 PDT 2017 to Sat Jun 3 14:13:27 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 996,178
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 980,433
favorite 0
comment 0
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 977,276
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl425.us.archive.org:wide from Sat Jun 3 01:22:54 PDT 2017 to Sat Jun 3 14:21:33 PDT 2017.
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 977,237
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl807.us.archive.org:wide from Sat Jun 3 01:18:39 PDT 2017 to Sat Jun 3 14:15:01 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 962,892
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 952,238
favorite 0
comment 0
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 922,148
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl344.us.archive.org:survey from Sun Oct 29 08:31:23 PDT 2017 to Sun Oct 29 02:17:27 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 908,636
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl339.us.archive.org:survey from Sat Sep 30 04:27:26 PDT 2017 to Fri Sep 29 21:56:42 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 879,942
favorite 0
comment 0
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 852,748
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl814.us.archive.org:wide from Wed Jun 7 01:39:13 PDT 2017 to Tue Jun 6 19:25:06 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 811,990
favorite 0
comment 0
Live Web Proxy Crawls
web
eye 804,676
favorite 0
comment 0
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 799,617
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl425.us.archive.org:wide from Sat Jun 3 01:04:23 PDT 2017 to Sat Jun 3 14:02:05 PDT 2017.
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 799,382
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl812.us.archive.org:wide from Sat Jun 3 01:15:27 PDT 2017 to Sat Jun 3 13:58:41 PDT 2017.
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 783,165
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl803.us.archive.org:wide from Sat Jun 3 01:02:37 PDT 2017 to Fri Jun 2 18:23:04 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 779,362
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl344.us.archive.org:survey from Sat Oct 7 15:34:23 PDT 2017 to Sat Oct 7 08:50:07 PDT 2017.
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 774,865
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl429.us.archive.org:wide from Sat Jun 3 01:09:57 PDT 2017 to Fri Jun 2 18:21:38 PDT 2017.
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
web
eye 759,664
favorite 0
comment 0
Internet Archive crawldata from Webwide Crawl, captured by crawl421.us.archive.org:wide from Sat Jun 3 01:10:47 PDT 2017 to Fri Jun 2 18:30:57 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 754,897
favorite 0
comment 0
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 748,593
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl825.us.archive.org:survey from Tue Sep 12 05:26:47 PDT 2017 to Wed Sep 13 06:35:45 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 741,127
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl339.us.archive.org:survey from Mon Sep 11 13:31:15 PDT 2017 to Mon Sep 11 22:22:53 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 741,021
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl824.us.archive.org:survey from Sat Sep 16 02:29:17 PDT 2017 to Tue Sep 19 12:30:46 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 740,969
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl818.us.archive.org:survey from Mon Sep 11 13:36:04 PDT 2017 to Mon Sep 11 22:51:52 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 737,807
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl825.us.archive.org:survey from Mon Sep 11 13:31:57 PDT 2017 to Mon Sep 11 22:26:47 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 736,040
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl835.us.archive.org:survey from Mon Sep 11 13:32:08 PDT 2017 to Mon Sep 11 22:37:40 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 735,578
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl817.us.archive.org:survey from Mon Sep 11 04:34:29 PDT 2017 to Mon Sep 11 22:23:11 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 732,068
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl824.us.archive.org:survey from Mon Sep 11 13:31:49 PDT 2017 to Mon Sep 11 22:26:17 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 730,209
favorite 0
comment 0
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 729,693
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl824.us.archive.org:survey from Tue Sep 12 05:26:17 PDT 2017 to Wed Sep 13 06:44:40 PDT 2017.
Topic: crawldata
Live Web Proxy Crawls
web
eye 727,662
favorite 0
comment 0
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 726,927
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl818.us.archive.org:survey from Tue Sep 12 05:51:52 PDT 2017 to Wed Sep 13 02:15:41 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 725,562
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl835.us.archive.org:survey from Tue Sep 12 05:37:40 PDT 2017 to Wed Sep 13 02:01:07 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 723,198
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl818.us.archive.org:survey from Wed Sep 13 09:15:41 PDT 2017 to Fri Sep 15 04:22:03 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 723,024
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl836.us.archive.org:survey from Tue Sep 12 05:54:14 PDT 2017 to Wed Sep 13 08:42:57 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 722,294
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl838.us.archive.org:survey from Mon Sep 11 13:32:37 PDT 2017 to Mon Sep 11 21:52:51 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 722,140
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl339.us.archive.org:survey from Sat Sep 16 03:22:07 PDT 2017 to Tue Sep 19 12:55:17 PDT 2017.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 721,900
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl825.us.archive.org:survey from Wed Sep 13 13:35:46 PDT 2017 to Fri Sep 15 23:02:56 PDT 2017.
Topic: crawldata