Crawl data donated by Alexa Internet. This data is currently not publicly accessible
Topic: crawldata
Domain crawl of the New Zealand web domain (.nz) performed by Internet Archive on behalf of the National Library of New Zealand in January-March, 2021.
Topic: crawldata
5.8M
5.8M
May 14, 2020
05/20
by
Internet Archive
web
eye 5.8M
favorite 0
comment 0
"Internet Archive crawldata from feed-driven by 1.2 million top ranked domains from data.domainrank.io - captured by crawl423.us.archive.org:survey_00010 from Mon May 11 14:14:43 PDT 2020 to Mon May 11 09:09:55 PDT 2020."
Topics: survey_00010, crawldata
485,410
485K
May 9, 2020
05/20
by
Internet Archive
web
eye 485,410
favorite 0
comment 0
Internet Archive crawldata from Twitter Outlinks Crawl, captured by crawl504.us.archive.org:twitter_outlinks from Sat May 9 10:59:17 PDT 2020 to Sat May 9 05:39:43 PDT 2020.
Topics: twitter, crawldata
485,081
485K
May 10, 2020
05/20
by
Internet Archive
web
eye 485,081
favorite 0
comment 0
Internet Archive crawldata from Twitter Outlinks Crawl, captured by crawl502.us.archive.org:twitter_outlinks from Sat May 9 11:34:27 PDT 2020 to Sat May 9 05:46:43 PDT 2020.
Topics: twitter, crawldata
480,832
481K
May 9, 2020
05/20
by
Internet Archive
web
eye 480,832
favorite 0
comment 0
Internet Archive crawldata from Twitter Outlinks Crawl, captured by crawl503.us.archive.org:twitter_outlinks from Sat May 9 17:55:40 PDT 2020 to Sat May 9 12:53:23 PDT 2020.
Topics: twitter, crawldata
482,823
483K
May 9, 2020
05/20
by
Internet Archive
web
eye 482,823
favorite 0
comment 0
Internet Archive crawldata from Twitter Outlinks Crawl, captured by crawl505.us.archive.org:twitter_outlinks from Sat May 9 12:46:01 PDT 2020 to Sat May 9 07:08:37 PDT 2020.
Topics: twitter, crawldata
This crawl of online resources of the 115th US Congress was performed on behalf of The United States National Archives & Records
Topic: crawldata
TEST COLLECTION: Crawl of .edu and .gov sites started in June 2010.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl428.us.archive.org:wide from Tue Jun 13 00:55:34 PDT 2017 to Mon Jun 12 19:36:27 PDT 2017.
Topic: crawldata
8.8M
8.8M
Jul 6, 2018
07/18
by
Internet Archive
web
eye 8.8M
favorite 0
comment 1
Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl345.us.archive.org:twitter from Fri Jul 6 13:02:39 PDT 2018 to Fri Jul 6 11:11:04 PDT 2018.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: twitter, crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl344.us.archive.org:survey from Thu Oct 12 08:48:34 PDT 2017 to Thu Oct 12 01:56:31 PDT 2017.
Topic: crawldata
202,527
203K
Feb 3, 2019
02/19
by
Internet Archive
web
eye 202,527
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl817.us.archive.org:survey from Sun Feb 3 02:47:29 PST 2019 to Sat Feb 2 23:56:30 PST 2019.
Topic: crawldata
52,609
53K
Feb 15, 2021
02/21
by
Internet Archive
web
eye 52,609
favorite 0
comment 0
The Internet Archive's crawl data capture by the TikTok crawl project. Captured by crawl802.us.archive.org:tiktok from Mon Feb 15 02:27:55 PST 2021 to Sun Feb 14 19:43:50 PST 2021.
Topic: crawldata
17.5M
18M
May 3, 2011
05/11
by
Internet Archive
web
eye 17.5M
favorite 6
comment 1
Internet Archive Liveweb Capture from WaybackMachine, captured by wwwb-proxy0.us.archive.org:wbm from Sun Mar 27 22:10:09 PDT 2011 to Mon Mar 28 05:27:05 PDT 2011.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
4.3M
4.3M
Mar 31, 2018
03/18
by
countess
web
eye 4.3M
favorite 0
comment 0
Alexa crawl
Topic: crawldata
This crawl of online resources of the 116th US Congress was performed on behalf of The United States National Archives & Records
Topic: crawldata
8.3M
8.3M
web
eye 8.3M
favorite 0
comment 0
Data crawled by National Endowment for the Humanities and JISC on behalf of Internet Archive from Fri Aug 08 00:17:40 PDT 2008 to Thu Jun 26 05:29:33 PDT 2008
Topic: crawldata
1.7M
1.7M
Jan 27, 2020
01/20
by
Internet Archive
web
eye 1.7M
favorite 0
comment 0
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl892.us.archive.org:wikipedia-eventstream from Mon Jan 27 19:40:21 PST 2020 to Mon Jan 27 12:56:59 PST 2020.
Topic: crawldata
155,675
156K
Apr 16, 2012
04/12
by
thumper2.php
web
eye 155,675
favorite 0
comment 0
Alexa crawl
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl426.us.archive.org:wide from Fri Oct 7 04:29:15 PDT 2011 to Fri Oct 7 00:11:17 PDT 2011.
Topic: crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl421.us.archive.org:wide from Mon Feb 12 21:42:38 PST 2018 to Mon Feb 12 15:20:34 PST 2018.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl426.us.archive.org:wide from Fri Oct 7 14:36:17 PDT 2011 to Fri Oct 7 08:44:43 PDT 2011.
Topic: crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
31,362
31K
Feb 19, 2021
02/21
by
Internet Archive
web
eye 31,362
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Thu Feb 18 20:38:21 PST 2021 to Fri Feb 19 09:05:21 PST 2021.
Topics: GDELT, crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
favorite ( 1 reviews )
Topic: crawldata
30,736
31K
Feb 18, 2021
02/21
by
Internet Archive
web
eye 30,736
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Thu Feb 18 03:05:33 PST 2021 to Thu Feb 18 10:06:32 PST 2021.
Topics: GDELT, crawldata
386,071
386K
Nov 14, 2019
11/19
by
Internet Archive
web
eye 386,071
favorite 0
comment 0
Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl863.us.archive.org:twitter from Wed Nov 13 22:13:55 PST 2019 to Wed Nov 13 17:06:12 PST 2019.
Topics: twitter, crawldata
385,477
385K
Nov 14, 2019
11/19
by
Internet Archive
web
eye 385,477
favorite 0
comment 0
Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl856.us.archive.org:twitter from Thu Nov 14 16:56:34 PST 2019 to Thu Nov 14 09:55:13 PST 2019.
Topics: twitter, crawldata
29,027
29K
Feb 17, 2021
02/21
by
Internet Archive
web
eye 29,027
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Tue Feb 16 10:53:57 PST 2021 to Tue Feb 16 23:38:58 PST 2021.
Topics: GDELT, crawldata
28,781
29K
Feb 17, 2021
02/21
by
Internet Archive
web
eye 28,781
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Tue Feb 16 22:10:49 PST 2021 to Wed Feb 17 11:06:43 PST 2021.
Topics: GDELT, crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl426.us.archive.org:wide from Wed Feb 19 07:58:38 PST 2014 to Wed Feb 19 05:13:46 PST 2014.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl453.us.archive.org:wide from Wed Feb 19 01:09:37 PST 2014 to Tue Feb 18 21:33:27 PST 2014.
Topic: crawldata
381,199
381K
Aug 30, 2019
08/19
by
countess
web
eye 381,199
favorite 0
comment 0
Alexa crawl
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl420.us.archive.org:wide from Tue Feb 18 17:01:58 PST 2014 to Tue Feb 18 13:14:06 PST 2014.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl454.us.archive.org:wide from Wed Feb 19 05:20:19 PST 2014 to Wed Feb 19 01:54:33 PST 2014.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl427.us.archive.org:wide from Wed Feb 19 09:49:01 PST 2014 to Wed Feb 19 06:07:15 PST 2014.
Topic: crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
84,742
85K
Oct 19, 2020
10/20
by
Internet Archive
web
eye 84,742
favorite 0
comment 0
Internet Archive crawldata from GDELT0 Crawl, captured by crawl500.us.archive.org:gdelt0 from Mon Oct 19 00:56:50 PDT 2020 to Sun Oct 18 18:27:51 PDT 2020.
Topics: GDELT, crawldata
368,810
369K
Dec 8, 2019
12/19
by
Internet Archive
web
eye 368,810
favorite 0
comment 0
Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl861.us.archive.org:twitter from Sun Dec 8 09:13:02 PST 2019 to Sun Dec 8 07:44:42 PST 2019.
Topics: twitter, crawldata
366,436
366K
Nov 19, 2019
11/19
by
Internet Archive
web
eye 366,436
favorite 0
comment 0
Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl856.us.archive.org:twitter from Tue Nov 19 02:13:43 PST 2019 to Mon Nov 18 19:41:35 PST 2019.
Topics: twitter, crawldata
25,650
26K
Feb 18, 2021
02/21
by
Internet Archive
web
eye 25,650
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Wed Feb 17 13:23:38 PST 2021 to Thu Feb 18 02:22:19 PST 2021.
Topics: GDELT, crawldata
Data crawled by Institut national de laudiovisuel on behalf of Institut national de l’audiovisuel from Thu Aug 12 00:00:00 PDT 2010 to Thu Aug 12 00:00:00 PDT 2010
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl453.us.archive.org:survey from Mon May 26 22:33:15 PDT 2014 to Mon May 26 23:08:51 PDT 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl419.us.archive.org:survey from Tue May 27 01:34:25 PDT 2014 to Mon May 26 22:52:57 PDT 2014.
Topic: crawldata
Internet Archive crawldata from Webwide Crawl, captured by crawl429.us.archive.org:wide from Sun Apr 14 23:48:38 PDT 2019 to Sun Apr 14 20:03:48 PDT 2019.
Topic: crawldata
24,768
25K
Feb 19, 2021
02/21
by
Internet Archive
web
eye 24,768
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Thu Feb 18 13:20:04 PST 2021 to Thu Feb 18 21:52:26 PST 2021.
Topics: GDELT, crawldata
330,665
331K
Mar 11, 2020
03/20
by
Internet Archive
web
eye 330,665
favorite 0
comment 0
Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl864.us.archive.org:twitter from Sat Mar 7 20:32:15 PST 2020 to Tue Mar 10 18:17:02 PDT 2020.
Topics: twitter, crawldata
24,476
24K
Feb 21, 2021
02/21
by
Internet Archive
web
eye 24,476
favorite 0
comment 0
Internet Archive crawldata from GDELT1 Crawl, captured by crawl501.us.archive.org:gdelt1_seeds from Fri Feb 19 21:50:55 PST 2021 to Sat Feb 20 23:25:40 PST 2021.
Topics: GDELT, crawldata