Utent:ReyBrujo/Dumps/20070213
Istrument
Sgeneral
Stampa/esporta
In di alter proget
Aspet
De Wikipedia
< Utent:ReyBrujo | Dumps
Dumps
February 13, 2007
[Modifega | modifica 'l sorgent]External link dumps
[Modifega | modifica 'l sorgent]Articles with more than 10 external links as of February 13, 2007. Only articles in the main space are considered.
External links |
Article ID | Article |
---|---|---|
24 | 1563 | Tram: ligamm da föra |
17 | 2711 | Girona |
14 | 2810 | Jean Cocteau |
14 | 1559 | Discüssiun sura la fundazziun |
13 | 7518 | Badalona |
10 | 4 | Lengua Lumbarda |
SELECT COUNT(el_from) AS total, el_from, page_title FROM externallinks, page WHERE externallinks.el_from = page_id AND page_is_redirect = 0 AND page_namespace = 0 GROUP BY el_from ORDER BY total DESC;
External link ranking
[Modifega | modifica 'l sorgent]Sites linked more than 10 times as of February 13, 2007. Only articles in the main space are considered.
Link count | Site |
---|---|
1000 | http://lmo.wikipedia.org |
724 | http://kvaleberg.com |
582 | http://www.idescat.es |
576 | http://www.municat.net:8000 |
179 | http://www.metlinkmelbourne.com.au |
119 | http://www.citytrain.com.au |
102 | http://www.tfl.gov.uk |
87 | http://www.cityrail.info |
72 | http://www.street-directory.com.au |
66 | http://maps.google.com |
39 | http://www.ddgi.es |
37 | http://www.gencat.net:8000 |
36 | http://www.idescat.net |
31 | http://www.iana.org |
31 | http://www.poblesdecatalunya.cat |
23 | http://www.Guidamanresa.com |
22 | http://tools.wikimedia.de |
11 | http://www.rail-reg.gov.uk |
10 | http://www.diba.es |
SELECT COUNT(el_to) AS total, SUBSTRING_INDEX(el_to, '/', 3) AS search FROM externallinks, page WHERE page_id = el_from AND page_namespace = 0 GROUP BY search ORDER BY total DESC;
Additional information
[Modifega | modifica 'l sorgent]Some more information about this dump:
- 6657 articles that are in the main space and not redirects
- 6818 articles and redirects in the main space
- 8649 pages in all namespaces
- 221 redirects in all namespaces
- 5086 external links in every namespace
- 4686 external links in the main space
Very probable spambot pages
[Modifega | modifica 'l sorgent]If index.php is found in a page title, it is very likely the article talk page has been created by a spambot. These pages should be deleted and protected if possible.
Article ID | Article |
---|
Possible spambot pages
[Modifega | modifica 'l sorgent]Possible pages created by spambots ending with /.
Article ID | Article |
---|
SELECT page_id, page_title, page_namespace FROM page WHERE page_title LIKE '%index.php%' OR page_title LIKE '%/wiki/%' OR page_title LIKE '%/w/%' OR page_title LIKE '%/';