Utent:ReyBrujo/Dumps/20070406
Istrument
Sgeneral
Stampa/esporta
In di alter proget
Aspet
De Wikipedia
< Utent:ReyBrujo | Dumps
Dumps
April 06, 2007
[Modifega | modifica 'l sorgent]External link dumps
[Modifega | modifica 'l sorgent]Articles with more than 10 external links as of April 06, 2007. Only articles in the main space are considered.
External links |
Article ID | Article |
---|---|---|
24 | 1563 | Tram: ligamm da föra |
17 | 2711 | Girona |
14 | 2810 | Jean Cocteau |
14 | 1559 | Discüssiun sura la fundazziun |
13 | 7518 | Badalona |
12 | 14034 | Vílnius |
10 | 21331 | Ayerbe |
10 | 4 | Lengua Lumbarda |
SELECT COUNT(el_from) AS total, el_from, page_title FROM externallinks, page WHERE externallinks.el_from = page_id AND page_is_redirect = 0 AND page_namespace = 0 GROUP BY el_from ORDER BY total DESC;
External link ranking
[Modifega | modifica 'l sorgent]Dump table | Hits |
---|---|
Sites linked more than 10 times | 41 |
SELECT COUNT(el_to) AS total, SUBSTRING_INDEX(el_to, '/', 3) AS search FROM externallinks, page WHERE page_id = el_from AND page_namespace = 0 GROUP BY search ORDER BY total DESC;
Additional information
[Modifega | modifica 'l sorgent]Some more information about this dump:
- 24463 articles that are in the main space and not redirects
- 25005 articles and redirects in the main space
- 27139 pages in all namespaces
- 610 redirects in all namespaces
- 10603 external links in every namespace
- 10105 external links in the main space
Very probable spambot pages
[Modifega | modifica 'l sorgent]If index.php is found in a page title, it is very likely the article talk page has been created by a spambot. These pages should be deleted and protected if possible.
Article ID | Article |
---|---|
21328 | W/index.php |
Possible spambot pages
[Modifega | modifica 'l sorgent]Possible pages created by spambots ending with /.
Article ID | Article |
---|
SELECT page_id, page_title, page_namespace FROM page WHERE page_title LIKE '%index.php%' OR page_title LIKE '%/wiki/%' OR page_title LIKE '%/w/%' OR page_title LIKE '%/';