PDA

View Full Version : EchO! Spider



Masetek
04-17-2006, 08:23 PM
Anyone noticed this new spider? I've never seen it before and its been crawling all my sites in the last week. Any idea who it belongs to? I googled it and checked robotstxt.org but can't find any records of it. It seems to be doing a much better job of crawling my sites than Inktomi and googlebot :)

KelliShaver
04-17-2006, 09:15 PM
I think it's the same spider that generates google sitemaps from http://sitemap.xmlecho.org - I used their service to generate a sitemap for a large site of mine and that's when it hit me. Why it would be hitting your site when you're not using that service, though, I'm not sure.

Masetek
04-17-2006, 10:37 PM
Makes sense. I did use that tool for 2 sites. Dont know about the others though... :confused:

Paul Kemp
01-15-2007, 09:03 PM
I'm pretty much positive I've never used that tool... I just started researching it myself, when I noticed in my logs that it's using a LOT more bandwidth per hit than any other spider...



Robots/Spiders visitors
14 different robots Hits Bandwidth Last visit
Inktomi Slurp 9276+838 61.12 MB 15 Jan 2007 - 21:39
MSNBot 3191+679 236.43 MB 15 Jan 2007 - 21:37
Googlebot 3249+8 179.88 MB 15 Jan 2007 - 04:19
Unknown robot (identified by 'crawl') 785+30 6.46 MB 15 Jan 2007 - 12:12
Alexa (IA Archiver) 425+110 55.78 MB 15 Jan 2007 - 21:39
EchO! 457+8 317.41 MB 15 Jan 2007 - 00:28
Unknown robot (identified by 'spider') 227+52 2.04 MB 15 Jan 2007 - 20:56
LinkWalker 202+1 4.92 MB 08 Jan 2007 - 00:56
Unknown robot (identified by hit on 'robots.txt') 0+173 133.24 KB 15 Jan 2007 - 19:19
AskJeeves 12+11 72.52 KB 15 Jan 2007 - 11:06
WISENutbot 17+1 82.51 KB 15 Jan 2007 - 07:29
Voyager 11+5 223.62 KB 14 Jan 2007 - 23:34
Unknown robot (identified by 'robot') 3 206.27 KB 14 Jan 2007 - 01:46
Scooter 0+1 817 Bytes 11 Jan 2007 - 08:28

Chris
01-15-2007, 09:04 PM
If in doubt, ban.

Paul Kemp
01-15-2007, 09:17 PM
Just found another tidbit:
http://www.psychedelix.com/agents/index.shtml

They list 'EchO!/2.0' as belonging to 'echo.fr' which redirects to http://www.francetelecom.com/en/

Edit: AHA!


Hello,
we (http://www.echo.fr) currently used metaprogramming in our
www search engine (http://www.voila.fr or http://www.wanadoo.fr).
http://www.oonumerics.org/oon/oon-list/archive/0607.html