[Systems] Excessive scraper queries

Bernie Innocenti bernie at sugarlabs.org
Tue Apr 1 16:01:10 EDT 2014


Hello Ignacio,

while I was investigating a logs bloat issue on sunjammer, I noticed
that apache is being hit by about 1 million requests per day from a
python script running locally:

  activities.sugarlabs.org:80 2001:4830:134:7::11 - -
[23/Mar/2014:01:59:36 -0400] "GET /es-ES/sugar/addon/4461 HTTP/1.1" 200 3
7522 "-" "Python-urllib/2.6"
 activities.sugarlabs.org:80 2001:4830:134:7::11 - -
[23/Mar/2014:01:59:38 -0400] "GET /es-ES/sugar/addon/4467 HTTP/1.1" 200 8
55 "-" "Python-urllib/2.6"
 [...]

This is not necessarily forbidden, but I'd like to understand what this
script does and whether it *really* needs to run so aggressively.

Note that the files are available on the local filesystem, there's no
need to read them periodically over http.

-- 
Bernie Innocenti
Sugar Labs Infrastructure Team
http://wiki.sugarlabs.org/go/Infrastructure_Team


More information about the Systems mailing list