1 #############################################################################
2 # Sample actions file for the Internet Junkbuster 2.9.x
4 # For information, see http://ijbswa.sourceforge.net/
6 # $Id: ijb.action,v 1.6 2002/03/08 17:25:52 oes Exp $
8 #############################################################################
10 #############################################################################
12 # To determine which actions apply to a request, the URL of the request is
13 # compared to all patterns in this file. Every time it matches, the list of
14 # applicable actions for this URL is incrementally updated. You can trace
15 # this process by visiting http://i.j.b/show-url-info
17 # There are 4 types of lines in this file: comments (like this line),
18 # actions, aliases and patterns, all of which are explained below.
20 #############################################################################
22 #############################################################################
24 # 1. On Domains and Paths
25 # -----------------------
27 # Generally, a pattern has the form <domain>/<path>, where both the <domain>
28 # and <path> part are optional. If you only specify a domain part, the "/"
32 # is a domain-only pattern and will match any request to www.yahoo.com
35 # means exactly the same (but is slightly less efficient)
37 # www.example.com/index.html
38 # matches only the document /index.html on www.example.com
41 # matches the document /index.html, regardless of the domain
44 # matches nothing, since it would be interpreted as a domain name and
45 # there is no top-level domain called ".html".
50 # The matching of the domain part offers some flexible options: If the
51 # domain starts or ends with a dot, it becomes unanchored at that end:
54 # matches only www.example.com
57 # matches any domain that ENDS in .example.com
60 # matches any domain that STARTS with www.
62 # Additionally, there are wildcards that you can use in the domain names
63 # themselves. They work pretty similar to shell wildcards: "*" stands for
64 # zero or more arbitrary characters, "?" stands for one, and you can define
65 # charachter classes in square brackets and they can be freely mixed:
68 # matches adserver.example.com, ads.example.com, etc but not sfads.example.com
71 # matches all of the above
74 # matches www.ipix.com, pictures.epix.com, a.b.c.d.e.upix.com etc
76 # www[1-9a-ez].example.com
77 # matches www1.example.com, www4.example.com, wwwd.example.com,
78 # wwwz.example.com etc, but not wwww.example.com
85 # Paths are specified as regular expressions. A comprehensive discussion of
86 # regular expressions wouldn't fit here, but (FIXME) someone should paste
87 # a concise intro to the regex language here.
89 # If Junkbuster was compiled with pcre support (default), Perl compatible
90 # regular expressions are used. See the pcre/docs/ direcory or man perlre
91 # (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for
94 # Please note that matching in the path is CASE INSENSITIVE by default, but
95 # you can switch to case sensitive by starting the pattern with the "(?-i)"
98 # www.example.com/(?-i)PaTtErN.*
99 # will match only documents whose path starts with PaTtErN in exactly this
102 # Partially case-sensetive and partially case-insensitive patterns are
103 # possible, but the rules about splitting them up are extremely complex
104 # - see the PCRE documentation for more information.
106 #############################################################################
108 #############################################################################
110 # There are 3 kinds of action:
112 # Boolean (e.g. "block"):
116 # Parameterized (e.g. "hide-user-agent"):
117 # +name{param} # enable and set parameter to "param"
120 # Multi-value (e.g. "add-header", "wafer"):
121 # +name{param} # enable and add parameter "param"
122 # -name{param} # remove the parameter "param"
123 # -name # disable totally
125 # The default (if you don't specify anything in this file) is not to take
126 # any actions - i.e completely disabled, so JunkBuster will just be a
127 # normal, non-blocking, non-anonymizing proxy. You must specifically
128 # enable the privacy and blocking features you need (although the
129 # provided default actions file will do that for you).
131 # Later actions always override earlier ones. For multi-valued actions,
132 # the actions are applied in the order they are specified.
134 #############################################################################
136 #############################################################################
138 # +add-header{Name: value}
139 # Adds the specified HTTP header, which is not checked for validity.
140 # You may specify this many times to specify many headers.
145 # +deanimate-gifs{last}
146 # +deanimate-gifs{first}
147 # Deanimate all animated GIF images, i.e. reduce them to their last
148 # frame. This will also shrink the images considerably. (In bytes,
150 # If the option "first" is given, the first frame of the animation
151 # is used as the replacement. If "last" is given, the last frame of
152 # the animation is used instead, which propably makes more sense for
153 # most banner animations, but also has the risk of not showing the
154 # entire last frame (if it is only a delta to an earlier frame).
157 # Downgrade HTTP/1.1 client requests to HTTP/1.0 and downgrade the
158 # responses as well. Use this action for servers that use HTTP/1.1
159 # protocol features that Junkbuster currently can't handle yet.
162 # Many sites, like yahoo.com, don't just link to other sites.
163 # Instead, they will link to some script on their own server,
164 # giving the destination as a parameter, which will then redirect
165 # you to the final target.
167 # URLs resulting from this scheme typically look like:
168 # http://some.place/some_script?http://some.where-else
170 # Sometimes, there are even multiple consecutive redirects encoded
171 # in the URL. These redirections via scripts make your web browing
172 # more traceable, since the server from which you follow such a link
173 # can see where you go to. Apart from that, valuable bandwidth and
174 # time is wasted, while your browser aks the server for one redirect
175 # after the other. Plus, it feeds the advertisers.
177 # The +fast-redirects option enables interception of these requests
178 # by junkbuster, who will cut off all but the last valid URL in the
179 # request and send a local redirect back to your browser without
180 # contacting the remote site.
183 # Filter the website through the re_filterfile
184 # FIXME: The syntax should be +filter{filename}
187 # Block any existing X-Forwarded-for header, and do not add a new one.
190 # +hide-from{spam@sittingduck.xqq}
191 # If the browser sends a "From:" header containing your e-mail address,
192 # either completely removes the header ("block"), or change it to the
193 # specified e-mail address.
195 # +hide-referer{block}
196 # +hide-referer{forge}
197 # +hide-referer{http://nowhere.com}
198 # Don't send the "Referer:" (sic) header to the web site. You can
199 # block it, forge a URL to the same server as the request (which is
200 # preferred because some sites will not send images otherwise) or
201 # set it to a constant string.
203 # +hide-referrer{...}
204 # Alternative spelling of +hide-referer. Has the same parameters,
205 # and can be freely mixed with, "+hide-referer". ("referrer" is the
206 # correct English spelling, however the HTTP specification has a
207 # bug - it requires it to be spelt "referer").
209 # +hide-user-agent{browser-type}
210 # Change the "User-Agent:" header so web servers can't tell your
211 # browser type. (Breaks many web sites). Specify the user-agent
212 # value you want - e.g., to pretend to be using Netscape on Linux:
213 # +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)}
214 # Or to identify yourself explicitly as a JunkBuster user:
215 # +hide-user-agent{JunkBuster/1.0}
216 # (Don't change the version number from 1.0 - after all, why tell them?)
219 # Treat this URL as an image. This only matters if it's also "+block"ed,
220 # in which case a "blocked" image can be sent rather than a HTML page.
221 # See +image-blocker{} for the control over what is actually sent.
223 # +image-blocker{logo}
224 # +image-blocker{blank}
225 # +image-blocker{pattern}
226 # +image-blocker{<URL>} with <url> being any valid image URL
227 # Decides what to do with URLs that end up tagged with {+block +image}.
228 # There are 5 options. "-image-blocker" will send a HTML "blocked" page,
229 # usually resulting in a "broken image" icon. "+image-blocker{logo}"
230 # will send a "JunkBuster" image. "+image-blocker{blank}" will send
231 # a 1x1 transparent image, "+image-blocker{pattern}" will send a 4x4
232 # grey/white pattern which is less intrusive than the logo but easier
233 # to recognize than the transparent one. And finally, "+image-blocker{<URL>}"
234 # will send a HTTP temporary redirect to the specified image URL.
237 # +limit-connect{portlist}
238 # The CONNECT methods exists in HTTP to allow access to secure websites
239 # (https:// URLs) through proxies. It works very simply: The proxy
240 # connects to the server on the specified port, and then short-circuits
241 # its connections to the cliant and to the remote proxy.
242 # This can be a big security hole, since CONNECT-enabled proxies can
243 # be abused as TCP relays very easily.
244 # By default, i.e. in the absence of a +limit-connect action, Junkbuster
245 # will only allow CONNECT requests to port 443, which is the standard port
247 # If you want to allow CONNECT for more ports than that, or want to forbid
248 # CONNECT altogether, you can specify a comma separated list of ports and port
249 # ranges (the latter using dashes, with the minimum defaulting to 0 and max to 65K):
251 # +limit-connect{443} # This is the default and need no be specified.
252 # +limit-connect{80,443} # Ports 80 and 443 are OK.
253 # +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100, and above 500 are OK.
256 # Prevent the website from compressing the data. Some websites do
257 # that, which is a problem for junkbuster, since +filter, +no-popup
258 # and +gif-deanimate will not work on compressed data. Will slow down
259 # connections to those websites, though.
262 # If the website sets cookies, make sure they are erased when you exit
263 # and restart your web browser. This makes profiling cookies useless,
264 # but won't break sites which require cookies so that you can log in
265 # or for transactions.
268 # Prevent the website from reading cookies
271 # Prevent the website from setting cookies
275 # Filter the website through a built-in filter to disable
276 # window.open() etc. The two alternative spellings are
280 # This action only applies if you are using a jarfile. It sends a
281 # cookie to every site stating that you do not accept any copyright
282 # on cookies sent to you, and asking them not to track you. Of
283 # course, this is a (relatively) unique header they could use to
287 # This allows you to add an arbitrary cookie. Specify it multiple
288 # times in order to add several cookies.
290 #############################################################################
293 #############################################################################
295 #############################################################################
297 #############################################################################
299 # You can define a short form for a list of permissions - e.g., instead
300 # of "-no-cookies-set -no-cookies-read -filter -fast-redirects", you can
301 # just write "shop". This is called an alias.
303 # Currently, an alias can contain any character except space, tab, '=', '{'
305 # But please use only 'a'-'z', '0'-'9', '+', and '-'.
307 # Alias names are not case sensitive.
309 # Aliases beginning with '+' or '-' may be used for system permission names
310 # in future releases - so try to avoid alias names like this. (e.g.
311 # "+no-cookies" below is not a good name)
313 # Aliases must be defined before they are used.
317 +no-cookies = +no-cookies-set +no-cookies-read
318 -no-cookies = -no-cookies-set -no-cookies-read
319 +imageblock = +block +image
320 +filter-all = +filter +no-compression
322 # Fragile sites should have the minimum changes
323 fragile = -block -deanimate-gifs -fast-redirects -filter -hide-referer -no-cookies -no-popups
325 # Shops should be allowed to set persistent cookies
326 shop = -filter -no-cookies -no-cookies-keep
328 #... etc. Customize to your heart's content.
330 #############################################################################
332 #############################################################################
344 +hide-referer{forge} \
347 +image-blocker{http://i.j.b/send-banner} \
358 #############################################################################
359 # A useful site for testing - shows all headers:
360 # http://privacy.net/analyze/
361 #############################################################################
362 {+add-header{X-Privacy: Yes please} \
363 +add-header{X-User-Tracking: No thanks!} -filter}
367 #############################################################################
368 # Test for new GIF deanimation feature.
369 # Just try http://www.oesterhelt.org/deanimate-demo with and without it.
370 #############################################################################
371 {+deanimate-gifs{last}}
372 www.oesterhelt.org/deanimate-demo
375 #############################################################################
376 # Sites that need cookies
378 # FIXME: Now cookies are allowed by default, do any of these sites
379 # need persistent cookies?
380 #############################################################################
395 #############################################################################
396 # These sites are very complex and require
397 # minimal interference.
398 #############################################################################
400 .office.microsoft.com
401 .windowsupdate.microsoft.com
404 #############################################################################
405 # Shopping sites - still want to block ads.
406 #############################################################################
409 .worldpay.com # for quietpc.com
413 #############################################################################
414 # These shops require pop-ups
415 #############################################################################
420 #############################################################################
421 # Sometimes fast-redirects catches things by mistake
422 #############################################################################
424 www.ukc.ac.uk/cgi-bin/wac\.cgi\?
426 edit.europe.yahoo.com
428 .altavista.com/.*(like|url|link):http
429 .altavista.com/trans.*urltext=http
433 #############################################################################
434 # Please don't re_filter code!
435 #############################################################################
440 #############################################################################
442 #############################################################################
444 #############################################################################
449 #############################################################################
451 #############################################################################
453 .ad.preferences.com/image.*
456 .ad-adex3.flycast.com
458 .connect.247media.ads.link4ads.com
460 .mojofarm.mediaplex.com/ad/
461 www.carbuyer.com/cgi-carbuyer/getimage.cgi
462 /phpAds(New)?/viewbanner\.php
463 .ad.de.doubleclick.net
464 /.*/count\.cgi\?.*df=
465 *.fxweb.com/v2-trackrun\.cgi
471 a196.g.akamai.net/7/196/2670/000[1-3]/images\.gmx\.net/.*images/.*/.*/
475 .smartclicks.com/.*/smart(img|banner|host|bar|site)
476 .linkexchange.com/.*/showl(ogo|e)
478 pixel.intares.net/cgi-bin/janus
479 ar.atwola.com # This serves all ads for CNN and AOL
481 #############################################################################
483 #############################################################################
485 #############################################################################
486 /.*/(.*[-_.])?ads?[0-9]?(/|[-_.].*|\.(gif|jpe?g))
487 /.*/(.*[-_.])?count(er)?(\.cgi|\.dll|\.exe|[?/])
488 /.*/(ng)?adclient\.cgi
489 /.*/(plain|live|rotate)[-_.]?ads?/
491 /.*/(sponsor)s?[0-9]?/
492 ###/*.*/(sponsor|banner)s?[0-9]?/
493 ###/*.*/.*banner([-_]?[a-z0-9]+)?\.(gif|jpg)
495 /?.*/_?(plain|live)?ads?(-banners)?/
497 /?.*/ad(sdna_image|gifs?)/
498 /?.*/ad(server|stream|juggler)\.(cgi|pl|dll|exe)
503 /?.*/adv((er)?ts?|ertis(ing|ements?))?/
507 /?.*/banner_?anzeigen
511 /?.*/cgi-bin/centralad/getimage
512 /?.*/images/addver\.gif
513 /?.*/images/advert\.gif
514 /?.*/images/marketing/.*\.(gif|jpe?g)
519 /?.*/randomads/.*\.(gif|jpe?g)
520 /?.*/rekla(ma|me|am)/.*\.(gif|jpe?g)
523 /?.*/sponsors?[0-9]?/
527 /?.*/werbung/.*\.(gif|jpe?g)
528 /?.*/adv\. # www.telegraaf.nl
529 /?.*/advert[0-9]+\.jpg
544 /bin/getimage.cgi/...\?AD
545 /bin/nph-oma.count/ct/default.shtml
546 /bin/nph-oma.count/ix/default.html
547 /cgi-bin/getimage.cgi/....\?GROUP=
549 /cgi-bin/webad.dll/ad
551 /cwmail/amzn-bm1\.gif
559 /image\.ng/transactionID
560 /images/.*/.*_anim\.gif # alvin brattli
561 /ip_img/.*\.(gif|jpe?g)
564 /netscapeworld/nw-ad/
565 /promotions/houseads/
569 /torget/jobline/.*\.gif
574 /cgi-bin/nph-adclick.exe/
575 /?.*/Image/BannerAdvertising/
577 /?.*/adlib/server\.cgi
578 /?.*/gsa_bs/gsa_bs.cmdl
582 # for our finnish friends, by Kai Puolamaki <Kai.Puolamaki@iki.fi>
583 /?.*/mainos/*.*/.*\.gif
584 /?.*/mainos/*.*/.*\.jpe?g
586 # more from a finnish friend Petri Haapio <pha@iki.fi>
588 .keltaisetsivut.fi/web/img/\.*gif
589 .haku.net/pics/pana\.*gif
591 /?.*/(.*[-_.].*)?maino(kset|nta|s).*(/|\.(gif|html?|jpe?g|png))
592 /?.*/(ilm(oitus)?|kampanja)(hallinta|kuvat?)(/|\.(gif|html?|jpe?g|png))
594 # and even more from a finnish friend Hannu Napari <Hannu.Napari@hut.fi>
595 194.251.243.50/cgi-bin/banner
599 www.iltalehti.fi/ilmkuvat
600 www.mtv3.fi/mainoskuvat
611 /?.*/images/topics/topicgimp\.gif
612 .discovery.com/.*banner_id
615 .idrink.com/frm_bottom.htm
617 /?.*/ph-ad.*\.focalink\.com
620 /we_ba/ # hausfrauenseite.de *bwhahahaaaaa*
623 /.*(ms)?backoff(ice)?.*\.(gif|jpe?g)
624 /.*(/ie4|/ie3|msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*\.(gif|jpe?g)
625 /.*activex.*(gif|jpe?g)
626 /.*explorer?.(gif|jpe?g)
627 /.*freeie\.(gif|jpe?g)
628 /.*/ie_?(buttonlogo|static?|anim.*)?\.(gif|jpe?g)
629 /.*ie_sm\.(gif|jpe?g)
630 /.*msie(30)?\.(gif|jpe?g)
631 /.*msnlogo\.(gif|jpe?g)
632 /.*office97_ad1\.(gif|jpe?g)
633 /.*pbbobansm\.(gif|jpe?g)
634 /.*powrbybo\.(gif|jpe?g)
635 /.*sqlbans\.(gif|jpe?g)
637 /.*ie4get_animated\.gif
662 # generally useless information and promo stuff (commented out)
663 #/.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?)\.(gif|jpe?g)
665 /?.*/images/na/us/brand/
666 /?.*/advantage\.(gif|jpg)
667 /?.*/advanbar\.(gif|jpg)
668 /?.*/advanbtn\.(gif|jpg)
669 /?.*/biznetsmall\.(gif|jpg)
670 /?.*/utopiad\.(gif|jpg)
671 /?.*/epipo\.(gif|jpg)
672 /?.*/amazon([a-zA-Z0-9]+)\.(gif|jpg)
673 /?.*/bnlogo.(gif|jpg)
674 /?.*/buynow([a-zA-Z0-9]+)\.(gif|jpg)
679 # for the dutch folks by a dutch friend gertjan@west.nl
682 .netdirect.nl/nd_servlet/___
684 # --------------------------------------------------------------------------
688 # --------------------------------------------------------------------------
690 # the next two lines work
693 193.158.37.3/cgi-bin/impact
700 195.63.104.*/(inbox|log|meld|folderlu|folderru|log(in|out)[lmr]u|)
708 206.165.5.162/images/gcanim\.gif
712 207.159.129.131/abacus
716 207.87.27.10/tool/includes/gifs/
719 209.1.112.252/adgraph/
720 209.1.135.14[24]:1971
725 209.207.224.22[02]/servfu.pl
726 209.239.37.214/cgi-pilotfaq/getimage\.cgi
729 209.85.89.183/cgi-bin/cycle\?host
730 212.63.155.122/(banner|concret|softwareclub)
733 216.49.10.236/web1000/
736 .ICDirect.com/cgi-bin
737 .Shannon.Austria.Eu.net/\.cgi/
742 # generic hosts (probably most effective)
750 #/.*/*preferences.com*
753 .akamaitech.net/.*/Banners/
754 .altavista.telia.com/av/pix/sponsors/
755 .amazon.com/g/associates/logos/
757 .asinglesplace.com/asplink\.gif
759 .automatiseringgids.nl/gfx/advertenties/
760 #avenuea.com/Banners/
763 .befriends.net/personals/matchmaking\.jpg
764 .bizad.nikkeibp.co.jp
765 .bs.gsanet.com/gsa_bs/
768 .cgicounter.puretec.de/cgi-bin/
769 .ciec.org/images/countdown\.gif
770 .classic.adlink.de/cgi-bin/accipiter/adserver.exe
772 #.clickhere.egroups.com/img/
774 .commonwealth.riddler.com/Commonwealth/bin/statdeploy\?[0-9]+
776 .dagbladet.no/ann-gif
779 .dn.adzerver.com/image.ad
784 .eur.a1.yimg.com/eur.yimg.com/a/
785 .us.a1.yimg.com/us.yimg.com/a/
787 #fastcounter.linkexchange.com
789 .focalink.com/SmartBanner
790 .freepage.de/cgi-bin/feets/freepage_ext/.*/rw_banner
791 .freespace.virgin.net/andy.drake
792 .futurecard.com/images/
796 .go.com/cimages\?SEEK_
798 .home.miningco.com/event.ng/.*AdID
802 image*.narrative.com/news/.*\.(gif|jpe?g)
804 #image.linkexchange.com
806 .images.yahoo.com/adv/
807 .images.yahoo.com/promotions/
810 .impartner.de/cgi-bin
811 informer2.comdirect.de:6004/cd/banner2
812 .infoseek.go.com/cimages
814 .kaufwas.com/cgi-bin/zentralbanner\.cgi
815 #leader.linkexchange.com
818 .linktrader.com/cgi-bin/
819 .logiclink.nl/cgi-bin/
820 lucky.theonion.com/cgi-bin/oniondirectin\.cgi
821 lucky.theonion.com/cgi-bin/onionimp\.cgi
822 lucky.theonion.com/cgi-bin/onionimpin\.cgi
824 .mailorderbrides.com/mlbrd2\.gif
827 .members.sexroulette.com
828 .messenger.netscape.com
830 # movielink became moviefone
831 .moviefone.com/.*(banner|newbutton|(ad|poster).*?\.gif|mmail|bytb|h_(guy|showtick|aML)|m_|icon_|NF_.*?back|h_.*?gif|media/(art|imagelinks(/MF.(ad|sponsor))))
832 mqgraphics.mapquest.com/graphics/Advertisements/
835 .news.com/cgi-bin/acc_clickthru
837 .ngserve.pcworld.com/adgifs/
845 .promotions.yahoo.com
847 .qsound.com/tracker/tracker.exe
848 .resource-marketing.com/tb/
850 .rtl.de/homepage/wb/images/
851 .schnellsuche.de/images/*
852 .shout-ads.com/cgibin/shout.php3
853 .sjmercury.com/advert/
854 .smartclicks.com/.*/smart(img|banner|host|bar|site)
857 .static.wired.com/advertising/
859 .sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor\.gif
860 .t-online.de/home/040255162-001/*
863 .teleauskunft.de/commercial/
866 .tvguide.com/rbitmaps/
869 .ultra.multimania.com
873 .us.yimg.com/promotions/
877 .videoserver.kpix.com
878 .washingtonpost.com/wp-adv/
879 .webconnect.net/cgi-bin/webconnect.dll
881 .webserv.vnunet.com/ip_img/.*ban
882 .werbung.pro-sieben.de/cgi-bin
883 .whatis.com/cgi-bin/getimage.exe/
884 www..bigyellow.com/......mat.*
886 www.addme.com/link8\.gif
887 www.aftonbladet.se/annons
888 www.americanpassage.com/
889 www.angelfire.com/in/twistriot/images/wish4\.gif
890 www.bizlink.ru/cgi-bin/irads\.cgi
891 www.blacklightmedia.com/adlemur
892 www.bluesnews.com/flameq\.gif
893 www.bluesnews.com/images/ad[0-9]+\.gif
894 www.bluesnews.com/images/gcanim3\.gif
895 www.bluesnews.com/images/throbber2\.gif
896 www.bluesnews.com/miscimages/fragbutton\.gif
897 www.businessweek.com/sponsors/
898 www.canoe.ca/AdsCanoe/
899 www.cdnow.com/MN/client.banners
902 www.clicmoi.com/cgi-bin/pub\.exe
903 www.dailycal.org/graphics/adbanner-ab\.gif
904 www.detelefoongids.com/pic/[0-9]*
905 www.dhd.de/CGI/werbepic
906 www.dsf.de/cgi-bin/site_newiac.adpos
907 www.firsttarget.com/cgi-bin/klicklog.cgi
908 www.forbes.com/forbes/gifs/ads
909 www.forbes.com/tool/includes/gifs/
910 www.fxweb.holowww.com/.*\.cgi
911 www.geocities.com/TimesSquare/Zone/5267/
912 www.goto.com/images-promoters/
913 www.handelsblatt.de/hbad
914 www.hotlinks.de/cgi-bin/barimage\.cgi
915 www.infoseek.com/cimages
916 www.infoworld.com/pageone/gif
917 www.isys.net/customer/images
918 www.javaworld.com/javaworld/jw-ad
919 www.kron.com/place-ads/
920 www.leo.org/leoclick/
921 #www.linkexchange.ru/cgi-bin/erle\.cgi
922 www.linkstation.de/cgi-bin/zeige
923 www.linux.org/graphic/miniature/
924 www.linux.org/graphic/square/
925 www.linux.org/graphic/standard/
926 www.luncha.se/annonsering
928 www.ml.org/gfx/spon/icom/
929 www.ml.org/gfx/spon/wmv
930 www.musicblvd.com/mb2/graphics/netgravity/
932 www.news.com/Midas/Images/
933 www.newscientist.com/houseads
934 www.nextcard.com/affiliates/
935 www.nikkeibp.asiabiztech.com/image/NAIS4\.gif
936 www.nordlys.no/imaker/.*/.*/.*/.....\.gif # alvin brattli
937 www.nordlys.no/imaker/.*/.*/.*/..003 # alvin brattli
938 www.oanda.com/server/banner
940 www.oneandonlynetwork.com
941 www.page2page.de/cgi-bin/
942 www.prnet.de/.*/bannerschnippel/.*\.(gif|jpe?g)
943 www.promptsoftware.com/marketing/
944 #www.reklama.ru/cgi-bin/banners/
945 www.riddler.com/sponsors/
946 www.rle.ru/cgi-bin/erle\.cgi
947 www.rock.com/images/affiliates/search_black\.gif
948 www.rtl.de/search/.*kunde
949 #www.search.com/Banners
950 www.sfgate.com/place-ads/
951 www.shareware.com/midas/images/borders-btn\.gif
952 #www.sjmercury.com/products/marcom/banners/
953 www.smartclicks.com:81
954 www.sol.dk/graphics/portalmenu
955 www.sponsornetz.de/jump/show.exe
957 www.sunworld.com/sunworldonline/icons/adinfo.sm\.gif
958 www.swwwap.com/cgi-bin/
960 www.telecom.at/icons/.*film\.(gif|jpe?g)
961 www.theonion.com/bin/
962 www.topsponsor.de/cgi-bin/show.exe
964 www.ugu.com/images/EJ\.gif
965 www.warzone.com/pics/banner/
966 www.warzone.com/wzfb/ads.cgi
968 www.websitepromote.com/partner/img/
969 www.winjey.com/onlinewerbung/*\.gif
970 www.wishing.com/webaudit
971 www.www-pool.de/cgi-bin/banner-pool
972 www2.blol.com/agrJRU\.gif
974 .yahoo.com/CategoryID=0
978 www.bannerland.de/click.exe
983 www.slate.com/redirect/
984 www.slate.com/articleimages/
986 www.forbes.com/tool/images/frontend/
989 .pathfinder.com/shopping/marketplace/images/
992 static.wired.com/images
993 .perso.estat.com/cgi-bin/perso/
994 #dinoadserver1.roka.net
995 .fooladclient*.fool.com
996 .affiliate.aol.com/static/
1004 # www.sunday-times.co.uk
1005 www.sunday-times.co.uk/standing/newsint/ticker
1007 #NeXgo (ex Germany.Net)
1011 # Block as much of GeoCities as possible
1012 # All geocities-owned images
1013 www.geocities.com/images
1014 www.geocities.com/MemberBanners/live/
1015 pic.geocities.com/images
1016 # And the popup (it still pops up, but does not eat up precious bandwidth)
1017 #www.geocities.com/ad_container/pop.html # already fixed by other regexp
1019 # from corion@informatik.uni-frankfurt.de
1022 #ads.xmonitor.net/xadengine.cgi # fixed by above regexp
1023 # Also block the japanese geocities popups
1024 www.geocities.co.jp/images
1025 # Also block the come.to, surf.to etc. popups
1028 # Also block the xoom stuff.
1030 home.talkcity.com/homepopup.html.*
1032 # Max Maischein <max.maischein@econsult.de> again ...
1033 # Halflife.net uses WON banners
1034 # Banners from Freeserve
1035 #banner.freeservers.com/cgi-bin/fs_adbar # fixed by above regexp
1036 # And those nasty va-popups !
1037 /?.*/?va_banner.html
1038 # And an all-around hit against advert*.jpg
1039 /?.*/advert[0-9]+\.jpg
1040 # And yet another Internet Explorer gif ...
1042 # Some uninteresting buttons I think...
1043 .mircx.com/images/buttons/
1044 services.mircx.com/.*\.gif
1045 # Easyspace - yet another "free disk space" provider with <yuck> banner popups
1046 www.easyspace.com/(fpub)?banner.html
1047 www.easyspace.com/100\.gif
1048 # Some russian banner exchanges
1049 .banner.ricor.ru/cgi-bin/banner.pl
1050 #www.bizlink.ru/cgi-bin/irads.cgi # already fixed by other regexp
1051 stx9.sextracker.com/stx/send/
1052 # And even more of geocities :
1053 www.geocities.com/pictures/
1054 # Gaah - www.angelfire.com - another webspace provider with popups
1055 .angelfire.com/sys/download.html
1056 # Gamasutra.com uses this ad provider
1057 sally.songline.com/@
1059 # Eule.de (search engine)
1060 # maybe images.eule.de as a whole...
1061 www.eule.de/cgi-bin/
1062 images.eule.de/comdirect\.gif
1063 images.eule.de/wp\.gif
1064 .aladin.de/125_1\.gif
1065 images.eule.de/neu/books\.gif
1067 # --------------------------------------------------------------------------
1071 # --------------------------------------------------------------------------
1073 # some images on cnn's website just suck!
1076 /.*cnnpostopinionhome.\.gif
1077 /.*custom_feature\.gif
1078 /.*explore.anim.*gif
1080 /.*pathnet.warner\.gif
1081 /.*images/cnnfn_infoseek\.gif
1082 /.*images/pathfinder_btn2\.gif
1083 /.*img/gen/fosz_front_em_abc\.gif
1084 /.*img/promos/bnsearch\.gif
1085 /.*navbars/nav_partner_logos\.gif
1086 /BarnesandNoble/images/bn.recommend.box.*
1087 /digitaljam/images/digital_ban\.gif
1088 /hotstories/companies/images/companies_banner\.gif
1089 /markets/images/markets_banner\.gif
1090 /ows-img/bnoble\.gif
1091 /ows-img/nb_Infoseek\.gif
1092 .cnn.com/images/custom/totale\.gif
1093 .cnn.com/images/lotd/custom.wheels\.gif
1094 .cnn.com/images/.*/by/main.12\.gif
1095 .cnn.com/images/.*/find115\.gif
1096 .cnn.com/.*/free.email.120\.gif
1097 .cnnfn.com/images/left_banner\.gif
1099 www.cnn.com/images/.*/bn/books\.gif
1100 www.cnn.com/images/.*/pointcast\.gif
1101 www.cnn.com/images/.*/fusa\.gif
1102 .cnn.com/images/.*/start120\.gif
1103 images.cnn.com/SHOP/
1107 # the / indicates the beginning of the path (and no longer the FQDN)
1113 /gif/buttons/banner_
1114 /gif/buttons/cd_shop_
1115 /gif/cd_shop/cd_shop_ani_
1118 /av/gifs/av_map\.gif
1119 /av/gifs/av_logo\.gif
1120 /av/gifs/new/ns\.gif
1121 altavista.com/i/valsdc3\.gif
1122 jump.altavista.com/gn_sf
1125 tucows./images/locallogo\.gif
1130 # simpliemu.hypermart.net/frames.html
1131 .go2net.com/mgic/adpopup
1132 .go2net.com/metaspy/images/exposed\.gif
1133 .go2net.com/metaspy/images/ms_un\.gif
1136 www.cebu-usa.com/cwbanim1\.gif
1137 www.cebu-usa.com/Connection\.jpg
1138 www.cebu-usa.com/phonead\.gif
1139 www.cebu-usa.com/ban3\.jpg
1140 www.cebu-usa.com/tlban\.gif
1141 www.cebu-usa.com/apwlogo1\.gif
1142 www.cebu-usa.com/rose\.gif
1145 www.fnet.de/img/geldboerselogo\.jpg
1147 # hirsch@mathcs.emory.edu
1148 /images/getareal2\.gif
1150 www.assalom.com/aziza/logos/cniaffil\.gif
1151 www.assalom.com/aziza/logos/4starrl1\.gif
1152 www.phantomstar.com/images/media/m1\.gif
1155 .wahlstreet.de/MediaW\$/tsponline\.gif
1156 .wahlstreet.de/MediaW\$/dzii156x60\.gif
1157 .wahlstreet.de/MediaW\$/etban156x60_2_opt2\.gif
1161 /pics/getareal1\.gif
1163 /ltbs/cgi-bin/click.cgi
1164 .linuxtoday.com/ltbs/pics/
1168 /include/watermark/v2/
1170 # Reinier Bikker <R.P.Bikker@phys.uu.nl>
1173 # Mark Lutz <luma@nikocity.de>
1174 /.*/*werb.*\.(gif|jpe?g) # hope that's not to restrictive
1176 #Free Yellow thing at bottom of pages (HereticPC)
1177 www.freeyellow.com/images/powerlink5a\.gif
1178 www.freeyellow.com/images/powerlink5b\.gif
1179 www.freeyellow.com/images/powerlink5c\.gif
1180 www.freeyellow.com/images/powerlink5d\.gif
1181 www.freeyellow.com/images/powerlink5e\.gif
1184 www.eads.com/images/refbutton\.gif
1185 www.fortunecity.com/console2/newnav/*
1186 www.goldetc.net/search\.gif
1187 www.cris.com/~Lzrdking/carpix/cars3-le\.gif
1188 www.justfreestuff.com/scott\.gif
1189 www.cyberthrill.com/entrance\.gif
1190 secure.pec.net/images/pec69ani\.gif
1191 www.new-direction.com/avviva\.gif
1192 /.*internetmarketingcenter\.gif
1193 www.new-direction.com/wp-linkexchange-loop\.gif
1194 www.new-direction.com/windough\.gif
1195 www.digitalwork.com/universal_images/affiliate/dw_le_3\.gif
1196 service.bfast.com/bfast/click/*
1197 www.new-direction.com/magiclearning\.gif
1198 www.new-direction.com/mailloop\.gif
1200 www.free-banners.com/images/hitslogo\.gif
1201 rob.simplenet.com/dyndns/fortune5\.gif
1202 .nasdaq-amex.com/images/bn_ticker\.gif
1205 # navilor@hotmail.com
1208 # wayne@staff.msen.com
1210 a*.*.*.yimg.com/([0-9]*|\/)*us.yimg.com/*
1213 www.realtop50.com/cgi-bin/ad
1217 www.yacht.de/images/(my_ani|eissingani|chartertrans|fum|schnupper|fysshop|garmin)\.gif
1218 www.sponsorweb.de/web-sponsor/nt-bin/show.exe
1221 # Club-internet pops up a complain if you refuse cookie (still pops up...)
1222 perso.club-internet.fr/html/Popup/popup_frame_nocookie.html
1223 perso.club-internet.fr/pagesperso/popup_nocookie.html
1225 .gmx.net/images/newsbanner/
1228 .quicken.lexware.de/images/us7-468x60.gif
1229 /img/special/chatpromo\.gif
1230 www.travelocity.com/images/promos/
1232 # wonder that that does...
1235 #/*.*/phpAds/viewbanner.php
1236 #/*.*/phpAds/phpads.php
1238 www.linux-magazin.de/banner
1239 .comtrack.comclick.com
1241 .iac-online.de/filler
1243 .media.interadnet.com
1244 .stat.www.fi/cgi-bin
1248 .disneystoreaffiliates.com
1250 .powerwork.mobile.de/cgi-bin/getimage\.cgi
1254 ####################################################
1257 # The Register ads - oh, and all images in Register stories (sigh).
1258 www.theregister.co.uk/media/
1260 # Used on http://www.theregister.co.uk/
1261 # Sample advert URL:
1262 # http://secure.webconnect.net/cgi-bin/webconnecthome.dll?F467
1266 www.dilbert.com/comics/dilbert/images/.*_140x800.*\.gif
1269 # Uses URL: http://www.stattrack.com/cgi-bin/stats/image.cgi
1271 # And loads JavaScript from http://www.stattrack.com/stats/code
1272 www.stattrack.com/stats/
1274 #Now they're Yahoo GeoCities, their junk is in a different place.
1275 ##geo.yahoo.com/serv
1276 ##visit.geocities.com/visit.gif
1277 .yimg.com/?.*/www.geocities.com/js_source
1278 #http://us.toto.geo.yahoo.com/toto?s=76001086
1280 .visit.geocities.com
1281 .yimg.com/?.*/www.geocities.com/
1283 #http://counter16.bravenet.com/counter.php
1286 #http://stat.cybermonitor.com/7emezone_p?1707_USdvd
1289 #http://members.tripod.com/adm/popup/.....
1290 members.tripod.com/adm/popup/
1292 #This is the worst ad idea ever!
1293 #count.exitexchange.com/exit/1100661
1294 #count.exitexchange.com/clients/navbar.html
1295 #(used in http://skyhivisuals.tripod.com/malfunctions_.htm)
1301 #This site traps the browser
1304 #privacy.net runs ads
1307 #Lindsay.Marshall@newcastle.ac.uk suggested these, to kill Opera adverts:
1312 dinoadserver*.roka.net
1314 logout.tvspielfilm.de
1316 www.freenet.de/customerindex\.html
1318 .fxweb.com/v2-trackrun\.cgi
1319 rtldating.peopleunited.de
1321 www.zdnet.com/fcgi-bin/
1322 service.bfast.com/bfast/serve
1324 fourohfour.nbci.com/Members404Error.php3
1327 www.fair-ist-mehr.de/cgi-bin/bt.pl
1337 #############################################################################
1339 #############################################################################
1342 www.userfriendly.org/images/banners/banner_dp_heart\.gif
1345 #Why were these in the Waldherr blockfile?
1347 #a*.*.*.yimg.com/([0-9]|\/)*us.yimg.com/i/*
1349 # some regexps are simply too aggressive ...
1351 # equalizer to /*.*(.*[-_.])?ads?[0-9]?(/|[-_.].*|.(gif|jpe?g))
1362 .ad.siemens.de # SIEMENS Automation & Drives
1363 #add-url.altavista.com
1370 # univ. don't advertise, do they :-)
1372 .ac.uk # English Universities too! - Jon
1373 .uni-*.de # What about Germany? --oes
1374 www.ugu.com/sui/ugu/adv
1378 clubs.yahoo.com/clubs
1379 edit.my.yahoo.com/config/show_identity
1380 www.ix.de/newsticker/data/ad
1381 www.heise.de/newsticker/data/ad
1382 www.careernet.de/anzeige
1383 www.careernet.de/bewerber/stellenanzeigen
1384 www.baumgartner.de/stellenmarkt/anzeigen
1385 www.dspartner.de/Anzeigen
1386 www.aws-jobs.de/Anzeigen
1387 www.jobware.de/.*/anzeigen/
1388 www.jobworld.de/bilder/
1389 www.cnn.com/TECH/computing/.*/internet.ads/
1390 www.financial.de/shop/
1394 194.221.152.2/phptelefontmp
1395 .harvard.edu/images/banner/
1398 www.dhd.de/CGI/anzeigen/
1401 .img.web.de/web/img/
1403 www.segel.de/menu/bilder/anzeigen\.gif
1404 www.corel.com/graphics/banners/
1405 www.software.ibm.com/ad/
1406 www.omg.org/docs/ad/
1408 .sperrmuell.de/scripts/anzeigen
1409 www.freenet.de/index.html
1410 www.01019freenet.de/index.html
1411 www.freenet.de/freenet/
1412 www.01019freenet.de/freenet/
1413 webfactory.de/anzeigen.php
1415 www.internatif.org/bortzmeyer/debian/sponsor/
1418 www.software.hosting.ibm.com/ad/
1419 www.ibm.com/software/ad/
1422 www.debian.org/Pics/banner-blue\.gif
1423 www.linux.de/pics/Nachrichten_banner\.gif
1426 finder.shopping.yahoo.com/shop/
1436 .consumer-direct.com
1441 # my banking stuff => no ads.
1447 # Jon's addition: MSDN
1452 .freemail*.web.de/online/ordner/anzeigen
1453 foggy.sda.t-online.de
1454 .us.i1.yimg.com/us.yimg.com/i/pim/ad2.gif
1455 www.nexgo.de/.*/bg_banner.jpg
1457 # .*ads. matches prdownloads.sourceforge.net and many other download sites