1 #############################################################################
2 # Sample actions file for the Internet Junkbuster 2.9.x
4 # For information, see http://ijbswa.sourceforge.net/
6 # $Id: ijb.action,v 1.4 2002/03/06 21:08:00 oes Exp $
8 #############################################################################
10 #############################################################################
12 # To determine which actions apply to a request, the URL of the request is
13 # compared to all patterns in this file. Every time it matches, the list of
14 # applicable actions for this URL is incrementally updated. You can trace
15 # this process by visiting http://i.j.b/show-url-info
17 # There are 4 types of lines in this file: comments (like this line),
18 # actions, aliases and patterns, all of which are explained below.
20 #############################################################################
22 #############################################################################
24 # 1. On Domains and Paths
25 # -----------------------
27 # Generally, a pattern has the form <domain>/<path>, where both the <domain>
28 # and <path> part are optional. If you only specify a domain part, the "/"
32 # is a domain-only pattern and will match any request to www.yahoo.com
35 # means exactly the same (but is slightly less efficient)
37 # www.example.com/index.html
38 # matches only the document /index.html on www.example.com
41 # matches the document /index.html, regardless of the domain
44 # matches nothing, since it would be interpreted as a domain name and
45 # there is no top-level domain called ".html".
50 # The matching of the domain part offers some flexible options: If the
51 # domain starts or ends with a dot, it becomes unanchored at that end:
54 # matches only www.example.com
57 # matches any domain that ENDS in .example.com
60 # matches any domain that STARTS with www.
62 # Additionally, there are wildcards that you can use in the domain names
63 # themselves. They work pretty similar to shell wildcards: "*" stands for
64 # zero or more arbitrary characters, "?" stands for one, and you can define
65 # charachter classes in square brackets and they can be freely mixed:
68 # matches adserver.example.com, ads.example.com, etc but not sfads.example.com
71 # matches all of the above
74 # matches www.ipix.com, pictures.epix.com, a.b.c.d.e.upix.com etc
76 # www[1-9a-ez].example.com
77 # matches www1.example.com, www4.example.com, wwwd.example.com,
78 # wwwz.example.com etc, but not wwww.example.com
85 # Paths are specified as regular expressions. A comprehensive discussion of
86 # regular expressions wouldn't fit here, but (FIXME) someone should paste
87 # a concise intro to the regex language here.
89 # If Junkbuster was compiled with pcre support (default), Perl compatible
90 # regular expressions are used. See the pcre/docs/ direcory or man perlre
91 # (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for
94 # Please note that matching in the path is CASE INSENSITIVE by default, but
95 # you can switch to case sensitive by starting the pattern with the "(?-i)"
98 # www.example.com/(?-i)PaTtErN.*
99 # will match only documents whose path starts with PaTtErN in exactly this
102 # Partially case-sensetive and partially case-insensitive patterns are
103 # possible, but the rules about splitting them up are extremely complex
104 # - see the PCRE documentation for more information.
106 #############################################################################
108 #############################################################################
110 # There are 3 kinds of action:
112 # Boolean (e.g. "block"):
116 # Parameterized (e.g. "hide-user-agent"):
117 # +name{param} # enable and set parameter to "param"
120 # Multi-value (e.g. "add-header", "wafer"):
121 # +name{param} # enable and add parameter "param"
122 # -name{param} # remove the parameter "param"
123 # -name # disable totally
125 # The default (if you don't specify anything in this file) is not to take
126 # any actions - i.e completely disabled, so JunkBuster will just be a
127 # normal, non-blocking, non-anonymizing proxy. You must specifically
128 # enable the privacy and blocking features you need (although the
129 # provided default actions file will do that for you).
131 # Later actions always override earlier ones. For multi-valued actions,
132 # the actions are applied in the order they are specified.
134 #############################################################################
136 #############################################################################
138 # +add-header{Name: value}
139 # Adds the specified HTTP header, which is not checked for validity.
140 # You may specify this many times to specify many headers.
145 # +deanimate-gifs{last}
146 # +deanimate-gifs{first}
147 # Deanimate all animated GIF images, i.e. reduce them to their last
148 # frame. This will also shrink the images considerably. (In bytes,
150 # If the option "first" is given, the first frame of the animation
151 # is used as the replacement. If "last" is given, the last frame of
152 # the animation is used instead, which propably makes more sense for
153 # most banner animations, but also has the risk of not showing the
154 # entire last frame (if it is only a delta to an earlier frame).
157 # Downgrade HTTP/1.1 client requests to HTTP/1.0 and downgrade the
158 # responses as well. Use this action for servers that use HTTP/1.1
159 # protocol features that Junkbuster currently can't handle yet.
162 # Many sites, like yahoo.com, don't just link to other sites.
163 # Instead, they will link to some script on their own server,
164 # giving the destination as a parameter, which will then redirect
165 # you to the final target.
167 # URLs resulting from this scheme typically look like:
168 # http://some.place/some_script?http://some.where-else
170 # Sometimes, there are even multiple consecutive redirects encoded
171 # in the URL. These redirections via scripts make your web browing
172 # more traceable, since the server from which you follow such a link
173 # can see where you go to. Apart from that, valuable bandwidth and
174 # time is wasted, while your browser aks the server for one redirect
175 # after the other. Plus, it feeds the advertisers.
177 # The +fast-redirects option enables interception of these requests
178 # by junkbuster, who will cut off all but the last valid URL in the
179 # request and send a local redirect back to your browser without
180 # contacting the remote site.
183 # Filter the website through the re_filterfile
184 # FIXME: The syntax should be +filter{filename}
187 # Block any existing X-Forwarded-for header, and do not add a new one.
190 # +hide-from{spam@sittingduck.xqq}
191 # If the browser sends a "From:" header containing your e-mail address,
192 # either completely removes the header ("block"), or change it to the
193 # specified e-mail address.
195 # +hide-referer{block}
196 # +hide-referer{forge}
197 # +hide-referer{http://nowhere.com}
198 # Don't send the "Referer:" (sic) header to the web site. You can
199 # block it, forge a URL to the same server as the request (which is
200 # preferred because some sites will not send images otherwise) or
201 # set it to a constant string.
203 # +hide-referrer{...}
204 # Alternative spelling of +hide-referer. Has the same parameters,
205 # and can be freely mixed with, "+hide-referer". ("referrer" is the
206 # correct English spelling, however the HTTP specification has a
207 # bug - it requires it to be spelt "referer").
209 # +hide-user-agent{browser-type}
210 # Change the "User-Agent:" header so web servers can't tell your
211 # browser type. (Breaks many web sites). Specify the user-agent
212 # value you want - e.g., to pretend to be using Netscape on Linux:
213 # +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)}
214 # Or to identify yourself explicitly as a JunkBuster user:
215 # +hide-user-agent{JunkBuster/1.0}
216 # (Don't change the version number from 1.0 - after all, why tell them?)
219 # Treat this URL as an image. This only matters if it's also "+block"ed,
220 # in which case a "blocked" image can be sent rather than a HTML page.
221 # See +image-blocker{} for the control over what is actually sent.
223 # +image-blocker{logo}
224 # +image-blocker{blank}
225 # +image-blocker{pattern}
226 # +image-blocker{<URL>} with <url> being any valid image URL
227 # Decides what to do with URLs that end up tagged with {+block +image}.
228 # There are 5 options. "-image-blocker" will send a HTML "blocked" page,
229 # usually resulting in a "broken image" icon. "+image-blocker{logo}"
230 # will send a "JunkBuster" image. "+image-blocker{blank}" will send
231 # a 1x1 transparent PNG, "+image-blocker{pattern}" will send a 4x4
232 # grey/white pattern which is less intrusive than the logo but easier
233 # to recognize than the transparent one. And finally, "+image-blocker{<URL>}"
234 # will send a HTTP temporary redirect to the specified image URL.
237 # +limit-connect{portlist}
238 # The CONNECT methods exists in HTTP to allow access to secure websites
239 # (https:// URLs) through proxies. It works very simply: The proxy
240 # connects to the server on the specified port, and then short-circuits
241 # its connections to the cliant and to the remote proxy.
242 # This can be a big security hole, since CONNECT-enabled proxies can
243 # be abused as TCP relays very easily.
244 # By default, i.e. in the absence of a +limit-connect action, Junkbuster
245 # will only allow CONNECT requests to port 443, which is the standard port
247 # If you want to allow CONNECT for more ports than that, or want to forbid
248 # CONNECT altogether, you can specify a comma separated list of ports and port
249 # ranges (the latter using dashes, with the minimum defaulting to 0 and max to 65K):
251 # +limit-connect{443} # This is the default and need no be specified.
252 # +limit-connect{80,443} # Ports 80 and 443 are OK.
253 # +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100, and above 500 are OK.
256 # Prevent the website from compressing the data. Some websites do
257 # that, which is a problem for junkbuster, since +filter, +no-popup
258 # and +gif-deanimate will not work on compressed data. Will slow down
259 # connections to those websites, though.
262 # If the website sets cookies, make sure they are erased when you exit
263 # and restart your web browser. This makes profiling cookies useless,
264 # but won't break sites which require cookies so that you can log in
265 # or for transactions.
268 # Prevent the website from reading cookies
271 # Prevent the website from setting cookies
275 # Filter the website through a built-in filter to disable
276 # window.open() etc. The two alternative spellings are
280 # This action only applies if you are using a jarfile. It sends a
281 # cookie to every site stating that you do not accept any copyright
282 # on cookies sent to you, and asking them not to track you. Of
283 # course, this is a (relatively) unique header they could use to
287 # This allows you to add an arbitrary cookie. Specify it multiple
288 # times in order to add several cookies.
290 #############################################################################
293 #############################################################################
295 #############################################################################
297 #############################################################################
299 # You can define a short form for a list of permissions - e.g., instead
300 # of "-no-cookies-set -no-cookies-read -filter -fast-redirects", you can
301 # just write "shop". This is called an alias.
303 # Currently, an alias can contain any character except space, tab, '=', '{'
305 # But please use only 'a'-'z', '0'-'9', '+', and '-'.
307 # Alias names are not case sensitive.
309 # Aliases beginning with '+' or '-' may be used for system permission names
310 # in future releases - so try to avoid alias names like this. (e.g.
311 # "+no-cookies" below is not a good name)
313 # Aliases must be defined before they are used.
317 +no-cookies = +no-cookies-set +no-cookies-read
318 -no-cookies = -no-cookies-set -no-cookies-read
319 +imageblock = +block +image
320 +filter-all = +filter +no-compression
322 # Fragile sites should have the minimum changes
323 fragile = -block -deanimate-gifs -fast-redirects -filter -hide-referer -no-cookies -no-popups
325 # Shops should be allowed to set persistent cookies
326 shop = -filter -no-cookies -no-cookies-keep
328 #... etc. Customize to your heart's content.
330 #############################################################################
332 #############################################################################
344 +hide-referer{forge} \
347 +image-blocker{http://i.j.b/send-banner} \
358 #############################################################################
359 # A useful site for testing - shows all headers:
360 # http://privacy.net/analyze/
361 #############################################################################
362 {+add-header{X-Privacy: Yes please} \
363 +add-header{X-User-Tracking: No thanks!} -filter}
367 #############################################################################
368 # Test for new GIF deanimation feature.
369 # Just try http://www.oesterhelt.org/deanimate-demo with and without it.
370 #############################################################################
371 {+deanimate-gifs{last}}
372 www.oesterhelt.org/deanimate-demo
375 #############################################################################
376 # Sites that need cookies
378 # FIXME: Now cookies are allowed by default, do any of these sites
379 # need persistent cookies?
380 #############################################################################
395 #############################################################################
396 # These sites are very complex and require
397 # minimal interference.
398 #############################################################################
400 .office.microsoft.com
401 .windowsupdate.microsoft.com
404 #############################################################################
405 # Shopping sites - still want to block ads.
406 #############################################################################
409 .worldpay.com # for quietpc.com
413 #############################################################################
414 # These shops require pop-ups
415 #############################################################################
420 #############################################################################
421 # Sometimes fast-redirects catches things by mistake
422 #############################################################################
424 www.ukc.ac.uk/cgi-bin/wac\.cgi\?
426 edit.europe.yahoo.com
428 .altavista.com/.*(like|url|link):http
429 .altavista.com/trans.*urltext=http
433 #############################################################################
434 # Please don't re_filter code!
435 #############################################################################
440 #############################################################################
442 #############################################################################
444 #############################################################################
449 #############################################################################
451 #############################################################################
453 .ad.preferences.com/image.*
456 .ad-adex3.flycast.com
458 .connect.247media.ads.link4ads.com
460 .mojofarm.mediaplex.com/ad/
461 www.carbuyer.com/cgi-carbuyer/getimage.cgi
462 /phpAds(New)?/viewbanner\.php
463 .ad.de.doubleclick.net
464 /.*/count\.cgi\?.*df=
465 *.fxweb.com/v2-trackrun\.cgi
471 a196.g.akamai.net/7/196/2670/000[1-3]/images\.gmx\.net/.*images/.*/.*/
475 .smartclicks.com/.*/smart(img|banner|host|bar|site)
476 .linkexchange.com/.*/showl(ogo|e)
478 pixel.intares.net/cgi-bin/janus
479 ar.atwola.com # This serves all ads for CNN and AOL
481 #############################################################################
483 #############################################################################
485 #############################################################################
486 /.*/(.*[-_.])?ads?[0-9]?(/|[-_.].*|\.(gif|jpe?g))
487 /.*/(.*[-_.])?count(er)?(\.cgi|\.dll|\.exe|[?/])
488 /.*/(ng)?adclient\.cgi
489 /.*/(plain|live|rotate)[-_.]?ads?/
491 /.*/(sponsor)s?[0-9]?/
492 ###/*.*/(sponsor|banner)s?[0-9]?/
493 ###/*.*/.*banner([-_]?[a-z0-9]+)?\.(gif|jpg)
495 /?.*/_?(plain|live)?ads?(-banners)?/
497 /?.*/ad(sdna_image|gifs?)/
498 /?.*/ad(server|stream|juggler)\.(cgi|pl|dll|exe)
502 /?.*/adv((er)?ts?|ertis(ing|ements?))?/
506 /?.*/banner_?anzeigen
510 /?.*/cgi-bin/centralad/getimage
511 /?.*/images/addver\.gif
512 /?.*/images/advert\.gif
513 /?.*/images/marketing/.*\.(gif|jpe?g)
518 /?.*/randomads/.*\.(gif|jpe?g)
519 /?.*/rekla(ma|me|am)/.*\.(gif|jpe?g)
522 /?.*/sponsors?[0-9]?/
526 /?.*/werbung/.*\.(gif|jpe?g)
527 /?.*/adv\. # www.telegraaf.nl
528 /?.*/advert[0-9]+\.jpg
543 /bin/getimage.cgi/...\?AD
544 /bin/nph-oma.count/ct/default.shtml
545 /bin/nph-oma.count/ix/default.html
546 /cgi-bin/getimage.cgi/....\?GROUP=
548 /cgi-bin/webad.dll/ad
550 /cwmail/amzn-bm1\.gif
558 /image\.ng/transactionID
559 /images/.*/.*_anim\.gif # alvin brattli
560 /ip_img/.*\.(gif|jpe?g)
563 /netscapeworld/nw-ad/
564 /promotions/houseads/
568 /torget/jobline/.*\.gif
573 /cgi-bin/nph-adclick.exe/
574 /?.*/Image/BannerAdvertising/
576 /?.*/adlib/server\.cgi
577 /?.*/gsa_bs/gsa_bs.cmdl
581 # for our finnish friends, by Kai Puolamaki <Kai.Puolamaki@iki.fi>
582 /?.*/mainos/*.*/.*\.gif
583 /?.*/mainos/*.*/.*\.jpe?g
585 # more from a finnish friend Petri Haapio <pha@iki.fi>
587 .keltaisetsivut.fi/web/img/\.*gif
588 .haku.net/pics/pana\.*gif
590 /?.*/(.*[-_.].*)?maino(kset|nta|s).*(/|\.(gif|html?|jpe?g|png))
591 /?.*/(ilm(oitus)?|kampanja)(hallinta|kuvat?)(/|\.(gif|html?|jpe?g|png))
593 # and even more from a finnish friend Hannu Napari <Hannu.Napari@hut.fi>
594 194.251.243.50/cgi-bin/banner
598 www.iltalehti.fi/ilmkuvat
599 www.mtv3.fi/mainoskuvat
610 /?.*/images/topics/topicgimp\.gif
611 .discovery.com/.*banner_id
614 .idrink.com/frm_bottom.htm
616 /?.*/ph-ad.*\.focalink\.com
619 /we_ba/ # hausfrauenseite.de *bwhahahaaaaa*
622 /.*(ms)?backoff(ice)?.*\.(gif|jpe?g)
623 /.*(/ie4|/ie3|msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*\.(gif|jpe?g)
624 /.*activex.*(gif|jpe?g)
625 /.*explorer?.(gif|jpe?g)
626 /.*freeie\.(gif|jpe?g)
627 /.*/ie_?(buttonlogo|static?|anim.*)?\.(gif|jpe?g)
628 /.*ie_sm\.(gif|jpe?g)
629 /.*msie(30)?\.(gif|jpe?g)
630 /.*msnlogo\.(gif|jpe?g)
631 /.*office97_ad1\.(gif|jpe?g)
632 /.*pbbobansm\.(gif|jpe?g)
633 /.*powrbybo\.(gif|jpe?g)
634 /.*sqlbans\.(gif|jpe?g)
636 /.*ie4get_animated\.gif
661 # generally useless information and promo stuff (commented out)
662 #/.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?)\.(gif|jpe?g)
664 /?.*/images/na/us/brand/
665 /?.*/advantage\.(gif|jpg)
666 /?.*/advanbar\.(gif|jpg)
667 /?.*/advanbtn\.(gif|jpg)
668 /?.*/biznetsmall\.(gif|jpg)
669 /?.*/utopiad\.(gif|jpg)
670 /?.*/epipo\.(gif|jpg)
671 /?.*/amazon([a-zA-Z0-9]+)\.(gif|jpg)
672 /?.*/bnlogo.(gif|jpg)
673 /?.*/buynow([a-zA-Z0-9]+)\.(gif|jpg)
678 # for the dutch folks by a dutch friend gertjan@west.nl
681 .netdirect.nl/nd_servlet/___
683 # --------------------------------------------------------------------------
687 # --------------------------------------------------------------------------
689 # the next two lines work
692 193.158.37.3/cgi-bin/impact
699 195.63.104.*/(inbox|log|meld|folderlu|folderru|log(in|out)[lmr]u|)
707 206.165.5.162/images/gcanim\.gif
711 207.159.129.131/abacus
715 207.87.27.10/tool/includes/gifs/
718 209.1.112.252/adgraph/
719 209.1.135.14[24]:1971
724 209.207.224.22[02]/servfu.pl
725 209.239.37.214/cgi-pilotfaq/getimage\.cgi
728 209.85.89.183/cgi-bin/cycle\?host
729 212.63.155.122/(banner|concret|softwareclub)
732 216.49.10.236/web1000/
735 .ICDirect.com/cgi-bin
736 .Shannon.Austria.Eu.net/\.cgi/
741 # generic hosts (probably most effective)
749 #/.*/*preferences.com*
752 .akamaitech.net/.*/Banners/
753 .altavista.telia.com/av/pix/sponsors/
754 .amazon.com/g/associates/logos/
756 .asinglesplace.com/asplink\.gif
758 .automatiseringgids.nl/gfx/advertenties/
759 #avenuea.com/Banners/
762 .befriends.net/personals/matchmaking\.jpg
763 .bizad.nikkeibp.co.jp
764 .bs.gsanet.com/gsa_bs/
767 .cgicounter.puretec.de/cgi-bin/
768 .ciec.org/images/countdown\.gif
769 .classic.adlink.de/cgi-bin/accipiter/adserver.exe
771 #.clickhere.egroups.com/img/
773 .commonwealth.riddler.com/Commonwealth/bin/statdeploy\?[0-9]+
775 .dagbladet.no/ann-gif
778 .dn.adzerver.com/image.ad
783 .eur.a1.yimg.com/eur.yimg.com/a/
784 .us.a1.yimg.com/us.yimg.com/a/
786 #fastcounter.linkexchange.com
788 .focalink.com/SmartBanner
789 .freepage.de/cgi-bin/feets/freepage_ext/.*/rw_banner
790 .freespace.virgin.net/andy.drake
791 .futurecard.com/images/
795 .go.com/cimages\?SEEK_
797 .home.miningco.com/event.ng/.*AdID
801 image*.narrative.com/news/.*\.(gif|jpe?g)
803 #image.linkexchange.com
805 .images.yahoo.com/adv/
806 .images.yahoo.com/promotions/
809 .impartner.de/cgi-bin
810 informer2.comdirect.de:6004/cd/banner2
811 .infoseek.go.com/cimages
813 .kaufwas.com/cgi-bin/zentralbanner\.cgi
814 #leader.linkexchange.com
817 .linktrader.com/cgi-bin/
818 .logiclink.nl/cgi-bin/
819 lucky.theonion.com/cgi-bin/oniondirectin\.cgi
820 lucky.theonion.com/cgi-bin/onionimp\.cgi
821 lucky.theonion.com/cgi-bin/onionimpin\.cgi
823 .mailorderbrides.com/mlbrd2\.gif
826 .members.sexroulette.com
827 .messenger.netscape.com
829 # movielink became moviefone
830 .moviefone.com/.*(banner|newbutton|(ad|poster).*?\.gif|mmail|bytb|h_(guy|showtick|aML)|m_|icon_|NF_.*?back|h_.*?gif|media/(art|imagelinks(/MF.(ad|sponsor))))
831 mqgraphics.mapquest.com/graphics/Advertisements/
834 .news.com/cgi-bin/acc_clickthru
836 .ngserve.pcworld.com/adgifs/
844 .promotions.yahoo.com
846 .qsound.com/tracker/tracker.exe
847 .resource-marketing.com/tb/
849 .rtl.de/homepage/wb/images/
850 .schnellsuche.de/images/*
851 .shout-ads.com/cgibin/shout.php3
852 .sjmercury.com/advert/
853 .smartclicks.com/.*/smart(img|banner|host|bar|site)
856 .static.wired.com/advertising/
858 .sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor\.gif
859 .t-online.de/home/040255162-001/*
862 .teleauskunft.de/commercial/
865 .tvguide.com/rbitmaps/
868 .ultra.multimania.com
872 .us.yimg.com/promotions/
876 .videoserver.kpix.com
877 .washingtonpost.com/wp-adv/
878 .webconnect.net/cgi-bin/webconnect.dll
880 .webserv.vnunet.com/ip_img/.*ban
881 .werbung.pro-sieben.de/cgi-bin
882 .whatis.com/cgi-bin/getimage.exe/
883 www..bigyellow.com/......mat.*
885 www.addme.com/link8\.gif
886 www.aftonbladet.se/annons
887 www.americanpassage.com/
888 www.angelfire.com/in/twistriot/images/wish4\.gif
889 www.bizlink.ru/cgi-bin/irads\.cgi
890 www.blacklightmedia.com/adlemur
891 www.bluesnews.com/flameq\.gif
892 www.bluesnews.com/images/ad[0-9]+\.gif
893 www.bluesnews.com/images/gcanim3\.gif
894 www.bluesnews.com/images/throbber2\.gif
895 www.bluesnews.com/miscimages/fragbutton\.gif
896 www.businessweek.com/sponsors/
897 www.canoe.ca/AdsCanoe/
898 www.cdnow.com/MN/client.banners
901 www.clicmoi.com/cgi-bin/pub\.exe
902 www.dailycal.org/graphics/adbanner-ab\.gif
903 www.detelefoongids.com/pic/[0-9]*
904 www.dhd.de/CGI/werbepic
905 www.dsf.de/cgi-bin/site_newiac.adpos
906 www.firsttarget.com/cgi-bin/klicklog.cgi
907 www.forbes.com/forbes/gifs/ads
908 www.forbes.com/tool/includes/gifs/
909 www.fxweb.holowww.com/.*\.cgi
910 www.geocities.com/TimesSquare/Zone/5267/
911 www.goto.com/images-promoters/
912 www.handelsblatt.de/hbad
913 www.hotlinks.de/cgi-bin/barimage\.cgi
914 www.infoseek.com/cimages
915 www.infoworld.com/pageone/gif
916 www.isys.net/customer/images
917 www.javaworld.com/javaworld/jw-ad
918 www.kron.com/place-ads/
919 www.leo.org/leoclick/
920 #www.linkexchange.ru/cgi-bin/erle\.cgi
921 www.linkstation.de/cgi-bin/zeige
922 www.linux.org/graphic/miniature/
923 www.linux.org/graphic/square/
924 www.linux.org/graphic/standard/
925 www.luncha.se/annonsering
927 www.ml.org/gfx/spon/icom/
928 www.ml.org/gfx/spon/wmv
929 www.musicblvd.com/mb2/graphics/netgravity/
931 www.news.com/Midas/Images/
932 www.newscientist.com/houseads
933 www.nextcard.com/affiliates/
934 www.nikkeibp.asiabiztech.com/image/NAIS4\.gif
935 www.nordlys.no/imaker/.*/.*/.*/.....\.gif # alvin brattli
936 www.nordlys.no/imaker/.*/.*/.*/..003 # alvin brattli
937 www.oanda.com/server/banner
939 www.oneandonlynetwork.com
940 www.page2page.de/cgi-bin/
941 www.prnet.de/.*/bannerschnippel/.*\.(gif|jpe?g)
942 www.promptsoftware.com/marketing/
943 #www.reklama.ru/cgi-bin/banners/
944 www.riddler.com/sponsors/
945 www.rle.ru/cgi-bin/erle\.cgi
946 www.rock.com/images/affiliates/search_black\.gif
947 www.rtl.de/search/.*kunde
948 #www.search.com/Banners
949 www.sfgate.com/place-ads/
950 www.shareware.com/midas/images/borders-btn\.gif
951 #www.sjmercury.com/products/marcom/banners/
952 www.smartclicks.com:81
953 www.sol.dk/graphics/portalmenu
954 www.sponsornetz.de/jump/show.exe
956 www.sunworld.com/sunworldonline/icons/adinfo.sm\.gif
957 www.swwwap.com/cgi-bin/
959 www.telecom.at/icons/.*film\.(gif|jpe?g)
960 www.theonion.com/bin/
961 www.topsponsor.de/cgi-bin/show.exe
963 www.ugu.com/images/EJ\.gif
964 www.warzone.com/pics/banner/
965 www.warzone.com/wzfb/ads.cgi
967 www.websitepromote.com/partner/img/
968 www.winjey.com/onlinewerbung/*\.gif
969 www.wishing.com/webaudit
970 www.www-pool.de/cgi-bin/banner-pool
971 www2.blol.com/agrJRU\.gif
973 .yahoo.com/CategoryID=0
977 www.bannerland.de/click.exe
982 www.slate.com/redirect/
983 www.slate.com/articleimages/
985 www.forbes.com/tool/images/frontend/
988 .pathfinder.com/shopping/marketplace/images/
991 static.wired.com/images
992 .perso.estat.com/cgi-bin/perso/
993 #dinoadserver1.roka.net
994 .fooladclient*.fool.com
995 .affiliate.aol.com/static/
1003 # www.sunday-times.co.uk
1004 www.sunday-times.co.uk/standing/newsint/ticker
1006 #NeXgo (ex Germany.Net)
1010 # Block as much of GeoCities as possible
1011 # All geocities-owned images
1012 www.geocities.com/images
1013 www.geocities.com/MemberBanners/live/
1014 pic.geocities.com/images
1015 # And the popup (it still pops up, but does not eat up precious bandwidth)
1016 #www.geocities.com/ad_container/pop.html # already fixed by other regexp
1018 # from corion@informatik.uni-frankfurt.de
1021 #ads.xmonitor.net/xadengine.cgi # fixed by above regexp
1022 # Also block the japanese geocities popups
1023 www.geocities.co.jp/images
1024 # Also block the come.to, surf.to etc. popups
1027 # Also block the xoom stuff.
1029 home.talkcity.com/homepopup.html.*
1031 # Max Maischein <max.maischein@econsult.de> again ...
1032 # Halflife.net uses WON banners
1033 # Banners from Freeserve
1034 #banner.freeservers.com/cgi-bin/fs_adbar # fixed by above regexp
1035 # And those nasty va-popups !
1036 /?.*/?va_banner.html
1037 # And an all-around hit against advert*.jpg
1038 /?.*/advert[0-9]+\.jpg
1039 # And yet another Internet Explorer gif ...
1041 # Some uninteresting buttons I think...
1042 .mircx.com/images/buttons/
1043 services.mircx.com/.*\.gif
1044 # Easyspace - yet another "free disk space" provider with <yuck> banner popups
1045 www.easyspace.com/(fpub)?banner.html
1046 www.easyspace.com/100\.gif
1047 # Some russian banner exchanges
1048 .banner.ricor.ru/cgi-bin/banner.pl
1049 #www.bizlink.ru/cgi-bin/irads.cgi # already fixed by other regexp
1050 stx9.sextracker.com/stx/send/
1051 # And even more of geocities :
1052 www.geocities.com/pictures/
1053 # Gaah - www.angelfire.com - another webspace provider with popups
1054 .angelfire.com/sys/download.html
1055 # Gamasutra.com uses this ad provider
1056 sally.songline.com/@
1058 # Eule.de (search engine)
1059 # maybe images.eule.de as a whole...
1060 www.eule.de/cgi-bin/
1061 images.eule.de/comdirect\.gif
1062 images.eule.de/wp\.gif
1063 .aladin.de/125_1\.gif
1064 images.eule.de/neu/books\.gif
1066 # --------------------------------------------------------------------------
1070 # --------------------------------------------------------------------------
1072 # some images on cnn's website just suck!
1075 /.*cnnpostopinionhome.\.gif
1076 /.*custom_feature\.gif
1077 /.*explore.anim.*gif
1079 /.*pathnet.warner\.gif
1080 /.*images/cnnfn_infoseek\.gif
1081 /.*images/pathfinder_btn2\.gif
1082 /.*img/gen/fosz_front_em_abc\.gif
1083 /.*img/promos/bnsearch\.gif
1084 /.*navbars/nav_partner_logos\.gif
1085 /BarnesandNoble/images/bn.recommend.box.*
1086 /digitaljam/images/digital_ban\.gif
1087 /hotstories/companies/images/companies_banner\.gif
1088 /markets/images/markets_banner\.gif
1089 /ows-img/bnoble\.gif
1090 /ows-img/nb_Infoseek\.gif
1091 .cnn.com/images/custom/totale\.gif
1092 .cnn.com/images/lotd/custom.wheels\.gif
1093 .cnn.com/images/.*/by/main.12\.gif
1094 .cnn.com/images/.*/find115\.gif
1095 .cnn.com/.*/free.email.120\.gif
1096 .cnnfn.com/images/left_banner\.gif
1098 www.cnn.com/images/.*/bn/books\.gif
1099 www.cnn.com/images/.*/pointcast\.gif
1100 www.cnn.com/images/.*/fusa\.gif
1101 .cnn.com/images/.*/start120\.gif
1102 images.cnn.com/SHOP/
1106 # the / indicates the beginning of the path (and no longer the FQDN)
1112 /gif/buttons/banner_
1113 /gif/buttons/cd_shop_
1114 /gif/cd_shop/cd_shop_ani_
1117 /av/gifs/av_map\.gif
1118 /av/gifs/av_logo\.gif
1119 /av/gifs/new/ns\.gif
1120 altavista.com/i/valsdc3\.gif
1121 jump.altavista.com/gn_sf
1124 tucows./images/locallogo\.gif
1129 # simpliemu.hypermart.net/frames.html
1130 .go2net.com/mgic/adpopup
1131 .go2net.com/metaspy/images/exposed\.gif
1132 .go2net.com/metaspy/images/ms_un\.gif
1135 www.cebu-usa.com/cwbanim1\.gif
1136 www.cebu-usa.com/Connection\.jpg
1137 www.cebu-usa.com/phonead\.gif
1138 www.cebu-usa.com/ban3\.jpg
1139 www.cebu-usa.com/tlban\.gif
1140 www.cebu-usa.com/apwlogo1\.gif
1141 www.cebu-usa.com/rose\.gif
1144 www.fnet.de/img/geldboerselogo\.jpg
1146 # hirsch@mathcs.emory.edu
1147 /images/getareal2\.gif
1149 www.assalom.com/aziza/logos/cniaffil\.gif
1150 www.assalom.com/aziza/logos/4starrl1\.gif
1151 www.phantomstar.com/images/media/m1\.gif
1154 .wahlstreet.de/MediaW\$/tsponline\.gif
1155 .wahlstreet.de/MediaW\$/dzii156x60\.gif
1156 .wahlstreet.de/MediaW\$/etban156x60_2_opt2\.gif
1160 /pics/getareal1\.gif
1162 /ltbs/cgi-bin/click.cgi
1163 .linuxtoday.com/ltbs/pics/
1167 /include/watermark/v2/
1169 # Reinier Bikker <R.P.Bikker@phys.uu.nl>
1172 # Mark Lutz <luma@nikocity.de>
1173 /.*/*werb.*\.(gif|jpe?g) # hope that's not to restrictive
1175 #Free Yellow thing at bottom of pages (HereticPC)
1176 www.freeyellow.com/images/powerlink5a\.gif
1177 www.freeyellow.com/images/powerlink5b\.gif
1178 www.freeyellow.com/images/powerlink5c\.gif
1179 www.freeyellow.com/images/powerlink5d\.gif
1180 www.freeyellow.com/images/powerlink5e\.gif
1183 www.eads.com/images/refbutton\.gif
1184 www.fortunecity.com/console2/newnav/*
1185 www.goldetc.net/search\.gif
1186 www.cris.com/~Lzrdking/carpix/cars3-le\.gif
1187 www.justfreestuff.com/scott\.gif
1188 www.cyberthrill.com/entrance\.gif
1189 secure.pec.net/images/pec69ani\.gif
1190 www.new-direction.com/avviva\.gif
1191 /.*internetmarketingcenter\.gif
1192 www.new-direction.com/wp-linkexchange-loop\.gif
1193 www.new-direction.com/windough\.gif
1194 www.digitalwork.com/universal_images/affiliate/dw_le_3\.gif
1195 service.bfast.com/bfast/click/*
1196 www.new-direction.com/magiclearning\.gif
1197 www.new-direction.com/mailloop\.gif
1199 www.free-banners.com/images/hitslogo\.gif
1200 rob.simplenet.com/dyndns/fortune5\.gif
1201 .nasdaq-amex.com/images/bn_ticker\.gif
1204 # navilor@hotmail.com
1207 # wayne@staff.msen.com
1209 a*.*.*.yimg.com/([0-9]*|\/)*us.yimg.com/*
1212 www.realtop50.com/cgi-bin/ad
1216 www.yacht.de/images/(my_ani|eissingani|chartertrans|fum|schnupper|fysshop|garmin)\.gif
1217 www.sponsorweb.de/web-sponsor/nt-bin/show.exe
1220 # Club-internet pops up a complain if you refuse cookie (still pops up...)
1221 perso.club-internet.fr/html/Popup/popup_frame_nocookie.html
1222 perso.club-internet.fr/pagesperso/popup_nocookie.html
1224 .gmx.net/images/newsbanner/
1227 .quicken.lexware.de/images/us7-468x60.gif
1228 /img/special/chatpromo\.gif
1229 www.travelocity.com/images/promos/
1231 # wonder that that does...
1234 #/*.*/phpAds/viewbanner.php
1235 #/*.*/phpAds/phpads.php
1237 www.linux-magazin.de/banner
1238 .comtrack.comclick.com
1240 .iac-online.de/filler
1242 .media.interadnet.com
1243 .stat.www.fi/cgi-bin
1247 .disneystoreaffiliates.com
1249 .powerwork.mobile.de/cgi-bin/getimage\.cgi
1253 ####################################################
1256 # The Register ads - oh, and all images in Register stories (sigh).
1257 www.theregister.co.uk/media/
1259 # Used on http://www.theregister.co.uk/
1260 # Sample advert URL:
1261 # http://secure.webconnect.net/cgi-bin/webconnecthome.dll?F467
1265 www.dilbert.com/comics/dilbert/images/.*_140x800.*\.gif
1268 # Uses URL: http://www.stattrack.com/cgi-bin/stats/image.cgi
1270 # And loads JavaScript from http://www.stattrack.com/stats/code
1271 www.stattrack.com/stats/
1273 #Now they're Yahoo GeoCities, their junk is in a different place.
1274 ##geo.yahoo.com/serv
1275 ##visit.geocities.com/visit.gif
1276 .yimg.com/?.*/www.geocities.com/js_source
1277 #http://us.toto.geo.yahoo.com/toto?s=76001086
1279 .visit.geocities.com
1280 .yimg.com/?.*/www.geocities.com/
1282 #http://counter16.bravenet.com/counter.php
1285 #http://stat.cybermonitor.com/7emezone_p?1707_USdvd
1288 #http://members.tripod.com/adm/popup/.....
1289 members.tripod.com/adm/popup/
1291 #This is the worst ad idea ever!
1292 #count.exitexchange.com/exit/1100661
1293 #count.exitexchange.com/clients/navbar.html
1294 #(used in http://skyhivisuals.tripod.com/malfunctions_.htm)
1300 #This site traps the browser
1303 #privacy.net runs ads
1306 #Lindsay.Marshall@newcastle.ac.uk suggested these, to kill Opera adverts:
1311 dinoadserver*.roka.net
1313 logout.tvspielfilm.de
1315 www.freenet.de/customerindex\.html
1317 .fxweb.com/v2-trackrun\.cgi
1318 rtldating.peopleunited.de
1320 www.zdnet.com/fcgi-bin/
1321 service.bfast.com/bfast/serve
1323 fourohfour.nbci.com/Members404Error.php3
1326 www.fair-ist-mehr.de/cgi-bin/bt.pl
1336 #############################################################################
1338 #############################################################################
1341 www.userfriendly.org/images/banners/banner_dp_heart\.gif
1344 #Why were these in the Waldherr blockfile?
1346 #a*.*.*.yimg.com/([0-9]|\/)*us.yimg.com/i/*
1348 # some regexps are simply too aggressive ...
1350 # equalizer to /*.*(.*[-_.])?ads?[0-9]?(/|[-_.].*|.(gif|jpe?g))
1361 .ad.siemens.de # SIEMENS Automation & Drives
1362 #add-url.altavista.com
1369 # univ. don't advertise, do they :-)
1371 .ac.uk # English Universities too! - Jon
1372 .uni-*.de # What about Germany? --oes
1373 www.ugu.com/sui/ugu/adv
1377 clubs.yahoo.com/clubs
1378 edit.my.yahoo.com/config/show_identity
1379 www.ix.de/newsticker/data/ad
1380 www.heise.de/newsticker/data/ad
1381 www.careernet.de/anzeige
1382 www.careernet.de/bewerber/stellenanzeigen
1383 www.baumgartner.de/stellenmarkt/anzeigen
1384 www.dspartner.de/Anzeigen
1385 www.aws-jobs.de/Anzeigen
1386 www.jobware.de/.*/anzeigen/
1387 www.jobworld.de/bilder/
1388 www.cnn.com/TECH/computing/.*/internet.ads/
1389 www.financial.de/shop/
1393 194.221.152.2/phptelefontmp
1394 .harvard.edu/images/banner/
1397 www.dhd.de/CGI/anzeigen/
1400 .img.web.de/web/img/
1402 www.segel.de/menu/bilder/anzeigen\.gif
1403 www.corel.com/graphics/banners/
1404 www.software.ibm.com/ad/
1405 www.omg.org/docs/ad/
1407 .sperrmuell.de/scripts/anzeigen
1408 www.freenet.de/index.html
1409 www.01019freenet.de/index.html
1410 www.freenet.de/freenet/
1411 www.01019freenet.de/freenet/
1412 webfactory.de/anzeigen.php
1414 www.internatif.org/bortzmeyer/debian/sponsor/
1417 www.software.hosting.ibm.com/ad/
1418 www.ibm.com/software/ad/
1421 www.debian.org/Pics/banner-blue\.gif
1422 www.linux.de/pics/Nachrichten_banner\.gif
1425 finder.shopping.yahoo.com/shop/
1435 .consumer-direct.com
1440 # my banking stuff => no ads.
1446 # Jon's addition: MSDN
1451 .freemail*.web.de/online/ordner/anzeigen
1452 foggy.sda.t-online.de
1453 .us.i1.yimg.com/us.yimg.com/i/pim/ad2.gif
1454 www.nexgo.de/.*/bg_banner.jpg
1456 # .*ads. matches prdownloads.sourceforge.net and many other download sites