1 #############################################################################
2 # Sample actions file for the Internet Junkbuster 2.9.x
4 # For information, see http://ijbswa.sourceforge.net/
6 # $Id: actionsfile,v 1.9 2001/10/15 22:06:16 joergs Exp $
8 #############################################################################
10 #############################################################################
12 # To determine which actions apply to a request, the URL of the request is
13 # compared to all patterns in this file. Every time it matches, the list of
14 # applicable actions for this URL is incrementally updated. You can trace
15 # this process by visiting http://i.j.b/show-url-info
17 # There are 4 types of lines in this file: comments (like this line),
18 # actions, aliases and patterns, all of which are explained below.
20 #############################################################################
22 #############################################################################
24 # 1. On Domains and Paths
25 # -----------------------
27 # Generally, a pattern has the form <domain>/<path>, where both the <domain>
28 # and <path> part are optional. If you only specify a domain part, the "/"
32 # is a domain-only pattern and will match any request to www.yahoo.com
35 # means exactly the same (but is slightly less efficient)
37 # www.example.com/index.html
38 # matches only the document /index.html on www.example.com
41 # matches the document /index.html, regardless of the domain
44 # matches nothing, since it would be interpreted as a domain name and
45 # there is no top-level domain called ".html".
50 # The matching of the domain part offers some flexible options: If the
51 # domain starts or ends with a dot, it becomes unanchored at that end:
54 # matches only www.example.com
57 # matches any domain that ENDS in .example.com
60 # matches any domain that STARTS with www.
62 # Additionally, there are wildcards that you can use in the domain names
63 # themselves. They work pretty similar to shell wildcards: "*" stands for
64 # zero or more arbitrary characters, "?" stands for one, and you can define
65 # charachter classes in square brackets and they can be freely mixed:
68 # matches adserver.example.com, ads.example.com, etc but not sfads.example.com
71 # matches all of the above
74 # matches www.ipix.com, pictures.epix.com, a.b.c.d.e.upix.com etc
76 # www[1-9a-ez].example.com
77 # matches www1.example.com, www4.example.com, wwwd.example.com,
78 # wwwz.example.com etc, but not wwww.example.com
85 # Paths are specified as regular expressions. A comprehensive discussion of
86 # regular expressions wouldn't fit here, but (FIXME) someone should paste
87 # a concise intro to the regex language here.
89 # If Junkbuster was compiled with pcre support (default), Perl compatible
90 # regular expressions are used. See the pcre/docs/ direcory or man perlre
91 # (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for
94 # Please note that matching in the path is CASE INSENSITIVE by default, but
95 # you can switch to case sensitive by starting the pattern with the "(?-i)"
98 # www.example.com/(?-i)PaTtErN.*
99 # will match only documents whose path starts with PaTtErN in exactly this
102 # Partially case-sensetive and partially case-insensitive patterns are
103 # possible, but the rules about splitting them up are extremely complex
104 # - see the PCRE documentation for more information.
106 #############################################################################
108 #############################################################################
110 # There are 3 kinds of action:
112 # Boolean (e.g. "block"):
116 # Parameterized (e.g. "hide-user-agent"):
117 # +name{param} # enable and set parameter to "param"
120 # Multi-value (e.g. "add-header", "wafer"):
121 # +name{param} # enable and add parameter "param"
122 # -name{param} # remove the parameter "param"
123 # -name # disable totally
125 # The default (if you don't specify anything in this file) is not to take
126 # any actions - i.e completely disabled, so JunkBuster will just be a
127 # normal, non-blocking, non-anonymizing proxy. You must specifically
128 # enable the privacy and blocking features you need (although the
129 # provided default actions file will do that for you).
131 # Later actions always override earlier ones. For multi-valued actions,
132 # the actions are applied in the order they are specified.
134 #############################################################################
136 #############################################################################
138 # +add-header{Name: value}
139 # Adds the specified HTTP header, which is not checked for validity.
140 # You may specify this many times to specify many headers.
145 # +deanimate-gifs{last}
146 # +deanimate-gifs{first}
147 # Deanimate all animated GIF images, i.e. reduce them to their last
148 # frame. This will also shrink the images considerably. (In bytes,
150 # If the option "first" is given, the first frame of the animation
151 # is used as the replacement. If "last" is given, the last frame of
152 # the animation is used instead, which propably makes more sense for
153 # most banner animations, but also has the risk of not showing the
154 # entire last frame (if it is only a delta to an earlier frame).
157 # Downgrade HTTP/1.1 client requests to HTTP/1.0 and downgrade the
158 # responses as well. Use this action for servers that use HTTP/1.1
159 # protocol features that Junkbuster currently can't handle yet.
162 # Many sites, like yahoo.com, don't just link to other sites.
163 # Instead, they will link to some script on their own server,
164 # giving the destination as a parameter, which will then redirect
165 # you to the final target.
167 # URLs resulting from this scheme typically look like:
168 # http://some.place/some_script?http://some.where-else
170 # Sometimes, there are even multiple consecutive redirects encoded
171 # in the URL. These redirections via scripts make your web browing
172 # more traceable, since the server from which you follow such a link
173 # can see where you go to. Apart from that, valuable bandwidth and
174 # time is wasted, while your browser aks the server for one redirect
175 # after the other. Plus, it feeds the advertisers.
177 # The +fast-redirects option enables interception of these requests
178 # by junkbuster, who will cut off all but the last valid URL in the
179 # request and send a local redirect back to your browser without
180 # contacting the remote site.
183 # Filter the website through the re_filterfile
184 # FIXME: The syntax should be +filter{filename}
187 # Block any existing X-Forwarded-for header, and do not add a new one.
190 # +hide-from{spam@sittingduck.xqq}
191 # If the browser sends a "From:" header containing your e-mail address,
192 # either completely removes the header ("block"), or change it to the
193 # specified e-mail address.
195 # +hide-referer{block}
196 # +hide-referer{forge}
197 # +hide-referer{http://nowhere.com}
198 # Don't send the "Referer:" (sic) header to the web site. You can
199 # block it, forge a URL to the same server as the request (which is
200 # preferred because some sites will not send images otherwise) or
201 # set it to a constant string.
203 # +hide-referrer{...}
204 # Alternative spelling of +hide-referer. Has the same parameters,
205 # and can be freely mixed with, "+hide-referer". ("referrer" is the
206 # correct English spelling, however the HTTP specification has a
207 # bug - it requires it to be spelt "referer").
209 # +hide-user-agent{browser-type}
210 # Change the "User-Agent:" header so web servers can't tell your
211 # browser type. (Breaks many web sites). Specify the user-agent
212 # value you want - e.g., to pretend to be using Netscape on Linux:
213 # +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)}
214 # Or to identify yourself explicitly as a JunkBuster user:
215 # +hide-user-agent{JunkBuster/1.0}
216 # (Don't change the version number from 1.0 - after all, why tell them?)
219 # Treat this URL as an image. This only matters if it's also "+block"ed,
220 # in which case a "blocked" image can be sent rather than a HTML page.
221 # See +image-blocker{} for the control over what is actually sent.
223 # +image-blocker{logo}
224 # +image-blocker{blank}
225 # +image-blocker{http://i.j.b/send-banner}
226 # Decides what to do with URLs that end up tagged with {+block +image}.
227 # There are 4 options. "-image-blocker" will send a HTML "blocked" page,
228 # usually resulting in a "broken image" icon. "+image-blocker{logo}"
229 # will send a "JunkBuster" image. "+image-blocker{blank}" will send
230 # a 1x1 transparent GIF. And finally, "+image-blocker{http://xyz.com}"
231 # will send a HTTP temporary redirect to the specified image - this
232 # has the advantage of the icon being beeing cached by the browser,
233 # which will speed up the display.
236 # +limit-connect{portlist}
237 # The CONNECT methods exists in HTTP to allow access to secure websites
238 # (https:// URLs) through proxies. It works very simply: The proxy
239 # connects to the server on the specified port, and then short-circuits
240 # its connections to the cliant and to the remote proxy.
241 # This can be a big security hole, since CONNECT-enabled proxies can
242 # be abused as TCP relays very easily.
243 # By default, i.e. in the absence of a +limit-connect action, Junkbuster
244 # will only allow CONNECT requests to port 443, which is the standard port
246 # If you want to allow CONNECT for more ports than that, or want to forbid
247 # CONNECT altogether, you can specify a comma separated list of ports and port
248 # ranges (the latter using dashes, with the minimum defaulting to 0 and max to 65K):
250 # +limit-connect{443} # This is the default and need no be specified.
251 # +limit-connect{80,443} # Ports 80 and 443 are OK.
252 # +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100, and above 500 are OK.
255 # Prevent the website from compressing the data. Some websites do
256 # that, which is a problem for junkbuster, since +filter, +no-popup
257 # and +gif-deanimate will not work on compressed data. Will slow down
258 # connections to those websites, though.
261 # If the website sets cookies, make sure they are erased when you exit
262 # and restart your web browser. This makes profiling cookies useless,
263 # but won't break sites which require cookies so that you can log in
264 # or for transactions.
267 # Prevent the website from reading cookies
270 # Prevent the website from setting cookies
274 # Filter the website through a built-in filter to disable
275 # window.open() etc. The two alternative spellings are
279 # This action only applies if you are using a jarfile. It sends a
280 # cookie to every site stating that you do not accept any copyright
281 # on cookies sent to you, and asking them not to track you. Of
282 # course, this is a (relatively) unique header they could use to
286 # This allows you to add an arbitrary cookie. Specify it multiple
287 # times in order to add several cookies.
289 #############################################################################
292 #############################################################################
294 #############################################################################
296 #############################################################################
298 # You can define a short form for a list of permissions - e.g., instead
299 # of "-no-cookies-set -no-cookies-read -filter -fast-redirects", you can
300 # just write "shop". This is called an alias.
302 # Currently, an alias can contain any character except space, tab, '=', '{'
304 # But please use only 'a'-'z', '0'-'9', '+', and '-'.
306 # Alias names are not case sensitive.
308 # Aliases beginning with '+' or '-' may be used for system permission names
309 # in future releases - so try to avoid alias names like this. (e.g.
310 # "+no-cookies" below is not a good name)
312 # Aliases must be defined before they are used.
316 +no-cookies = +no-cookies-set +no-cookies-read
317 -no-cookies = -no-cookies-set -no-cookies-read
318 +imageblock = +block +image
319 +filter-all = +filter +no-compression
321 # Fragile sites should have the minimum changes
322 fragile = -block -deanimate-gifs -fast-redirects -filter -hide-referer -no-cookies -no-popups
324 # Shops should be allowed to set persistent cookies
325 shop = -filter -no-cookies -no-cookies-keep
327 #... etc. Customize to your heart's content.
329 #############################################################################
331 #############################################################################
343 +hide-referer{forge} \
346 +image-blocker{http://i.j.b/send-banner} \
357 #############################################################################
358 # A useful site for testing - shows all headers:
359 # http://privacy.net/analyze/
360 #############################################################################
361 {+add-header{X-Privacy: Yes please} \
362 +add-header{X-User-Tracking: No thanks!} -filter}
366 #############################################################################
367 # Test for new GIF deanimation feature.
368 # Just try http://www.oesterhelt.org/deanimate-demo with and without it.
369 #############################################################################
370 {+deanimate-gifs{last}}
371 www.oesterhelt.org/deanimate-demo
374 #############################################################################
375 # Sites that need cookies
377 # FIXME: Now cookies are allowed by default, do any of these sites
378 # need persistent cookies?
379 #############################################################################
394 #############################################################################
395 # These sites are very complex and require
396 # minimal interference.
397 #############################################################################
399 .office.microsoft.com
400 .windowsupdate.microsoft.com
403 #############################################################################
404 # Shopping sites - still want to block ads.
405 #############################################################################
408 .worldpay.com # for quietpc.com
412 #############################################################################
413 # These shops require pop-ups
414 #############################################################################
419 #############################################################################
420 # Sometimes fast-redirects catches things by mistake
421 #############################################################################
423 www.ukc.ac.uk/cgi-bin/wac\.cgi\?
425 edit.europe.yahoo.com
427 .altavista.com/.*(like|url|link):http
428 .altavista.com/trans.*urltext=http
432 #############################################################################
433 # Please don't re_filter code!
434 #############################################################################
439 #############################################################################
441 #############################################################################
443 #############################################################################
448 #############################################################################
450 #############################################################################
452 .ad.preferences.com/image.*
455 .ad-adex3.flycast.com
457 .connect.247media.ads.link4ads.com
459 .mojofarm.mediaplex.com/ad/
460 www.carbuyer.com/cgi-carbuyer/getimage.cgi
461 /phpAds(New)?/viewbanner\.php
462 .ad.de.doubleclick.net
463 /.*/count\.cgi\?.*df=
464 *.fxweb.com/v2-trackrun\.cgi
470 a196.g.akamai.net/7/196/2670/000[1-3]/images\.gmx\.net/.*images/.*/.*/
474 .smartclicks.com/.*/smart(img|banner|host|bar|site)
475 .linkexchange.com/.*/showl(ogo|e)
477 pixel.intares.net/cgi-bin/janus
479 #############################################################################
481 #############################################################################
483 #############################################################################
484 /.*/(.*[-_.])?ads?[0-9]?(/|[-_.].*|\.(gif|jpe?g))
485 /.*/(.*[-_.])?count(er)?(\.cgi|\.dll|\.exe|[?/])
486 /.*/(ng)?adclient\.cgi
487 /.*/(plain|live|rotate)[-_.]?ads?/
489 /.*/(sponsor)s?[0-9]?/
490 ###/*.*/(sponsor|banner)s?[0-9]?/
491 ###/*.*/.*banner([-_]?[a-z0-9]+)?\.(gif|jpg)
493 /?.*/_?(plain|live)?ads?(-banners)?/
495 /?.*/ad(sdna_image|gifs?)/
496 /?.*/ad(server|stream|juggler)\.(cgi|pl|dll|exe)
500 /?.*/adv((er)?ts?|ertis(ing|ements?))?/
504 /?.*/banner_?anzeigen
508 /?.*/cgi-bin/centralad/getimage
509 /?.*/images/addver\.gif
510 /?.*/images/advert\.gif
511 /?.*/images/marketing/.*\.(gif|jpe?g)
516 /?.*/randomads/.*\.(gif|jpe?g)
517 /?.*/rekla(ma|me|am)/.*\.(gif|jpe?g)
520 /?.*/sponsors?[0-9]?/
524 /?.*/werbung/.*\.(gif|jpe?g)
525 /?.*/adv\. # www.telegraaf.nl
526 /?.*/advert[0-9]+\.jpg
541 /bin/getimage.cgi/...\?AD
542 /bin/nph-oma.count/ct/default.shtml
543 /bin/nph-oma.count/ix/default.html
544 /cgi-bin/getimage.cgi/....\?GROUP=
546 /cgi-bin/webad.dll/ad
548 /cwmail/amzn-bm1\.gif
556 /image\.ng/transactionID
557 /images/.*/.*_anim\.gif # alvin brattli
558 /ip_img/.*\.(gif|jpe?g)
561 /netscapeworld/nw-ad/
562 /promotions/houseads/
566 /torget/jobline/.*\.gif
571 /cgi-bin/nph-adclick.exe/
572 /?.*/Image/BannerAdvertising/
574 /?.*/adlib/server\.cgi
575 /?.*/gsa_bs/gsa_bs.cmdl
579 # for our finnish friends, by Kai Puolamaki <Kai.Puolamaki@iki.fi>
580 /?.*/mainos/*.*/.*\.gif
581 /?.*/mainos/*.*/.*\.jpe?g
583 # more from a finnish friend Petri Haapio <pha@iki.fi>
585 .keltaisetsivut.fi/web/img/\.*gif
586 .haku.net/pics/pana\.*gif
588 /?.*/(.*[-_.].*)?maino(kset|nta|s).*(/|\.(gif|html?|jpe?g|png))
589 /?.*/(ilm(oitus)?|kampanja)(hallinta|kuvat?)(/|\.(gif|html?|jpe?g|png))
591 # and even more from a finnish friend Hannu Napari <Hannu.Napari@hut.fi>
592 194.251.243.50/cgi-bin/banner
596 www.iltalehti.fi/ilmkuvat
597 www.mtv3.fi/mainoskuvat
608 /?.*/images/topics/topicgimp\.gif
609 .discovery.com/.*banner_id
612 .idrink.com/frm_bottom.htm
614 /?.*/ph-ad.*\.focalink\.com
617 /we_ba/ # hausfrauenseite.de *bwhahahaaaaa*
620 /.*(ms)?backoff(ice)?.*\.(gif|jpe?g)
621 /.*(/ie4|/ie3|msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*\.(gif|jpe?g)
622 /.*activex.*(gif|jpe?g)
623 /.*explorer?.(gif|jpe?g)
624 /.*freeie\.(gif|jpe?g)
625 /.*/ie_?(buttonlogo|static?|anim.*)?\.(gif|jpe?g)
626 /.*ie_sm\.(gif|jpe?g)
627 /.*msie(30)?\.(gif|jpe?g)
628 /.*msnlogo\.(gif|jpe?g)
629 /.*office97_ad1\.(gif|jpe?g)
630 /.*pbbobansm\.(gif|jpe?g)
631 /.*powrbybo\.(gif|jpe?g)
632 /.*sqlbans\.(gif|jpe?g)
634 /.*ie4get_animated\.gif
659 # generally useless information and promo stuff (commented out)
660 #/.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?)\.(gif|jpe?g)
662 /?.*/images/na/us/brand/
663 /?.*/advantage\.(gif|jpg)
664 /?.*/advanbar\.(gif|jpg)
665 /?.*/advanbtn\.(gif|jpg)
666 /?.*/biznetsmall\.(gif|jpg)
667 /?.*/utopiad\.(gif|jpg)
668 /?.*/epipo\.(gif|jpg)
669 /?.*/amazon([a-zA-Z0-9]+)\.(gif|jpg)
670 /?.*/bnlogo.(gif|jpg)
671 /?.*/buynow([a-zA-Z0-9]+)\.(gif|jpg)
676 # for the dutch folks by a dutch friend gertjan@west.nl
679 .netdirect.nl/nd_servlet/___
681 # --------------------------------------------------------------------------
685 # --------------------------------------------------------------------------
687 # the next two lines work
690 193.158.37.3/cgi-bin/impact
697 195.63.104.*/(inbox|log|meld|folderlu|folderru|log(in|out)[lmr]u|)
705 206.165.5.162/images/gcanim\.gif
709 207.159.129.131/abacus
713 207.87.27.10/tool/includes/gifs/
716 209.1.112.252/adgraph/
717 209.1.135.14[24]:1971
722 209.207.224.22[02]/servfu.pl
723 209.239.37.214/cgi-pilotfaq/getimage\.cgi
726 209.85.89.183/cgi-bin/cycle\?host
727 212.63.155.122/(banner|concret|softwareclub)
730 216.49.10.236/web1000/
733 .ICDirect.com/cgi-bin
734 .Shannon.Austria.Eu.net/\.cgi/
739 # generic hosts (probably most effective)
747 #/.*/*preferences.com*
750 .akamaitech.net/.*/Banners/
751 .altavista.telia.com/av/pix/sponsors/
752 .amazon.com/g/associates/logos/
754 .asinglesplace.com/asplink\.gif
756 .automatiseringgids.nl/gfx/advertenties/
757 #avenuea.com/Banners/
760 .befriends.net/personals/matchmaking\.jpg
761 .bizad.nikkeibp.co.jp
762 .bs.gsanet.com/gsa_bs/
765 .cgicounter.puretec.de/cgi-bin/
766 .ciec.org/images/countdown\.gif
767 .classic.adlink.de/cgi-bin/accipiter/adserver.exe
769 #.clickhere.egroups.com/img/
771 .commonwealth.riddler.com/Commonwealth/bin/statdeploy\?[0-9]+
773 .dagbladet.no/ann-gif
776 .dn.adzerver.com/image.ad
781 .eur.a1.yimg.com/eur.yimg.com/a/
782 .us.a1.yimg.com/us.yimg.com/a/
784 #fastcounter.linkexchange.com
786 .focalink.com/SmartBanner
787 .freepage.de/cgi-bin/feets/freepage_ext/.*/rw_banner
788 .freespace.virgin.net/andy.drake
789 .futurecard.com/images/
793 .go.com/cimages\?SEEK_
795 .home.miningco.com/event.ng/.*AdID
799 image*.narrative.com/news/.*\.(gif|jpe?g)
801 #image.linkexchange.com
803 .images.yahoo.com/adv/
804 .images.yahoo.com/promotions/
807 .impartner.de/cgi-bin
808 informer2.comdirect.de:6004/cd/banner2
809 .infoseek.go.com/cimages
811 .kaufwas.com/cgi-bin/zentralbanner\.cgi
812 #leader.linkexchange.com
815 .linktrader.com/cgi-bin/
816 .logiclink.nl/cgi-bin/
817 lucky.theonion.com/cgi-bin/oniondirectin\.cgi
818 lucky.theonion.com/cgi-bin/onionimp\.cgi
819 lucky.theonion.com/cgi-bin/onionimpin\.cgi
821 .mailorderbrides.com/mlbrd2\.gif
824 .members.sexroulette.com
825 .messenger.netscape.com
827 # movielink became moviefone
828 .moviefone.com/.*(banner|newbutton|(ad|poster).*?\.gif|mmail|bytb|h_(guy|showtick|aML)|m_|icon_|NF_.*?back|h_.*?gif|media/(art|imagelinks(/MF.(ad|sponsor))))
829 mqgraphics.mapquest.com/graphics/Advertisements/
832 .news.com/cgi-bin/acc_clickthru
834 .ngserve.pcworld.com/adgifs/
842 .promotions.yahoo.com
844 .qsound.com/tracker/tracker.exe
845 .resource-marketing.com/tb/
847 .rtl.de/homepage/wb/images/
848 .schnellsuche.de/images/*
849 .shout-ads.com/cgibin/shout.php3
850 .sjmercury.com/advert/
851 .smartclicks.com/.*/smart(img|banner|host|bar|site)
854 .static.wired.com/advertising/
856 .sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor\.gif
857 .t-online.de/home/040255162-001/*
860 .teleauskunft.de/commercial/
863 .tvguide.com/rbitmaps/
866 .ultra.multimania.com
870 .us.yimg.com/promotions/
874 .videoserver.kpix.com
875 .washingtonpost.com/wp-adv/
876 .webconnect.net/cgi-bin/webconnect.dll
878 .webserv.vnunet.com/ip_img/.*ban
879 .werbung.pro-sieben.de/cgi-bin
880 .whatis.com/cgi-bin/getimage.exe/
881 www..bigyellow.com/......mat.*
883 www.addme.com/link8\.gif
884 www.aftonbladet.se/annons
885 www.americanpassage.com/
886 www.angelfire.com/in/twistriot/images/wish4\.gif
887 www.bizlink.ru/cgi-bin/irads\.cgi
888 www.blacklightmedia.com/adlemur
889 www.bluesnews.com/flameq\.gif
890 www.bluesnews.com/images/ad[0-9]+\.gif
891 www.bluesnews.com/images/gcanim3\.gif
892 www.bluesnews.com/images/throbber2\.gif
893 www.bluesnews.com/miscimages/fragbutton\.gif
894 www.businessweek.com/sponsors/
895 www.canoe.ca/AdsCanoe/
896 www.cdnow.com/MN/client.banners
899 www.clicmoi.com/cgi-bin/pub\.exe
900 www.dailycal.org/graphics/adbanner-ab\.gif
901 www.detelefoongids.com/pic/[0-9]*
902 www.dhd.de/CGI/werbepic
903 www.dsf.de/cgi-bin/site_newiac.adpos
904 www.firsttarget.com/cgi-bin/klicklog.cgi
905 www.forbes.com/forbes/gifs/ads
906 www.forbes.com/tool/includes/gifs/
907 www.fxweb.holowww.com/.*\.cgi
908 www.geocities.com/TimesSquare/Zone/5267/
909 www.goto.com/images-promoters/
910 www.handelsblatt.de/hbad
911 www.hotlinks.de/cgi-bin/barimage\.cgi
912 www.infoseek.com/cimages
913 www.infoworld.com/pageone/gif
914 www.isys.net/customer/images
915 www.javaworld.com/javaworld/jw-ad
916 www.kron.com/place-ads/
917 www.leo.org/leoclick/
918 #www.linkexchange.ru/cgi-bin/erle\.cgi
919 www.linkstation.de/cgi-bin/zeige
920 www.linux.org/graphic/miniature/
921 www.linux.org/graphic/square/
922 www.linux.org/graphic/standard/
923 www.luncha.se/annonsering
925 www.ml.org/gfx/spon/icom/
926 www.ml.org/gfx/spon/wmv
927 www.musicblvd.com/mb2/graphics/netgravity/
929 www.news.com/Midas/Images/
930 www.newscientist.com/houseads
931 www.nextcard.com/affiliates/
932 www.nikkeibp.asiabiztech.com/image/NAIS4\.gif
933 www.nordlys.no/imaker/.*/.*/.*/.....\.gif # alvin brattli
934 www.nordlys.no/imaker/.*/.*/.*/..003 # alvin brattli
935 www.oanda.com/server/banner
937 www.oneandonlynetwork.com
938 www.page2page.de/cgi-bin/
939 www.prnet.de/.*/bannerschnippel/.*\.(gif|jpe?g)
940 www.promptsoftware.com/marketing/
941 #www.reklama.ru/cgi-bin/banners/
942 www.riddler.com/sponsors/
943 www.rle.ru/cgi-bin/erle\.cgi
944 www.rock.com/images/affiliates/search_black\.gif
945 www.rtl.de/search/.*kunde
946 #www.search.com/Banners
947 www.sfgate.com/place-ads/
948 www.shareware.com/midas/images/borders-btn\.gif
949 #www.sjmercury.com/products/marcom/banners/
950 www.smartclicks.com:81
951 www.sol.dk/graphics/portalmenu
952 www.sponsornetz.de/jump/show.exe
954 www.sunworld.com/sunworldonline/icons/adinfo.sm\.gif
955 www.swwwap.com/cgi-bin/
957 www.telecom.at/icons/.*film\.(gif|jpe?g)
958 www.theonion.com/bin/
959 www.topsponsor.de/cgi-bin/show.exe
961 www.ugu.com/images/EJ\.gif
962 www.warzone.com/pics/banner/
963 www.warzone.com/wzfb/ads.cgi
965 www.websitepromote.com/partner/img/
966 www.winjey.com/onlinewerbung/*\.gif
967 www.wishing.com/webaudit
968 www.www-pool.de/cgi-bin/banner-pool
969 www2.blol.com/agrJRU\.gif
971 .yahoo.com/CategoryID=0
975 www.bannerland.de/click.exe
980 www.slate.com/redirect/
981 www.slate.com/articleimages/
983 www.forbes.com/tool/images/frontend/
986 .pathfinder.com/shopping/marketplace/images/
989 static.wired.com/images
990 .perso.estat.com/cgi-bin/perso/
991 #dinoadserver1.roka.net
992 .fooladclient*.fool.com
993 .affiliate.aol.com/static/
1001 # www.sunday-times.co.uk
1002 www.sunday-times.co.uk/standing/newsint/ticker
1004 #NeXgo (ex Germany.Net)
1008 # Block as much of GeoCities as possible
1009 # All geocities-owned images
1010 www.geocities.com/images
1011 www.geocities.com/MemberBanners/live/
1012 pic.geocities.com/images
1013 # And the popup (it still pops up, but does not eat up precious bandwidth)
1014 #www.geocities.com/ad_container/pop.html # already fixed by other regexp
1016 # from corion@informatik.uni-frankfurt.de
1019 #ads.xmonitor.net/xadengine.cgi # fixed by above regexp
1020 # Also block the japanese geocities popups
1021 www.geocities.co.jp/images
1022 # Also block the come.to, surf.to etc. popups
1025 # Also block the xoom stuff.
1027 home.talkcity.com/homepopup.html.*
1029 # Max Maischein <max.maischein@econsult.de> again ...
1030 # Halflife.net uses WON banners
1031 # Banners from Freeserve
1032 #banner.freeservers.com/cgi-bin/fs_adbar # fixed by above regexp
1033 # And those nasty va-popups !
1034 /?.*/?va_banner.html
1035 # And an all-around hit against advert*.jpg
1036 /?.*/advert[0-9]+\.jpg
1037 # And yet another Internet Explorer gif ...
1039 # Some uninteresting buttons I think...
1040 .mircx.com/images/buttons/
1041 services.mircx.com/.*\.gif
1042 # Easyspace - yet another "free disk space" provider with <yuck> banner popups
1043 www.easyspace.com/(fpub)?banner.html
1044 www.easyspace.com/100\.gif
1045 # Some russian banner exchanges
1046 .banner.ricor.ru/cgi-bin/banner.pl
1047 #www.bizlink.ru/cgi-bin/irads.cgi # already fixed by other regexp
1048 stx9.sextracker.com/stx/send/
1049 # And even more of geocities :
1050 www.geocities.com/pictures/
1051 # Gaah - www.angelfire.com - another webspace provider with popups
1052 .angelfire.com/sys/download.html
1053 # Gamasutra.com uses this ad provider
1054 sally.songline.com/@
1056 # Eule.de (search engine)
1057 # maybe images.eule.de as a whole...
1058 www.eule.de/cgi-bin/
1059 images.eule.de/comdirect\.gif
1060 images.eule.de/wp\.gif
1061 .aladin.de/125_1\.gif
1062 images.eule.de/neu/books\.gif
1064 # --------------------------------------------------------------------------
1068 # --------------------------------------------------------------------------
1070 # some images on cnn's website just suck!
1073 /.*cnnpostopinionhome.\.gif
1074 /.*custom_feature\.gif
1075 /.*explore.anim.*gif
1077 /.*pathnet.warner\.gif
1078 /.*images/cnnfn_infoseek\.gif
1079 /.*images/pathfinder_btn2\.gif
1080 /.*img/gen/fosz_front_em_abc\.gif
1081 /.*img/promos/bnsearch\.gif
1082 /.*navbars/nav_partner_logos\.gif
1083 /BarnesandNoble/images/bn.recommend.box.*
1084 /digitaljam/images/digital_ban\.gif
1085 /hotstories/companies/images/companies_banner\.gif
1086 /markets/images/markets_banner\.gif
1087 /ows-img/bnoble\.gif
1088 /ows-img/nb_Infoseek\.gif
1089 .cnn.com/images/custom/totale\.gif
1090 .cnn.com/images/lotd/custom.wheels\.gif
1091 .cnn.com/images/.*/by/main.12\.gif
1092 .cnn.com/images/.*/find115\.gif
1093 .cnn.com/.*/free.email.120\.gif
1094 .cnnfn.com/images/left_banner\.gif
1096 www.cnn.com/images/.*/bn/books\.gif
1097 www.cnn.com/images/.*/pointcast\.gif
1098 www.cnn.com/images/.*/fusa\.gif
1099 .cnn.com/images/.*/start120\.gif
1100 images.cnn.com/SHOP/
1104 # the / indicates the beginning of the path (and no longer the FQDN)
1110 /gif/buttons/banner_
1111 /gif/buttons/cd_shop_
1112 /gif/cd_shop/cd_shop_ani_
1115 /av/gifs/av_map\.gif
1116 /av/gifs/av_logo\.gif
1117 /av/gifs/new/ns\.gif
1118 altavista.com/i/valsdc3\.gif
1119 jump.altavista.com/gn_sf
1122 tucows./images/locallogo\.gif
1127 # simpliemu.hypermart.net/frames.html
1128 .go2net.com/mgic/adpopup
1129 .go2net.com/metaspy/images/exposed\.gif
1130 .go2net.com/metaspy/images/ms_un\.gif
1133 www.cebu-usa.com/cwbanim1\.gif
1134 www.cebu-usa.com/Connection\.jpg
1135 www.cebu-usa.com/phonead\.gif
1136 www.cebu-usa.com/ban3\.jpg
1137 www.cebu-usa.com/tlban\.gif
1138 www.cebu-usa.com/apwlogo1\.gif
1139 www.cebu-usa.com/rose\.gif
1142 www.fnet.de/img/geldboerselogo\.jpg
1144 # hirsch@mathcs.emory.edu
1145 /images/getareal2\.gif
1147 www.assalom.com/aziza/logos/cniaffil\.gif
1148 www.assalom.com/aziza/logos/4starrl1\.gif
1149 www.phantomstar.com/images/media/m1\.gif
1152 .wahlstreet.de/MediaW\$/tsponline\.gif
1153 .wahlstreet.de/MediaW\$/dzii156x60\.gif
1154 .wahlstreet.de/MediaW\$/etban156x60_2_opt2\.gif
1158 /pics/getareal1\.gif
1160 /ltbs/cgi-bin/click.cgi
1161 .linuxtoday.com/ltbs/pics/
1165 /include/watermark/v2/
1167 # Reinier Bikker <R.P.Bikker@phys.uu.nl>
1170 # Mark Lutz <luma@nikocity.de>
1171 /.*/*werb.*\.(gif|jpe?g) # hope that's not to restrictive
1173 #Free Yellow thing at bottom of pages (HereticPC)
1174 www.freeyellow.com/images/powerlink5a\.gif
1175 www.freeyellow.com/images/powerlink5b\.gif
1176 www.freeyellow.com/images/powerlink5c\.gif
1177 www.freeyellow.com/images/powerlink5d\.gif
1178 www.freeyellow.com/images/powerlink5e\.gif
1181 www.eads.com/images/refbutton\.gif
1182 www.fortunecity.com/console2/newnav/*
1183 www.goldetc.net/search\.gif
1184 www.cris.com/~Lzrdking/carpix/cars3-le\.gif
1185 www.justfreestuff.com/scott\.gif
1186 www.cyberthrill.com/entrance\.gif
1187 secure.pec.net/images/pec69ani\.gif
1188 www.new-direction.com/avviva\.gif
1189 /.*internetmarketingcenter\.gif
1190 www.new-direction.com/wp-linkexchange-loop\.gif
1191 www.new-direction.com/windough\.gif
1192 www.digitalwork.com/universal_images/affiliate/dw_le_3\.gif
1193 service.bfast.com/bfast/click/*
1194 www.new-direction.com/magiclearning\.gif
1195 www.new-direction.com/mailloop\.gif
1197 www.free-banners.com/images/hitslogo\.gif
1198 rob.simplenet.com/dyndns/fortune5\.gif
1199 .nasdaq-amex.com/images/bn_ticker\.gif
1202 # navilor@hotmail.com
1205 # wayne@staff.msen.com
1207 a*.*.*.yimg.com/([0-9]*|\/)*us.yimg.com/*
1210 www.realtop50.com/cgi-bin/ad
1214 www.yacht.de/images/(my_ani|eissingani|chartertrans|fum|schnupper|fysshop|garmin)\.gif
1215 www.sponsorweb.de/web-sponsor/nt-bin/show.exe
1218 # Club-internet pops up a complain if you refuse cookie (still pops up...)
1219 perso.club-internet.fr/html/Popup/popup_frame_nocookie.html
1220 perso.club-internet.fr/pagesperso/popup_nocookie.html
1222 .gmx.net/images/newsbanner/
1225 .quicken.lexware.de/images/us7-468x60.gif
1226 /img/special/chatpromo\.gif
1227 www.travelocity.com/images/promos/
1229 # wonder that that does...
1232 #/*.*/phpAds/viewbanner.php
1233 #/*.*/phpAds/phpads.php
1235 www.linux-magazin.de/banner
1236 .comtrack.comclick.com
1238 .iac-online.de/filler
1240 .media.interadnet.com
1241 .stat.www.fi/cgi-bin
1245 .disneystoreaffiliates.com
1247 .powerwork.mobile.de/cgi-bin/getimage\.cgi
1251 ####################################################
1254 # The Register ads - oh, and all images in Register stories (sigh).
1255 www.theregister.co.uk/media/
1257 # Used on http://www.theregister.co.uk/
1258 # Sample advert URL:
1259 # http://secure.webconnect.net/cgi-bin/webconnecthome.dll?F467
1263 www.dilbert.com/comics/dilbert/images/.*_140x800.*\.gif
1266 # Uses URL: http://www.stattrack.com/cgi-bin/stats/image.cgi
1268 # And loads JavaScript from http://www.stattrack.com/stats/code
1269 www.stattrack.com/stats/
1271 #Now they're Yahoo GeoCities, their junk is in a different place.
1272 ##geo.yahoo.com/serv
1273 ##visit.geocities.com/visit.gif
1274 .yimg.com/?.*/www.geocities.com/js_source
1275 #http://us.toto.geo.yahoo.com/toto?s=76001086
1277 .visit.geocities.com
1278 .yimg.com/?.*/www.geocities.com/
1280 #http://counter16.bravenet.com/counter.php
1283 #http://stat.cybermonitor.com/7emezone_p?1707_USdvd
1286 #http://members.tripod.com/adm/popup/.....
1287 members.tripod.com/adm/popup/
1289 #This is the worst ad idea ever!
1290 #count.exitexchange.com/exit/1100661
1291 #count.exitexchange.com/clients/navbar.html
1292 #(used in http://skyhivisuals.tripod.com/malfunctions_.htm)
1298 #This site traps the browser
1301 #privacy.net runs ads
1304 #Lindsay.Marshall@newcastle.ac.uk suggested these, to kill Opera adverts:
1309 dinoadserver*.roka.net
1311 logout.tvspielfilm.de
1313 www.freenet.de/customerindex\.html
1315 .fxweb.com/v2-trackrun\.cgi
1316 rtldating.peopleunited.de
1318 www.zdnet.com/fcgi-bin/
1319 service.bfast.com/bfast/serve
1321 fourohfour.nbci.com/Members404Error.php3
1324 www.fair-ist-mehr.de/cgi-bin/bt.pl
1334 #############################################################################
1336 #############################################################################
1339 www.userfriendly.org/images/banners/banner_dp_heart\.gif
1341 #Why were these in the Waldherr blockfile?
1343 #a*.*.*.yimg.com/([0-9]|\/)*us.yimg.com/i/*
1345 # some regexps are simply too aggressive ...
1347 # equalizer to /*.*(.*[-_.])?ads?[0-9]?(/|[-_.].*|.(gif|jpe?g))
1358 .ad.siemens.de # SIEMENS Automation & Drives
1359 #add-url.altavista.com
1366 # univ. don't advertise, do they :-)
1368 .ac.uk # English Universities too! - Jon
1369 .uni-*.de # What about Germany? --oes
1370 www.ugu.com/sui/ugu/adv
1374 clubs.yahoo.com/clubs
1375 edit.my.yahoo.com/config/show_identity
1376 www.ix.de/newsticker/data/ad
1377 www.heise.de/newsticker/data/ad
1378 www.careernet.de/anzeige
1379 www.careernet.de/bewerber/stellenanzeigen
1380 www.baumgartner.de/stellenmarkt/anzeigen
1381 www.dspartner.de/Anzeigen
1382 www.aws-jobs.de/Anzeigen
1383 www.jobware.de/.*/anzeigen/
1384 www.jobworld.de/bilder/
1385 www.cnn.com/TECH/computing/.*/internet.ads/
1386 www.financial.de/shop/
1390 194.221.152.2/phptelefontmp
1391 .harvard.edu/images/banner/
1394 www.dhd.de/CGI/anzeigen/
1397 .img.web.de/web/img/
1399 www.segel.de/menu/bilder/anzeigen\.gif
1400 www.corel.com/graphics/banners/
1401 www.software.ibm.com/ad/
1402 www.omg.org/docs/ad/
1404 .sperrmuell.de/scripts/anzeigen
1405 www.freenet.de/index.html
1406 www.01019freenet.de/index.html
1407 www.freenet.de/freenet/
1408 www.01019freenet.de/freenet/
1409 webfactory.de/anzeigen.php
1411 www.internatif.org/bortzmeyer/debian/sponsor/
1414 www.software.hosting.ibm.com/ad/
1415 www.ibm.com/software/ad/
1418 www.debian.org/Pics/banner-blue\.gif
1419 www.linux.de/pics/Nachrichten_banner\.gif
1422 finder.shopping.yahoo.com/shop/
1432 .consumer-direct.com
1437 # my banking stuff => no ads.
1443 # Jon's addition: MSDN
1448 .freemail*.web.de/online/ordner/anzeigen
1449 foggy.sda.t-online.de
1450 .us.i1.yimg.com/us.yimg.com/i/pim/ad2.gif
1451 www.nexgo.de/.*/bg_banner.jpg
1453 # .*ads. matches prdownloads.sourceforge.net and many other download sites