1 #############################################################################
2 # Sample actions file for the Internet Junkbuster 2.9.x
4 # For information, see http://ijbswa.sourceforge.net/
6 # $Id: actionsfile,v 1.7 2001/10/10 16:43:39 oes Exp $
8 #############################################################################
10 #############################################################################
12 # To determine which actions apply to a request, the URL of the request is
13 # compared to all patterns in this file. Every time it matches, the list of
14 # applicable actions for this URL is incrementally updated. You can trace
15 # this process by visiting http://i.j.b/show-url-info
17 # There are 4 types of lines in this file: comments (like this line),
18 # actions, aliases and patterns, all of which are explained below.
20 #############################################################################
22 #############################################################################
24 # 1. On Domains and Paths
25 # -----------------------
27 # Generally, a pattern has the form <domain>/<path>, where both the <domain>
28 # and <path> part are optional. If you only specify a domain part, the "/"
32 # is a domain-only pattern and will match any request to www.yahoo.com
35 # means exactly the same
37 # www.example.com/index.html
38 # matches only the document /index.html on www.example.com
41 # matches the document /index.html, regardless of the domain
44 # matches nothing, since it would be interpreted as a domain name and
45 # there is no top-level domain called ".html".
50 # The matching of the domain part offers some flexible options: If the
51 # domain starts or ends with a dot, it becomes unanchored at that end:
54 # matches only www.example.com
57 # matches any domain that ENDS in .example.com
60 # matches any domain that STARTS with www.
62 # Additionally, there are wildcards that you can use in the domain names
63 # themselves. They work pretty similar to shell wildcards: "*" stands for
64 # zero or more arbitrary characters, "?" stands for one, and you can define
65 # charachter classes in square brackets and they can be freely mixed:
68 # matches adserver.example.com, ads.example.com, etc but not sfads.example.com
71 # matches all of the above
74 # matches www.ipix.com, pictures.epix.com, a.b.c.d.e.upix.com etc
76 # www[1-9a-ez].example.com
77 # matches www1.example.com, www4.example.com, wwwd.example.com,
78 # wwwz.example.com etc, but not wwww.example.com
85 # Paths are specified as regular expressions. A comprehensive discussion of
86 # regular expressions wouldn't fit here, but (FIXME) someone should paste
87 # a concise intro to the regex language here.
89 # If Junkbuster was compiled with pcre support (default), Perl compatible
90 # regular expressions are used. See the pcre/docs/ direcory or man perlre
91 # (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for
94 # Please note that matching in the path is CASE INSENSITIVE by default, but
95 # you can switch to case sensitive at any point in the pattern by using
98 # www.example.com/(?-i)PaTtErN.*
99 # will match only documents whose path starts with PaTtErN in exactly this
102 #############################################################################
104 #############################################################################
106 # There are 3 kinds of action:
108 # Boolean (e.g. "block"):
112 # Parameterized (e.g. "hide-user-agent"):
113 # +name{param} # enable and set parameter to "param"
116 # Multi-value (e.g. "add-header", "wafer"):
117 # +name{param} # enable and add parameter "param"
118 # -name{param} # remove the parameter "param"
119 # -name # disable totally
121 # The default (if you don't specify anything in this file) is not to take
122 # any actions - i.e completely disabled, so JunkBuster will just be a
123 # normal, non-blocking, non-anonymizing proxy. You must specifically
124 # enable the privacy and blocking features you need (although the
125 # provided default actions file will do that for you).
127 # Later actions always override earlier ones. For multi-valued actions,
128 # the actions are applied in the order they are specified.
130 #############################################################################
132 #############################################################################
134 # +add-header{Name: value}
135 # Adds the specified HTTP header, which is not checked for validity.
136 # You may specify this many times to specify many headers.
141 # +deanimate-gifs{last}
142 # +deanimate-gifs{first}
143 # Deanimate all animated GIF images, i.e. reduce them to their last
144 # frame. This will also shrink the images considerably. (In bytes,
146 # If the option "first" is given, the first frame of the animation
147 # is used as the replacement. If "last" is given, the last frame of
148 # the animation is used instead, which propably makes more sense for
149 # most banner animations, but also has the risk of not showing the
150 # entire last frame (if it is only a delta to an earlier frame).
153 # Downgrade HTTP/1.1 client requests to HTTP/1.0 and downgrade the
154 # responses as well. Use this action for servers that use HTTP/1.1
155 # protocol features that Junkbuster currently can't handle yet.
158 # Many sites, like yahoo.com, don't just link to other sites.
159 # Instead, they will link to some script on their own server,
160 # giving the destination as a parameter, which will then redirect
161 # you to the final target.
163 # URLs resulting from this scheme typically look like:
164 # http://some.place/some_script?http://some.where-else
166 # Sometimes, there are even multiple consecutive redirects encoded
167 # in the URL. These redirections via scripts make your web browing
168 # more traceable, since the server from which you follow such a link
169 # can see where you go to. Apart from that, valuable bandwidth and
170 # time is wasted, while your browser aks the server for one redirect
171 # after the other. Plus, it feeds the advertisers.
173 # The +fast-redirects option enables interception of these requests
174 # by junkbuster, who will cut off all but the last valid URL in the
175 # request and send a local redirect back to your browser without
176 # contacting the remote site.
179 # Filter the website through the re_filterfile
180 # FIXME: The syntax should be +filter{filename}
183 # Block any existing X-Forwarded-for header, and do not add a new one.
186 # +hide-from{spam@sittingduck.xqq}
187 # If the browser sends a "From:" header containing your e-mail address,
188 # either completely removes the header ("block"), or change it to the
189 # specified e-mail address.
191 # +hide-referer{block}
192 # +hide-referer{forge}
193 # +hide-referer{http://nowhere.com}
194 # Don't send the "Referer:" (sic) header to the web site. You can
195 # block it, forge a URL to the same server as the request (which is
196 # preferred because some sites will not send images otherwise) or
197 # set it to a constant string.
199 # +hide-referrer{...}
200 # Alternative spelling of +hide-referer. Has the same parameters,
201 # and can be freely mixed with, "+hide-referer". ("referrer" is the
202 # correct English spelling, however the HTTP specification has a
203 # bug - it requires it to be spelt "referer").
205 # +hide-user-agent{browser-type}
206 # Change the "User-Agent:" header so web servers can't tell your
207 # browser type. (Breaks many web sites). Specify the user-agent
208 # value you want - e.g., to pretend to be using Netscape on Linux:
209 # +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)}
210 # Or to identify yourself explicitly as a JunkBuster user:
211 # +hide-user-agent{JunkBuster/1.0}
212 # (Don't change the version number from 1.0 - after all, why tell them?)
215 # Treat this URL as an image. This only matters if it's also "+block"ed,
216 # in which case a "blocked" image can be sent rather than a HTML page.
217 # See +image-blocker{} for the control over what is actually sent.
219 # +image-blocker{logo}
220 # +image-blocker{blank}
221 # +image-blocker{http://i.j.b/send-banner}
222 # Decides what to do with URLs that end up tagged with {+block +image}.
223 # There are 4 options. "-image-blocker" will send a HTML "blocked" page,
224 # usually resulting in a "broken image" icon. "+image-blocker{logo}"
225 # will send a "JunkBuster" image. "+image-blocker{blank}" will send
226 # a 1x1 transparent GIF. And finally, "+image-blocker{http://xyz.com}"
227 # will send a HTTP temporary redirect to the specified image - this
228 # has the advantage of the icon being beeing cached by the browser,
229 # which will speed up the display.
232 # +limit-connect{portlist}
233 # The CONNECT methods exists in HTTP to allow access to secure websites
234 # (https:// URLs) through proxies. It works very simply: The proxy
235 # connects to the server on the specified port, and then short-circuits
236 # its connections to the cliant and to the remote proxy.
237 # This can be a big security hole, since CONNECT-enabled proxies can
238 # be abused as TCP relays very easily.
239 # By default, i.e. in the absence of a +limit-connect action, Junkbuster
240 # will only allow CONNECT requests to port 443, which is the standard port
242 # If you want to allow CONNECT for more ports than that, or want to forbid
243 # CONNECT altogether, you can specify a comma separated list of ports and port
244 # ranges (the latter using dashes, with the minimum defaulting to 0 and max to 65K):
246 # +limit-connect{443} # This is the default and need no be specified.
247 # +limit-connect{80,443} # Ports 80 and 443 are OK.
248 # +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100, and above 500 are OK.
251 # Prevent the website from compressing the data. Some websites do
252 # that, which is a problem for junkbuster, since +filter, +no-popup
253 # and +gif-deanimate will not work on compressed data. Will slow down
254 # connections to those websites, though.
257 # Prevent the website from reading cookies
260 # Prevent the website from setting cookies
264 # Filter the website through a built-in filter to disable
265 # window.open() etc. The two alternative spellings are
269 # This action only applies if you are using a jarfile. It sends a
270 # cookie to every site stating that you do not accept any copyright
271 # on cookies sent to you, and asking them not to track you. Of
272 # course, this is a (relatively) unique header they could use to
276 # This allows you to add an arbitrary cookie. Specify it multiple
277 # times in order to add several cookies.
279 #############################################################################
282 #############################################################################
284 #############################################################################
286 #############################################################################
288 # You can define a short form for a list of permissions - e.g., instead
289 # of "-no-cookies-set -no-cookies-read -filter -fast-redirects", you can
290 # just write "shop". This is called an alias.
292 # Currently, an alias can contain any character except space, tab, '=', '{'
294 # But please use only 'a'-'z', '0'-'9', '+', and '-'.
296 # Alias names are not case sensitive.
298 # Aliases beginning with '+' or '-' may be used for system permission names
299 # in future releases - so try to avoid alias names like this. (e.g.
300 # "+no-cookies" below is not a good name)
302 # Aliases must be defined before they are used.
306 +no-cookies = +no-cookies-set +no-cookies-read
307 -no-cookies = -no-cookies-set -no-cookies-read
308 fragile = -block -no-cookies -filter -fast-redirects -hide-referer -no-popups
309 shop = -no-cookies -filter -fast-redirects
310 +imageblock = +block +image
311 +filter-all = +filter +no-compression
313 #... etc. Customize to your heart's content.
315 #############################################################################
317 #############################################################################
326 +hide-referer{forge} \
329 +image-blocker{http://i.j.b/send-banner} \
337 #############################################################################
338 # A useful site for testing - shows all headers:
339 # http://privacy.net/analyze/
340 #############################################################################
341 {+add-header{X-Privacy: Yes please} #-add-header{*} \
342 +add-header{X-User-Tracking: No thanks!} -filter}
345 #############################################################################
347 #############################################################################
349 # Sites that need cookies
358 # These sites are very complex and require
359 # minimal interference.
361 .office.microsoft.com
362 .windowsupdate.microsoft.com
365 # Shopping sites - still want to block ads.
368 .worldpay.com # for quietpc.com
372 # These shops require pop-ups
378 www.ukc.ac.uk/cgi-bin/wac\.cgi\?
381 # Please don't re_filter code!
385 # Hal reported that fast-redirects break this site
386 {-no-cookies -fast-redirects}
389 # Test for new GIF deanimation feature.
390 # Just try http://www.oesterhelt.org/deanimate-demo with and without it.
392 {+deanimate-gifs{last}}
393 www.oesterhelt.org/deanimate-demo
395 #############################################################################
397 #############################################################################
399 #############################################################################
404 #############################################################################
406 #############################################################################
408 .ad.preferences.com/image.*
411 .ad-adex3.flycast.com
413 .connect.247media.ads.link4ads.com
415 .mojofarm.mediaplex.com/ad/
416 www.carbuyer.com/cgi-carbuyer/getimage.cgi
417 /phpAds(New)?/viewbanner\.php
418 .ad.de.doubleclick.net
419 /.*/count\.cgi\?.*df=
420 *.fxweb.com/v2-trackrun\.cgi
426 a196.g.akamai.net/7/196/2670/000[12]/images.gmx.net/i4/images/.*/
431 #############################################################################
433 #############################################################################
435 #############################################################################
436 /.*/(.*[-_.])?ads?[0-9]?(/|[-_.].*|\.(gif|jpe?g))
437 /.*/(.*[-_.])?count(er)?(\.cgi|\.dll|\.exe|[?/])
438 /.*/(ng)?adclient\.cgi
439 /.*/(plain|live|rotate)[-_.]?ads?/
441 /.*/(sponsor)s?[0-9]?/
442 ###/*.*/(sponsor|banner)s?[0-9]?/
443 ###/*.*/.*banner([-_]?[a-z0-9]+)?\.(gif|jpg)
445 /.*/_?(plain|live)?ads?(-banners)?/
447 /.*/ad(sdna_image|gifs?)/
448 /.*/ad(server|stream|juggler)\.(cgi|pl|dll|exe)
452 /.*/adv((er)?ts?|ertis(ing|ements?))?/
460 /.*/cgi-bin/centralad/getimage
461 /.*/images/addver\.gif
462 /.*/images/advert\.gif
463 /.*/images/marketing/.*\.(gif|jpe?g)
468 /.*/randomads/.*\.(gif|jpe?g)
469 /.*/reklama/.*\.(gif|jpe?g)
470 /.*/reklame/.*\.(gif|jpe?g)
471 /.*/reklaam/.*\.(gif|jpe?g)
478 /.*/werbung/.*\.(gif|jpe?g)
479 /.*/adv\. # www.telegraaf.nl
480 /.*/advert[0-9]+\.jpg
495 /bin/getimage.cgi/...\?AD
496 /bin/nph-oma.count/ct/default.shtml
497 /bin/nph-oma.count/ix/default.html
498 /cgi-bin/getimage.cgi/....\?GROUP=
500 /cgi-bin/webad.dll/ad
502 /cwmail/amzn-bm1\.gif
510 /image\.ng/transactionID
511 /images/.*/.*_anim\.gif # alvin brattli
512 /ip_img/.*\.(gif|jpe?g)
515 /netscapeworld/nw-ad/
516 /promotions/houseads/
520 /torget/jobline/.*\.gif
525 /cgi-bin/nph-adclick.exe/
526 /.*/Image/BannerAdvertising/
528 /.*/adlib/server\.cgi
529 /.*/gsa_bs/gsa_bs.cmdl
533 # for our finnish friends, by Kai Puolamaki <Kai.Puolamaki@iki.fi>
534 /.*/mainos/*.*/.*\.gif
535 /.*/mainos/*.*/.*\.jpe?g
537 # more from a finnish friend Petri Haapio <pha@iki.fi>
539 .keltaisetsivut.fi/web/img/\.*gif
540 .haku.net/pics/pana\.*gif
542 /.*/(.*[-_.].*)?maino(kset|nta|s).*(/|\.(gif|html?|jpe?g|png))
543 /.*/(ilm(oitus)?|kampanja)(hallinta|kuvat?)(/|\.(gif|html?|jpe?g|png))
545 # and even more from a finnish friend Hannu Napari <Hannu.Napari@hut.fi>
546 194.251.243.50/cgi-bin/banner
550 www.iltalehti.fi/ilmkuvat
551 www.mtv3.fi/mainoskuvat
562 /.*/images/topics/topicgimp\.gif
563 .discovery.com/.*banner_id
566 .idrink.com/frm_bottom.htm
568 /.*/ph-ad.*\.focalink\.com
571 /we_ba/ # hausfrauenseite.de *bwhahahaaaaa*
574 /.*(ms)?backoff(ice)?.*\.(gif|jpe?g)
575 /.*(/ie4|/ie3|msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*\.(gif|jpe?g)
576 /.*activex.*(gif|jpe?g)
577 /.*explorer?.(gif|jpe?g)
578 /.*freeie\.(gif|jpe?g)
579 /.*/ie_?(buttonlogo|static?|anim.*)?\.(gif|jpe?g)
580 /.*ie_sm\.(gif|jpe?g)
581 /.*msie(30)?\.(gif|jpe?g)
582 /.*msnlogo\.(gif|jpe?g)
583 /.*office97_ad1\.(gif|jpe?g)
584 /.*pbbobansm\.(gif|jpe?g)
585 /.*powrbybo\.(gif|jpe?g)
586 /.*sqlbans\.(gif|jpe?g)
588 /.*ie4get_animated\.gif
613 # generally useless information and promo stuff (commented out)
614 #/.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?)\.(gif|jpe?g)
616 /.*/images/na/us/brand/
617 /.*/advantage\.(gif|jpg)
618 /.*/advanbar\.(gif|jpg)
619 /.*/advanbtn\.(gif|jpg)
620 /.*/biznetsmall\.(gif|jpg)
621 /.*/utopiad\.(gif|jpg)
623 /.*/amazon([a-zA-Z0-9]+)\.(gif|jpg)
625 /.*/buynow([a-zA-Z0-9]+)\.(gif|jpg)
630 # for the dutch folks by a dutch friend gertjan@west.nl
633 .netdirect.nl/nd_servlet/___
635 # --------------------------------------------------------------------------
639 # --------------------------------------------------------------------------
641 # the next two lines work
644 193.158.37.3/cgi-bin/impact
651 195.63.104.*/(inbox|log|meld|folderlu|folderru|log(in|out)[lmr]u|)
659 206.165.5.162/images/gcanim\.gif
663 207.159.129.131/abacus
667 207.87.27.10/tool/includes/gifs/
670 209.1.112.252/adgraph/
671 209.1.135.14[24]:1971
676 209.207.224.22[02]/servfu.pl
677 209.239.37.214/cgi-pilotfaq/getimage\.cgi
680 209.85.89.183/cgi-bin/cycle\?host
681 212.63.155.122/(banner|concret|softwareclub)
684 216.49.10.236/web1000/
687 .ICDirect.com/cgi-bin
688 .Shannon.Austria.Eu.net/\.cgi/
693 # generic hosts (probably most effective)
701 #/.*/*preferences.com*
704 .akamaitech.net/.*/Banners/
705 .altavista.telia.com/av/pix/sponsors/
706 .amazon.com/g/associates/logos/
708 .asinglesplace.com/asplink\.gif
710 .automatiseringgids.nl/gfx/advertenties/
711 #avenuea.com/Banners/
714 .befriends.net/personals/matchmaking\.jpg
715 .bizad.nikkeibp.co.jp
716 .bs.gsanet.com/gsa_bs/
719 .cgicounter.puretec.de/cgi-bin/
720 .ciec.org/images/countdown\.gif
721 .classic.adlink.de/cgi-bin/accipiter/adserver.exe
723 #.clickhere.egroups.com/img/
725 .commonwealth.riddler.com/Commonwealth/bin/statdeploy\?[0-9]+
727 .dagbladet.no/ann-gif
730 .dn.adzerver.com/image.ad
735 .eur.a1.yimg.com/eur.yimg.com/a/
736 .us.a1.yimg.com/us.yimg.com/a/
738 #fastcounter.linkexchange.com
740 .focalink.com/SmartBanner
741 .freepage.de/cgi-bin/feets/freepage_ext/.*/rw_banner
742 .freespace.virgin.net/andy.drake
743 .futurecard.com/images/
747 .go.com/cimages\?SEEK_
749 .home.miningco.com/event.ng/.*AdID
753 image*.narrative.com/news/.*\.(gif|jpe?g)
755 #image.linkexchange.com
757 .images.yahoo.com/adv/
758 .images.yahoo.com/promotions/
761 .impartner.de/cgi-bin
762 informer2.comdirect.de:6004/cd/banner2
763 .infoseek.go.com/cimages
765 .kaufwas.com/cgi-bin/zentralbanner\.cgi
766 #leader.linkexchange.com
769 .linktrader.com/cgi-bin/
770 .logiclink.nl/cgi-bin/
771 lucky.theonion.com/cgi-bin/oniondirectin\.cgi
772 lucky.theonion.com/cgi-bin/onionimp\.cgi
773 lucky.theonion.com/cgi-bin/onionimpin\.cgi
775 .mailorderbrides.com/mlbrd2\.gif
778 .members.sexroulette.com
779 .messenger.netscape.com
781 # movielink became moviefone
782 .moviefone.com/.*(banner|newbutton|(ad|poster).*?\.gif|mmail|bytb|h_(guy|showtick|aML)|m_|icon_|NF_.*?back|h_.*?gif|media/(art|imagelinks(/MF.(ad|sponsor))))
783 mqgraphics.mapquest.com/graphics/Advertisements/
786 .news.com/cgi-bin/acc_clickthru
788 .ngserve.pcworld.com/adgifs/
796 .promotions.yahoo.com
798 .qsound.com/tracker/tracker.exe
799 .resource-marketing.com/tb/
801 .rtl.de/homepage/wb/images/
802 .schnellsuche.de/images/*
803 .shout-ads.com/cgibin/shout.php3
804 .sjmercury.com/advert/
805 .smartclicks.com/.*/smart(img|banner|host|bar|site)
808 .static.wired.com/advertising/
810 .sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor\.gif
811 .t-online.de/home/040255162-001/*
814 .teleauskunft.de/commercial/
817 .tvguide.com/rbitmaps/
820 .ultra.multimania.com
824 .us.yimg.com/promotions/
828 .videoserver.kpix.com
829 .washingtonpost.com/wp-adv/
830 .webconnect.net/cgi-bin/webconnect.dll
832 .webserv.vnunet.com/ip_img/.*ban
833 .werbung.pro-sieben.de/cgi-bin
834 .whatis.com/cgi-bin/getimage.exe/
835 www..bigyellow.com/......mat.*
837 www.addme.com/link8\.gif
838 www.aftonbladet.se/annons
839 www.americanpassage.com/
840 www.angelfire.com/in/twistriot/images/wish4\.gif
841 www.bizlink.ru/cgi-bin/irads\.cgi
842 www.blacklightmedia.com/adlemur
843 www.bluesnews.com/flameq\.gif
844 www.bluesnews.com/images/ad[0-9]+\.gif
845 www.bluesnews.com/images/gcanim3\.gif
846 www.bluesnews.com/images/throbber2\.gif
847 www.bluesnews.com/miscimages/fragbutton\.gif
848 www.businessweek.com/sponsors/
849 www.canoe.ca/AdsCanoe/
850 www.cdnow.com/MN/client.banners
853 www.clicmoi.com/cgi-bin/pub\.exe
854 www.dailycal.org/graphics/adbanner-ab\.gif
855 www.detelefoongids.com/pic/[0-9]*
856 www.dhd.de/CGI/werbepic
857 www.dsf.de/cgi-bin/site_newiac.adpos
858 www.firsttarget.com/cgi-bin/klicklog.cgi
859 www.forbes.com/forbes/gifs/ads
860 www.forbes.com/tool/includes/gifs/
861 www.fxweb.holowww.com/.*\.cgi
862 www.geocities.com/TimesSquare/Zone/5267/
863 www.goto.com/images-promoters/
864 www.handelsblatt.de/hbad
865 www.hotlinks.de/cgi-bin/barimage\.cgi
866 www.infoseek.com/cimages
867 www.infoworld.com/pageone/gif
868 www.isys.net/customer/images
869 www.javaworld.com/javaworld/jw-ad
870 www.kron.com/place-ads/
871 www.leo.org/leoclick/
872 #www.linkexchange.ru/cgi-bin/erle\.cgi
873 www.linkstation.de/cgi-bin/zeige
874 www.linux.org/graphic/miniature/
875 www.linux.org/graphic/square/
876 www.linux.org/graphic/standard/
877 www.luncha.se/annonsering
879 www.ml.org/gfx/spon/icom/
880 www.ml.org/gfx/spon/wmv
881 www.musicblvd.com/mb2/graphics/netgravity/
883 www.news.com/Midas/Images/
884 www.newscientist.com/houseads
885 www.nextcard.com/affiliates/
886 www.nikkeibp.asiabiztech.com/image/NAIS4\.gif
887 www.nordlys.no/imaker/.*/.*/.*/.....\.gif # alvin brattli
888 www.nordlys.no/imaker/.*/.*/.*/..003 # alvin brattli
889 www.oanda.com/server/banner
891 www.oneandonlynetwork.com
892 www.page2page.de/cgi-bin/
893 www.prnet.de/.*/bannerschnippel/.*\.(gif|jpe?g)
894 www.promptsoftware.com/marketing/
895 #www.reklama.ru/cgi-bin/banners/
896 www.riddler.com/sponsors/
897 www.rle.ru/cgi-bin/erle\.cgi
898 www.rock.com/images/affiliates/search_black\.gif
899 www.rtl.de/search/.*kunde
900 #www.search.com/Banners
901 www.sfgate.com/place-ads/
902 www.shareware.com/midas/images/borders-btn\.gif
903 #www.sjmercury.com/products/marcom/banners/
904 www.smartclicks.com:81
905 www.sol.dk/graphics/portalmenu
906 www.sponsornetz.de/jump/show.exe
908 www.sunworld.com/sunworldonline/icons/adinfo.sm\.gif
909 www.swwwap.com/cgi-bin/
911 www.telecom.at/icons/.*film\.(gif|jpe?g)
912 www.theonion.com/bin/
913 www.topsponsor.de/cgi-bin/show.exe
915 www.ugu.com/images/EJ\.gif
916 www.warzone.com/pics/banner/
917 www.warzone.com/wzfb/ads.cgi
919 www.websitepromote.com/partner/img/
920 www.winjey.com/onlinewerbung/*\.gif
921 www.wishing.com/webaudit
922 www.www-pool.de/cgi-bin/banner-pool
923 www2.blol.com/agrJRU\.gif
925 .yahoo.com/CategoryID=0
929 www.bannerland.de/click.exe
934 www.slate.com/redirect/
935 www.slate.com/articleimages/
937 www.forbes.com/tool/images/frontend/
940 .pathfinder.com/shopping/marketplace/images/
943 static.wired.com/images
944 .perso.estat.com/cgi-bin/perso/
945 #dinoadserver1.roka.net
946 .fooladclient*.fool.com
947 .affiliate.aol.com/static/
955 # www.sunday-times.co.uk
956 www.sunday-times.co.uk/standing/newsint/ticker
958 #NeXgo (ex Germany.Net)
962 # Block as much of GeoCities as possible
963 # All geocities-owned images
964 www.geocities.com/images
965 www.geocities.com/MemberBanners/live/
966 pic.geocities.com/images
967 # And the popup (it still pops up, but does not eat up precious bandwidth)
968 #www.geocities.com/ad_container/pop.html # already fixed by other regexp
970 # from corion@informatik.uni-frankfurt.de
973 #ads.xmonitor.net/xadengine.cgi # fixed by above regexp
974 # Also block the japanese geocities popups
975 www.geocities.co.jp/images
976 # Also block the come.to, surf.to etc. popups
979 # Also block the xoom stuff.
981 home.talkcity.com/homepopup.html.*
983 # Max Maischein <max.maischein@econsult.de> again ...
984 # Halflife.net uses WON banners
985 # Banners from Freeserve
986 #banner.freeservers.com/cgi-bin/fs_adbar # fixed by above regexp
987 # And those nasty va-popups !
989 # And an all-around hit against advert*.jpg
990 /.*/advert[0-9]+\.jpg
991 # And yet another Internet Explorer gif ...
993 # Some uninteresting buttons I think...
994 .mircx.com/images/buttons/
995 services.mircx.com/.*\.gif
996 # Easyspace - yet another "free disk space" provider with <yuck> banner popups
997 www.easyspace.com/(fpub)?banner.html
998 www.easyspace.com/100\.gif
999 # Some russian banner exchanges
1000 .banner.ricor.ru/cgi-bin/banner.pl
1001 #www.bizlink.ru/cgi-bin/irads.cgi # already fixed by other regexp
1002 stx9.sextracker.com/stx/send/
1003 # And even more of geocities :
1004 www.geocities.com/pictures/
1005 # Gaah - www.angelfire.com - another webspace provider with popups
1006 .angelfire.com/sys/download.html
1007 # Gamasutra.com uses this ad provider
1008 sally.songline.com/@
1010 # Eule.de (search engine)
1011 # maybe images.eule.de as a whole...
1012 www.eule.de/cgi-bin/
1013 images.eule.de/comdirect\.gif
1014 images.eule.de/wp\.gif
1015 .aladin.de/125_1\.gif
1016 images.eule.de/neu/books\.gif
1018 # --------------------------------------------------------------------------
1022 # --------------------------------------------------------------------------
1024 # some images on cnn's website just suck!
1027 /.*cnnpostopinionhome.\.gif
1028 /.*custom_feature\.gif
1029 /.*explore.anim.*gif
1031 /.*pathnet.warner\.gif
1032 /.*images/cnnfn_infoseek\.gif
1033 /.*images/pathfinder_btn2\.gif
1034 /.*img/gen/fosz_front_em_abc\.gif
1035 /.*img/promos/bnsearch\.gif
1036 /.*navbars/nav_partner_logos\.gif
1037 /BarnesandNoble/images/bn.recommend.box.*
1038 /digitaljam/images/digital_ban\.gif
1039 /hotstories/companies/images/companies_banner\.gif
1040 /markets/images/markets_banner\.gif
1041 /ows-img/bnoble\.gif
1042 /ows-img/nb_Infoseek\.gif
1043 .cnn.com/images/custom/totale\.gif
1044 .cnn.com/images/lotd/custom.wheels\.gif
1045 .cnn.com/images/.*/by/main.12\.gif
1046 .cnn.com/images/.*/find115\.gif
1047 .cnn.com/.*/free.email.120\.gif
1048 .cnnfn.com/images/left_banner\.gif
1050 www.cnn.com/images/.*/bn/books\.gif
1051 www.cnn.com/images/.*/pointcast\.gif
1052 www.cnn.com/images/.*/fusa\.gif
1053 .cnn.com/images/.*/start120\.gif
1054 images.cnn.com/SHOP/
1058 # the / indicates the beginning of the path (and no longer the FQDN)
1064 /gif/buttons/banner_
1065 /gif/buttons/cd_shop_
1066 /gif/cd_shop/cd_shop_ani_
1069 /av/gifs/av_map\.gif
1070 /av/gifs/av_logo\.gif
1071 /av/gifs/new/ns\.gif
1072 altavista.com/i/valsdc3\.gif
1073 jump.altavista.com/gn_sf
1076 tucows./images/locallogo\.gif
1081 # simpliemu.hypermart.net/frames.html
1082 .go2net.com/mgic/adpopup
1083 .go2net.com/metaspy/images/exposed\.gif
1084 .go2net.com/metaspy/images/ms_un\.gif
1087 www.cebu-usa.com/cwbanim1\.gif
1088 www.cebu-usa.com/Connection\.jpg
1089 www.cebu-usa.com/phonead\.gif
1090 www.cebu-usa.com/ban3\.jpg
1091 www.cebu-usa.com/tlban\.gif
1092 www.cebu-usa.com/apwlogo1\.gif
1093 www.cebu-usa.com/rose\.gif
1096 www.fnet.de/img/geldboerselogo\.jpg
1098 # hirsch@mathcs.emory.edu
1099 /images/getareal2\.gif
1101 www.assalom.com/aziza/logos/cniaffil\.gif
1102 www.assalom.com/aziza/logos/4starrl1\.gif
1103 www.phantomstar.com/images/media/m1\.gif
1106 .wahlstreet.de/MediaW\$/tsponline\.gif
1107 .wahlstreet.de/MediaW\$/dzii156x60\.gif
1108 .wahlstreet.de/MediaW\$/etban156x60_2_opt2\.gif
1112 /pics/getareal1\.gif
1114 /ltbs/cgi-bin/click.cgi
1115 .linuxtoday.com/ltbs/pics/
1119 /include/watermark/v2/
1121 # Reinier Bikker <R.P.Bikker@phys.uu.nl>
1124 # Mark Lutz <luma@nikocity.de>
1125 /.*/*werb.*\.(gif|jpe?g) # hope that's not to restrictive
1127 #Free Yellow thing at bottom of pages (HereticPC)
1128 www.freeyellow.com/images/powerlink5a\.gif
1129 www.freeyellow.com/images/powerlink5b\.gif
1130 www.freeyellow.com/images/powerlink5c\.gif
1131 www.freeyellow.com/images/powerlink5d\.gif
1132 www.freeyellow.com/images/powerlink5e\.gif
1135 www.eads.com/images/refbutton\.gif
1136 www.fortunecity.com/console2/newnav/*
1137 www.goldetc.net/search\.gif
1138 www.cris.com/~Lzrdking/carpix/cars3-le\.gif
1139 www.justfreestuff.com/scott\.gif
1140 www.cyberthrill.com/entrance\.gif
1141 secure.pec.net/images/pec69ani\.gif
1142 www.new-direction.com/avviva\.gif
1143 /.*internetmarketingcenter\.gif
1144 www.new-direction.com/wp-linkexchange-loop\.gif
1145 www.new-direction.com/windough\.gif
1146 www.digitalwork.com/universal_images/affiliate/dw_le_3\.gif
1147 service.bfast.com/bfast/click/*
1148 www.new-direction.com/magiclearning\.gif
1149 www.new-direction.com/mailloop\.gif
1151 www.free-banners.com/images/hitslogo\.gif
1152 rob.simplenet.com/dyndns/fortune5\.gif
1153 .nasdaq-amex.com/images/bn_ticker\.gif
1156 # navilor@hotmail.com
1159 # wayne@staff.msen.com
1161 a*.*.*.yimg.com/([0-9]*|\/)*us.yimg.com/*
1164 www.realtop50.com/cgi-bin/ad
1168 www.yacht.de/images/(my_ani|eissingani|chartertrans|fum|schnupper|fysshop|garmin)\.gif
1169 www.sponsorweb.de/web-sponsor/nt-bin/show.exe
1172 # Club-internet pops up a complain if you refuse cookie (still pops up...)
1173 perso.club-internet.fr/html/Popup/popup_frame_nocookie.html
1174 perso.club-internet.fr/pagesperso/popup_nocookie.html
1176 .gmx.net/images/newsbanner/
1179 .quicken.lexware.de/images/us7-468x60.gif
1180 /img/special/chatpromo\.gif
1181 www.travelocity.com/images/promos/
1183 # wonder that that does...
1186 #/*.*/phpAds/viewbanner.php
1187 #/*.*/phpAds/phpads.php
1189 www.linux-magazin.de/banner
1190 .comtrack.comclick.com
1192 .iac-online.de/filler
1194 .media.interadnet.com
1195 .stat.www.fi/cgi-bin
1199 .disneystoreaffiliates.com
1201 .powerwork.mobile.de/cgi-bin/getimage\.cgi
1205 ####################################################
1208 # The Register ads - oh, and all images in Register stories (sigh).
1209 www.theregister.co.uk/media/
1213 www.dilbert.com/comics/dilbert/images/.*_140x800.*\.gif
1216 # Uses URL: http://www.stattrack.com/cgi-bin/stats/image.cgi
1218 # And loads JavaScript from http://www.stattrack.com/stats/code
1219 www.stattrack.com/stats/
1221 #Now they're Yahoo GeoCities, their junk is in a different place.
1222 ##geo.yahoo.com/serv
1223 ##visit.geocities.com/visit.gif
1224 .yimg.com/.*/www.geocities.com/js_source
1225 #http://us.toto.geo.yahoo.com/toto?s=76001086
1227 .visit.geocities.com
1228 .yimg.com/.*/www.geocities.com/
1230 #http://counter16.bravenet.com/counter.php
1233 #http://stat.cybermonitor.com/7emezone_p?1707_USdvd
1236 #http://members.tripod.com/adm/popup/.....
1237 members.tripod.com/adm/popup/
1239 #This is the worst ad idea ever!
1240 #count.exitexchange.com/exit/1100661
1241 #count.exitexchange.com/clients/navbar.html
1242 #(used in http://skyhivisuals.tripod.com/malfunctions_.htm)
1248 #This site traps the browser
1251 #privacy.net runs ads
1254 #Lindsay.Marshall@newcastle.ac.uk suggested these, to kill Opera adverts:
1259 dinoadserver*.roka.net
1261 logout.tvspielfilm.de
1263 www.freenet.de/customerindex\.html
1265 .fxweb.com/v2-trackrun\.cgi
1266 rtldating.peopleunited.de
1268 www.zdnet.com/fcgi-bin/
1269 service.bfast.com/bfast/serve
1271 fourohfour.nbci.com/Members404Error.php3
1274 www.fair-ist-mehr.de/cgi-bin/bt.pl
1283 #############################################################################
1285 #############################################################################
1288 www.userfriendly.org/images/banners/banner_dp_heart\.gif
1290 #Why were these in the Waldherr blockfile?
1292 #a*.*.*.yimg.com/([0-9]|\/)*us.yimg.com/i/*
1294 # some regexps are simply too aggressive ...
1296 # equalizer to /*.*(.*[-_.])?ads?[0-9]?(/|[-_.].*|.(gif|jpe?g))
1307 .ad.siemens.de # SIEMENS Automation & Drives
1308 #add-url.altavista.com
1315 # univ. don't advertise, do they :-)
1317 .ac.uk # English Universities too! - Jon
1318 .uni-*.de # What about Germany? --oes
1319 www.ugu.com/sui/ugu/adv
1323 clubs.yahoo.com/clubs
1324 edit.my.yahoo.com/config/show_identity
1325 www.ix.de/newsticker/data/ad
1326 www.heise.de/newsticker/data/ad
1327 www.careernet.de/anzeige
1328 www.careernet.de/bewerber/stellenanzeigen
1329 www.baumgartner.de/stellenmarkt/anzeigen
1330 www.dspartner.de/Anzeigen
1331 www.aws-jobs.de/Anzeigen
1332 www.jobware.de/.*/anzeigen/
1333 www.jobworld.de/bilder/
1334 www.cnn.com/TECH/computing/.*/internet.ads/
1335 www.financial.de/shop/
1339 194.221.152.2/phptelefontmp
1340 .harvard.edu/images/banner/
1343 www.dhd.de/CGI/anzeigen/
1346 .img.web.de/web/img/
1348 www.segel.de/menu/bilder/anzeigen\.gif
1349 www.corel.com/graphics/banners/
1350 www.software.ibm.com/ad/
1351 www.omg.org/docs/ad/
1353 .sperrmuell.de/scripts/anzeigen
1354 www.freenet.de/index.html
1355 www.01019freenet.de/index.html
1356 www.freenet.de/freenet/
1357 www.01019freenet.de/freenet/
1358 webfactory.de/anzeigen.php
1360 www.internatif.org/bortzmeyer/debian/sponsor/
1363 www.software.hosting.ibm.com/ad/
1364 www.ibm.com/software/ad/
1367 www.debian.org/Pics/banner-blue\.gif
1368 www.linux.de/pics/Nachrichten_banner\.gif
1371 finder.shopping.yahoo.com/shop/
1381 .consumer-direct.com
1386 # my banking stuff => no ads.
1392 # Jon's addition: MSDN
1397 .freemail*.web.de/online/ordner/anzeigen
1398 foggy.sda.t-online.de
1399 .us.i1.yimg.com/us.yimg.com/i/pim/ad2.gif
1400 www.nexgo.de/.*/bg_banner.jpg
1402 prdownloads.sourceforge.net