1 #############################################################################
2 # Sample actions file for the Internet Junkbuster 2.9.x
4 # For information, see http://ijbswa.sourceforge.net/
6 # $Id: ijb.action,v 1.7 2002/03/08 18:01:48 morcego Exp $
8 #############################################################################
10 #############################################################################
12 # To determine which actions apply to a request, the URL of the request is
13 # compared to all patterns in this file. Every time it matches, the list of
14 # applicable actions for this URL is incrementally updated. You can trace
15 # this process by visiting http://i.j.b/show-url-info
17 # There are 4 types of lines in this file: comments (like this line),
18 # actions, aliases and patterns, all of which are explained below.
20 #############################################################################
22 #############################################################################
24 # 1. On Domains and Paths
25 # -----------------------
27 # Generally, a pattern has the form <domain>/<path>, where both the <domain>
28 # and <path> part are optional. If you only specify a domain part, the "/"
32 # is a domain-only pattern and will match any request to www.yahoo.com
35 # means exactly the same (but is slightly less efficient)
37 # www.example.com/index.html
38 # matches only the document /index.html on www.example.com
41 # matches the document /index.html, regardless of the domain
44 # matches nothing, since it would be interpreted as a domain name and
45 # there is no top-level domain called ".html".
50 # The matching of the domain part offers some flexible options: If the
51 # domain starts or ends with a dot, it becomes unanchored at that end:
54 # matches only www.example.com
57 # matches any domain that ENDS in .example.com
60 # matches any domain that STARTS with www.
62 # Additionally, there are wildcards that you can use in the domain names
63 # themselves. They work pretty similar to shell wildcards: "*" stands for
64 # zero or more arbitrary characters, "?" stands for one, and you can define
65 # charachter classes in square brackets and they can be freely mixed:
68 # matches adserver.example.com, ads.example.com, etc but not sfads.example.com
71 # matches all of the above
74 # matches www.ipix.com, pictures.epix.com, a.b.c.d.e.upix.com etc
76 # www[1-9a-ez].example.com
77 # matches www1.example.com, www4.example.com, wwwd.example.com,
78 # wwwz.example.com etc, but not wwww.example.com
85 # Paths are specified as regular expressions. A comprehensive discussion of
86 # regular expressions wouldn't fit here, but (FIXME) someone should paste
87 # a concise intro to the regex language here.
89 # If Junkbuster was compiled with pcre support (default), Perl compatible
90 # regular expressions are used. See the pcre/docs/ direcory or man perlre
91 # (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for
94 # Please note that matching in the path is CASE INSENSITIVE by default, but
95 # you can switch to case sensitive by starting the pattern with the "(?-i)"
98 # www.example.com/(?-i)PaTtErN.*
99 # will match only documents whose path starts with PaTtErN in exactly this
102 # Partially case-sensetive and partially case-insensitive patterns are
103 # possible, but the rules about splitting them up are extremely complex
104 # - see the PCRE documentation for more information.
106 #############################################################################
108 #############################################################################
110 # There are 3 kinds of action:
112 # Boolean (e.g. "block"):
116 # Parameterized (e.g. "hide-user-agent"):
117 # +name{param} # enable and set parameter to "param"
120 # Multi-value (e.g. "add-header", "wafer"):
121 # +name{param} # enable and add parameter "param"
122 # -name{param} # remove the parameter "param"
123 # -name # disable totally
125 # The default (if you don't specify anything in this file) is not to take
126 # any actions - i.e completely disabled, so JunkBuster will just be a
127 # normal, non-blocking, non-anonymizing proxy. You must specifically
128 # enable the privacy and blocking features you need (although the
129 # provided default actions file will do that for you).
131 # Later actions always override earlier ones. For multi-valued actions,
132 # the actions are applied in the order they are specified.
134 #############################################################################
136 #############################################################################
138 # +add-header{Name: value}
139 # Adds the specified HTTP header, which is not checked for validity.
140 # You may specify this many times to specify many headers.
145 # +deanimate-gifs{last}
146 # +deanimate-gifs{first}
147 # Deanimate all animated GIF images, i.e. reduce them to their last
148 # frame. This will also shrink the images considerably. (In bytes,
150 # If the option "first" is given, the first frame of the animation
151 # is used as the replacement. If "last" is given, the last frame of
152 # the animation is used instead, which propably makes more sense for
153 # most banner animations, but also has the risk of not showing the
154 # entire last frame (if it is only a delta to an earlier frame).
157 # Downgrade HTTP/1.1 client requests to HTTP/1.0 and downgrade the
158 # responses as well. Use this action for servers that use HTTP/1.1
159 # protocol features that Junkbuster currently can't handle yet.
162 # Many sites, like yahoo.com, don't just link to other sites.
163 # Instead, they will link to some script on their own server,
164 # giving the destination as a parameter, which will then redirect
165 # you to the final target.
167 # URLs resulting from this scheme typically look like:
168 # http://some.place/some_script?http://some.where-else
170 # Sometimes, there are even multiple consecutive redirects encoded
171 # in the URL. These redirections via scripts make your web browing
172 # more traceable, since the server from which you follow such a link
173 # can see where you go to. Apart from that, valuable bandwidth and
174 # time is wasted, while your browser aks the server for one redirect
175 # after the other. Plus, it feeds the advertisers.
177 # The +fast-redirects option enables interception of these requests
178 # by junkbuster, who will cut off all but the last valid URL in the
179 # request and send a local redirect back to your browser without
180 # contacting the remote site.
183 # Filter the website through one or more regular expression filters.
184 # Repeat for multiple filters.
186 # Filters predefined in the supplied re_filterfile include:
188 # html-annoyances: Get rid of particularly annoying HTML abuse
189 # js-annoyances: Get rid of particularly annoying JavaScript abuse
190 # no-poups: Kill all popups in JS and HTML
191 # frameset-borders: Give frames a border
192 # webbugs: Squish WebBugs (1x1 invisible GIFs used for user tracking)
193 # no-refresh: Automatic refresh sucks on auto-dialup lines
194 # fun: Text replacements for subversive browsing fun!
195 # nimda: Remove (virus) Nimda code.
196 # banners-by-size: Kill banners by size
197 # crude-parental: Kill all web pages that contain the words "sex" or "warez"
201 # Block any existing X-Forwarded-for header, and do not add a new one.
204 # +hide-from{spam@sittingduck.xqq}
205 # If the browser sends a "From:" header containing your e-mail address,
206 # either completely removes the header ("block"), or change it to the
207 # specified e-mail address.
209 # +hide-referer{block}
210 # +hide-referer{forge}
211 # +hide-referer{http://nowhere.com}
212 # Don't send the "Referer:" (sic) header to the web site. You can
213 # block it, forge a URL to the same server as the request (which is
214 # preferred because some sites will not send images otherwise) or
215 # set it to a constant string.
217 # +hide-referrer{...}
218 # Alternative spelling of +hide-referer. Has the same parameters,
219 # and can be freely mixed with, "+hide-referer". ("referrer" is the
220 # correct English spelling, however the HTTP specification has a
221 # bug - it requires it to be spelt "referer").
223 # +hide-user-agent{browser-type}
224 # Change the "User-Agent:" header so web servers can't tell your
225 # browser type. (Breaks many web sites). Specify the user-agent
226 # value you want - e.g., to pretend to be using Netscape on Linux:
227 # +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)}
228 # Or to identify yourself explicitly as a JunkBuster user:
229 # +hide-user-agent{JunkBuster/1.0}
230 # (Don't change the version number from 1.0 - after all, why tell them?)
233 # Treat this URL as an image. This only matters if it's also "+block"ed,
234 # in which case a "blocked" image can be sent rather than a HTML page.
235 # See +image-blocker{} for the control over what is actually sent.
237 # +image-blocker{logo}
238 # +image-blocker{blank}
239 # +image-blocker{pattern}
240 # +image-blocker{<URL>} with <url> being any valid image URL
241 # Decides what to do with URLs that end up tagged with {+block +image}.
242 # There are 5 options. "-image-blocker" will send a HTML "blocked" page,
243 # usually resulting in a "broken image" icon. "+image-blocker{logo}"
244 # will send a "JunkBuster" image. "+image-blocker{blank}" will send
245 # a 1x1 transparent image, "+image-blocker{pattern}" will send a 4x4
246 # grey/white pattern which is less intrusive than the logo but easier
247 # to recognize than the transparent one. And finally, "+image-blocker{<URL>}"
248 # will send a HTTP temporary redirect to the specified image URL.
251 # +limit-connect{portlist}
252 # The CONNECT methods exists in HTTP to allow access to secure websites
253 # (https:// URLs) through proxies. It works very simply: The proxy
254 # connects to the server on the specified port, and then short-circuits
255 # its connections to the cliant and to the remote proxy.
256 # This can be a big security hole, since CONNECT-enabled proxies can
257 # be abused as TCP relays very easily.
258 # By default, i.e. in the absence of a +limit-connect action, Junkbuster
259 # will only allow CONNECT requests to port 443, which is the standard port
261 # If you want to allow CONNECT for more ports than that, or want to forbid
262 # CONNECT altogether, you can specify a comma separated list of ports and port
263 # ranges (the latter using dashes, with the minimum defaulting to 0 and max to 65K):
265 # +limit-connect{443} # This is the default and need no be specified.
266 # +limit-connect{80,443} # Ports 80 and 443 are OK.
267 # +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100, and above 500 are OK.
270 # Prevent the website from compressing the data. Some websites do
271 # that, which is a problem for junkbuster, since +filter, +no-popup
272 # and +gif-deanimate will not work on compressed data. Will slow down
273 # connections to those websites, though.
276 # If the website sets cookies, make sure they are erased when you exit
277 # and restart your web browser. This makes profiling cookies useless,
278 # but won't break sites which require cookies so that you can log in
279 # or for transactions.
282 # Prevent the website from reading cookies
285 # Prevent the website from setting cookies
289 # Filter the website through a built-in filter to disable
290 # window.open() etc. The two alternative spellings are
294 # This action only applies if you are using a jarfile. It sends a
295 # cookie to every site stating that you do not accept any copyright
296 # on cookies sent to you, and asking them not to track you. Of
297 # course, this is a (relatively) unique header they could use to
301 # This allows you to add an arbitrary cookie. Specify it multiple
302 # times in order to add several cookies.
304 #############################################################################
307 #############################################################################
309 #############################################################################
311 #############################################################################
313 # You can define a short form for a list of permissions - e.g., instead
314 # of "-no-cookies-set -no-cookies-read -filter -fast-redirects", you can
315 # just write "shop". This is called an alias.
317 # Currently, an alias can contain any character except space, tab, '=', '{'
319 # But please use only 'a'-'z', '0'-'9', '+', and '-'.
321 # Alias names are not case sensitive.
323 # Aliases beginning with '+' or '-' may be used for system permission names
324 # in future releases - so try to avoid alias names like this. (e.g.
325 # "+no-cookies" below is not a good name)
327 # Aliases must be defined before they are used.
331 +no-cookies = +no-cookies-set +no-cookies-read
332 -no-cookies = -no-cookies-set -no-cookies-read
333 +imageblock = +block +image
335 # Fragile sites should have the minimum changes
336 fragile = -block -deanimate-gifs -fast-redirects -filter -hide-referer -no-cookies -no-popups
338 # Shops should be allowed to set persistent cookies
339 shop = -filter -no-cookies -no-cookies-keep
341 # Your favourite blend of filters:
343 myfilters = +filter{html-annoyances} +filter{js-annoyances} +filter{no-popups}\
344 +filter{webbugs} +filter{nimda} +filter{banners-by-size}
346 #... etc. Customize to your heart's content.
348 #############################################################################
350 #############################################################################
362 +hide-referer{forge} \
365 +image-blocker{http://i.j.b/send-banner} \
376 #############################################################################
377 # A useful site for testing - shows all headers:
378 # http://privacy.net/analyze/
379 #############################################################################
380 {+add-header{X-Privacy: Yes please} \
381 +add-header{X-User-Tracking: No thanks!} -filter}
385 #############################################################################
386 # Test for new GIF deanimation feature.
387 # Just try http://www.oesterhelt.org/deanimate-demo with and without it.
388 #############################################################################
389 {+deanimate-gifs{last}}
390 www.oesterhelt.org/deanimate-demo
393 #############################################################################
394 # Sites that need cookies
396 # FIXME: Now cookies are allowed by default, do any of these sites
397 # need persistent cookies?
398 #############################################################################
413 #############################################################################
414 # These sites are very complex and require
415 # minimal interference.
416 #############################################################################
418 .office.microsoft.com
419 .windowsupdate.microsoft.com
422 #############################################################################
423 # Shopping sites - still want to block ads.
424 #############################################################################
427 .worldpay.com # for quietpc.com
431 #############################################################################
432 # These shops require pop-ups
433 #############################################################################
438 #############################################################################
439 # Sometimes fast-redirects catches things by mistake
440 #############################################################################
442 www.ukc.ac.uk/cgi-bin/wac\.cgi\?
444 edit.europe.yahoo.com
446 .altavista.com/.*(like|url|link):http
447 .altavista.com/trans.*urltext=http
451 #############################################################################
452 # Please don't re_filter code!
453 #############################################################################
458 #############################################################################
460 #############################################################################
462 #############################################################################
467 #############################################################################
469 #############################################################################
471 .ad.preferences.com/image.*
474 .ad-adex3.flycast.com
476 .connect.247media.ads.link4ads.com
478 .mojofarm.mediaplex.com/ad/
479 www.carbuyer.com/cgi-carbuyer/getimage.cgi
480 /phpAds(New)?/viewbanner\.php
481 .ad.de.doubleclick.net
482 /.*/count\.cgi\?.*df=
483 *.fxweb.com/v2-trackrun\.cgi
489 a196.g.akamai.net/7/196/2670/000[1-3]/images\.gmx\.net/.*images/.*/.*/
493 .smartclicks.com/.*/smart(img|banner|host|bar|site)
494 .linkexchange.com/.*/showl(ogo|e)
496 pixel.intares.net/cgi-bin/janus
497 ar.atwola.com # This serves all ads for CNN and AOL
499 #############################################################################
501 #############################################################################
503 #############################################################################
504 /.*/(.*[-_.])?ads?[0-9]?(/|[-_.].*|\.(gif|jpe?g))
505 /.*/(.*[-_.])?count(er)?(\.cgi|\.dll|\.exe|[?/])
506 /.*/(ng)?adclient\.cgi
507 /.*/(plain|live|rotate)[-_.]?ads?/
509 /.*/(sponsor)s?[0-9]?/
510 ###/*.*/(sponsor|banner)s?[0-9]?/
511 ###/*.*/.*banner([-_]?[a-z0-9]+)?\.(gif|jpg)
513 /?.*/_?(plain|live)?ads?(-banners)?/
515 /?.*/ad(sdna_image|gifs?)/
516 /?.*/ad(server|stream|juggler)\.(cgi|pl|dll|exe)
521 /?.*/adv((er)?ts?|ertis(ing|ements?))?/
525 /?.*/banner_?anzeigen
529 /?.*/cgi-bin/centralad/getimage
530 /?.*/images/addver\.gif
531 /?.*/images/advert\.gif
532 /?.*/images/marketing/.*\.(gif|jpe?g)
537 /?.*/randomads/.*\.(gif|jpe?g)
538 /?.*/rekla(ma|me|am)/.*\.(gif|jpe?g)
541 /?.*/sponsors?[0-9]?/
545 /?.*/werbung/.*\.(gif|jpe?g)
546 /?.*/adv\. # www.telegraaf.nl
547 /?.*/advert[0-9]+\.jpg
562 /bin/getimage.cgi/...\?AD
563 /bin/nph-oma.count/ct/default.shtml
564 /bin/nph-oma.count/ix/default.html
565 /cgi-bin/getimage.cgi/....\?GROUP=
567 /cgi-bin/webad.dll/ad
569 /cwmail/amzn-bm1\.gif
577 /image\.ng/transactionID
578 /images/.*/.*_anim\.gif # alvin brattli
579 /ip_img/.*\.(gif|jpe?g)
582 /netscapeworld/nw-ad/
583 /promotions/houseads/
587 /torget/jobline/.*\.gif
592 /cgi-bin/nph-adclick.exe/
593 /?.*/Image/BannerAdvertising/
595 /?.*/adlib/server\.cgi
596 /?.*/gsa_bs/gsa_bs.cmdl
600 # for our finnish friends, by Kai Puolamaki <Kai.Puolamaki@iki.fi>
601 /?.*/mainos/*.*/.*\.gif
602 /?.*/mainos/*.*/.*\.jpe?g
604 # more from a finnish friend Petri Haapio <pha@iki.fi>
606 .keltaisetsivut.fi/web/img/\.*gif
607 .haku.net/pics/pana\.*gif
609 /?.*/(.*[-_.].*)?maino(kset|nta|s).*(/|\.(gif|html?|jpe?g|png))
610 /?.*/(ilm(oitus)?|kampanja)(hallinta|kuvat?)(/|\.(gif|html?|jpe?g|png))
612 # and even more from a finnish friend Hannu Napari <Hannu.Napari@hut.fi>
613 194.251.243.50/cgi-bin/banner
617 www.iltalehti.fi/ilmkuvat
618 www.mtv3.fi/mainoskuvat
629 /?.*/images/topics/topicgimp\.gif
630 .discovery.com/.*banner_id
633 .idrink.com/frm_bottom.htm
635 /?.*/ph-ad.*\.focalink\.com
638 /we_ba/ # hausfrauenseite.de *bwhahahaaaaa*
641 /.*(ms)?backoff(ice)?.*\.(gif|jpe?g)
642 /.*(/ie4|/ie3|msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*\.(gif|jpe?g)
643 /.*activex.*(gif|jpe?g)
644 /.*explorer?.(gif|jpe?g)
645 /.*freeie\.(gif|jpe?g)
646 /.*/ie_?(buttonlogo|static?|anim.*)?\.(gif|jpe?g)
647 /.*ie_sm\.(gif|jpe?g)
648 /.*msie(30)?\.(gif|jpe?g)
649 /.*msnlogo\.(gif|jpe?g)
650 /.*office97_ad1\.(gif|jpe?g)
651 /.*pbbobansm\.(gif|jpe?g)
652 /.*powrbybo\.(gif|jpe?g)
653 /.*sqlbans\.(gif|jpe?g)
655 /.*ie4get_animated\.gif
680 # generally useless information and promo stuff (commented out)
681 #/.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?)\.(gif|jpe?g)
683 /?.*/images/na/us/brand/
684 /?.*/advantage\.(gif|jpg)
685 /?.*/advanbar\.(gif|jpg)
686 /?.*/advanbtn\.(gif|jpg)
687 /?.*/biznetsmall\.(gif|jpg)
688 /?.*/utopiad\.(gif|jpg)
689 /?.*/epipo\.(gif|jpg)
690 /?.*/amazon([a-zA-Z0-9]+)\.(gif|jpg)
691 /?.*/bnlogo.(gif|jpg)
692 /?.*/buynow([a-zA-Z0-9]+)\.(gif|jpg)
697 # for the dutch folks by a dutch friend gertjan@west.nl
700 .netdirect.nl/nd_servlet/___
702 # --------------------------------------------------------------------------
706 # --------------------------------------------------------------------------
708 # the next two lines work
711 193.158.37.3/cgi-bin/impact
718 195.63.104.*/(inbox|log|meld|folderlu|folderru|log(in|out)[lmr]u|)
726 206.165.5.162/images/gcanim\.gif
730 207.159.129.131/abacus
734 207.87.27.10/tool/includes/gifs/
737 209.1.112.252/adgraph/
738 209.1.135.14[24]:1971
743 209.207.224.22[02]/servfu.pl
744 209.239.37.214/cgi-pilotfaq/getimage\.cgi
747 209.85.89.183/cgi-bin/cycle\?host
748 212.63.155.122/(banner|concret|softwareclub)
751 216.49.10.236/web1000/
754 .ICDirect.com/cgi-bin
755 .Shannon.Austria.Eu.net/\.cgi/
760 # generic hosts (probably most effective)
768 #/.*/*preferences.com*
771 .akamaitech.net/.*/Banners/
772 .altavista.telia.com/av/pix/sponsors/
773 .amazon.com/g/associates/logos/
775 .asinglesplace.com/asplink\.gif
777 .automatiseringgids.nl/gfx/advertenties/
778 #avenuea.com/Banners/
781 .befriends.net/personals/matchmaking\.jpg
782 .bizad.nikkeibp.co.jp
783 .bs.gsanet.com/gsa_bs/
786 .cgicounter.puretec.de/cgi-bin/
787 .ciec.org/images/countdown\.gif
788 .classic.adlink.de/cgi-bin/accipiter/adserver.exe
790 #.clickhere.egroups.com/img/
792 .commonwealth.riddler.com/Commonwealth/bin/statdeploy\?[0-9]+
794 .dagbladet.no/ann-gif
797 .dn.adzerver.com/image.ad
802 .eur.a1.yimg.com/eur.yimg.com/a/
803 .us.a1.yimg.com/us.yimg.com/a/
805 #fastcounter.linkexchange.com
807 .focalink.com/SmartBanner
808 .freepage.de/cgi-bin/feets/freepage_ext/.*/rw_banner
809 .freespace.virgin.net/andy.drake
810 .futurecard.com/images/
814 .go.com/cimages\?SEEK_
816 .home.miningco.com/event.ng/.*AdID
820 image*.narrative.com/news/.*\.(gif|jpe?g)
822 #image.linkexchange.com
824 .images.yahoo.com/adv/
825 .images.yahoo.com/promotions/
828 .impartner.de/cgi-bin
829 informer2.comdirect.de:6004/cd/banner2
830 .infoseek.go.com/cimages
832 .kaufwas.com/cgi-bin/zentralbanner\.cgi
833 #leader.linkexchange.com
836 .linktrader.com/cgi-bin/
837 .logiclink.nl/cgi-bin/
838 lucky.theonion.com/cgi-bin/oniondirectin\.cgi
839 lucky.theonion.com/cgi-bin/onionimp\.cgi
840 lucky.theonion.com/cgi-bin/onionimpin\.cgi
842 .mailorderbrides.com/mlbrd2\.gif
845 .members.sexroulette.com
846 .messenger.netscape.com
848 # movielink became moviefone
849 .moviefone.com/.*(banner|newbutton|(ad|poster).*?\.gif|mmail|bytb|h_(guy|showtick|aML)|m_|icon_|NF_.*?back|h_.*?gif|media/(art|imagelinks(/MF.(ad|sponsor))))
850 mqgraphics.mapquest.com/graphics/Advertisements/
853 .news.com/cgi-bin/acc_clickthru
855 .ngserve.pcworld.com/adgifs/
863 .promotions.yahoo.com
865 .qsound.com/tracker/tracker.exe
866 .resource-marketing.com/tb/
868 .rtl.de/homepage/wb/images/
869 .schnellsuche.de/images/*
870 .shout-ads.com/cgibin/shout.php3
871 .sjmercury.com/advert/
872 .smartclicks.com/.*/smart(img|banner|host|bar|site)
875 .static.wired.com/advertising/
877 .sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor\.gif
878 .t-online.de/home/040255162-001/*
881 .teleauskunft.de/commercial/
884 .tvguide.com/rbitmaps/
887 .ultra.multimania.com
891 .us.yimg.com/promotions/
895 .videoserver.kpix.com
896 .washingtonpost.com/wp-adv/
897 .webconnect.net/cgi-bin/webconnect.dll
899 .webserv.vnunet.com/ip_img/.*ban
900 .werbung.pro-sieben.de/cgi-bin
901 .whatis.com/cgi-bin/getimage.exe/
902 www..bigyellow.com/......mat.*
904 www.addme.com/link8\.gif
905 www.aftonbladet.se/annons
906 www.americanpassage.com/
907 www.angelfire.com/in/twistriot/images/wish4\.gif
908 www.bizlink.ru/cgi-bin/irads\.cgi
909 www.blacklightmedia.com/adlemur
910 www.bluesnews.com/flameq\.gif
911 www.bluesnews.com/images/ad[0-9]+\.gif
912 www.bluesnews.com/images/gcanim3\.gif
913 www.bluesnews.com/images/throbber2\.gif
914 www.bluesnews.com/miscimages/fragbutton\.gif
915 www.businessweek.com/sponsors/
916 www.canoe.ca/AdsCanoe/
917 www.cdnow.com/MN/client.banners
920 www.clicmoi.com/cgi-bin/pub\.exe
921 www.dailycal.org/graphics/adbanner-ab\.gif
922 www.detelefoongids.com/pic/[0-9]*
923 www.dhd.de/CGI/werbepic
924 www.dsf.de/cgi-bin/site_newiac.adpos
925 www.firsttarget.com/cgi-bin/klicklog.cgi
926 www.forbes.com/forbes/gifs/ads
927 www.forbes.com/tool/includes/gifs/
928 www.fxweb.holowww.com/.*\.cgi
929 www.geocities.com/TimesSquare/Zone/5267/
930 www.goto.com/images-promoters/
931 www.handelsblatt.de/hbad
932 www.hotlinks.de/cgi-bin/barimage\.cgi
933 www.infoseek.com/cimages
934 www.infoworld.com/pageone/gif
935 www.isys.net/customer/images
936 www.javaworld.com/javaworld/jw-ad
937 www.kron.com/place-ads/
938 www.leo.org/leoclick/
939 #www.linkexchange.ru/cgi-bin/erle\.cgi
940 www.linkstation.de/cgi-bin/zeige
941 www.linux.org/graphic/miniature/
942 www.linux.org/graphic/square/
943 www.linux.org/graphic/standard/
944 www.luncha.se/annonsering
946 www.ml.org/gfx/spon/icom/
947 www.ml.org/gfx/spon/wmv
948 www.musicblvd.com/mb2/graphics/netgravity/
950 www.news.com/Midas/Images/
951 www.newscientist.com/houseads
952 www.nextcard.com/affiliates/
953 www.nikkeibp.asiabiztech.com/image/NAIS4\.gif
954 www.nordlys.no/imaker/.*/.*/.*/.....\.gif # alvin brattli
955 www.nordlys.no/imaker/.*/.*/.*/..003 # alvin brattli
956 www.oanda.com/server/banner
958 www.oneandonlynetwork.com
959 www.page2page.de/cgi-bin/
960 www.prnet.de/.*/bannerschnippel/.*\.(gif|jpe?g)
961 www.promptsoftware.com/marketing/
962 #www.reklama.ru/cgi-bin/banners/
963 www.riddler.com/sponsors/
964 www.rle.ru/cgi-bin/erle\.cgi
965 www.rock.com/images/affiliates/search_black\.gif
966 www.rtl.de/search/.*kunde
967 #www.search.com/Banners
968 www.sfgate.com/place-ads/
969 www.shareware.com/midas/images/borders-btn\.gif
970 #www.sjmercury.com/products/marcom/banners/
971 www.smartclicks.com:81
972 www.sol.dk/graphics/portalmenu
973 www.sponsornetz.de/jump/show.exe
975 www.sunworld.com/sunworldonline/icons/adinfo.sm\.gif
976 www.swwwap.com/cgi-bin/
978 www.telecom.at/icons/.*film\.(gif|jpe?g)
979 www.theonion.com/bin/
980 www.topsponsor.de/cgi-bin/show.exe
982 www.ugu.com/images/EJ\.gif
983 www.warzone.com/pics/banner/
984 www.warzone.com/wzfb/ads.cgi
986 www.websitepromote.com/partner/img/
987 www.winjey.com/onlinewerbung/*\.gif
988 www.wishing.com/webaudit
989 www.www-pool.de/cgi-bin/banner-pool
990 www2.blol.com/agrJRU\.gif
992 .yahoo.com/CategoryID=0
996 www.bannerland.de/click.exe
1001 www.slate.com/redirect/
1002 www.slate.com/articleimages/
1004 www.forbes.com/tool/images/frontend/
1007 .pathfinder.com/shopping/marketplace/images/
1010 static.wired.com/images
1011 .perso.estat.com/cgi-bin/perso/
1012 #dinoadserver1.roka.net
1013 .fooladclient*.fool.com
1014 .affiliate.aol.com/static/
1022 # www.sunday-times.co.uk
1023 www.sunday-times.co.uk/standing/newsint/ticker
1025 #NeXgo (ex Germany.Net)
1029 # Block as much of GeoCities as possible
1030 # All geocities-owned images
1031 www.geocities.com/images
1032 www.geocities.com/MemberBanners/live/
1033 pic.geocities.com/images
1034 # And the popup (it still pops up, but does not eat up precious bandwidth)
1035 #www.geocities.com/ad_container/pop.html # already fixed by other regexp
1037 # from corion@informatik.uni-frankfurt.de
1040 #ads.xmonitor.net/xadengine.cgi # fixed by above regexp
1041 # Also block the japanese geocities popups
1042 www.geocities.co.jp/images
1043 # Also block the come.to, surf.to etc. popups
1046 # Also block the xoom stuff.
1048 home.talkcity.com/homepopup.html.*
1050 # Max Maischein <max.maischein@econsult.de> again ...
1051 # Halflife.net uses WON banners
1052 # Banners from Freeserve
1053 #banner.freeservers.com/cgi-bin/fs_adbar # fixed by above regexp
1054 # And those nasty va-popups !
1055 /?.*/?va_banner.html
1056 # And an all-around hit against advert*.jpg
1057 /?.*/advert[0-9]+\.jpg
1058 # And yet another Internet Explorer gif ...
1060 # Some uninteresting buttons I think...
1061 .mircx.com/images/buttons/
1062 services.mircx.com/.*\.gif
1063 # Easyspace - yet another "free disk space" provider with <yuck> banner popups
1064 www.easyspace.com/(fpub)?banner.html
1065 www.easyspace.com/100\.gif
1066 # Some russian banner exchanges
1067 .banner.ricor.ru/cgi-bin/banner.pl
1068 #www.bizlink.ru/cgi-bin/irads.cgi # already fixed by other regexp
1069 stx9.sextracker.com/stx/send/
1070 # And even more of geocities :
1071 www.geocities.com/pictures/
1072 # Gaah - www.angelfire.com - another webspace provider with popups
1073 .angelfire.com/sys/download.html
1074 # Gamasutra.com uses this ad provider
1075 sally.songline.com/@
1077 # Eule.de (search engine)
1078 # maybe images.eule.de as a whole...
1079 www.eule.de/cgi-bin/
1080 images.eule.de/comdirect\.gif
1081 images.eule.de/wp\.gif
1082 .aladin.de/125_1\.gif
1083 images.eule.de/neu/books\.gif
1085 # --------------------------------------------------------------------------
1089 # --------------------------------------------------------------------------
1091 # some images on cnn's website just suck!
1094 /.*cnnpostopinionhome.\.gif
1095 /.*custom_feature\.gif
1096 /.*explore.anim.*gif
1098 /.*pathnet.warner\.gif
1099 /.*images/cnnfn_infoseek\.gif
1100 /.*images/pathfinder_btn2\.gif
1101 /.*img/gen/fosz_front_em_abc\.gif
1102 /.*img/promos/bnsearch\.gif
1103 /.*navbars/nav_partner_logos\.gif
1104 /BarnesandNoble/images/bn.recommend.box.*
1105 /digitaljam/images/digital_ban\.gif
1106 /hotstories/companies/images/companies_banner\.gif
1107 /markets/images/markets_banner\.gif
1108 /ows-img/bnoble\.gif
1109 /ows-img/nb_Infoseek\.gif
1110 .cnn.com/images/custom/totale\.gif
1111 .cnn.com/images/lotd/custom.wheels\.gif
1112 .cnn.com/images/.*/by/main.12\.gif
1113 .cnn.com/images/.*/find115\.gif
1114 .cnn.com/.*/free.email.120\.gif
1115 .cnnfn.com/images/left_banner\.gif
1117 www.cnn.com/images/.*/bn/books\.gif
1118 www.cnn.com/images/.*/pointcast\.gif
1119 www.cnn.com/images/.*/fusa\.gif
1120 .cnn.com/images/.*/start120\.gif
1121 images.cnn.com/SHOP/
1125 # the / indicates the beginning of the path (and no longer the FQDN)
1131 /gif/buttons/banner_
1132 /gif/buttons/cd_shop_
1133 /gif/cd_shop/cd_shop_ani_
1136 /av/gifs/av_map\.gif
1137 /av/gifs/av_logo\.gif
1138 /av/gifs/new/ns\.gif
1139 altavista.com/i/valsdc3\.gif
1140 jump.altavista.com/gn_sf
1143 tucows./images/locallogo\.gif
1148 # simpliemu.hypermart.net/frames.html
1149 .go2net.com/mgic/adpopup
1150 .go2net.com/metaspy/images/exposed\.gif
1151 .go2net.com/metaspy/images/ms_un\.gif
1154 www.cebu-usa.com/cwbanim1\.gif
1155 www.cebu-usa.com/Connection\.jpg
1156 www.cebu-usa.com/phonead\.gif
1157 www.cebu-usa.com/ban3\.jpg
1158 www.cebu-usa.com/tlban\.gif
1159 www.cebu-usa.com/apwlogo1\.gif
1160 www.cebu-usa.com/rose\.gif
1163 www.fnet.de/img/geldboerselogo\.jpg
1165 # hirsch@mathcs.emory.edu
1166 /images/getareal2\.gif
1168 www.assalom.com/aziza/logos/cniaffil\.gif
1169 www.assalom.com/aziza/logos/4starrl1\.gif
1170 www.phantomstar.com/images/media/m1\.gif
1173 .wahlstreet.de/MediaW\$/tsponline\.gif
1174 .wahlstreet.de/MediaW\$/dzii156x60\.gif
1175 .wahlstreet.de/MediaW\$/etban156x60_2_opt2\.gif
1179 /pics/getareal1\.gif
1181 /ltbs/cgi-bin/click.cgi
1182 .linuxtoday.com/ltbs/pics/
1186 /include/watermark/v2/
1188 # Reinier Bikker <R.P.Bikker@phys.uu.nl>
1191 # Mark Lutz <luma@nikocity.de>
1192 /.*/*werb.*\.(gif|jpe?g) # hope that's not to restrictive
1194 #Free Yellow thing at bottom of pages (HereticPC)
1195 www.freeyellow.com/images/powerlink5a\.gif
1196 www.freeyellow.com/images/powerlink5b\.gif
1197 www.freeyellow.com/images/powerlink5c\.gif
1198 www.freeyellow.com/images/powerlink5d\.gif
1199 www.freeyellow.com/images/powerlink5e\.gif
1202 www.eads.com/images/refbutton\.gif
1203 www.fortunecity.com/console2/newnav/*
1204 www.goldetc.net/search\.gif
1205 www.cris.com/~Lzrdking/carpix/cars3-le\.gif
1206 www.justfreestuff.com/scott\.gif
1207 www.cyberthrill.com/entrance\.gif
1208 secure.pec.net/images/pec69ani\.gif
1209 www.new-direction.com/avviva\.gif
1210 /.*internetmarketingcenter\.gif
1211 www.new-direction.com/wp-linkexchange-loop\.gif
1212 www.new-direction.com/windough\.gif
1213 www.digitalwork.com/universal_images/affiliate/dw_le_3\.gif
1214 service.bfast.com/bfast/click/*
1215 www.new-direction.com/magiclearning\.gif
1216 www.new-direction.com/mailloop\.gif
1218 www.free-banners.com/images/hitslogo\.gif
1219 rob.simplenet.com/dyndns/fortune5\.gif
1220 .nasdaq-amex.com/images/bn_ticker\.gif
1223 # navilor@hotmail.com
1226 # wayne@staff.msen.com
1228 a*.*.*.yimg.com/([0-9]*|\/)*us.yimg.com/*
1231 www.realtop50.com/cgi-bin/ad
1235 www.yacht.de/images/(my_ani|eissingani|chartertrans|fum|schnupper|fysshop|garmin)\.gif
1236 www.sponsorweb.de/web-sponsor/nt-bin/show.exe
1239 # Club-internet pops up a complain if you refuse cookie (still pops up...)
1240 perso.club-internet.fr/html/Popup/popup_frame_nocookie.html
1241 perso.club-internet.fr/pagesperso/popup_nocookie.html
1243 .gmx.net/images/newsbanner/
1246 .quicken.lexware.de/images/us7-468x60.gif
1247 /img/special/chatpromo\.gif
1248 www.travelocity.com/images/promos/
1250 # wonder that that does...
1253 #/*.*/phpAds/viewbanner.php
1254 #/*.*/phpAds/phpads.php
1256 www.linux-magazin.de/banner
1257 .comtrack.comclick.com
1259 .iac-online.de/filler
1261 .media.interadnet.com
1262 .stat.www.fi/cgi-bin
1266 .disneystoreaffiliates.com
1268 .powerwork.mobile.de/cgi-bin/getimage\.cgi
1272 ####################################################
1275 # The Register ads - oh, and all images in Register stories (sigh).
1276 www.theregister.co.uk/media/
1278 # Used on http://www.theregister.co.uk/
1279 # Sample advert URL:
1280 # http://secure.webconnect.net/cgi-bin/webconnecthome.dll?F467
1284 www.dilbert.com/comics/dilbert/images/.*_140x800.*\.gif
1287 # Uses URL: http://www.stattrack.com/cgi-bin/stats/image.cgi
1289 # And loads JavaScript from http://www.stattrack.com/stats/code
1290 www.stattrack.com/stats/
1292 #Now they're Yahoo GeoCities, their junk is in a different place.
1293 ##geo.yahoo.com/serv
1294 ##visit.geocities.com/visit.gif
1295 .yimg.com/?.*/www.geocities.com/js_source
1296 #http://us.toto.geo.yahoo.com/toto?s=76001086
1298 .visit.geocities.com
1299 .yimg.com/?.*/www.geocities.com/
1301 #http://counter16.bravenet.com/counter.php
1304 #http://stat.cybermonitor.com/7emezone_p?1707_USdvd
1307 #http://members.tripod.com/adm/popup/.....
1308 members.tripod.com/adm/popup/
1310 #This is the worst ad idea ever!
1311 #count.exitexchange.com/exit/1100661
1312 #count.exitexchange.com/clients/navbar.html
1313 #(used in http://skyhivisuals.tripod.com/malfunctions_.htm)
1319 #This site traps the browser
1322 #privacy.net runs ads
1325 #Lindsay.Marshall@newcastle.ac.uk suggested these, to kill Opera adverts:
1330 dinoadserver*.roka.net
1332 logout.tvspielfilm.de
1334 www.freenet.de/customerindex\.html
1336 .fxweb.com/v2-trackrun\.cgi
1337 rtldating.peopleunited.de
1339 www.zdnet.com/fcgi-bin/
1340 service.bfast.com/bfast/serve
1342 fourohfour.nbci.com/Members404Error.php3
1345 www.fair-ist-mehr.de/cgi-bin/bt.pl
1355 #############################################################################
1357 #############################################################################
1360 www.userfriendly.org/images/banners/banner_dp_heart\.gif
1363 #Why were these in the Waldherr blockfile?
1365 #a*.*.*.yimg.com/([0-9]|\/)*us.yimg.com/i/*
1367 # some regexps are simply too aggressive ...
1369 # equalizer to /*.*(.*[-_.])?ads?[0-9]?(/|[-_.].*|.(gif|jpe?g))
1380 .ad.siemens.de # SIEMENS Automation & Drives
1381 #add-url.altavista.com
1388 # univ. don't advertise, do they :-)
1390 .ac.uk # English Universities too! - Jon
1391 .uni-*.de # What about Germany? --oes
1392 www.ugu.com/sui/ugu/adv
1396 clubs.yahoo.com/clubs
1397 edit.my.yahoo.com/config/show_identity
1398 www.ix.de/newsticker/data/ad
1399 www.heise.de/newsticker/data/ad
1400 www.careernet.de/anzeige
1401 www.careernet.de/bewerber/stellenanzeigen
1402 www.baumgartner.de/stellenmarkt/anzeigen
1403 www.dspartner.de/Anzeigen
1404 www.aws-jobs.de/Anzeigen
1405 www.jobware.de/.*/anzeigen/
1406 www.jobworld.de/bilder/
1407 www.cnn.com/TECH/computing/.*/internet.ads/
1408 www.financial.de/shop/
1412 194.221.152.2/phptelefontmp
1413 .harvard.edu/images/banner/
1416 www.dhd.de/CGI/anzeigen/
1419 .img.web.de/web/img/
1421 www.segel.de/menu/bilder/anzeigen\.gif
1422 www.corel.com/graphics/banners/
1423 www.software.ibm.com/ad/
1424 www.omg.org/docs/ad/
1426 .sperrmuell.de/scripts/anzeigen
1427 www.freenet.de/index.html
1428 www.01019freenet.de/index.html
1429 www.freenet.de/freenet/
1430 www.01019freenet.de/freenet/
1431 webfactory.de/anzeigen.php
1433 www.internatif.org/bortzmeyer/debian/sponsor/
1436 www.software.hosting.ibm.com/ad/
1437 www.ibm.com/software/ad/
1440 www.debian.org/Pics/banner-blue\.gif
1441 www.linux.de/pics/Nachrichten_banner\.gif
1444 finder.shopping.yahoo.com/shop/
1454 .consumer-direct.com
1459 # my banking stuff => no ads.
1465 # Jon's addition: MSDN
1470 .freemail*.web.de/online/ordner/anzeigen
1471 foggy.sda.t-online.de
1472 .us.i1.yimg.com/us.yimg.com/i/pim/ad2.gif
1473 www.nexgo.de/.*/bg_banner.jpg
1475 # .*ads. matches prdownloads.sourceforge.net and many other download sites