1 ######################################################################
3 # File : $Source: /cvsroot/ijbswa/current/basic.action,v $
5 # $Id: basic.action,v 1.3 2002/03/26 22:29:54 swa Exp $
7 # Purpose : Default actions file, see
8 # http://www.privoxy.org/faq/questions.html#CONFIGFILES
10 # Copyright : Written by and Copyright
11 # Privoxy team. http://www.privoxy.org/
13 # Based on the Internet Junkbuster originally written
14 # by and Copyright (C) 1997 Anonymous Coders and
15 # Junkbusters Corporation. http://www.junkbusters.com
17 # We value your feedback. However, to provide you with the best support,
20 # * Use the support forum to get help:
21 # http://sourceforge.net/tracker/?group_id=11118&atid=211118
22 # * Submit bugs only thru our bug forum:
23 # http://sourceforge.net/tracker/?group_id=11118&atid=111118
24 # Make sure that the bug has not already been submitted. Please try
25 # to verify that it is a Junkbuster bug, and not a browser or site
26 # bug first. If you are using your own custom configuration, please
27 # try the stock configs to see if the problem is a configuration
28 # related bug. And if not using the latest development snapshot,
29 # please try the latest one. Or even better, CVS sources.
30 # * Submit feature requests only thru our feature request forum:
31 # http://sourceforge.net/tracker/?atid=361118&group_id=11118&func=browse
33 # For any other issues, feel free to use the mailing lists:
34 # http://sourceforge.net/mail/?group_id=11118
36 # Anyone interested in actively participating in development and related
37 # discussions can join the appropriate mailing list here:
38 # http://sourceforge.net/mail/?group_id=11118. Archives are available
41 #############################################################################
43 #############################################################################
45 # To determine which actions apply to a request, the URL of the request is
46 # compared to all patterns in this file. Every time it matches, the list of
47 # applicable actions for this URL is incrementally updated. You can trace
48 # this process by visiting http://i.j.b/show-url-info
50 # There are 4 types of lines in this file: comments (like this line),
51 # actions, aliases and patterns, all of which are explained below.
53 #############################################################################
55 #############################################################################
57 # 1. On Domains and Paths
58 # -----------------------
60 # Generally, a pattern has the form <domain>/<path>, where both the <domain>
61 # and <path> part are optional. If you only specify a domain part, the "/"
65 # is a domain-only pattern and will match any request to www.yahoo.com
68 # means exactly the same (but is slightly less efficient)
70 # www.example.com/index.html
71 # matches only the document /index.html on www.example.com
74 # matches the document /index.html, regardless of the domain
77 # matches nothing, since it would be interpreted as a domain name and
78 # there is no top-level domain called ".html".
83 # The matching of the domain part offers some flexible options: If the
84 # domain starts or ends with a dot, it becomes unanchored at that end:
87 # matches only www.example.com
90 # matches any domain that ENDS in .example.com
93 # matches any domain that STARTS with www.
95 # Additionally, there are wildcards that you can use in the domain names
96 # themselves. They work pretty similar to shell wildcards: "*" stands for
97 # zero or more arbitrary characters, "?" stands for one, and you can define
98 # charachter classes in square brackets and they can be freely mixed:
101 # matches adserver.example.com, ads.example.com, etc but not sfads.example.com
104 # matches all of the above
107 # matches www.ipix.com, pictures.epix.com, a.b.c.d.e.upix.com etc
109 # www[1-9a-ez].example.com
110 # matches www1.example.com, www4.example.com, wwwd.example.com,
111 # wwwz.example.com etc, but not wwww.example.com
118 # Paths are specified as regular expressions. A comprehensive discussion of
119 # regular expressions wouldn't fit here, but (FIXME) someone should paste
120 # a concise intro to the regex language here.
122 # If Junkbuster was compiled with pcre support (default), Perl compatible
123 # regular expressions are used. See the pcre/docs/ direcory or man perlre
124 # (also available on http://www.perldoc.com/perl5.6/pod/perlre.html) for
127 # Please note that matching in the path is CASE INSENSITIVE by default, but
128 # you can switch to case sensitive by starting the pattern with the "(?-i)"
131 # www.example.com/(?-i)PaTtErN.*
132 # will match only documents whose path starts with PaTtErN in exactly this
135 # Partially case-sensetive and partially case-insensitive patterns are
136 # possible, but the rules about splitting them up are extremely complex
137 # - see the PCRE documentation for more information.
139 #############################################################################
141 #############################################################################
143 # There are 3 kinds of action:
145 # Boolean (e.g. "block"):
149 # Parameterized (e.g. "hide-user-agent"):
150 # +name{param} # enable and set parameter to "param"
153 # Multi-value (e.g. "add-header", "wafer"):
154 # +name{param} # enable and add parameter "param"
155 # -name{param} # remove the parameter "param"
156 # -name # disable totally
158 # The default (if you don't specify anything in this file) is not to take
159 # any actions - i.e completely disabled, so JunkBuster will just be a
160 # normal, non-blocking, non-anonymizing proxy. You must specifically
161 # enable the privacy and blocking features you need (although the
162 # provided default actions file will do that for you).
164 # Later actions always override earlier ones. For multi-valued actions,
165 # the actions are applied in the order they are specified.
167 #############################################################################
169 #############################################################################
171 # +add-header{Name: value}
172 # Adds the specified HTTP header, which is not checked for validity.
173 # You may specify this many times to specify many headers.
178 # +deanimate-gifs{last}
179 # +deanimate-gifs{first}
180 # Deanimate all animated GIF images, i.e. reduce them to their last
181 # frame. This will also shrink the images considerably. (In bytes,
183 # If the option "first" is given, the first frame of the animation
184 # is used as the replacement. If "last" is given, the last frame of
185 # the animation is used instead, which propably makes more sense for
186 # most banner animations, but also has the risk of not showing the
187 # entire last frame (if it is only a delta to an earlier frame).
190 # Downgrade HTTP/1.1 client requests to HTTP/1.0 and downgrade the
191 # responses as well. Use this action for servers that use HTTP/1.1
192 # protocol features that Junkbuster currently can't handle yet.
195 # Many sites, like yahoo.com, don't just link to other sites.
196 # Instead, they will link to some script on their own server,
197 # giving the destination as a parameter, which will then redirect
198 # you to the final target.
200 # URLs resulting from this scheme typically look like:
201 # http://some.place/some_script?http://some.where-else
203 # Sometimes, there are even multiple consecutive redirects encoded
204 # in the URL. These redirections via scripts make your web browing
205 # more traceable, since the server from which you follow such a link
206 # can see where you go to. Apart from that, valuable bandwidth and
207 # time is wasted, while your browser aks the server for one redirect
208 # after the other. Plus, it feeds the advertisers.
210 # The +fast-redirects option enables interception of these requests
211 # by junkbuster, who will cut off all but the last valid URL in the
212 # request and send a local redirect back to your browser without
213 # contacting the remote site.
216 # Filter the website through one or more regular expression filters.
217 # Repeat for multiple filters.
219 # Filters predefined in the supplied re_filterfile include:
221 # html-annoyances: Get rid of particularly annoying HTML abuse
222 # js-annoyances: Get rid of particularly annoying JavaScript abuse
223 # no-poups: Kill all popups in JS and HTML
224 # frameset-borders: Give frames a border
225 # webbugs: Squish WebBugs (1x1 invisible GIFs used for user tracking)
226 # no-refresh: Automatic refresh sucks on auto-dialup lines
227 # fun: Text replacements for subversive browsing fun!
228 # nimda: Remove (virus) Nimda code.
229 # banners-by-size: Kill banners by size
230 # crude-parental: Kill all web pages that contain the words "sex" or "warez"
234 # Block any existing X-Forwarded-for header, and do not add a new one.
237 # +hide-from{spam@sittingduck.xqq}
238 # If the browser sends a "From:" header containing your e-mail address,
239 # either completely removes the header ("block"), or change it to the
240 # specified e-mail address.
242 # +hide-referer{block}
243 # +hide-referer{forge}
244 # +hide-referer{http://nowhere.com}
245 # Don't send the "Referer:" (sic) header to the web site. You can
246 # block it, forge a URL to the same server as the request (which is
247 # preferred because some sites will not send images otherwise) or
248 # set it to a constant string.
250 # +hide-referrer{...}
251 # Alternative spelling of +hide-referer. Has the same parameters,
252 # and can be freely mixed with, "+hide-referer". ("referrer" is the
253 # correct English spelling, however the HTTP specification has a
254 # bug - it requires it to be spelt "referer").
256 # +hide-user-agent{browser-type}
257 # Change the "User-Agent:" header so web servers can't tell your
258 # browser type. (Breaks many web sites). Specify the user-agent
259 # value you want - e.g., to pretend to be using Netscape on Linux:
260 # +hide-user-agent{Mozilla (X11; I; Linux 2.0.32 i586)}
261 # Or to identify yourself explicitly as a JunkBuster user:
262 # +hide-user-agent{JunkBuster/1.0}
263 # (Don't change the version number from 1.0 - after all, why tell them?)
266 # Treat this URL as an image. This only matters if it's also "+block"ed,
267 # in which case a "blocked" image can be sent rather than a HTML page.
268 # See +image-blocker{} for the control over what is actually sent.
270 # +image-blocker{blank}
271 # +image-blocker{pattern}
272 # +image-blocker{<URL>} with <url> being any valid image URL
273 # Decides what to do with URLs that end up tagged with {+block +image}.
274 # There are 4 options:
275 # * "-image-blocker" will send a HTML "blocked" page, usually
276 # resulting in a "broken image" icon.
277 # * "+image-blocker{blank}" will send a 1x1 transparent image
278 # * "+image-blocker{pattern}" will send a 4x4 grey/white pattern
279 # which is less intrusive than the logo but easier to recognize
280 # than the transparent one.
281 # * "+image-blocker{<URL>}" will send a HTTP temporary redirect
282 # to the specified image URL.
285 # +limit-connect{portlist}
286 # The CONNECT methods exists in HTTP to allow access to secure websites
287 # (https:// URLs) through proxies. It works very simply: The proxy
288 # connects to the server on the specified port, and then short-circuits
289 # its connections to the cliant and to the remote proxy.
290 # This can be a big security hole, since CONNECT-enabled proxies can
291 # be abused as TCP relays very easily.
292 # By default, i.e. in the absence of a +limit-connect action, Junkbuster
293 # will only allow CONNECT requests to port 443, which is the standard port
295 # If you want to allow CONNECT for more ports than that, or want to forbid
296 # CONNECT altogether, you can specify a comma separated list of ports and port
297 # ranges (the latter using dashes, with the minimum defaulting to 0 and max to 65K):
299 # +limit-connect{443} # This is the default and need no be specified.
300 # +limit-connect{80,443} # Ports 80 and 443 are OK.
301 # +limit-connect{-3, 7, 20-100, 500-} # Port less than 3, 7, 20 to 100, and above 500 are OK.
304 # Prevent the website from compressing the data. Some websites do
305 # that, which is a problem for junkbuster, since +filter, +no-popup
306 # and +gif-deanimate will not work on compressed data. Will slow down
307 # connections to those websites, though.
310 # If the website sets cookies, make sure they are erased when you exit
311 # and restart your web browser. This makes profiling cookies useless,
312 # but won't break sites which require cookies so that you can log in
313 # or for transactions.
316 # Prevent the website from reading cookies
319 # Prevent the website from setting cookies
323 # Filter the website through a built-in filter to disable
324 # window.open() etc. The two alternative spellings are
328 # This action only applies if you are using a jarfile. It sends a
329 # cookie to every site stating that you do not accept any copyright
330 # on cookies sent to you, and asking them not to track you. Of
331 # course, this is a (relatively) unique header they could use to
335 # This allows you to add an arbitrary cookie. Specify it multiple
336 # times in order to add several cookies.
338 #############################################################################
341 #############################################################################
343 #############################################################################
345 #############################################################################
347 # You can define a short form for a list of permissions - e.g., instead
348 # of "-no-cookies-set -no-cookies-read -filter -fast-redirects", you can
349 # just write "shop". This is called an alias.
351 # Currently, an alias can contain any character except space, tab, '=', '{'
353 # But please use only 'a'-'z', '0'-'9', '+', and '-'.
355 # Alias names are not case sensitive.
357 # Aliases beginning with '+' or '-' may be used for system permission names
358 # in future releases - so try to avoid alias names like this. (e.g.
359 # "+no-cookies" below is not a good name)
361 # Aliases must be defined before they are used.
365 +no-cookies = +no-cookies-set +no-cookies-read
366 -no-cookies = -no-cookies-set -no-cookies-read
367 +imageblock = +block +image
369 # Fragile sites should have the minimum changes
370 fragile = -block -deanimate-gifs -fast-redirects -filter -hide-referer -no-cookies -no-popups
372 # Shops should be allowed to set persistent cookies
373 shop = -filter -no-cookies -no-cookies-keep
375 # Your favourite blend of filters:
377 myfilters = +filter{html-annoyances} +filter{js-annoyances} +filter{no-popups}\
378 +filter{webbugs} +filter{nimda} +filter{banners-by-size}
380 #... etc. Customize to your heart's content.
382 #############################################################################
384 #############################################################################
396 +hide-referer{forge} \
399 +image-blocker{http://config.privoxy.org/send-banner} \
410 #############################################################################
411 # A useful site for testing - shows all headers:
412 # http://privacy.net/analyze/
413 #############################################################################
414 {+add-header{X-Privacy: Yes please} \
415 +add-header{X-User-Tracking: No thanks!} -filter}
419 #############################################################################
420 # Test for new GIF deanimation feature.
421 # Just try http://www.oesterhelt.org/deanimate-demo with and without it.
422 #############################################################################
423 {+deanimate-gifs{last}}
424 www.oesterhelt.org/deanimate-demo
427 #############################################################################
428 # Sites that need cookies
430 # FIXME: Now cookies are allowed by default, do any of these sites
431 # need persistent cookies?
432 #############################################################################
447 #############################################################################
448 # These sites are very complex and require
449 # minimal interference.
450 #############################################################################
452 .office.microsoft.com
453 .windowsupdate.microsoft.com
456 #############################################################################
457 # Shopping sites - still want to block ads.
458 #############################################################################
461 .worldpay.com # for quietpc.com
465 #############################################################################
466 # These shops require pop-ups
467 #############################################################################
472 #############################################################################
473 # Sometimes fast-redirects catches things by mistake
474 #############################################################################
476 www.ukc.ac.uk/cgi-bin/wac\.cgi\?
478 edit.europe.yahoo.com
480 .altavista.com/.*(like|url|link):http
481 .altavista.com/trans.*urltext=http
485 #############################################################################
486 # Please don't re_filter code!
487 #############################################################################
492 #############################################################################
494 #############################################################################
496 #############################################################################
501 #############################################################################
503 #############################################################################
505 .ad.preferences.com/image.*
508 .ad-adex3.flycast.com
510 .connect.247media.ads.link4ads.com
512 .mojofarm.mediaplex.com/ad/
513 www.carbuyer.com/cgi-carbuyer/getimage.cgi
514 /phpAds(New)?/viewbanner\.php
515 .ad.de.doubleclick.net
516 /.*/count\.cgi\?.*df=
517 *.fxweb.com/v2-trackrun\.cgi
523 a196.g.akamai.net/7/196/2670/000[1-3]/images\.gmx\.net/.*images/.*/.*/
527 .smartclicks.com/.*/smart(img|banner|host|bar|site)
528 .linkexchange.com/.*/showl(ogo|e)
530 pixel.intares.net/cgi-bin/janus
531 ar.atwola.com # This serves all ads for CNN and AOL
533 #############################################################################
535 #############################################################################
537 #############################################################################
538 /.*/(.*[-_.])?ads?[0-9]?(/|[-_.].*|\.(gif|jpe?g))
539 /.*/(.*[-_.])?count(er)?(\.cgi|\.dll|\.exe|[?/])
540 /.*/(ng)?adclient\.cgi
541 /.*/(plain|live|rotate)[-_.]?ads?/
543 /.*/(sponsor)s?[0-9]?/
544 ###/*.*/(sponsor|banner)s?[0-9]?/
545 ###/*.*/.*banner([-_]?[a-z0-9]+)?\.(gif|jpg)
547 /?.*/_?(plain|live)?ads?(-banners)?/
549 /?.*/ad(sdna_image|gifs?)/
550 /?.*/ad(server|stream|juggler)\.(cgi|pl|dll|exe)
555 /?.*/adv((er)?ts?|ertis(ing|ements?))?/
559 /?.*/banner_?anzeigen
563 /?.*/cgi-bin/centralad/getimage
564 /?.*/images/addver\.gif
565 /?.*/images/advert\.gif
566 /?.*/images/marketing/.*\.(gif|jpe?g)
571 /?.*/randomads/.*\.(gif|jpe?g)
572 /?.*/rekla(ma|me|am)/.*\.(gif|jpe?g)
575 /?.*/sponsors?[0-9]?/
579 /?.*/werbung/.*\.(gif|jpe?g)
580 /?.*/adv\. # www.telegraaf.nl
581 /?.*/advert[0-9]+\.jpg
596 /bin/getimage.cgi/...\?AD
597 /bin/nph-oma.count/ct/default.shtml
598 /bin/nph-oma.count/ix/default.html
599 /cgi-bin/getimage.cgi/....\?GROUP=
601 /cgi-bin/webad.dll/ad
603 /cwmail/amzn-bm1\.gif
611 /image\.ng/transactionID
612 /images/.*/.*_anim\.gif # alvin brattli
613 /ip_img/.*\.(gif|jpe?g)
616 /netscapeworld/nw-ad/
617 /promotions/houseads/
621 /torget/jobline/.*\.gif
626 /cgi-bin/nph-adclick.exe/
627 /?.*/Image/BannerAdvertising/
629 /?.*/adlib/server\.cgi
630 /?.*/gsa_bs/gsa_bs.cmdl
634 # for our finnish friends, by Kai Puolamaki <Kai.Puolamaki@iki.fi>
635 /?.*/mainos/*.*/.*\.gif
636 /?.*/mainos/*.*/.*\.jpe?g
638 # more from a finnish friend Petri Haapio <pha@iki.fi>
640 .keltaisetsivut.fi/web/img/\.*gif
641 .haku.net/pics/pana\.*gif
643 /?.*/(.*[-_.].*)?maino(kset|nta|s).*(/|\.(gif|html?|jpe?g|png))
644 /?.*/(ilm(oitus)?|kampanja)(hallinta|kuvat?)(/|\.(gif|html?|jpe?g|png))
646 # and even more from a finnish friend Hannu Napari <Hannu.Napari@hut.fi>
647 194.251.243.50/cgi-bin/banner
651 www.iltalehti.fi/ilmkuvat
652 www.mtv3.fi/mainoskuvat
663 /?.*/images/topics/topicgimp\.gif
664 .discovery.com/.*banner_id
667 .idrink.com/frm_bottom.htm
669 /?.*/ph-ad.*\.focalink\.com
672 /we_ba/ # hausfrauenseite.de *bwhahahaaaaa*
675 /.*(ms)?backoff(ice)?.*\.(gif|jpe?g)
676 /.*(/ie4|/ie3|msie|sqlbans|powrbybo|activex|backoffice|explorer|netnow|getpoint|ntbutton|hmlink).*\.(gif|jpe?g)
677 /.*activex.*(gif|jpe?g)
678 /.*explorer?.(gif|jpe?g)
679 /.*freeie\.(gif|jpe?g)
680 /.*/ie_?(buttonlogo|static?|anim.*)?\.(gif|jpe?g)
681 /.*ie_sm\.(gif|jpe?g)
682 /.*msie(30)?\.(gif|jpe?g)
683 /.*msnlogo\.(gif|jpe?g)
684 /.*office97_ad1\.(gif|jpe?g)
685 /.*pbbobansm\.(gif|jpe?g)
686 /.*powrbybo\.(gif|jpe?g)
687 /.*sqlbans\.(gif|jpe?g)
689 /.*ie4get_animated\.gif
714 # generally useless information and promo stuff (commented out)
715 #/.*/(counter|getpcbutton|BuiltByNOF|netscape|hotmail|vcr(rated)?|rsaci(rated)?|freeloader|cache_now(_anim)?|apache_pb|now_(anim_)?button|ie_?(buttonlogo|static?|.*ani.*)?)\.(gif|jpe?g)
717 /?.*/images/na/us/brand/
718 /?.*/advantage\.(gif|jpg)
719 /?.*/advanbar\.(gif|jpg)
720 /?.*/advanbtn\.(gif|jpg)
721 /?.*/biznetsmall\.(gif|jpg)
722 /?.*/utopiad\.(gif|jpg)
723 /?.*/epipo\.(gif|jpg)
724 /?.*/amazon([a-zA-Z0-9]+)\.(gif|jpg)
725 /?.*/bnlogo.(gif|jpg)
726 /?.*/buynow([a-zA-Z0-9]+)\.(gif|jpg)
731 # for the dutch folks by a dutch friend gertjan@west.nl
734 .netdirect.nl/nd_servlet/___
736 # --------------------------------------------------------------------------
740 # --------------------------------------------------------------------------
742 # the next two lines work
745 193.158.37.3/cgi-bin/impact
752 195.63.104.*/(inbox|log|meld|folderlu|folderru|log(in|out)[lmr]u|)
760 206.165.5.162/images/gcanim\.gif
764 207.159.129.131/abacus
768 207.87.27.10/tool/includes/gifs/
771 209.1.112.252/adgraph/
772 209.1.135.14[24]:1971
777 209.207.224.22[02]/servfu.pl
778 209.239.37.214/cgi-pilotfaq/getimage\.cgi
781 209.85.89.183/cgi-bin/cycle\?host
782 212.63.155.122/(banner|concret|softwareclub)
785 216.49.10.236/web1000/
788 .ICDirect.com/cgi-bin
789 .Shannon.Austria.Eu.net/\.cgi/
794 # generic hosts (probably most effective)
802 #/.*/*preferences.com*
805 .akamaitech.net/.*/Banners/
806 .altavista.telia.com/av/pix/sponsors/
807 .amazon.com/g/associates/logos/
809 .asinglesplace.com/asplink\.gif
811 .automatiseringgids.nl/gfx/advertenties/
812 #avenuea.com/Banners/
815 .befriends.net/personals/matchmaking\.jpg
816 .bizad.nikkeibp.co.jp
817 .bs.gsanet.com/gsa_bs/
820 .cgicounter.puretec.de/cgi-bin/
821 .ciec.org/images/countdown\.gif
822 .classic.adlink.de/cgi-bin/accipiter/adserver.exe
824 #.clickhere.egroups.com/img/
826 .commonwealth.riddler.com/Commonwealth/bin/statdeploy\?[0-9]+
828 .dagbladet.no/ann-gif
831 .dn.adzerver.com/image.ad
836 .eur.a1.yimg.com/eur.yimg.com/a/
837 .us.a1.yimg.com/us.yimg.com/a/
839 #fastcounter.linkexchange.com
841 .focalink.com/SmartBanner
842 .freepage.de/cgi-bin/feets/freepage_ext/.*/rw_banner
843 .freespace.virgin.net/andy.drake
844 .futurecard.com/images/
848 .go.com/cimages\?SEEK_
850 .home.miningco.com/event.ng/.*AdID
854 image*.narrative.com/news/.*\.(gif|jpe?g)
856 #image.linkexchange.com
858 .images.yahoo.com/adv/
859 .images.yahoo.com/promotions/
862 .impartner.de/cgi-bin
863 informer2.comdirect.de:6004/cd/banner2
864 .infoseek.go.com/cimages
866 .kaufwas.com/cgi-bin/zentralbanner\.cgi
867 #leader.linkexchange.com
870 .linktrader.com/cgi-bin/
871 .logiclink.nl/cgi-bin/
872 lucky.theonion.com/cgi-bin/oniondirectin\.cgi
873 lucky.theonion.com/cgi-bin/onionimp\.cgi
874 lucky.theonion.com/cgi-bin/onionimpin\.cgi
876 .mailorderbrides.com/mlbrd2\.gif
879 .members.sexroulette.com
880 .messenger.netscape.com
882 # movielink became moviefone
883 .moviefone.com/.*(banner|newbutton|(ad|poster).*?\.gif|mmail|bytb|h_(guy|showtick|aML)|m_|icon_|NF_.*?back|h_.*?gif|media/(art|imagelinks(/MF.(ad|sponsor))))
884 mqgraphics.mapquest.com/graphics/Advertisements/
887 .news.com/cgi-bin/acc_clickthru
889 .ngserve.pcworld.com/adgifs/
897 .promotions.yahoo.com
899 .qsound.com/tracker/tracker.exe
900 .resource-marketing.com/tb/
902 .rtl.de/homepage/wb/images/
903 .schnellsuche.de/images/*
904 .shout-ads.com/cgibin/shout.php3
905 .sjmercury.com/advert/
906 .smartclicks.com/.*/smart(img|banner|host|bar|site)
909 .static.wired.com/advertising/
911 .sysdoc.pair.com/cgi-sys/cgiwrap/sysdoc/sponsor\.gif
912 .t-online.de/home/040255162-001/*
915 .teleauskunft.de/commercial/
918 .tvguide.com/rbitmaps/
921 .ultra.multimania.com
925 .us.yimg.com/promotions/
929 .videoserver.kpix.com
930 .washingtonpost.com/wp-adv/
931 .webconnect.net/cgi-bin/webconnect.dll
933 .webserv.vnunet.com/ip_img/.*ban
934 .werbung.pro-sieben.de/cgi-bin
935 .whatis.com/cgi-bin/getimage.exe/
936 www..bigyellow.com/......mat.*
938 www.addme.com/link8\.gif
939 www.aftonbladet.se/annons
940 www.americanpassage.com/
941 www.angelfire.com/in/twistriot/images/wish4\.gif
942 www.bizlink.ru/cgi-bin/irads\.cgi
943 www.blacklightmedia.com/adlemur
944 www.bluesnews.com/flameq\.gif
945 www.bluesnews.com/images/ad[0-9]+\.gif
946 www.bluesnews.com/images/gcanim3\.gif
947 www.bluesnews.com/images/throbber2\.gif
948 www.bluesnews.com/miscimages/fragbutton\.gif
949 www.businessweek.com/sponsors/
950 www.canoe.ca/AdsCanoe/
951 www.cdnow.com/MN/client.banners
954 www.clicmoi.com/cgi-bin/pub\.exe
955 www.dailycal.org/graphics/adbanner-ab\.gif
956 www.detelefoongids.com/pic/[0-9]*
957 www.dhd.de/CGI/werbepic
958 www.dsf.de/cgi-bin/site_newiac.adpos
959 www.firsttarget.com/cgi-bin/klicklog.cgi
960 www.forbes.com/forbes/gifs/ads
961 www.forbes.com/tool/includes/gifs/
962 www.fxweb.holowww.com/.*\.cgi
963 www.geocities.com/TimesSquare/Zone/5267/
964 www.goto.com/images-promoters/
965 www.handelsblatt.de/hbad
966 www.hotlinks.de/cgi-bin/barimage\.cgi
967 www.infoseek.com/cimages
968 www.infoworld.com/pageone/gif
969 www.isys.net/customer/images
970 www.javaworld.com/javaworld/jw-ad
971 www.kron.com/place-ads/
972 www.leo.org/leoclick/
973 #www.linkexchange.ru/cgi-bin/erle\.cgi
974 www.linkstation.de/cgi-bin/zeige
975 www.linux.org/graphic/miniature/
976 www.linux.org/graphic/square/
977 www.linux.org/graphic/standard/
978 www.luncha.se/annonsering
980 www.ml.org/gfx/spon/icom/
981 www.ml.org/gfx/spon/wmv
982 www.musicblvd.com/mb2/graphics/netgravity/
984 www.news.com/Midas/Images/
985 www.newscientist.com/houseads
986 www.nextcard.com/affiliates/
987 www.nikkeibp.asiabiztech.com/image/NAIS4\.gif
988 www.nordlys.no/imaker/.*/.*/.*/.....\.gif # alvin brattli
989 www.nordlys.no/imaker/.*/.*/.*/..003 # alvin brattli
990 www.oanda.com/server/banner
992 www.oneandonlynetwork.com
993 www.page2page.de/cgi-bin/
994 www.prnet.de/.*/bannerschnippel/.*\.(gif|jpe?g)
995 www.promptsoftware.com/marketing/
996 #www.reklama.ru/cgi-bin/banners/
997 www.riddler.com/sponsors/
998 www.rle.ru/cgi-bin/erle\.cgi
999 www.rock.com/images/affiliates/search_black\.gif
1000 www.rtl.de/search/.*kunde
1001 #www.search.com/Banners
1002 www.sfgate.com/place-ads/
1003 www.shareware.com/midas/images/borders-btn\.gif
1004 #www.sjmercury.com/products/marcom/banners/
1005 www.smartclicks.com:81
1006 www.sol.dk/graphics/portalmenu
1007 www.sponsornetz.de/jump/show.exe
1009 www.sunworld.com/sunworldonline/icons/adinfo.sm\.gif
1010 www.swwwap.com/cgi-bin/
1011 www.taz.de/~taz/anz/
1012 www.telecom.at/icons/.*film\.(gif|jpe?g)
1013 www.theonion.com/bin/
1014 www.topsponsor.de/cgi-bin/show.exe
1016 www.ugu.com/images/EJ\.gif
1017 www.warzone.com/pics/banner/
1018 www.warzone.com/wzfb/ads.cgi
1020 www.websitepromote.com/partner/img/
1021 www.winjey.com/onlinewerbung/*\.gif
1022 www.wishing.com/webaudit
1023 www.www-pool.de/cgi-bin/banner-pool
1024 www2.blol.com/agrJRU\.gif
1026 .yahoo.com/CategoryID=0
1027 .yahoo.de/adv/images
1030 www.bannerland.de/click.exe
1035 www.slate.com/redirect/
1036 www.slate.com/articleimages/
1038 www.forbes.com/tool/images/frontend/
1041 .pathfinder.com/shopping/marketplace/images/
1044 static.wired.com/images
1045 .perso.estat.com/cgi-bin/perso/
1046 #dinoadserver1.roka.net
1047 .fooladclient*.fool.com
1048 .affiliate.aol.com/static/
1056 # www.sunday-times.co.uk
1057 www.sunday-times.co.uk/standing/newsint/ticker
1059 #NeXgo (ex Germany.Net)
1063 # Block as much of GeoCities as possible
1064 # All geocities-owned images
1065 www.geocities.com/images
1066 www.geocities.com/MemberBanners/live/
1067 pic.geocities.com/images
1068 # And the popup (it still pops up, but does not eat up precious bandwidth)
1069 #www.geocities.com/ad_container/pop.html # already fixed by other regexp
1071 # from corion@informatik.uni-frankfurt.de
1074 #ads.xmonitor.net/xadengine.cgi # fixed by above regexp
1075 # Also block the japanese geocities popups
1076 www.geocities.co.jp/images
1077 # Also block the come.to, surf.to etc. popups
1080 # Also block the xoom stuff.
1082 home.talkcity.com/homepopup.html.*
1084 # Max Maischein <max.maischein@econsult.de> again ...
1085 # Halflife.net uses WON banners
1086 # Banners from Freeserve
1087 #banner.freeservers.com/cgi-bin/fs_adbar # fixed by above regexp
1088 # And those nasty va-popups !
1089 /?.*/?va_banner.html
1090 # And an all-around hit against advert*.jpg
1091 /?.*/advert[0-9]+\.jpg
1092 # And yet another Internet Explorer gif ...
1094 # Some uninteresting buttons I think...
1095 .mircx.com/images/buttons/
1096 services.mircx.com/.*\.gif
1097 # Easyspace - yet another "free disk space" provider with <yuck> banner popups
1098 www.easyspace.com/(fpub)?banner.html
1099 www.easyspace.com/100\.gif
1100 # Some russian banner exchanges
1101 .banner.ricor.ru/cgi-bin/banner.pl
1102 #www.bizlink.ru/cgi-bin/irads.cgi # already fixed by other regexp
1103 stx9.sextracker.com/stx/send/
1104 # And even more of geocities :
1105 www.geocities.com/pictures/
1106 # Gaah - www.angelfire.com - another webspace provider with popups
1107 .angelfire.com/sys/download.html
1108 # Gamasutra.com uses this ad provider
1109 sally.songline.com/@
1111 # Eule.de (search engine)
1112 # maybe images.eule.de as a whole...
1113 www.eule.de/cgi-bin/
1114 images.eule.de/comdirect\.gif
1115 images.eule.de/wp\.gif
1116 .aladin.de/125_1\.gif
1117 images.eule.de/neu/books\.gif
1119 # --------------------------------------------------------------------------
1123 # --------------------------------------------------------------------------
1125 # some images on cnn's website just suck!
1128 /.*cnnpostopinionhome.\.gif
1129 /.*custom_feature\.gif
1130 /.*explore.anim.*gif
1132 /.*pathnet.warner\.gif
1133 /.*images/cnnfn_infoseek\.gif
1134 /.*images/pathfinder_btn2\.gif
1135 /.*img/gen/fosz_front_em_abc\.gif
1136 /.*img/promos/bnsearch\.gif
1137 /.*navbars/nav_partner_logos\.gif
1138 /BarnesandNoble/images/bn.recommend.box.*
1139 /digitaljam/images/digital_ban\.gif
1140 /hotstories/companies/images/companies_banner\.gif
1141 /markets/images/markets_banner\.gif
1142 /ows-img/bnoble\.gif
1143 /ows-img/nb_Infoseek\.gif
1144 .cnn.com/images/custom/totale\.gif
1145 .cnn.com/images/lotd/custom.wheels\.gif
1146 .cnn.com/images/.*/by/main.12\.gif
1147 .cnn.com/images/.*/find115\.gif
1148 .cnn.com/.*/free.email.120\.gif
1149 .cnnfn.com/images/left_banner\.gif
1151 www.cnn.com/images/.*/bn/books\.gif
1152 www.cnn.com/images/.*/pointcast\.gif
1153 www.cnn.com/images/.*/fusa\.gif
1154 .cnn.com/images/.*/start120\.gif
1155 images.cnn.com/SHOP/
1159 # the / indicates the beginning of the path (and no longer the FQDN)
1165 /gif/buttons/banner_
1166 /gif/buttons/cd_shop_
1167 /gif/cd_shop/cd_shop_ani_
1170 /av/gifs/av_map\.gif
1171 /av/gifs/av_logo\.gif
1172 /av/gifs/new/ns\.gif
1173 altavista.com/i/valsdc3\.gif
1174 jump.altavista.com/gn_sf
1177 tucows./images/locallogo\.gif
1182 # simpliemu.hypermart.net/frames.html
1183 .go2net.com/mgic/adpopup
1184 .go2net.com/metaspy/images/exposed\.gif
1185 .go2net.com/metaspy/images/ms_un\.gif
1188 www.cebu-usa.com/cwbanim1\.gif
1189 www.cebu-usa.com/Connection\.jpg
1190 www.cebu-usa.com/phonead\.gif
1191 www.cebu-usa.com/ban3\.jpg
1192 www.cebu-usa.com/tlban\.gif
1193 www.cebu-usa.com/apwlogo1\.gif
1194 www.cebu-usa.com/rose\.gif
1197 www.fnet.de/img/geldboerselogo\.jpg
1199 # hirsch@mathcs.emory.edu
1200 /images/getareal2\.gif
1202 www.assalom.com/aziza/logos/cniaffil\.gif
1203 www.assalom.com/aziza/logos/4starrl1\.gif
1204 www.phantomstar.com/images/media/m1\.gif
1207 .wahlstreet.de/MediaW\$/tsponline\.gif
1208 .wahlstreet.de/MediaW\$/dzii156x60\.gif
1209 .wahlstreet.de/MediaW\$/etban156x60_2_opt2\.gif
1213 /pics/getareal1\.gif
1215 /ltbs/cgi-bin/click.cgi
1216 .linuxtoday.com/ltbs/pics/
1220 /include/watermark/v2/
1222 # Reinier Bikker <R.P.Bikker@phys.uu.nl>
1225 # Mark Lutz <luma@nikocity.de>
1226 /.*/*werb.*\.(gif|jpe?g) # hope that's not to restrictive
1228 #Free Yellow thing at bottom of pages (HereticPC)
1229 www.freeyellow.com/images/powerlink5a\.gif
1230 www.freeyellow.com/images/powerlink5b\.gif
1231 www.freeyellow.com/images/powerlink5c\.gif
1232 www.freeyellow.com/images/powerlink5d\.gif
1233 www.freeyellow.com/images/powerlink5e\.gif
1236 www.eads.com/images/refbutton\.gif
1237 www.fortunecity.com/console2/newnav/*
1238 www.goldetc.net/search\.gif
1239 www.cris.com/~Lzrdking/carpix/cars3-le\.gif
1240 www.justfreestuff.com/scott\.gif
1241 www.cyberthrill.com/entrance\.gif
1242 secure.pec.net/images/pec69ani\.gif
1243 www.new-direction.com/avviva\.gif
1244 /.*internetmarketingcenter\.gif
1245 www.new-direction.com/wp-linkexchange-loop\.gif
1246 www.new-direction.com/windough\.gif
1247 www.digitalwork.com/universal_images/affiliate/dw_le_3\.gif
1248 service.bfast.com/bfast/click/*
1249 www.new-direction.com/magiclearning\.gif
1250 www.new-direction.com/mailloop\.gif
1252 www.free-banners.com/images/hitslogo\.gif
1253 rob.simplenet.com/dyndns/fortune5\.gif
1254 .nasdaq-amex.com/images/bn_ticker\.gif
1257 # navilor@hotmail.com
1260 # wayne@staff.msen.com
1262 a*.*.*.yimg.com/([0-9]*|\/)*us.yimg.com/*
1265 www.realtop50.com/cgi-bin/ad
1269 www.yacht.de/images/(my_ani|eissingani|chartertrans|fum|schnupper|fysshop|garmin)\.gif
1270 www.sponsorweb.de/web-sponsor/nt-bin/show.exe
1273 # Club-internet pops up a complain if you refuse cookie (still pops up...)
1274 perso.club-internet.fr/html/Popup/popup_frame_nocookie.html
1275 perso.club-internet.fr/pagesperso/popup_nocookie.html
1277 .gmx.net/images/newsbanner/
1280 .quicken.lexware.de/images/us7-468x60.gif
1281 /img/special/chatpromo\.gif
1282 www.travelocity.com/images/promos/
1284 # wonder that that does...
1287 #/*.*/phpAds/viewbanner.php
1288 #/*.*/phpAds/phpads.php
1290 www.linux-magazin.de/banner
1291 .comtrack.comclick.com
1293 .iac-online.de/filler
1295 .media.interadnet.com
1296 .stat.www.fi/cgi-bin
1300 .disneystoreaffiliates.com
1302 .powerwork.mobile.de/cgi-bin/getimage\.cgi
1306 ####################################################
1309 # The Register ads - oh, and all images in Register stories (sigh).
1310 www.theregister.co.uk/media/
1312 # Used on http://www.theregister.co.uk/
1313 # Sample advert URL:
1314 # http://secure.webconnect.net/cgi-bin/webconnecthome.dll?F467
1318 www.dilbert.com/comics/dilbert/images/.*_140x800.*\.gif
1321 # Uses URL: http://www.stattrack.com/cgi-bin/stats/image.cgi
1323 # And loads JavaScript from http://www.stattrack.com/stats/code
1324 www.stattrack.com/stats/
1326 #Now they're Yahoo GeoCities, their junk is in a different place.
1327 ##geo.yahoo.com/serv
1328 ##visit.geocities.com/visit.gif
1329 .yimg.com/?.*/www.geocities.com/js_source
1330 #http://us.toto.geo.yahoo.com/toto?s=76001086
1332 .visit.geocities.com
1333 .yimg.com/?.*/www.geocities.com/
1335 #http://counter16.bravenet.com/counter.php
1338 #http://stat.cybermonitor.com/7emezone_p?1707_USdvd
1341 #http://members.tripod.com/adm/popup/.....
1342 members.tripod.com/adm/popup/
1344 #This is the worst ad idea ever!
1345 #count.exitexchange.com/exit/1100661
1346 #count.exitexchange.com/clients/navbar.html
1347 #(used in http://skyhivisuals.tripod.com/malfunctions_.htm)
1353 #This site traps the browser
1356 #privacy.net runs ads
1359 #Lindsay.Marshall@newcastle.ac.uk suggested these, to kill Opera adverts:
1364 dinoadserver*.roka.net
1366 logout.tvspielfilm.de
1368 www.freenet.de/customerindex\.html
1370 .fxweb.com/v2-trackrun\.cgi
1371 rtldating.peopleunited.de
1373 www.zdnet.com/fcgi-bin/
1374 service.bfast.com/bfast/serve
1376 fourohfour.nbci.com/Members404Error.php3
1379 www.fair-ist-mehr.de/cgi-bin/bt.pl
1389 #############################################################################
1391 #############################################################################
1394 www.userfriendly.org/images/banners/banner_dp_heart\.gif
1397 #Why were these in the Waldherr blockfile?
1399 #a*.*.*.yimg.com/([0-9]|\/)*us.yimg.com/i/*
1401 # some regexps are simply too aggressive ...
1403 # equalizer to /*.*(.*[-_.])?ads?[0-9]?(/|[-_.].*|.(gif|jpe?g))
1414 .ad.siemens.de # SIEMENS Automation & Drives
1415 #add-url.altavista.com
1422 # univ. don't advertise, do they :-)
1424 .ac.uk # English Universities too! - Jon
1425 .uni-*.de # What about Germany? --oes
1426 www.ugu.com/sui/ugu/adv
1430 clubs.yahoo.com/clubs
1431 edit.my.yahoo.com/config/show_identity
1432 www.ix.de/newsticker/data/ad
1433 www.heise.de/newsticker/data/ad
1434 www.careernet.de/anzeige
1435 www.careernet.de/bewerber/stellenanzeigen
1436 www.baumgartner.de/stellenmarkt/anzeigen
1437 www.dspartner.de/Anzeigen
1438 www.aws-jobs.de/Anzeigen
1439 www.jobware.de/.*/anzeigen/
1440 www.jobworld.de/bilder/
1441 www.cnn.com/TECH/computing/.*/internet.ads/
1442 www.financial.de/shop/
1446 194.221.152.2/phptelefontmp
1447 .harvard.edu/images/banner/
1450 www.dhd.de/CGI/anzeigen/
1453 .img.web.de/web/img/
1455 www.segel.de/menu/bilder/anzeigen\.gif
1456 www.corel.com/graphics/banners/
1457 www.software.ibm.com/ad/
1458 www.omg.org/docs/ad/
1460 .sperrmuell.de/scripts/anzeigen
1461 www.freenet.de/index.html
1462 www.01019freenet.de/index.html
1463 www.freenet.de/freenet/
1464 www.01019freenet.de/freenet/
1465 webfactory.de/anzeigen.php
1467 www.internatif.org/bortzmeyer/debian/sponsor/
1470 www.software.hosting.ibm.com/ad/
1471 www.ibm.com/software/ad/
1474 www.debian.org/Pics/banner-blue\.gif
1475 www.linux.de/pics/Nachrichten_banner\.gif
1478 finder.shopping.yahoo.com/shop/
1488 .consumer-direct.com
1493 # my banking stuff => no ads.
1499 # Jon's addition: MSDN
1504 .freemail*.web.de/online/ordner/anzeigen
1505 foggy.sda.t-online.de
1506 .us.i1.yimg.com/us.yimg.com/i/pim/ad2.gif
1507 www.nexgo.de/.*/bg_banner.jpg
1509 # .*ads. matches prdownloads.sourceforge.net and many other download sites