From 4932ae61ae408803210af44070ab7b0cd37d8014 Mon Sep 17 00:00:00 2001 From: hal9 Date: Wed, 24 Apr 2002 02:39:31 +0000 Subject: [PATCH] Add 'Chain of Events' section. --- doc/source/user-manual.sgml | 119 ++++++++++++++++++++++++++++++++---- 1 file changed, 108 insertions(+), 11 deletions(-) diff --git a/doc/source/user-manual.sgml b/doc/source/user-manual.sgml index 14c8d286..189aae04 100644 --- a/doc/source/user-manual.sgml +++ b/doc/source/user-manual.sgml @@ -25,7 +25,7 @@ This file belongs into ijbswa.sourceforge.net:/home/groups/i/ij/ijbswa/htdocs/ - $Id: user-manual.sgml,v 1.89 2002/04/23 21:05:28 oes Exp $ + $Id: user-manual.sgml,v 1.90 2002/04/23 21:41:25 hal9 Exp $ Written by and Copyright (C) 2001 the SourceForge Privoxy team. http://www.privoxy.org/ @@ -46,7 +46,7 @@ Privoxy User Manual -$Id: user-manual.sgml,v 1.89 2002/04/23 21:05:28 oes Exp $ +$Id: user-manual.sgml,v 1.90 2002/04/23 21:41:25 hal9 Exp $ @@ -2867,6 +2867,12 @@ Please choose from the following options: and +image, then it can be handled by +image-blocker (see below). + + The +filter action can also perform some of the + same functionality as +block, but by virtue of very + different programming techniques, and is typically used for different + reasons. + @@ -3084,7 +3090,7 @@ Please choose from the following options: Apply page filtering as defined by named sections of the default.filter file to the specified site(s). Filtering can be any modification of the raw - page content, including re-writing or deletion. + page content, including re-writing or deletion of content. @@ -3172,6 +3178,7 @@ Please choose from the following options: This is potentially a very powerful feature! And requires a knowledge of regular expressions if you want to roll your own. + Filtering operates on a line by line basis. Filtering requires buffering the page content, which may appear to @@ -3180,6 +3187,12 @@ Please choose from the following options: since the page is not incrementally displayed.) This effect will be more noticeable on slower connections. + + Filtering can achieve some of the effects as the +block + action, i.e. it can be used to block ads and banners. In the overall + scheme of things, filtering is one of the last things Privoxy + does with a web page. So other actions are applied first. + @@ -4927,6 +4940,85 @@ Requests + + +Chain of Events + + Let's take a quick look at the basic sequence of events when a web page is + requested by your browser and Privoxy is on duty: + + + + + + + First, the web browser requests a page, and this request is intercepted by + Privoxy immediately. + + + + + Privoxy traps any request for internal CGI + pages (e.g http://p.p/) and relays these back to the browser. + + + + + If the URL matches a +block pattern, then it is blocked + and the banner displayed. + + + + + Untrusted URLs are blocked. If URLs are being added to the + trust file, then that is done. + + + + + +fast-redirect is processed, stripping unwanted parts + of the request web page URL. + + + + + At this point, Privoxy relays the request to the + web server, and requests the page (assuming nothing up to this point has + prevented getting us from this far). + + + + + The first few hundred bytes are read from the web server and + +kill-popups is processed, if enabled. + + + + + If +filter applies, the rest of the page is read into + memory and then the filters are processed. Filters are applied in the order they + are specified in the default.filter file. The entire + page, which is now filtered, is then sent by + Privoxy to your browser. + + + + + As the browser receives the filtered page content, it will read and request any + embedded URLs on the page, e.g. an ad image. As the browser requests these + secondary URLs from whatever server they may be on, + Privoxy handles these same as above, and the process + is repeated for each such URL. Note that a fancy web page may have many, many + such URLs for graphics, frames, etc. + + + + + + + + + Anatomy of an Action @@ -4961,14 +5053,16 @@ Requests First, enter one URL (or partial URL) at the prompt, and then Privoxy will tell us how the current configuration will handle it. This will not - help with filtering effects from the default.filter file! It - also will not tell you about any other URLs that may be embedded within the - URL you are testing (i.e. a web page). For instance, images such as ads are expressed as URLs - within the raw page source of HTML pages. So you will only get info for the - actual URL that is pasted into the prompt area -- not any sub-URLs. If you - want to know about embedded URLs like ads, you will have to dig those out of - the HTML source. Use your browser's View Page Source option - for this. Or right click on the ad, and grab the URL. + help with filtering effects (i.e. the +filter action) from the + default.filter file since this is handled very differently + and not so easy to trap! It also will not tell you about any other URLs that + may be embedded within the URL you are testing (i.e. a web page). For + instance, images such as ads are expressed as URLs within the raw page source + of HTML pages. So you will only get info for the actual URL that is pasted + into the prompt area -- not any sub-URLs. If you want to know about embedded + URLs like ads, you will have to dig those out of the HTML source. Use your + browser's View Page Source option for this. Or right click on + the ad, and grab the URL. @@ -5238,6 +5332,9 @@ Requests Temple Place - Suite 330, Boston, MA 02111-1307, USA. $Log: user-manual.sgml,v $ + Revision 1.90 2002/04/23 21:41:25 hal9 + Linuxconf is deprecated on RH, substitute chkconfig. + Revision 1.89 2002/04/23 21:05:28 oes Added hint for startup on Red Hat -- 2.39.2