Sophie

Sophie

distrib > Mandriva > 2008.1 > x86_64 > media > main-release > by-pkgid > 022426f6d0e674ef20930b8eef162e76 > files > 47

amavisd-new-2.5.4-1mdv2008.1.noarch.rpm

<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="#internalStyle" type="text/css"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
    "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
  <title>amavisd-new documentation bits and pieces</title>
  <meta http-equiv="Content-Type" content="text/html;charset=UTF-8" />
  <meta name="AUTHOR" content="Mark Martinec" />
  <link rev="made" href="mailto:mark.martinec@ijs.si" />
<style type="text/css" id="internalStyle">
  body { background: white; color: black }
  kbd { font-family: monospace }
  img.noboarder { color: white; border: none }
</style>
<!-- [link rel="STYLESHEET" href="./a.css" type="text/css"] -->
</head>

<body>
<h1><em><a href="http://www.ijs.si/software/amavisd/">amavisd-new</a></em>
documentation bits and pieces</h1>

<p>The most recent version of this document is available at
<a href="http://www.ijs.si/software/amavisd/amavisd-new-docs.html">
http://www.ijs.si/software/amavisd/amavisd-new-docs.html</a></p>

<ul>
<li><a href="#checks">performing mail checks</a></li>
<li><a href="#actions">acting on mail checks results</a></li>
<li><a href="#tagkill">tag, tag2 and kill levels</a></li>
<li><a href="#quarantine">quarantine</a></li>
<li><a href="#quar-release">releasing from a quarantine</a></li>
<li><a href="#addrext">redirecting malware to a different
mailbox -- plus addressing</a></li>
<li><a href="#wblist">hard black- and whitelisting senders regarding spam</a></li>
<li><a href="#score_sender">soft black- and whitelisting
senders regarding spam -- @score_sender_maps</a></li>
<li><a href="#confvars">configuration variables</a></li>
<li><a href="#pbanks">policy banks</a></li>
<li><a href="#pbanks-ex">putting policy banks to good use -- examples</a></li>
<li><a href="#max_requests">$max_requests</a></li>
<li><a href="#dkim">setting up DKIM (and DomainKeys) mail signing
and verification</a></li>
</ul>


<h2><a name="checks">Performing mail checks</a></h2>

<p>The following checks on mail are available</p>

<ul>
<li>mail header validity checks</li>
<li>banned names and types checks</li>
<li>virus checks</li>
<li>spam checks</li>
<li>is sender white- or blacklisted (regarding spam)</li>
</ul>

<p>Although checks are presently not performed in parallel, it is
best to consider the order of their evaluation unspecified (unknown).
Besides possible future parallel implementation, another reason is
the caching of results, where subsequent mail with the same contents
may benefit from earlier checks if validity of these check results
has not yet expired -- so a check result may be instantly available,
regardless of whether it has been asked for or not.</p>

<p>Using configuration variables @bypass_virus_checks_maps,
@bypass_banned_checks_maps, @bypass_header_checks_maps and
@bypass_spam_checks_maps each recipient (or administrator on their
behalf) may suggest that certain tests are not needed, primarily
for performance reasons. Although the @bypass_*_checks_maps pertain
to individual recipients, a mail check is an operation done on the
whole message, regardless of the number of recipients and their individual
preferences. Suggestion by some of the recipients that certain check is
not needed (is to be bypassed) does not guarantee the test will not be
performed.</p>

<p>Similarly the (hard) blacklisting or whitelisting of sender address
may make running spam check unnecessary, but it does not guarantee
the spam check result will not be available for subsequent decisions.</p>

<p>There are two primary reasons why a check result may still be available
despite the bypass hint or a sender being black- or whitelisted:</p>

<ul>
<li>a check result from some previous mail with the same contents
  has been cached and is still valid;</li>
<li>when mail has multiple recipients and not all of them agree
  that a check should be bypassed.</li>
</ul>

<p>The amavisd-new program is allowed to skip some check for performance
reasons if all recipients agree that a check is not necessary (that it may
be bypassed), or if the outcome of a check to be skipped could not influence
further mail processing and delivery/non-delivery of the message (as is
the case of a sender being black- or whitelisted regarding spam check).</p>

<p>For example spam checks may be skipped if it is already known that
a mail is infected. This is an implementation and optimization issue,
and no guarantee is given about interdependency of checks.
Future version may use a different strategy of performing checks
(e.g. some checks may be performed in parallel), as long as a change
does not affect the final outcome.</p>


<h2><a name="actions">Acting on mail checks results</a></h2>

<p>Based on the outcome of mail checks performed during mail analysis
or cached from previous mail with the same contents, and based on
global settings and individual recipient preferences, the program now
decides what action to perform next. As described in the previous section,
not all results of checks are necessarily known (e.g. if all recipients
voted for some check to be bypassed). For the purpose of deciding further
actions, unknown results of a check are considered equivalent to
negative (false) results, i.e. skipped virus check is treated the same
as non-infected mail, bypassed spam check is equivalent to low spam score
(ham).</p>

<p>The following decisions are made at this stage:</p>
<ul>
<li>whether a mail should be quarantined and how;</li>
<li>whether an administrator (and which administrator)
  should receive a notification (and which notification);</li>
<li>whether recipients should receive a notification;</li>
</ul>

<p>and regarding mail delivery and/or sender (non)delivery notifications:</p>
<ul>
<li>whether a mail should be delivered to each recipient or not;</li>
<li>whether delivered mail should be modified (header edits, defanging);</li>
<li>whether a sender should receive a (non)delivery notification (bounce);</li>
<li>what should be the final status code returned to the mailer
  (reject/pass).</li>
</ul>

<p>For the purpose of deciding on these actions, a mail is classified
based on all available checks results. It is quite possible that more
than one check results would be positive (e.g. virus and banned and
bad header, or spam and bad header, or virus and spam), yet a mail
is considered to be only in one category. The logic is currently
hard-wired into the program and can not be influenced by configuration
variables. The following order is used, the first condition met
decides the outcome:</p>

<ol>
<li>a virus is detected: mail is considered infected;</li>
<li>contains banned name or type: mail is considered banned;</li>
<li>spam level is above kill level for at least one recipient,
  or a sender is blacklisted: mail is considered spam;</li>
<li>bad (invalid) headers: mail is considered as having a bad header.</li>
</ol>

<p>This decision order explains why amavisd-new is not free to skip
(to optimize away) virus checks if a presence of a banned name or a bad
header is already known or can easily be determined. The order was chosen
with the intention that a more informative or a stronger assertion is the
one to base further mail delivery on, and to be quoted in notifications
and in the log. Even at the expense of possibly longer processing time,
it is more important to declare a mail infected than complain about
a bad header, a banned executable or spamy contents.</p>

<p>The determined mail category now governs further action.
Administrators are notified if enabled for the category,
mail is quarantined if quarantining if enabled for the category,
recipients are notified if enabled for the category.</p>

<p>Next a mail delivery is attempted. A decision to deliver depends
on mail category and on global and individual recipient preferences.
The global setting $final_*_destiny=D_PASS or a per-recipient setting
@*_lovers_maps ensure mail delivery for corresponding mail category
even if mail would otherwise be blocked for being infected or banned
or spam or having a bad header.</p>

<p>A mail that is decided to be passed to an individual recipient
undergoes some simple header editing which happens on-the-fly during
mail forwarding. Certain mail header fields may be inserted or removed,
or an existing header field (e.g. Subject) may be modified. This header
editing may be different for each recipient even in multi-recipient
messages. If necessary, a multi-recipient mail is split into more than
one forwarding transaction, grouping (clustering) recipients with same
settings into one SMTP transaction.</p>

<p>Based on decisions to forward or to block mail to each recipient,
and on the global setting for the mail category ($final_*_destiny=D_BOUNCE
or D_REJECT), the sender (non)delivery notification is now prepared
in case of D_BOUNCE, and MTA receives a 2xx status (success); or in
case of D_REJECT the MTA receives a 5xx (reject) status and preparing
sender notifications is thus delegated to MTA (not recommended in
post-queue or dual-MTA content filtering setup).</p>

<p>Even in cases of mail non-delivery when a (non-)delivery status
notification (DSN) for the sender should have been prepared and sent,
there are certain exceptions where the DSN is suppressed, which makes mail
effectively lost as far as the sender and the recipient are concerned
(but quarantining is not affected):</p>

<ul>
<li>when $final_*_destiny=D_DISCARD;</li>
<li>when mail is infected and the detected virus name matches the
  @viruses_that_fake_sender_maps (unconditionally true by default);</li>
<li>when spam score exceeds level determined by @spam_dsn_cutoff_level_maps
  for all recipients;</li>
<li>when mail is coming from a mailing list, as determined by
  examining a mail header <i>Precedence:</i> for containing
  string 'bulk' or 'list' or 'junk';</li>
</ul>


<h2><a name="tagkill">tag, tag2 and kill levels</a></h2>

<p>When SpamAssassin is called upon to analyze a mail message, it returns
a spam score (spam level, hits), which is a numeric representation of
spaminess. The higher the number, the more spamy the message is considered.
Small numbers near zero or negative indicate a clean message, colloquially
called ham. Spam score is a characteristic of the whole message,
and does not depend on recipient preferences. SpamAssassin is called
only once for each message regardless of the number of recipients.</p>

<p>To determine further course of action, amavisd-new compares the spam score
to three numeric values: tag level, tag2 level and kill level. These values
may be different for each recipient, and further actions may be different
for each recipient. If necessary, the mail forwarding is split into more
than one transaction to cater for different recipient preferences.</p>

<dl>
<dt>tag level</dt>
<dd>if spam score is at or above tag level, spam-related header fields
  (X-Spam-Status, X-Spam-Level) are inserted for local recipients;
  undefined (unknown) spam score is interpreted as lower than any
  spam score;</dd>
<dt>tag2 level</dt>
<dd>if spam score is at or above tag2 level, spam-related header fields
  (X-Spam-Status, X-Spam-Level, X-Spam-Flag and X-Spam-Report)
  are inserted for local recipients, and X-Spam-Flag and X-Spam-Status
  bear a YES; also recipient address extension (if enabled) is tacked onto
  recipient address for local recipients; for these actions to have any
  effect, mail must be allowed to be delivered to a recipient;</dd>
<dt>kill level</dt>
<dd>if spam score is at or above kill level, mail is blocked; and
  sender receives a nondelivery notification unless spam score exceeds
  dsn cutoff level.</dd>
</dl>

<p>The general idea is that kill level is what controls the main actions
as far as MTA and amavisd-new is concerned (regardless of what recipients'
MUA later does with the mail).</p>

<p>Reaching kill level for at least one recipient controls the following:</p>

<ul>
<li>mail gets quarantined (unless disabled)</li>
<li>spam administrator gets a notification (unless disabled)</li>
<li>ContentSpamMsgs counter is incremented</li>
<li>sender gets a notification if warnspamsender
  is true and $final_spam_destiny is D_PASS</li>
<li>if message is not delivered, sender gets a nondelivery
  notification (suppressed under certain conditions)</li>
<li>the main log entry says: Passed/Blocked SPAM.</li>
</ul>

<p>On the other hand the tag2 level just adds some mark to the passed
mail (only for local recipients), which recipient or his MUA may decide
to act on or not. Specifically:</p>

<ul>
<li>Subject header field is modified (unless disabled)</li>
<li>X-Spam-Flag and X-Spam-Status header field get a Yes</li>
<li>address extension for spam gets tacked on the recipient address</li>
<li>spam defanging is done (unless disabled)</li>
<li>the main log entry says: Passed/Blocked SPAMMY.</li>
</ul>

<p>For mail below kill level, if a recipient (or his MUA) decides
to discard a message based on tag2 marking, there is no way to
retrieve it later from a quarantine, the sender is never notified,
spam administrator is never notified. As far as the MTA and amavisd-new
are concerned, the message was successfully delivered. Whatever MUA
does with the mail is entirely the responsibility and jurisdiction
of the recipient and his LDA and MUA.</p>


<h2><a name="quarantine">Quarantine</a></h2>

<p>Mail quarantining is attempted when enabled for a given contents category,
which usually includes infected or banned or spam mail with score for
at least one of its recipients at or above his kill level. It is also
possible to enable quarantining of clean messages for archiving or
troubleshooting purposes. The <i>*quarantine_to</i> for each recipient
(when nonempty), along with a global corresponding <i>*_quarantine_method</i>,
determines where the quarantine location should be.</p>

<p>The <i>*_quarantine_method</i> can be considered a static and a site-wide
setting, generally controlling a format and location of the quarantine
on the system. The <i>*quarantine_to</i> can be considered a dynamic
part of the quarantine location, possibly affected by per-recipient settings
and the class of malware (contents category). It serves to fully specify
the final location, e.g. a file or a mailbox.</p>

<p>Depending on mail contents category (type of malware), the following
variables specify the quarantine method: <tt>$virus_quarantine_method</tt>,
<tt>$spam_quarantine_method</tt>, <tt>$banned_files_quarantine_method</tt>,
and <tt>$bad_header_quarantine_method</tt>. One way to globally disable
quarantine is to specify undef or an empty string as a value of these
variables. A nonempty string should follow a syntax:</p>

<ul>
<li><tt>local:</tt><i>filename-template</i></li>
<li><tt>bsmtp:</tt><i>filename-template</i></li>
<li><tt>smtp:[</tt><i>IPv4-or-IPv6-address-or-hostname</i><tt>]:</tt><i>port</i></li>
<li><tt>smtp:</tt><i>hostname</i><tt>:</tt><i>port</i></li>
<li><tt>smtp:</tt><i>/path/to/a/unix/socket</i></li>
<li><tt>lmtp:[</tt><i>IPv4-or-IPv6-address-or-hostname</i><tt>]:</tt><i>port</i></li>
<li><tt>lmtp:</tt><i>hostname</i><tt>:</tt><i>port</i></li>
<li><tt>lmtp:</tt><i>/path/to/a/unix/socket</i></li>
<li><tt>pipe:</tt><i>argv=command args...</i></li>
<li><tt>sql:</tt><i>anything</i></li>

</ul>

<p>The <tt>local:</tt>, <tt>bsmtp:</tt> and <tt>sql:</tt> methods are the
usual methods for quarantining. The <tt>smtp:</tt> or <tt>lmtp:</tt> methods
are only useful for quarantining if quarantine location is some dedicated
mailbox instead of a local file or directory. The <tt>smtp:</tt>, <tt>lmtp:</tt>
and <tt>pipe:</tt> methods are more often used for forwarding and notifications.
The following features became available with version 2.5.0: the <tt>lmtp:</tt>
method, support for IPv6, and specifying a Unix socket to a <tt>smtp:</tt> or
<tt>lmtp:</tt> method.</p>

<p>Depending on the method specified (local/bsmtp/smtp/sql) a per-recipient
setting <i>*quarantine_to</i> adopts different semantics and syntax,
possibly modified by the configuration variable <tt>$QUARANTINEDIR</tt>.</p>

<table border="1">
<tr>
  <th>method</th>
  <th>quarantine_to</th>
  <th><tt>$QUARANTINEDIR</tt></th>
  <th>effect</th></tr>
<tr>
  <td>anything</td>
  <td>empty or undef</td>
  <td>anything</td>
  <td>not quarantined</td></tr>
<tr>
  <td>empty or undef</td>
  <td>anything</td>
  <td>anything</td>
  <td>not quarantined</td></tr>
<tr>
  <td><tt>local:</tt></td>
  <td>pseudo-alias mapped through %local_delivery_aliases</td>
  <td>directory</td>
  <td>stored as an individual file
      below the directory <tt>$QUARANTINEDIR</tt>, file name comes
      from the template specified in the <i>*_quarantine_method</i>
  </td></tr>
<tr>
  <td><tt>local:</tt></td>
  <td>pseudo-alias mapped through %local_delivery_aliases</td>
  <td>filename of a mailbox</td>
  <td>appended to a file <tt>$QUARANTINEDIR</tt> in mbox format</td></tr>
<tr>
  <td><tt>local:</tt></td>
  <td>pseudo-alias mapped through %local_delivery_aliases</td>
  <td>empty or undef</td>
  <td>not quarantined</td></tr>
<tr>
  <td><tt>local:</tt></td>
  <td>e-mail address containing '@'-sign</td>
  <td>anything</td>
  <td>sent via SMTP to a mailer for storage,
      uses $notify_method to specify how to deliver to MTA;
      much like a newer '<tt>smtp:</tt>' entry below</td></tr>
<tr>
  <td><tt>smtp:</tt></td>
  <td>e-mail address</td>
  <td>anything</td>
  <td>sent via SMTP to a mailer for storage,
      uses the specified IP address and port,
      or a Unix socket for delivery; formerly
      a '<tt>local:</tt>' method was used for
      this purpose</td></tr>
<tr>
  <td><tt>lmtp:</tt></td>
  <td>e-mail address</td>
  <td>anything</td>
  <td>sent via LMTP to a mailer for storage,
      uses the specified IP address and port,
      or a Unix socket for delivery</td></tr>
<tr>
  <td><tt>bsmtp:</tt></td>
  <td>anything (nonempty)</td>
  <td>anything</td>
  <td>stored in a file specified in the <i>*_quarantine_method</i>
   in BSMTP format
   (if file name is absolute, i.e. starts with a "/")</td></tr>
<tr>
  <td><tt>bsmtp:</tt></td>
  <td>anything (nonempty)</td>
  <td>directory</td>
  <td>stored in a file specified in the <i>*_quarantine_method</i>
   in BSMTP format
   (file name relative to <tt>$QUARANTINEDIR</tt>)</td></tr>
<tr>
  <td><tt>sql:</tt></td>
  <td>anything (nonempty)</td>
  <td>anything</td>
  <td>stored into SQL database specified by @storage_sql_dsn</td></tr>
<tr>
  <td><tt>sql:</tt></td>
  <td>empty or undef</td>
  <td>anything</td>
  <td>not quarantined</td></tr>
</table>

<p>The <i>*quarantine_to</i> is currently quite limited in functionality,
it is often used only to turn off the quarantining for some user or local
subdomain. The reason for this limited functionality is a more vulnerable
nature of this value, as it may come from SQL or LDAP lookups where
non-careful access controls to these databases might permit users to enter
any value in the <i>*quarantine_to</i> field, which is why we do not let
it control the directory or the exact file name of the quarantine file.
This may be somewhat relaxed in the future.</p>

<p>In common setups the quarantine location (e.g. a directory or a dedicated
mailbox) is the same for all recipients. If at least one recipient specifies
a nonempty <i>*quarantine_to</i> specifying this location, the message is
quarantined (stored) there once, regardless of the number of recipients.</p>

<p>The general algorithm is: the <i>*quarantine_to</i> value associated with
each recipient is looked up. Empty or undef values are ignored and duplicates
are discarded. A mail to be quarantined is then stored/sent to each
unique location remaining on the list.</p>

<p>The "bsmtp:" quarantine method is somewhat special in that the quarantine
file location is entirely determined by the <i>*_quarantine_method</i> setting,
and the value of per-recipient <i>*quarantine_to</i> settings do not influence
the quarantine location, as long as this value is nonempty.</p>

<p>When using the "bsmtp:" quarantine method and versions of amavisd-new
earlier than 2.2.0, the <i>*_quarantine_to</i> was completely ignored,
which made it impossible to turn off quarantining selectively for certain
users by specifying an empty or undef value. Since 2.2.0, an empty
<i>*_quarantine_to</i> turns off quarantine for a recipient regardless
of the quarantine method. A nonempty string in <i>*_quarantine_to</i>
(the exact value is ignored) must now be used even with "bsmtp:" to
enable quarantining.</p>


<h2><a name="quar-release">Releasing from a quarantine</a></h2>

<p>The utility <i>amavisd-release</i> tells the amavisd daemon to fetch
a mail from a local quarantine, and send it to MTA through its regular
channels ($notify_method), bypassing re-checking.</p>

<p>By default it connects to socket /var/amavis/amavisd.sock, on which
amavisd should be listening for AM.PDP protocol, but one can use inet socket
instead of a Unix socket if there is a need to run <i>amavisd-release</i>
from a remote host.</p>

<p>In the amavisd.conf the following should be added:</p>

<pre>
$unix_socketname = "$MYHOME/amavisd.sock";  # listen on Unix socket

# alternatively (less common):
# $inet_socket_port = [10024, 9998];  # listen on listed inet tcp ports

# apply policy bank AM.PDP-SOCK on a Unix socket:
#  (note that this precludes the use of old amavis-milter
#   helper program (with sendmail) on the same socket)
$interface_policy{'SOCK'} = 'AM.PDP-SOCK';

# apply policy bank AM.PDP-INET to some inet tcp socket, e.g. tcp port 9998:
$interface_policy{'9998'} = 'AM.PDP-INET';

$policy_bank{'AM.PDP-SOCK'} = {
  protocol => 'AM.PDP',  # select Amavis policy delegation protocol
  auth_required_release => 0,  # don't require secret_id for amavisd-release
};
$policy_bank{'AM.PDP-INET'} = {
  protocol => 'AM.PDP',  # select Amavis policy delegation protocol
  inet_acl => [qw( 127.0.0.1 [::1] )],  # restrict access to these IP addresses
# auth_required_release => 0,  # don't require secret_id for amavisd-release
};
</pre>

<p>Setting of $auth_required_release decides whether the requestor needs
to specify secret_id in addition to mail_id to authorize a mail release.
The secret_id is stored in SQL table msgs when logging to SQL is enabled,
otherwise this information is not accessible.</p>

<p>Note that turning off $auth_required_release check is safe as long as
access to the socket is restricted, like with file protections on a Unix
socket, or restricted with inet_acl to specific IP addresses. Enabling or
disabling $auth_required_release is a management / setup decision and
convenience.</p>

<p>To release a mail message an exact quarantine location from a log file
should be specified as an argument to amavisd-release, e.g.:</p>

<pre>
amavis[29297]: (29297-01-6) Blocked SPAM,
  ... &lt;xxx&gt; -&gt; &lt;yyy&gt;,
  quarantine: <i>spam/U/UM3XM3XDbN52.gz</i>,
  Message-ID:&lt;...&gt;, mail_id: UM3XM3XDbN52, Hits: 13.365,

$ amavisd-release <i>spam/U/UM3XM3XDbN52.gz</i>
250 2.6.0 Ok, id=rel-UM3XM3XDbN52,
  from MTA([193.2.4.66]:10025): 250 2.0.0 Ok: queued as F137717B88B
</pre>

<p>The <i>amavisd-release</i> utility also accepts <i>mail_id</i> from STDIN
if releasing more than one message in one go is more convenient:</p>

<pre>
$ amavisd-release -
<i>spam/U/UM3XM3XDbN52.gz</i>
<i>spam/g/gnwKVFKiuey3.gz</i>
<i>spam/X/Xpkj9mLLBHTR.gz</i>
</pre>


<h2><a name="addrext">Redirecting malware to a different
mailbox -- plus addressing</a></h2>

<p>Amavisd-new can tag passed malware by appending an address extension
to a recipient address. An address extension is usually a short string
(such as 'spam') appended to the local part of the recipient address,
delimited from it by a single character delimiter, often a '+'
(or sometimes a '-'). This is why address extensions are also known as
"plus addressing". Examples of such mail addresses belonging to user
<i>jim@example.com</i> are: <i>jim+spam@example.com</i>,
<i>jim+cooking@example.com</i>, <i>jim+health@example.com</i>,
<i>jim+postfix@example.com</i>.</p>

<p>Most mailers (MTA), including Postfix and sendmail, have some provision
to put address extensions to good use. Similarly, local delivery agents (LDA)
such as Cyrus or LDAs that come with MTA, can be configured to recognize
and make use of address extensions.</p>

<p>The most common application for address extensions is to provide
additional information to LDA to store mail into a separate mail folder.
Users may for example choose to use this feature to let LDA automatically
file messages from mailing lists to a dedicated subfolder, or to file
spam to a spam folder, just by letting LDA simply and quickly examine
the envelope recipient address, without having to parse mail header
or having to configure and run filters such as procmail or Sieve.</p>

<p>Mailers (MTA and LDA) usually attempt first to examine (to check for
validity, to lookup in virtual or aliases maps) a full unmodified recipient
address. If the attempt is unsuccessful, they strip away the extension part,
and try again. This way a presence of some unknown address extension is
simply ignored. For example, a delivery for <i>jim+health@example.com</i>
would deliver the mail to the main Jim's inbox if he hasn't provided
a subfolder <i>health</i> in his mailbox.</p>

<p>For this fallback to work (to ignore unknown extensions), it is important
that all components that need to deal with address extensions (MTA, LDA,
content filters) have the same notion of the delimiter in use on the
system. For Postfix the configuration option is <tt>recipient_delimiter=+</tt>
(see also propagate_unmatched_extensions), for amavisd-new the option is
<tt>$recipient_delimiter='+';</tt> for Cyrus the delimiter is hardcoded
as '+', see <a href="http://cyrusimap.web.cmu.edu/imapd/faq.html">Cyrus
IMAP FAQ -&gt; plus addressing</a>.</p>

<p>The amavisd-new configuration options for adding address
extensions are @addr_extension_virus_maps, @addr_extension_spam_maps,
@addr_extension_banned_maps, @addr_extension_bad_header_maps.
The configuration must also ensure the malware mail is to be delivered,
otherwise there is nothing to tack an address extension on -- either by
setting kill level sufficiently high, or by declaring spam lovers, or by
<tt>$final_spam_destiny=D_PASS</tt>; an example:</p>

<pre>
$recipient_delimiter = '+';
@addr_extension_spam_maps = ('spam');
$sa_tag2_level_deflt = 6.7 ;    # score above which spam extension is added
$sa_kill_level_deflt = 15;      # block higher score entirely
$final_spam_destiny=D_DISCARD;  # junk all above kill level
</pre>

<p>or provide extension string more selectively for certain users
or subdomains:</p>

<pre>
@addr_extension_spam_maps = (
  { '.sub1.example.com' => 'spam',     # an entire subdomain
    'user1@example.com' => 'spam',     # a particular user
    'user2@example.com' => 'malware',  # another user wants a different ext.
    '.'                 => '' }  # all the rest do not receive an extension
);
</pre>

<p>If one is considering using a quarantine mechanism but wants a per-user
(or perhaps per-subdomain) quarantines, this is not such a good idea,
because quarantined files are not supposed to be directly visible or
handled by recipients: to protect the privacy of the sender, some
header pre-processing must be performed on a quarantined file before
handing it over to a recipient.</p>

<p>The cleanest way to achieve per-user quarantine which may be directly
accessible and/or manipulated by recipients is to turn on adding address
extensions, and configure MTA and/or LDA to store such mail wherever
necessary, either to a user's dedicated subfolder, or perhaps to some
centralized dedicated set of malware mailboxes (per-user or perhaps
per-subdomain).</p>

<p>If it is desired to reroute extension-tagged mail to some mailbox
away from the usual LDA, the virtual alias mapping by MTA is the tool
for the job. With Postfix, a pcre-based virtual map can specify
for example:</p>

<pre>
/^(.*)\+spam@([^@]*)\.example\.com$/   spam-$2-box@example.com
</pre>

<p>which will collect all spam into one mailbox for each subdomain.</p>

<p>For the Postfix local(8) LDA, a presence of a file $HOME/.forward+spam
can redirect mail for user+spam to some dedicated file. For the Postfix
virtual(8) LDA, a virtual_mailbox_maps may contain entries like:</p>

<pre>
user1         mbxfile1
user1+spam    mbxspamfile1
user2         mbxfile2
user2+spam    mbxspamfile2
</pre>


<h2><a name="wblist">Hard black- and whitelisting senders regarding spam</a></h2>

<p>The blacklisting and the whitelisting are ways of telling that we already
know that a message is spam or is ham (non-spam) just by examining the envelope
sender address and comparing it to lists of known spammers or to lists of
known legitimate senders of ham. It is a quick check, potentially saving us
the trouble of examining the mail contents. It has a big drawback however
in that the sender mail address can be (and often is) faked and there is no
guarantee that the claimed sender address represents the actual sender.</p>

<p>The sender address is usually faked for spam messages, so whitelisting
some sender address is a of questionable value, and often lets in far more
spam than it does good by approving legitimate mail. For a reliable way
of permitting certain sending clients to send spamy mail see <i>policy
banks</i>.</p>

<p>Blacklisting however is still useful: spammer has no desire to pretend
to be some blacklisted sending address, when he can choose any other address.
Genuine sender that is intentionally blacklisted can only avoid being
blocked by falsifying his address (joining spammers in his methods)
<em>and</em> sending non-spamy mail, the later being our objective anyway.
Although amavisd-new does provide blacklisting, it is functionally equivalent
but more effective to blacklist senders at the MTA, preventing such mail
from even entering the mail system.</p>

<p>It should be emphasized that whitelisting (and blacklisting) only affects
spam checks. It has no influence on other checks such as virus, banned or
header checks. Infected mail from whitelisted sender would still be blocked
if our policy is to block viruses.</p>

<p>Another point to bear in mind is that the sender address examined
is the one from the SMTP protocol, exactly as provide by MTA to amavisd-new.
It is known as the envelope sender address or return path. This address
does not necessarily match the mail author's address from the mail header
(From:) or the sender's address from the header (Sender:). This is most
obvious with mail from mailing lists, where the envelope sender address
is usually the address of a mailing list management service, while the
author's address (From:) is the address of a person sending the message.
Using the envelope sender address in most cases makes it easier to black-
or whitelist mail from mailing lists, compared to guessing a sender address
by parsing mail header.</p>

<p>To avoid surprises, whitelisted sender suppresses inserting/editing
the tag2-level header fields (X-Spam-*, Subject), appending spam address
extension, and quarantining, even if we know the message is spam (e.g.
because the spam check result on the same mail contents has been cached from
some earlier mail or known from check on behalf of another recipient).</p>

<p>For mail from blacklisted senders, the effect is as if the spam level
were artificially pushed high, resulting in 'X-Spam-Flag: YES', high
'X-Spam-Level' bar and other usual reactions to spam, including possible
rejection. If the message nevertheless still passes (e.g. for spam loving
recipients), it is tagged as BLACKLISTED in the 'X-Spam-Status' header field,
but the reported spam value and set of tests in this report header field
is not adjusted (if available from SpamAssassin, which may or may not have
been called)</p>

<p>If <em>all</em> recipients of a message either white- or blacklist the
sender, amavisd is free to skip spam scanning (calling the SpamAssassin),
saving on time. There is no guarantee however that spam scanning will
actually and always be skipped.</p>

<p>The following variables (lists of lookup tables) are available,
with the semantics and syntax as specified in README.lookups:
@whitelist_sender_maps, @blacklist_sender_maps, which implement
global policy applicable to all recipients. Similarly there are
$per_recip_blacklist_sender_lookup_tables and
$per_recip_whitelist_sender_lookup_tables, which make possible
for each recipient or subdomain to specify its own set of black-
or whitelisted senders. The per-recipient tables take precedence
over global tables.</p>

<p>For SQL lookups, amavisd-new will first lookup the recipient in table
<i>users</i> in order of descending priority, e.g. user@sub.domain.org,
user, @.sub.domain.org, @.domain.org, @.org, and @. (which can be considered
a catchall). Each matching recipient record may have a list of senders
associated (through join on field <i>users.id</i> and <i>wblist.rid</i>).
The sender address is then looked up in the associated list of senders
(<i>wblist</i>) in order of descending priority, e.g. sender@sub.example.com,
@.sub.example.com, @.example.com, @.com, and @. . This search stops at the
first matching sender record with a non-NULL field <i>wblist.wb</i>. The value
of a field <i>wblist.wb</i> from the matched record determines if the sender
is considered whitelisted ('W'), blacklisted ('B') or neutral ('&nbsp;')
for this recipient.</p>

<p>The neutral value is there just as a way to explicitly stop the search,
which may be used by a recipient to overrule site-wide or static
white- or blacklisting defaults for some specific sender, and to
explicitly neither whitelist nor blacklist the sender, letting the
normal spam check determine the spaminess of a mail.</p>

<p>For recipient user@sub.domain.com and sender sender@sub.example.com
the following search is performed:</p>

<pre>
user@sub.domain.org
  sender@sub.example.com @.sub.example.com @.example.com @.com @.

user
  sender@sub.example.com @.sub.example.com @.example.com @.com @.

@.sub.domain.org
  sender@sub.example.com @.sub.example.com @.example.com @.com @.

@.domain.org
  sender@sub.example.com @.sub.example.com @.example.com @.com @.

@.org
  sender@sub.example.com @.sub.example.com @.example.com @.com @.

@.
  sender@sub.example.com @.sub.example.com @.example.com @.com @.
</pre>


<h2><a name="score_sender">Soft black- and whitelisting
senders regarding spam -- @score_sender_maps</a></h2>

<p>Instead of hard black- or whitelisting a sender address (unconditionally
considering mail spam or ham solely based on sender address regardless
of mail contents), a more gentle approach is to add score points (penalties)
to the spam score for mail from certain senders or sending domains.
Positive points lean towards blacklisting, negative towards whitelisting.
This is much like adding SpamAssassin rules or using its white/blacklisting,
except that here only envelope sender addresses are considered (not addresses
in a mail header), and that score points can be assigned per-recipient
(or per-domain or globally), and that the assigned penalties are customarily
much lower than the default SpamAssassin white/blacklisting score.</p>

<p>The table structure of @score_sender_maps is similar to
$per_recip_blacklist_sender_lookup_tables i.e. the first level key is
recipient address, pointing to by-sender lookup tables. The essential
difference is that scores from <em>all</em> matching by-recipient lookups
(not just the first that matches) are summed to give the final score boost.
That means that both the site and domain administrators, as well as the
recipient can have a say on the final score.</p>

<p>For SQL lookups, the mechanism is much like the one described for
hard black- or whitelisting, with the following differences:</p>
<ul>
<li>the field <i>wblist.wb</i> is numeric, representing score points,
  instead of containing a character W or B or space;</li>
<li>the search through matching recipients does not stop at the first
  match, but traverses all matching recipients, summing up the
  corresponding <i>wblist.wb</i> field values.</li>
</ul>

<p>Namely, amavisd will lookup the recipient, e.g. user@sub.domain.org,
user, @.sub.domain.org, @.domain.org, @.org, and @. . Since the search will
not stop at the first recipient match, the search order in this case is
unimportant, although it is actually the same descending-priority order as with
hard b/w listing. Each matching recipient record may have a list of senders
associated (through join on field <i>users.id</i> and <i>wblist.rid</i>).
The sender address is then looked up in the associated list of senders
(<i>wblist</i>) in order of descending priority, e.g. sender@sub.example.com,
@.sub.example.com, @.example.com, @.com, and @. . This search stops at the
first matching sender record with a non-NULL field <i>wblist.wb</i>, but this
does not terminate the outer recipients search. Numeric values of a field
<i>wblist.wb</i> from matched records are summed up across all matching
recipients tables, and the result is added to the spam score as produced
by SpamAssassin.</p>

<p>Unlike static tables, where hard and soft w/b-listing use separate
tables, the SQL-based hard and soft w/b-listing uses the same SQL tables
and the same field <i>wblist.wb</i>. Mixing the 'W', 'B' with numeric values
is somewhat frowned upon, but is supported to facilitate transition.
The search goes like described above as long as only numeric field values
are encountered, summing up the values and adding the accumulated sum
to the final score. If a non-numeric value of field <i>wblist.wb</i>
is encountered during this search, its value (W or B or space) is
interpreted as described for hard w/b listing, and the search stops at
this point.</p>


<h2><a name="confvars">Configuration variables</a></h2>

<p>The behaviour of the amavisd-new is controlled by a set of configuration
variables, which are just normal module-global Perl variables (in package
Amavis::Conf). At daemon startup time these variables are first assigned
an initial value (often just an undefined value, the undef). The default
values of configuration variables are documented in file amavisd.conf-defaults,
which lists all configuration variables.</p>

<p>Next a configuration file amavisd.conf (or other file as specified
by option -c) is read and interpreted by the Perl interpreter itself.
The amavisd.conf is just a normal Perl program, and can in principle
do whatever and however it pleases, but its main purpose is to assign
values to configuration variables.</p>

<p>After execution of amavisd.conf is done, the daemon may correct some
configuration variable values (mainly to maintain backwards compatibility
with earlier version of configuration file), and may assign a default value
to certain variables which are still undefined -- these variables and their
default values are marked "after-defaults" in the documentation file
amavisd.conf-defaults. The main reason for existence of the "after-defaults"
concept is that some default values depend on other configuration variables
and can not be computed before the amavisd.conf is finished. To force such
variables to an off/false/disabled state, one needs to assign some false but
defined value to them, such as '' (an empty string) or a 0 for booleans.</p>

<p>Perl variables always start with a character $, @ or % to indicate a type
of variable. This leading character is part of the variable name for all
practical purposes.</p>

<dl>
<dt>$ (dollar character)</dt>
<dd>indicates a scalar variable (a string, a number, a reference)</dd>
<dt>@ (at sign)</dt>
<dd>indicates an array variable (a list)</dd>
<dt>% (percent character)</dt>
<dd>indicates an associative array (also known as hash),
  which maps keys to values</dd>
</dl>

<p>A couple of Perl syntactical elements deserve mention at this point,
as they are often used in the amavisd.conf configuration file.</p>

<dl>
<dt>"...", a double-quoted string</dt>
<dd>is a string; variables within are evaluated, e.g. "$MYHOME/tmp"</dd>
<dt>'...', a single-quoted string</dt>
<dd>is a string; variables within are not evaluated,
  the $ and @ loose their special meaning, e.g. 'user@example.com'</dd>
<dt>(...)</dt>
<dd>is a list of comma-separated expressions, e.g. (1,2,"test");
  a list is normally assigned to an array variable</dd>
<dt>qw(string)</dt>
<dd>is an operator that interprets its argument as a single string,
  splits it on whitespace to words, and returns a list of words (strings);
  it is a convenience to avoid some typing,
  e.g. qw(user@example.com .example.net .org) is exactly equivalent
  to ('user@example.com', '.example.net', '.org');
</dd>
<dt>[...]</dt>
<dd>is a reference to an anonymous list of comma-separated expressions,
  e.g. [1,2,"test"]; (note: a reference is a scalar)</dd>
<dt>{...}</dt>
<dd>is a reference to an anonymous associative array,
  e.g. {'alfa'=>1, 'beta'=>99, 'other'=>'test'};
  (note: a reference is a scalar)</dd>
<dt>\variable</dt>
<dd>is a reference to a variable, e.g. \$virus_admin, \@mynetworks, \%whitelist_sender;
  (note: a reference is a scalar)</dd>
</dl>

<p>Historically amavisd-new accessed all configuration variables directly
with their name, e.g. %spam_lovers, @spam_lovers_acl, $spam_lovers_re.
Later it became apparent that certain groups of variables (lookups) are
always used together in the same way, so new array variables like
@spam_lovers_maps were introduced. The program now never accesses old
lookup table variables directly, but always through higher level lists.
The solution is fully backwards compatible, as the default value
for the new lists references the old variables, e.g.:</p>
<pre>
@spam_lovers_maps = (\%spam_lovers, \@spam_lovers_acl, \$spam_lovers_re);
</pre>

<p>Administrator is free to modify or replace the lists in variables like
@spam_lovers_maps, perhaps rearranging the order or loosing all references to
legacy variables, and replacing them with other variables, often anonymous
arrays/lists or anonymous associative maps (hashes), or constants which can
serve as a convenient catchall default value when used last in the list.</p>

<p>Since amavisd-new version 2.0, there is one further generalization step
in the way a program accesses configuration variables. More than a hundred
configuration variables which control amavisd-new operation on a by-message
level (as opposed to by-recipient and truly global settings) are now grouped
in associative array called a <i>policy bank</i>. These configuration variables
are no longer accessed directly by their variable name by the program, but
always through a currently installed policy bank. Administrator is free to
modify the policy bank, normally by providing replacement policy banks and
specifying under what conditions the replacement policy bank is to be
automatically installed.</p>


<h2><a name="pbanks">Policy banks</a></h2>

<p>Policy banks hold sets of configuration variables controlling most
of per-message settings, including: static lookup tables, IP interface
access rules, forwarding address, log level, templates, administrator
addresses, spam trigger levels, quarantine rules, lists of anti-virus
scanner entries (or just a subset), banned names rules, defang settings,
etc. The whole set of these settings may be replaced with another
predefined set based on incoming port number, making it possible for one
amavisd daemon to cope with more diverse needs of served user communities
which could so far only be implemented by running more than one instance
of the amavisd daemon, each with its own configuration file.</p>

<p>This mechanism brings new potentials for the future: in principle policy
banks could be swapped not only based on port number or SMTP client
IP address, but on any characteristics pertaining to a mail message as
a whole (not specific to each of its recipients), or to characteristics
of a connection from a mailer (e.g. the interface address or protocol);</p>

<p>Until a better mechanism is available, a policy bank named 'MYNETS' has
special semantics: this policy bank is loaded (if it exists) whenever MTA
supplies a SMTP client's IP address (through Postfix XFORWARD extension to
the SMTP protocol, or via a new AM.PDP protocol) and that address matches
the @mynetworks list (actually: the list referenced by 'mynetworks_maps'
key in the currently installed policy map).</p>

<p>An associative array %interface_policy is a current mechanism of assigning
a policy bank to an incoming TCP port number (port must be in the list
@$inet_socket_port, otherwise amavisd will not listen on that port).
Whenever a connection from MTA is received, first a built-in policy bank
with an empty name -- the $policy_bank{''} gets loaded, which brings in
all the global/legacy settings. Then it is overlaid by whatever configuration
settings are in the bank named in the $interface_policy{$port} if any,
and finally the policy bank named 'MYNETS' (i.e. settings from
$policy_bank{'MYNETS'}) is overlaid if such policy bank exists and the SMTP
client IP address is known (by XFORWARD SMTP extension command from MTA)
and it matches the current mynetworks_maps. See amavisd.conf-sample for
examples.</p>

<p>When a new policy bank is overlaid over an existing set of configuration
variables, the variables not present in the new policy bank retain their
value. This makes it possible to specify new policy banks which carry
only a minimal set of settings that need to be changed.</p>

<p>The built-in policy bank (with empty name) is predefined, and includes
references to most other variables (the dynamic config variables),
which are accessed only indirectly through the currently installed
policy bank. Overlaying a policy bank with another policy bank may
bring in references to entirely different variables, possibly unnamed,
and may remove references to legacy variables if it so chooses.</p>

<p>Configuration variables are referenced from a policy bank (which
is implemented as a perl associative array, i.e. a hash) by keys of the
same name, e.g. { log_level => \$log_level, inet_acl => \@inet_acl, ...}.
For scalars one level of indirection is allowed, e.g.
a policy bank { log_level => \$log_level }; $log_level=2;
is equivalent to { log_level => $log_level } or to { log_level => 2 },
but in the first example with an indirect reference, the $log_level
may be assigned to even _after_ the policy bank has already been formed.</p>

<p>A word of caution: the syntax of entries within a policy bank hash
is slightly different from assignments to configuration variables.
This is because entries within policy bank are not assignments, but
key=>value pairs as in any Perl associative array. And these pairs are
delimited by commas, unlike statements, which are delimited by semicolons.
Value is separated from its key by '=>' (or by a comma), whereas the
assignment operator is '='. Keys of a policy bank are without leading $
or @ or %, unlike variable names. Values of an associative array can only
be scalars (e.g. strings or numbers or references to arrays or references
to associative array).</p>

<p>Compare:</p>
<ul>
<li>value of a policy bank is a reference to a Perl associative array, e.g.:
<pre>
    { log_level => 3,
      forward_method => 'smtp:[127.0.0.1]:10025',
      spam_admin_maps => ["spamalert\@$mydomain"],
      blacklist_sender_maps => [ [qw(.example.org .example.net)] ],
    }
</pre>
</li>
<li>normal assignments look like:
<pre>
      $log_level = 3;
      $forward_method = 'smtp:[127.0.0.1]:10025';
      @spam_admin_maps = ("spamalert\@$mydomain");
      @blacklist_sender_maps = ( [qw(.example.org .example.net)] );
</pre>
</li>
</ul>

<p>And a final note: Perl can detect and report typing mistakes in variable
names, but mistyped key is just some unused associative array entry lurking
in a hash, never used and never reported as mistyped/useless.</p>


<h2><a name="pbanks-ex">Putting policy banks to good use -- examples</a></h2>

<p>The sender address can be faked, so comparing envelope sender address
to @local_domains_maps or some other lookup table to base some important
decisions on would not be trustworthy. The only reliable information is
the recipient's e-mail address and information about client SMTP session,
such as the IP address of the sending SMTP client and the server port number
or the interface address. Such information can be made available by MTA to
amavisd-new through a feeding protocol (e.g. XFORWARD extension or via AM.PDP),
or separate MTA paths can be set up for mail that needs to be treated
differently, such as internally originating and externally originating mail,
or perhaps separating authenticated mail from the rest.</p>

<p>Amavisd-new has two ways of receiving such extra information from MTA:</p>
<ul>
<li>it can listen on more than one TCP port and apply different policy banks
to each port (applicable to any dual-MTA setup including Postfix), and/or</li>
<li>it can accept SMTP client's IP address from MTA by XFORWARD extension
to the SMTP protocol (available in Postfix only), which can control loading
of policy bank MYNETS. This is in addition to loading policy banks based
on TCP port number. The MYNETS policy bank is loaded (if applicable)
<i>after</i> (on top of the) the port-assigned policy bank.</li>
</ul>


<p>The following examples illustrate several ways of distinguishing
between different mail origins. For most common purposes the only distinction
that really matters is separating internally originating mail from the rest,
and for this purpose the use of policy bank MYNETS and a sufficiently
recent version of Postfix supporting XFORWARD suffices -- the complication
with multiple ports and multiple interfaces is needed only for more demanding
sites which prefer maximum flexibility.</p>


<h3>Example 1</h3>

<p>As stated earlier, a policy bank named 'MYNETS' is loaded (if it exists)
whenever MTA supplies an original SMTP client's IP address (e.g. via the
Postfix XFORWARD extension) and that address matches the @mynetworks list.
This covers most common needs to distinguish internally-originating mail
from the rest, and allows them to be treated differently, as illustrated
by the following example:</p>

<pre>
$policy_bank{'MYNETS'} = {  # mail originating from @mynetworks
  virus_admin_maps => ["security\@$mydomain"], # alert of infected local hosts
  spam_admin_maps  => ["abuse\@$mydomain"],    # alert of internal spam
  spam_kill_level_maps => [7.0],  # slightly more permissive spam kill level
  spam_dsn_cutoff_level_maps => [15],
  banned_filename_maps => [
    new_RE(
    # block double extensions in names:
      qr'\.[^./]*\.(exe|vbs|pif|scr|bat|cmd|com|cpl|dll)\.?$'i,
    # allow any name or type (except viruses) within an archive:
      [ qr'^\.(Z|gz|bz2|rpm|cpio|tar|zip|rar|arc|arj|zoo)$' => 0],
    # blocks MS executable file(1) types, unless allowed above:
      qr'^\.(exe-ms)$',
    ),
  ],
};
</pre>


<h3>Example 2</h3>

<p>In the following example some of the external mail is coming in via
fetchmail, the rest of the externally originating mail is coming in via normal
SMTP at tcp port 25, and all internally originating mail is coming to MTA
via mail submission port 587 reserved for that purpose, or via dedicated IP
address accessible only from inside, or through a Postfix pickup service.
We'll use Postfix in this example, although it does not rely on any particular
Postfix capability that wouldn't be available in any general purpose MTA
in some form or another.</p>

<p>Only the specifics of this setup are described here. Missing bits
like the MTA re-entry port 10025 and other options are described in
<a href="./README.postfix">README.postfix</a> and are assumed here.
Specifying additional smtpd restrictions and options may be desired,
and is omitted here for brevity.</p>

<p>To let amavisd-new be able to distinguish between all four
mail entry routes, we let amavisd listen on four TCP ports
(the fifth is for good measure, to be used in the next example):
<tt>$inet_socket_port = [10040,10041,10042,10043,10044];</tt>
(any unused non-privileged TCP ports can be used)</p>

<p>In Postfix configuration file master.cf we attach different
<i>content_filter</i> options to each of the Postfix services
receiving mail. We'll assume the MTA host has two IP addresses 192.0.2.1
and 192.0.2.2 assigned (IP aliases or separate physical interfaces),
which makes it easier to distinguish between internally originating mail
and the rest even if XFORWARD can not be used (older Postfix versions
or some other MTA):</p>

<pre>
# regular incoming mail, originating from anywhere (usually from outside)
# the MX record (or backup mailers) should point to this IP address
192.0.2.1:smtp inet  n  -  n  -  -  smtpd
  -o content_filter=smtp-amavis:[127.0.0.1]:10040

# incoming mail from fetchmail, considered externally originating
# (add 'smtphost localhost/2345' to the poll section in .fetchmailrc)
127.0.0.1:2345 inet  n  -  n  -  -  smtpd
  -o content_filter=smtp-amavis:[127.0.0.1]:10041
  -o smtpd_client_restrictions=permit_mynetworks,reject
  -o mynetworks=127.0.0.0/8

# IP address to be used by internal hosts for mail submission
192.0.2.2:smtp inet  n  -  n  -  -  smtpd
  -o content_filter=smtp-amavis:[127.0.0.1]:10042
  -o smtpd_client_restrictions=permit_mynetworks,reject

# or, tcp port 587 to be used by internal hosts for mail submission
submission inet  n  -  n  -  -  smtpd
  -o content_filter=smtp-amavis:[127.0.0.1]:10042
  -o smtpd_client_restrictions=permit_mynetworks,reject

# locally originating mail submitted on this host through a sendmail binary
pickup     fifo  n  -  n  60  1  pickup
  -o content_filter=smtp-amavis:[127.0.0.1]:10043
</pre>

<p>A global option <i>content_filter</i> in file main.cf could provide
a convenient default, only services that need a different setting would
then need to override it.</p>

<p>Now let's make up names for policy banks which will cover all four cases.
We'll pick names EXT, EXT-FM, INT, INT-HOST for policy banks. The amavisd
needs to be told to load corresponding policy when a request comes in on each
of the listening ports:</p>
<pre>
  $interface_policy{'10040'} = 'EXT';
  $interface_policy{'10041'} = 'EXT-FM';
  $interface_policy{'10042'} = 'INT';
  $interface_policy{'10043'} = 'INT-HOST';
  $interface_policy{'10044'} = 'AUTH';  # to be used in the next example
</pre>

<p>Next we'll prepare each policy and specify there the options which should
be different from global options. Note that the following policies serve
mostly as an example and to provide ideas -- they should not be considered
a recommendation. For example:</p>

<pre>
# regular incoming mail, originating from anywhere (usually from outside)
$policy_bank{'EXT'} = {
  # just use global settings, no special overrides
};

# incoming mail from fetchmail, considered externally originating
$policy_bank{'EXT-FM'} = {
  log_level => 2,
    # no bounces for spam, not even for score below spam_dsn_cutoff_level_maps:
  final_spam_destiny => D_DISCARD,
};

# locally originating mail guaranteed to be from inside
$policy_bank{'INT'} = {
    # enable/redirect admin notifications for locally originating malware:
  virus_admin_maps => ["virusalert\@$mydomain"],
  spam_admin_maps  => ["virusalert\@$mydomain"],
    # be slightly more permissive on spam levels for mail from our hosts:
  spam_kill_level_maps => [7.0],
  spam_dsn_cutoff_level_maps => [15],
  final_virus_destiny => D_BOUNCE,  # (unless in viruses_that_fake_sender_maps)
  final_spam_destiny  => D_BOUNCE,  # (unless above spam_dsn_cutoff_level_maps)
  bypass_banned_checks_maps => [ 1 ],  # allow sending any file type or name
    # provide customized sender notifications for spam from our users:
  notify_spam_sender_templ => read_text("$MYHOME/notify_spam_sender.txt"),
};

# mail locally submitted on the host on which MTA runs
$policy_bank{'INT-HOST'} = {
    # NOTE: this is just an example; ignoring internally generated spam
    # may not be such a good idea, consider zombified infected local PCs
  bypass_spam_checks_maps   => [ 1 ],
  bypass_banned_checks_maps => [ 1 ],
  final_spam_destiny   => D_PASS,
  final_banned_destiny => D_PASS,
};

# authenticated mail (used by the next example)
$policy_bank{'AUTH'} = {
    # enable admin notifications for malware originating from our users:
  virus_admin_maps => ["virusalert\@$mydomain"],
  spam_admin_maps  => ["virusalert\@$mydomain"],
    # be slightly more permissive on spam levels for mail from our users:
  spam_kill_level_maps => 7.0,
  spam_dsn_cutoff_level_maps => 15,
  bypass_banned_checks_maps => 1,  # allow sending any file type or name
  final_bad_header_destiny => D_BOUNCE;  # block invalid headers
};
</pre>

<p>If not all four cases need to be distinguished, the same policy bank
name (or none at all) can be assigned to more than one port. Also the
MTA configuration can use the same amavisd port for more than one
of its incoming services if there is no need for different settings.</p>


<h3>Example 3</h3>

<p>Besides setting different <i>content_filter</i> options for different
Postfix services, one may use the option FILTER in Postfix lookup tables,
as described in Postfix man pages access(5) and header_checks(5), to specify
different <i>content_filter</i> settings based on various conditions,
such as sender domain name or IP address, mail header fields, etc.</p>

<p>Consider the next example which uses the FILTER settings to distinguish
from internally originating, authenticated external mail and the rest.</p>

<pre>
# global default:
content_filter=smtp-amavis:[127.0.0.1]:10044

# note that permit_mynetworks only checks for key presence and ignores rhs
mynetworks = cidr:/etc/postfix/mynetworks-filter.cidr

smtpd_recipient_restrictions =
  reject_unauth_pipelining, reject_non_fqdn_recipient, reject_non_fqdn_sender,
  reject_unknown_recipient_domain, reject_unknown_sender_domain,
  check_client_access cidr:/etc/postfix/mynetworks-filter.cidr,
  permit_mynetworks,
  permit_sasl_authenticated, permit_tls_clientcerts,
  reject_unauth_destination,
  check_sender_access regexp:/etc/postfix/filter-catchall.regexp
</pre>

<p>The <tt>check_client_access cidr:/etc/postfix/mynetworks-filter.cidr</tt>
preceeds the <i>permit_mynetworks</i> (which uses the same cidr table,
but ignores the righthand side), and it serves to override the global
<i>content_filter</i> setting by the use of FILTER for each of the
networks (presumably internal) listed in mynetworks-filter.cidr.
The final effect is that mail matching networks listed in
mynetworks-filter.cidr will be sent for content filtering to tcp port 10042
(the FILTER setting in access map), authenticated non-local mail will be
sent for content filtering to port 10044 (the global setting), while all
the rest will be sent to port 10040 (as specified in catchall filter).
If there are any other overrides in master.cf like in the previous example,
they take precedence over the global settings, but the FILTER rules take
the ultimate precedence.</p>

<p>/etc/postfix/mynetworks-filter.cidr :</p>
<pre>
127.0.0.0/8    FILTER smtp-amavis:[127.0.0.1]:10042
10.0.0.0/8     FILTER smtp-amavis:[127.0.0.1]:10042
172.16.0.0/12  FILTER smtp-amavis:[127.0.0.1]:10042
192.168.0.0/16 FILTER smtp-amavis:[127.0.0.1]:10042
</pre>

<p>/etc/postfix/filter-catchall.regexp:</p>
<pre>
/^/            FILTER smtp-amavis:[127.0.0.1]:10040
</pre>

<p>Note that in place of the last catchall entry:
<tt>check_sender_access regexp:/etc/postfix/filter-catchall.regexp</tt>
one would be tempted to do:
<tt>check_sender_access static:FILTER smtp-amavis:[127.0.0.1]:10040</tt>,
but unfortunately spaces are not allowed within an option value
in master.cf, so we have to resort to a lookup table.</p>


<h2><a name="max_requests">$max_requests</a></h2>

<p>Amavisd-new runs under process control of Net::Server. This is a pre-forked
environment where $max_servers child processes are constantly kept alive and
ready to accept new tasks (mail messages to be checked). Each amavisd child
process is able to handle several tasks in a row, which helps to reduce
startup (fork) costs. In case of SMTP or LMTP protocol, each session may
consist of several SMTP/LMTP transactions. Each SMTP/LMTP transaction is
counted a one task, regardless of whether it came in from the same SMTP/LMTP
client in a multi-transaction session, or as separate sessions, possibly
from different SMTP/LMTP clients.</p>

<p>A configuration variable $max_requests (default value 20) controls the
approximate number of tasks each child process is willing to handle. After
that the child process terminates and Net::Server provides a new child process
to take its place.</p>

<p>The exact value of $max_requests is not critical. There are two
opposing needs, and some in-between value should be chosen.</p>

<p>On the low side, the number should not be too small in order for the
startup cost to be averaged out / sufficiently diluted over an entire
child lifetime. A value above 5 or 10 meets this goal in most amavisd-new
configurations.</p>

<p>On the high side, the value depends on the amavisd-new configuration.
The amavisd daemon itself is conservative in its use of dynamically
allocated memory and does not load mail into memory, but keeps mail
being processed and its components on files. Similarly, most of the
called external virus scanners and decoders are rational in their use
of memory (a notable exception was Archive::Tar which was used if a
pax or cpio command was not available, but is no longer supported).
Unfortunately this is not true for Perl module Mail::SpamAssassin,
which expects to have an entire decoded mail in memory in order
to be able to run its large set of rules on it in reasonable time.
This is a design decision of SpamAssassin.</p>

<p>When amavisd-new is not configured to use SpamAssassin, the value of
$max_requests can be quite high without any known or expected problems.
For general sanity reasons, an upper limit could be a 100 for example,
although anything above 20 or so would not bring measurable benefit to
the maximum sustained mail throughput.</p>

<p>When amavisd-new <em>is</em> configured to use SpamAssassin however,
the slurping of entire mail in memory and decoding it may have implications,
depending on the $sa_mail_body_size_limit value, on the maximum mail size
allowed at the MTA (e.g. Postfix setting for <i>message_size_limit</i>)
and on the mail compression factor. Even though the allocated memory is
reclaimed by Perl after mail processing, and is reused for subsequent
processing, the process virtual memory footprint never shrinks, it can
only expand as needed.</p>

<p>The $sa_mail_body_size_limit sets a limit on a mail size beyond which
SpamAssassin is not called, so it can not contribute to memory usage
much beyond this limit, times a small factor (2-5?, due to multiple
internal representations of a message). If the $sa_mail_body_size_limit
is large, and MTA mail size is not limited, or if mail has a huge
mail header, the memory footprint can become noticable. For the rest
of a lifetime the child process that processed the mail stays at its
high virtual memory size. If this happens frequently, host resources
may become scarce. Limiting the number of tasks is very much desirable
in this case.</p>

<p>The default value of 20 for $max_servers was chosen as a good
compromise between averaging-out the startup costs and not wasting too
much resources on hosts with high message size limit and SpamAssassin
enabled.</p>

<p>In the setup with Postfix where its lmtp client is chosen to
feed amavisd-new, this client tries to keep LMTP session open and
submit several mail messages in multiple transactions. With recent
Postfix versions its SMTP client is capable and willing of using
multiple transaction sessions as well, although it seems to be
less persistent than the LMTP client.</p>

<p>According to SMTP and LMTP protocol specifications, dropping the
session on the server side is considered rude and should be used
only as a last resort. In order to respect the $max_requests setting
(which is not strictly enforced by amavisd, and is considered an
advisory value), the client side should preferably be configured with
a comparable limit. Starting with amavisd-new-2.2.0 the amavisd daemon
is more strict in enforcing the limit and drops the SMTP or LMTP session
after $max_servers is exceeded by one. This was a recommendation from
the Postfix community, as the option of reducing Postfix max_use setting
is considered less appropriate.</p>

<p>Nevertheless, Postfix doesn't take session dropping lightly, it backs
off a while after content filter forcibly drops the session, which is
undesired. Better behaviour is achieved when Postfix voluntarily terminates
a SMTP session before amavisd would reach its $max_requests limit.
This can be achieved by applying max_use to the Postfix smtp service
feeding a content filter (typically this entry in master.cf is named
'amavisfeed').</p>


<h2><a name="dkim">Setting up DKIM mail signing and verification</a></h2>

<p>A DKIM standard (RFC 4871) states the following, which applies
to its predecessor DomainKeys (historical: RFC 4870) as well:</p>

<blockquote>
<p><i>
DomainKeys Identified Mail (DKIM)</i> defines a mechanism by which email
messages can be cryptographically signed, permitting a signing domain
to claim responsibility for the introduction of a message into the
mail stream.  Message recipients can verify the signature by querying
the signer's domain directly to retrieve the appropriate public key,
and thereby confirm that the message was attested to by a party in
possession of the private key for the signing domain.</p>
</blockquote>

<p>With added support in Postfix 2.3 for a milter protocol, it became
possible to use with Postfix many of existing milters (mail filters)
that were originally developed with sendmail in mind. It was hoped that
a widespread use of milters with sendmail offered a fertile ground for
software development, producing software of sufficient quality to be
able to use it with Postfix. It remains to be seen whether quality of
freely available milters comes anywhere close to high standards we
are accustomed to with Postfix, but with a bit of luck and reasonable
expectations, some of it can be put to good use.</p>

<p>Two of such milters are
<a href="http://sourceforge.net/projects/dkim-milter/">dkim-milter</a>
offering support for <i>DomainKeys Identified Mail (DKIM) Signatures</i>,
and <a href="http://sourceforge.net/projects/dk-milter/">dk-milter</a>,
offering support for <i>Domain-based Email Authentication (DomainKeys)</i>.
The DomainKeys (DK) is a predecessor of DKIM, as recognized by RFC 4870:</p>

<blockquote>
<p>The <i>DomainKeys</i> specification was a primary source from which the
<i>DomainKeys Identified Mail [DKIM]</i> specification has been derived.
The purpose in submitting the RFC 4870 document is as an historical reference
for deployed implementations written prior to the DKIM specification.</p>
</blockquote>

<p>The <i>dkim-milter</i> is more reliable and better maintained than
<i>dk-milter</i>, which is slowly fading into oblivion and is no longer
actively maintained. Google.com has already switched to DKIM, and Yahoo
is following shortly. Similar holds true in the world of Perl modules:
there are modules <i>Mail::DomainKeys</i> and <i>Mail::DKIM</i>, both of
which can be used by SpamAssassin plugins. Again the <i>Mail::DKIM</i>
(by Jason Long, based on initial work by Anthony D. Urso) is actively
maintained, compared to its predecessor <i>Mail::DomainKeys</i>, whose
active maintenance has been discontinued. SpamAssassin makes it easy
to use each or both of them (for verification / whitelisting), just by
enabling the already provided plugins. Note that recent versions of
<i>Mail::DKIM</i> support also DomainKeys signatures.</p>

<p>Mail signing and verification is a two-part job: signing of originating
mail (or mail being redistributed) from our domain, and verifying signatures
of incoming mail. Both tasks can be done by the same program, or they can
be performed by separate entities. Traditionally with sendmail, both tasks
are performed by the same milter, which may be easier to maintain, but
has certain disadvantages.</p>

<p>Verifying signatures should be performed early, before any local mail
transformations get a chance of invalidating a signature, e.g. by performing
MIME conversions to quote-printable, by fixing syntactically invalid mail
header section, by frivolously reformatting or reordering some header
fields, by modifying/inserting/removing certain header fields, or by a
local mailing list modifying mail text, e.g. by appending footnotes.</p>

<p>Signing outgoing mail should be performed late, after mail sanitation,
after conversion to 7-bit characters (to avoid later uncontrollable changes
by a relaying or receiving MTA), and after adding header fields by a content
filter. Similar applies to local mailing lists, which may be rewriting
messages, requiring them to be re-signed by the domain hosting a mailing
list, just before being sent out.</p>

<p>Since SpamAssassin only provides signature verification but not signing,
one obvious choice for signing is to use dkim-milter in signing-only mode,
invoked by a Postfix smtpd service which is receiving content-checked mail
from a content filter such as amavisd-new. As this second-stage smtpd
service does not reliably know how a given message came into a mail system
and whether it is supposed to be signed or not, a clean solution is to
provide two parallel paths through a content filter, one used for mail
to be signed (originating mail), the other for all the rest:</p>

<pre>
              +------+            (verify)
              |verify|          SpamAssassin
              +--+---+               ^|
                ^^^                  |v
incoming:       |||             +----++-----+
  MX ---->  25 smtpd ---> 10024 >           >---> 10025 smtpd -->
submission:      ||             |           |
  SASL -->  25 smtpd \          |  amavisd  | (notifications)
                  |   +->       |           >--->_
  mynets->  25 smtpd ---> 10026 >ORIGINATING>---> 10027 smtpd -->
                      +->       +-----------+            |
       --> 587 smtpd /  :                                v
                       (convert to 7-bit)             +------+
                                                      | sign |
                                                      +------+
</pre>

<p>There are other benefits to providing two parallal paths: a content
filter may be configured to apply different rules and settings to mail
that is known to be originating from our users, compared to the rest.
Some suggestions: apply less strict banning rules, enable spam
administrator notifications for internally originating spam and
viruses, letting SpamAssassin rules be conditionalized based on
amavisd-new policy banks loaded, etc.</p>

<p>For verification there are two choices: either a SpamAssassin plugin
can do it by calling a Perl module, or a milter in verification-only
mode can be invoked by the incoming Postfix smtpd service. In the
setup described, SpamAssassin sees almost the same message as a milter
on the incoming smtpd would (just few header fields such as a
<i>Received</i> are prepended, no other changes), so is unlikely
that a signature would become invalid.</p>

<p>Invoking signature verification by SpamAssassin has an advantage that
DKIM-based or DomainKeys-based whitelisting or scoring can be used,
but has a disadvantage that possibly not all mail is checked (e.g. large
mail may be exempt from spam checks).</p>

<p>On the other hand, invoking signature verification by calling a milter
from incoming smtpd service can insert a header field reporting a result
of verification, which may be used by MUA or by SpamAssassin rules,
although currently no such rules are provided ready-made.</p>

<p>For the purpose of gathering experience, both methods for signature
verification are described. It turns out that one or the other
implementation (in C or in Perl) may be buggy, and being able to compare
both results of signature verification can help with troubleshooting.</p>

<p>Let's begin by starting a dkim milter in two instances, one dedicated
to signing, the other to verification. For security reasons all milters
should run under a dedicated username, certainly not as <i>root</i>,
not as user <i>amavis</i> and not as user <i>postfix</i> or <i>mail</i>:
</p>

<p><i>verifying:</i></p>

<pre>
  dkim-filter -u dkfilter -b v \
    -l -p inet:4443@127.0.0.1 -P /var/run/dkim-filter-v.pid
</pre>

<p><i>signing:</i></p>

<pre>
  dkim-filter -u dkimfilter -b s -m ORIGINATING \
    -c relaxed/simple -S rsa-sha1 \
    -d example.com -s myselector -k /var/db/dkim/mykey.pem \
    -l -p inet:4445@127.0.0.1 -P /var/run/dkim-filter-s.pid
</pre>

<p>Generating a public and a private pair of keys and publishing
a public key in DNS is described in the dkim milter documentation
and also in the DKIM RFC document.</p>

<p>We are not specifying option -i to milters, the default of
-i 127.0.0.1 suits our setup just fine, as mail to be signed is
coming from a content filter, usually on a loopback interface
from the IP address 127.0.0.1.</p>

<p>Now we can tie the verifying milter to a Postfix smtpd service
listening for incoming mail:</p>

<p><i>master.cf:</i></p>
<pre>
  smtp inet n - n - 300 smtpd
    -o milter_default_action=accept
    -o milter_macro_daemon_name=MTA
    -o smtpd_milters=inet:127.0.0.1:4443
</pre>

<p>and tie the signing milter to a Postfix smtpd service that is
receiving checked mail from amavisd, intended to be signed:</p>

<p><i>master.cf:</i></p>
<pre>
  # mail return from a content filter (non-signing)
  10025 inet n - n - - smtpd
    -o content_filter=
    ... (other options, mail not to be signed) ...

  # mail from our users returning from a content filter (DKIM signing)
  10027 inet n - n - - smtpd
    -o content_filter=
    ... (other options, mail intended to be signed) ...
    -o milter_default_action=accept
    -o milter_macro_daemon_name=ORIGINATING
    -o smtpd_milters=inet:127.0.0.1:4445
</pre>

<p>As a sidenote, attaching milters to sendmail would use the same order
of invocations: signature verifying milter first, content filters next,
and signing milter last, for example:</p>

<pre>
  dnl Verifiers:
  INPUT_MAIL_FILTER(`dkim-filter-v', `S=inet:4443@127.0.0.1, T=R:2m')

  dnl Content filter:
  INPUT_MAIL_FILTER(`amavisd-milter',
    `S=unix:/var/amavis/amavisd-milter.sock, F=T, T=S:10m;R:10m;E:10m')

  dnl Signers:
  INPUT_MAIL_FILTER(`dkim-filter-s', `S=inet:4445@127.0.0.1, T=R:2m')
</pre>

<p>Now back to a Postfix setup (or any dual-MTA setup). Default content
filter is to be amavisd, listening on port 10026 (intended for signing).
Locally submited or SASL-authenticated mail will go to a content filter
on this default port 10026 and will be signed on its way out. All other
mail (incoming) will be diverted to port 10024 for content filtering
by a final catchall FILTER, and will never hit the signing milter:</p>

<p><i>main.cf:</i></p>
<pre>
  content_filter = smtp-amavis:[127.0.0.1]:10026

  smtpd_recipient_restrictions =
    ...
    permit_mynetworks
    permit_sasl_authenticated
    reject_unauth_destination
    ...
    check_sender_access regexp:/etc/postfix/filter_10024_catchall
    permit
</pre>

<p><i>/etc/postfix/filter_10024_catchall:</i></p>
<pre>
  /^/  FILTER smtp-amavis:[127.0.0.1]:10024
</pre>

<p>In SpamAssassin all that is necessary is to add (or uncomment) a line in
any of the .pre files (e.g. in local.pre, or in init.pre and v320.pre):</p>

<pre>
  loadplugin Mail::SpamAssassin::Plugin::DKIM
</pre>

<p>Perl module Mail::DKIM needs to be installed. Note that Mail::DKIM
starting with version 0.20 also recognizes DomainKeys signatures, so that
Plugin::DomainKeys is not needed any longer, and in fact its underlying
module is not supported any longer. It is advisable to stick to the most
recent version of Mail::DKIM.</p>

<p>The following SpamAssassin rules (in local.cf) work fairly well,
giving verified mail a little bit of advantage and slightly favourize
verified mail from some popular domains, encouraging people to start
signing their mail. Signed spam can be counterbalanced by other
measures.</p>

<pre>
  score DKIM_VERIFIED -0.3
  score DKIM_SIGNED    0
  score DKIM_POLICY_SIGNALL  0
  score DKIM_POLICY_SIGNSOME 0
  score DKIM_POLICY_TESTING  0

  # DKIM-based whitelisting of domains with good reputation:
  score USER_IN_DKIM_WHITELIST -8.0
  whitelist_from_dkim  *@intl.paypal.com  paypal.com
  whitelist_from_dkim  *@*.paypal.com
  whitelist_from_dkim  *@paypal.com
  whitelist_from_dkim  *@*.paypal.be
  whitelist_from_dkim  *@*.ebay.com
  whitelist_from_dkim  *@ebay.com
  whitelist_from_dkim  *@*.ebay.co.uk
  whitelist_from_dkim  *@ebay.co.uk
  whitelist_from_dkim  *@ebay.at
  whitelist_from_dkim  *@ebay.ca
  whitelist_from_dkim  *@ebay.de
  whitelist_from_dkim  *@ebay.fr
  whitelist_from_dkim  *@amazon.com
  whitelist_from_dkim  *@cisco.com
  whitelist_from_dkim  *@cern.ch
  whitelist_from_dkim  *@welcome.skype.com
  whitelist_from_dkim  *@skype.net
  whitelist_from_dkim  *@alert.bankofamerica.com
  whitelist_from_dkim  *@bankofamerica.com
  whitelist_from_dkim  *@*.cnn.com
  whitelist_from_dkim  *@cnn.com

  # DKIM-based whitelisting of domains with less then perfect
  # reputation can be given fewer negative score points:
  score USER_IN_DEF_DKIM_WL -1.5
  score ENV_AND_HDR_DKIM_MATCH 0
  def_whitelist_from_dkim  *@google.com
  def_whitelist_from_dkim  *@googlemail.com
  def_whitelist_from_dkim  *@gmail.com
  def_whitelist_from_dkim  *@yahoo.com
  def_whitelist_from_dkim  *@yahoo.com.au
  def_whitelist_from_dkim  *@yahoo.se
  def_whitelist_from_dkim  *@*  googlegroups.com
  def_whitelist_from_dkim  *@*  yahoogroups.com
  def_whitelist_from_dkim  *@*  yahoogroups.co.uk
  def_whitelist_from_dkim  *@*  yahoogroupes.fr
  def_whitelist_from_dkim  *@rogers.com
</pre>

<p>Another suggestions - penalize mail claiming to be from PayPal,
eBay, Yahoo or Gmail but was not signed by their official mailers:</p>

<pre>
  header __L_ML1  Precedence =~ m{\b(list|bulk)\b}i
  header __L_ML2  exists:List-Id
  header __L_ML3  exists:List-Post
  header __L_ML4  exists:Mailing-List
  header __L_ML5  Return-Path:addr =~ m{^([^\@]+-(request|bounces|admin|owner)|owner-[^\@]+)(\@|\z)}mi
  meta   __L_VIA_ML   __L_ML1 || __L_ML2 || __L_ML3 || __L_ML4 || __L_ML5

  header __L_FROM_Y1  From:addr =~ m{[\@.]yahoo\.com$}mi
  header __L_FROM_Y2  From:addr =~ m{\@yahoo\.com\.(ar|br|cn|hk|my|sg)$}mi
  header __L_FROM_Y3  From:addr =~ m{\@yahoo\.co\.(id|in|jp|nz|uk)$}mi
  header __L_FROM_Y4  From:addr =~ m{\@yahoo\.(ca|de|dk|es|fr|gr|ie|it|pl|se)$}mi
  meta   __L_FROM_YAHOO __L_FROM_Y1 || __L_FROM_Y2 || __L_FROM_Y3 || __L_FROM_Y4
  header __L_FROM_GMAIL From:addr =~ m{\@gmail\.com$}mi
  meta     L_NOTVALID_YAHOO  !DKIM_VERIFIED &amp;&amp; __L_FROM_YAHOO &amp;&amp; !__L_VIA_ML
  priority L_NOTVALID_YAHOO  500
  score    L_NOTVALID_YAHOO  2.8
  meta     L_NOTVALID_GMAIL  !DKIM_VERIFIED &amp;&amp; __L_FROM_GMAIL &amp;&amp; !__L_VIA_ML
  priority L_NOTVALID_GMAIL  500
  score    L_NOTVALID_GMAIL  2.8

  header __L_FROM_PAYPAL From:addr =~ /[\@.]paypal\.com$/mi
  header __L_FROM_EBAY   From:addr =~ /[\@.]ebay\.(com|at|be|ca|ch|de|ee|es|fr|hu|ie|in|it|nl|ph|pl|pt|se|co\.(kr|uk)|com\.(au|cn|hk|mx|my|sg))$/mi
  meta     L_NOTVALID_PAY !DKIM_VERIFIED &amp;&amp; (__L_FROM_PAYPAL || __L_FROM_EBAY)
  score    L_NOTVALID_PAY 6

  # accept replies from abuse@yahoo.com even if not dkim/dk-signed:
  whitelist_from_rcvd abuse@yahoo.com          yahoo.com
  whitelist_from_rcvd MAILER-DAEMON@yahoo.com  yahoo.com
</pre>

<p>In amavisd.conf two parallel paths need to be provided,
one receiving on port 10024 and forwarding to 10025,
the other receiving on port 10026 and forwarding to 10027.</p>

<pre>
  $inet_socket_port = [10024,10026];  # listen on two ports
</pre>

<p>The 10024&gt;10025 path will be controlled by a default policy bank,
the other (10026&gt;10027), dedicated to mail intended to be signed,
will use a policy bank (arbitrarily) named ORIGINATING:</p>

<pre>
  $forward_method = 'smtp:[127.0.0.1]:10025';  # MTA with non-signing service
  $notify_method  = 'smtp:[127.0.0.1]:10027';  # MTA with signing service

  # switch policy bank to 'ORIGINATING' for mail received on port 10026:
  $interface_policy{'10026'} = 'ORIGINATING';

  $policy_bank{'ORIGINATING'} = {  # mail originating from our users
    originating =&gt; 1, # indicates our client, introduced in amavisd-new-2.5.0
    #
    # force MTA to convert mail to 7-bit before DKIM signing
    # to avoid later conversions which could destroy signature:
    smtpd_discard_ehlo_keywords =&gt; ['8BITMIME'],
    #
    # forward to a smtpd service providing DKIM signing service:
    forward_method =&gt; 'smtp:[127.0.0.1]:10027',
    #
    # other special treatment of locally originating mail, e.g.:
    spam_admin_maps  =&gt; ["spamalert\@$mydomain"],  # warn of spam from us
    virus_admin_maps =&gt; ["virusalert\@$mydomain"],
    banned_filename_maps =&gt; ['ALT-RULES'],         # more relaxed rules
    spam_quarantine_cutoff_level_maps =&gt; undef,    # quarantine all spam
    spam_dsn_cutoff_level_maps =&gt; undef,
    spam_dsn_cutoff_level_bysender_maps =&gt; # bounce to local senders only
      [ { lc(".$mydomain") =&gt; undef,  '.' =&gt; 15 } ],
  };
</pre>

<p>The <tt>smtpd_discard_ehlo_keywords=&gt;['8BITMIME']</tt> serves to
persuade Postfix to convert mail to 7-bit quoted-printable before submitting
it to content filtering and signing (this configuration variable was
introduced with version 2.4.3 of amavisd-new). Avoiding 8-bit characters
in mail body makes signatures less susceptible to breaking by some
relaying or receiving MTA over which we have no control. Note that
the same effect (making Postfix convert outgoing mail to 7-bits
before DKIM signing) could be achieved by a Postfix setting
<tt>smtp_discard_ehlo_keywords=8bitmime</tt> on a smtp service
feeding mail to be signed to amavisd, but this would require setting
up two such services, one with the option and one without.</p>

<p>Note that 8-bit to 7-bit conversion may break a S/MIME or PGP signature,
so if mail signing is in use, it may not be desirable to let Postfix
do the conversion, and it may be acceptable to take a risk that a remote
MTA will clobber signatures if it decides the mail text is to be converted
to 7-bits QP. The only reliable solution in this case is to configure
MUA clients to stick to 7-bit characters/encodings before generating
S/MIME or PGP signatures.</p>

<p>The following text from the Postfix documentation file MILTER_README
<em>should be disregarded</em> -- amavisd <em>is</em> 8-bit clean,
and we want Postfix to do conversion to 7-bits on the signing
path but not on the other path:
<span style="text-decoration:line-through;font-style:italic">Content 
filters may break domain key etc. signatures. If you use an SMTP-based
content filter, then you should add a line to master.cf with
"-o disable_mime_output_conversion=yes", as described in the
advanced content filter example.</span></p>

<p>While testing how the configured system plays with some mailing lists
(such as <i>postfix-users</i> or SpamAssassin <i>users</i> list), one has
to keep in mind that amavisd-new caches spam checking results of recently
seen message bodies: a mail going out to a mailing list is not yet signed
as it reaches a content filter, but the SpamAssassin verdict is remembered
at that point (claiming the message is not signed). When this message
with unchanged body comes back from a mailing list, this time signed
in the header section by our domain, the signature should prove correct,
yet the cached result from a minute ago still claims the message is not
signed. If this is of concern, one can turn off caching of spam checking
results for ham by setting:  <tt>$spam_check_negative_ttl = 0;</tt></p>


<h3>Some experience with DKIM and DomainKeys</h3>

<p>Recent versions of software components must be used to avoid bugs and
interoperability problems with earlier versions:</p>

<ul>
<li>use Postfix 2.3.12 or later, or 2.4.5 or later, or 2.5 (or later);</li>
<li>amavisd-new 2.4.3 polished some corner issues on modifying mail header
  section when releasing from a quarantine and defanging, and added some
  goodies affecting DKIM and DomainKeys to facilitate integration;
  version 2.5.2 (or later) is recommended;</li>
<li>Mail::DKIM is very solid; use the latest version, currently 0.29;</li>
<li>SpamAssassin 3.2.3 or later;</li>
<li>At the time of writing the current version of dkim-milter is 2.4.0.</li>
</ul>

<p>Instead of signing mail with <i>dkim-milter</i>, the same can be
achieved by using Jason Long's <i>DKIM Proxy</i>, which is a Perl program
calling a Perl module <i>Mail::DKIM</i>, i.e. the same modules as used
by a SpamAssassin DKIM plugin. As the <i>Mail::DKIM</i> turned out to be
a reliable and quite efficient module, this may be a good alternative
to <i>dkim-milter</i> (which is also quite good in recent versions).</p>

<p>The coming version of amavisd-new 2.6.0 will provide a built-in
signing and verification capability by using a module <i>Mail::DKIM</i>
directly, greatly simplifying the setup currently described here.</p>

<p>Under FreeBSD just install mail/dkim-milter from freshly updated ports,
and add the following to <tt>/etc/rc.conf</tt>:</p>

<pre>
  milterdkim_enable="YES"
  milterdkim_uid='dkimfilter'
  milterdkim_domain='example.com'
  milterdkim_profiles="verifier signer"
  milterdkim_verifier_socket='inet:4443@127.0.0.1'
  milterdkim_verifier_flags='-b v -C dns=a,int=a'
  milterdkim_signer_socket='inet:4445@127.0.0.1'
  milterdkim_signer_selector='myselector'
  milterdkim_signer_key='/var/db/domainkeys/myselector.key.pem'
  milterdkim_signer_alg='rsa-sha1'
  milterdkim_signer_flags='-b s -c relaxed/simple -m ORIGINATING -o - -C int=a'
</pre>

<p>Recent startup script supplied by FreeBSD port already ensures that
deamon runs under its dedicated username, supplying a default value for
option -u.</p>

<p>Mail transformations as performed by some mailing lists are probably
the most challenging problem facing DKIM deployment (and to other schemes
as well). Nevertheless, mailing lists can be configured to either avoid
transformations which invalidate mail signatures, or can re-sign fan-out mail.
Examples of mailing lists which work very well with DKIM (and DomainKeys),
preserving existing signatures provided by posters, are the <i>postfix-users</i>
( postfix-users@postfix.org ) and the <i>SpamAssassin users</i> list
( users@spamassassin.apache.org ). Example of re-signing mailing lists
are Yahoo groups. A representative of another type of mailing lists is
Mailman, which often modifies mail body and strips out original signatures,
unless explicitly configured not to.</p>

<p>Several big players are already signing mail from their customers
or employees: Yahoo! (worldwide), Gmail, eBay, Earthlink, google.com,
Amazon, CNN, Skype, Cisco, etc. (not to mention porcupine.org :)</p>

<p>When signatures are missing on mail from domains which are known to be
signing all their mail (yahoo.com, gmail.com), the most common reason is
that a sender submitted his mail through some other provider, but supplied
his Yahoo or gmail e-mail address in the <i>From</i> header field.
Similar to other schemes designed to prevent faking of sending address,
the DKIM (and the DomainKeys) encourages mail submission only through
a domain which is used in the <i>From</i> address - although there are
other possibilities for roaming users in the DKIM system.</p>

<p>People will need to become aware that their best choice is to submit
mail through their native domain to prevent their messages from being
treated as second-class. With a widespread support for authorized mail
submission for roaming users (SASL, TLS) through a mail submission port
(tcp port 587, RFC 4409), supported by practically all modern clients
and mailers, there is no longer any good excuse for submitting mail
through foreign mail submission agents.</p>

<p>Note that some spam is also being signed by DomainKeys or DKIM lately,
which is a good thing -- it indicates the sender owns (or <i>ownz</i>)
a domain they are sending mail from. This either shows sender's sincere
desire of not hiding behind a faked sender mail address (in which
case such mail can be easily filtered if necessary), or they are using
a short-lived temporary domain (perhaps through <i>domain kiting</i>),
which can be counteracted by black lists of few-days old freshly
registered domains (such as <a href="http://support-intelligence.com/dob/"
>http://support-intelligence.com/dob/</a>) or other reputation schemes.
Signing and verifying mail is a good mechanism for companies to reliably
whitelist mail from their partner companies or frequent clients.</p>

<p>Some references:</p>
<ul>
<li><a href="http://www.dkim.org/info/dkim-faq.html">DKIM
  Frequently Asked Questions</a> (at MIPA)</li>
<li><a href="http://www.rfc-editor.org/rfc/rfc4686.txt">RFC 4686</a>:
  Analysis of Threats Motivating DomainKeys Identified Mail (DKIM)</li>
<li><a href="http://www.rfc-editor.org/rfc/rfc4871.txt">RFC 4871</a>:
  DomainKeys Identified Mail (DKIM) Signatures</li>
<li><a href="http://www.rfc-editor.org/rfc/rfc4870.txt">RFC 4870</a>:
  (historical document) Domain-Based Email Authentication Using
  Public Keys Advertised in the DNS (DomainKeys)</li>
<li><a href="http://www.rfc-editor.org/rfc/rfc5016.txt">RFC 5016</a>:
  Requirements for a DomainKeys Identified Mail (DKIM) Signing Practices
  Protocol</li>
<li><a href="http://ietf.org/html.charters/dkim-charter.html">IETF
  charter: Domain Keys Identified Mail (DKIM)</a></li>
<li><a href="http://www.postfix.org/MILTER_README.html">Postfix
  before-queue Milter support</a> (original Postfix documentation)</li>
<li><a href="http://www.dkim.org/deploy/">DKIM Deployment
  Reports</a> (at <i>Mutual Internet Practices Association</i> - MIPA)</li>
<li><a href="http://jason.long.name/dkimproxy/">Mail::DKIM and dkimproxy</a></li>
<li><a href="http://cpan.perl.org/authors/id/J/JA/JASLONG/"
  >Mail::DKIM</a> module download page at CPAN</li>
<li><a href="http://sourceforge.net/projects/dkim-milter/">dkim-milter</a></li>
<li>See also:
<a href="http://www.arschkrebs.de/postfix/postfix_cisco_pix_bugs.shtml"
>Cisco PIX bugs</a> or this
<a href="http://www.heise-online.co.uk/security/Cisco-PIX-obstructs-anti-spam-protocol--/news/93725"
>heise-online article</a> on handling DKIM-signed mail
(the solution is to upgrade PIX to version 7.2(2.19) or 8.0(2.7),
or to disable a <i>smtp protocol fixup</i> (mis)feature</li>
</ul>

<hr />
<p>
<i><a href="http://www.ijs.si/people/mark/">mm</a></i>
<br />Last updated: 2008-02-22
</p>

<p>
<a href="http://validator.w3.org/check?uri=referer"
><img class="noboarder" src="./valid-xhtml10.png" height="31" width="88"
      alt="Valid XHTML 1.0!" /></a>
</p>

</body>
</html>