Very large mail clusters can be built by keeping the Maildirs on fileservers shared by NFS
External interfaces (public) | | | | | +------+ +------+ +------+ +------+ +-------+ | MX | | MX | | POP3 | | POP3 | |webmail| +------+ +------+ +------+ +------+ +-------+ ^ | | | | | NFS +---------+----+-----+----+-----+----------+ private v | | network +-----+ +-----+ | NFS | | NFS | +-----+ +-----+
Notes:
| | | | +------+ +------+ +------+ +------+ | MX | | MX | | POP3 | | POP3 | |proxy | |proxy | |proxy | |proxy | +------+ +------+ +------+ +------+ | | | | SMTP ^ +-----+----+-----+----+-----+----+ v POP3 | | | +------+ +------+ +------+ | mail | | mail | | mail | +------+ +------+ +------+
Backend servers are mailservers, accessed via SMTP and POP3. Incoming mail is delivered to the appropriate server by means of an SMTP proxy (e.g. Exim); users collect their mail via a POP3 proxy, e.g.
This solution avoids the use of NFS, but is somewhat inelegant and is more difficult to manage (the backends need to have information about user accounts, whereas NFS servers are "dumb"). An NFS solution is likely to be more reliable and easier to scale.
How do you spread the load between your various machines?
Just use equal-priority MX records. This also gives you resilience, because the sender will retry if one of the machines is down.
example.com MX 10 mx-1.mail.example.com. MX 10 mx-2.mail.example.com.
You can use round-robin DNS to share the load:
pop3.example.com A 192.0.2.17 A 192.0.2.18 smtp.example.com A 192.0.2.1 A 192.0.2.2
However, in the event of an outage, some customers may find themselves unable to connect (Windows clients tend to 'stick' with the first IP address they get from the DNS). Solutions:
NOTE: By keeping your SMTP (smarthost) service separate from your POP3 service, the SMTP machines can be scaled separately and do not need to be connected to the shared backend at all.
With any clustering solution, each of the servers needs access to information about usernames, passwords and mail directories. This information needs to be consistent across all the boxes, and updated whenever accounts are added/deleted or people change their passwords. In fact, other parts of your ISP probably need the same information (e.g. RADIUS servers)
Essentially there are two main approaches:
In both cases, you will probably want to tie this information into your billing system so that people do not receive service without paying for it. I strongly recommend that you give each customer a unique 'customerID' (not related to their access username) which links the service(s) they take to their billing account. Note that userdb lets you add additional fields, so one of these could be customer ID. Having this information from the start makes it very easy to migrate to a future billing/provisioning system.
NOTE: NFS is an insecure protocol. You are strongly recommended to run NFS on a completely private network, using RFC1918 address space, with a separate NIC. Keep this NFS network completely separate from any office network or other RFC1918 network you have.
# mkdir /mail1 # mount ns.t1.ws.afnog.org:/usr/mail1 /mail1 # mount
By default, NFS servers do not allow accesses from "root" - but you can perform accesses as another user. In our case, this directory is owned by 'exim' so you should be able to create files as this user.
# su exim $ vi /mail1/somefilename $ ls /mail1 $ exit #
# This optimises NFS client performance but is not essential nfs_client_enable="YES"
Note that there is currently no way for a FreeBSD client to request a lock on an NFS server. Hence it is essential only to use mechanisms that don't depend on locking (such as Maildir)
Test exercise:
# pw useradd tevie -d /mail1/12/34/tevie -u 2001 -s /nonexistent # passwd tevie Changing local password for tevie. New password: Retype new password: passwd: updating the database... passwd: done
In /usr/exim/configure:
domainlist local_domains = @ : .......... : cluster.t1.ws.afnog.org
Then hup your Exim daemon, and send a mail to tevie@cluster.t1.ws.afnog.org
This is how ns.t1.ws.afnog.org was configured:
# mkdir /usr/mail1 # chown exim:exim /usr/mail1 # mkdir -p /usr/mail1/12/34/tevie # chown 2001:2001 /usr/mail1/12/34/tevie
/usr/mail1 -network 80.248.72.0 -mask 255.255.255.128
(If you are restricting access with TCP wrappers, you have to add a rule to permit portmap from your NFS clients)
portmap : 80.248.72.0/255.255.255.128 : allow portmap : ALL : deny
portmap_enable="YES" nfs_server_enable="YES"
This ensures the correct daemons are started when the machine is next rebooted.