HTML Email: Whenever Possible, Turn It Off!

Author: Alan G. Isaac
Copyright: http://creativecommons.org/licenses/by-nd/2.0/
First posted:2002 Feb 15
Last modified:2013 May 14
Text version:https://subversion.american.edu/aisaac/notes/htmlmail.rst
HTML version:https://subversion.american.edu/aisaac/notes/htmlmail.htm

Abstract

HTML email can impose costs on the recipient. Therefore HTML mail should be an act between consenting parties. Unless you know that you are writing to someone who does not mind HTML email, send plain text. If you do use HTML mail, be aware of the security dangers.

First consider a matter of email etiquette. There are many people who prefer not to receive HTML email: some find it to be a personal inconvenience, and others must literally pay for the greater bandwidth consumed by HTML email. These considerations become particularly strong on newsgroups and mailing lists (with their great variety of participants from around the world), where the use of HTML email is often considered ignorant or rude. It is therefore a basic courtesy to refrain from sending HTML email except to those who wish to receive it.

Since HTML mail should be an act between consenting parties, we can quickly find three situations in which HTML email is acceptable.

  1. You work in a business environment that accepts HTML email for internal communications.
  2. You and an interlocutor have agreed (perhaps implicitly) that HTML email is acceptable in your communications. (Perhaps your interlocutor implies this by sending you HTML email.)
  3. You have no other way to access the character set you require to communicate with someone using another language, and yet you wish to write that person.

In all other circumstances: TURN IT OFF!

HTML Email May Look Ignorant

The norms of internet communication are that email interaction takes place by means of plain text messages, except in the situations indicated above. Sometimes these norms are explicitly stated (e.g., as posting guidelines for a newsgroup), but most of the time they are implicit. When you use HTML email, you indicate your ignorance of these norms. When you participate in email lists or newsgroups, you will quickly be informed of these norms. At that point the excuse of ignorance disappears.

If your excuse is that you have not learned how to turn off HTML in your email client, this is a different kind of ignorance that you can easily overcome: you can find online help, or you can ask a friend or colleague.

HTML Email May Look Rude

Except in the situations indicated above, HTML email disregards the preferences of your interlocutor. Such disregard is of course rude. But that leaves open the question of why your interlocutor may prefer plain text. Here are a few possibilities.

Many professional communications take place on email lists and in newsgroups. In such settings, HTML email can be especially rude. First it generally violates the norms established for such interactions, and it is always rude to enter a group conversation without respecting its norms. Second, these norms are well founded: they are designed to save the participants time (as discussed above) and money. Many people still pay per minute to download their email over slow modems, especially if the communication is international in scope.

But Where Is It Written ... ?

Dislike of HTML email is so wide-spread and the drawbacks of HTML email are so well-known that it may seem surprising that there is no RFC addressing it directly. (Indeed, that lack was my motivation for writing this document.) The core reason, I surmise, is that the acceptability of HTML email depends on the environment in which it occurs, as discussed above. The arguments are not against HTML email per se, but against sending it to a recipient whom you do not know to consider it acceptable.

Nevertheless, RFC 1855 addresses internet standards for email, and it clearly presumes that plain text is the standard for email. Especially relevant are the request to limit bandwidth use, the suggestion to restrict the character set to ASCII and, related to this, the suggestion to use ASCII symbols to connote emphasis or _underlining_ in your text. (In an appendix, I quote a few pieces of RFC 1855 to make this point. However it should be clear that the arguments supplied above apply even if we disregard this RFC.)

Security Risk

This document focuses on why you should not send HTML mail. However, you should also be very careful about accepting HTML mail.

HTML email contains commands to be executed by your email client: when you read HTML email you let the sender give commands to your computer. Some of these commands can be used maliciously to exploit common security holes. This can expose you to spam, computer viruses, and worms. On Windows machines, if you enable ActiveX, HTML email can directly infect your computer (without your opening an attachment). Bulk emailers embed web bugs to determine whether your email address is valid, making you more vulnerable to spam. [Smith-and-Martin-2001] note that HTML email can also embed JavaScript that will track forwarded HTML email and transmit any added text! This is known as "email wire tapping", and it is a very good reason for businesses to forbid the use of HTML mail. In 2006, the JTF-GNO reported that for security reasons the Department of Defense is blocking HTML email.

If you do decide to read an HTML email, make sure you do not enable ActiveX, and instruct your email client to turn off JavaScript and not to display external images. Also, do not forward the message, because you will forward any email bug or email wiretap to a recipient who may be less security conscious than you are. Good luck!

Efficient Communication

Many people who handle a lot of email assume HTML mail is spam and filter it on that assumption. Some commercial spam filtering schemes also treat HTML mail as more likely to be spam. As these practices becomes increasingly widespread, sending HTML mail becomes increasingly inefficient: your message will be postponed or even deleted because of its format.

Anyone who handles a lot of email should be concerned about efficiency in communication. Even if your own volume of email is small, you should still support efficiency in communication as a courtesy to your interlocutor. Some basic norms of email communication have evolved to promote this efficiency. Quoting practices are among these---especially selective quoting, interlineated responses, and standard quote indicators. These established practices again favor plain text over HTML mail for three reasons: many email clients cannot correctly handle HTML mail when attempting to produce standard quote indicators, plain text editors are more powerful and faster than HTML editors, and standard quote indicators are more likely to be handled intelligently by text editors.

An additional consideration is particularly important for people who handle large quantities of email and, correspondingly, for any newsgroup or email list. HTML mail reduces the useability of old mail archives. Search of text messages is much faster than search of HTML messages. (Similar considerations support selective quoting over indiscriminate quoting, since the latter leads to too many matches during archive searches.)

Selective quoting

Selective quoting is both a courtesy and a communication device. Quote only the text that is relevant to your response. The quotation should be selective and to the point. Editing for selective quoting is much easier in plain text messages. If you communicate in plain text, you can speed up your own response and assist any responding reader of your email.

Be sure to distinguish quotes from other text in your email.

Interlineated responses

When responding to more than one comment in an initial email, standard practice is to place each response after a relevant section of quoted text. This allows the reader to more easily see how the conversation is taking place: the response is adjacent to the text responded to. Editing for interlineated responses is much easier in plain text messages.

Many heavy users of email believe that anything other than interlineated responses constitutes a discourtesy. "Top posting" is especially disliked. Unfortunately some email clients promote top posting and fail to use standard quote indicators.

Standard quote indicators

The most widespread standard is to use a greater-than sign (>) for each level of quotation. For example:

Tip

>> This text was quoted in the email I am responding to.
> This was original text in the email I am responding to.
This is my response to the two quotes.

All modern email clients will correctly nest standard quote indicators in plain text messages. The level of quotation is indicated by the standard quotation scheme (i.e., by the use of >), and generally also by the color of the text, which makes it very easy to see the structure of the message. All modern email clients will colorize plain text in response to the standard quotation scheme (unless the reader prefers to turn this off). This is yet another courtesy to your reader that you can provide, if you stick to plain text and the standard quotation indicator in your email.

Users of webmail and of older email clients cannot always expect the helpful colorizing of text in response to the standard quoting practices. However the quote indicators still provide the same information about the structure of the message.

Note that when more than two people are involved in an email exchange, it is important as well to indicate the author of the quote. Standard practice is to do this immediately above the quoted text.

Tip

> On 31 January 2010, Sam wrote:
>> This text was quoted in the email I am responding to.
On 1 February 2010, Sara wrote:
> This was original text in the email I am responding to.
This is my response to the two quotes.

Proposals for incorporating such information in HTML email are often complicated and there is no widely implemented standard.

Trivial Markup

Two common non-commercial reasons for choosing to send HTML mail are

  1. the desire to include images, and
  2. the desire to add visually formatted emphasis to text.

Images can be included as attachments, and in most cases should be. (Of course the decision to send no images is often a good one too.) Visually emphasized text can be achieved by using a standard and widely accepted trivial markup: asterisks imply emphasized text, underscores imply _underlined text_, slashes imply /italicized text/, capital letters imply BOLD TEXT, quotation marks are used to indicate "quoted text", and paragraphs are separated by a blank line.

Trivial Markup is a very courteous way to markup text. If your interlocutor wishes this markup to affect the color or size of the displayed text, s/he can use an email client that supports visual formatting of trivial markup. Note that this leaves the display decision up to the reader, which is a natural courtesy.

In the rare case that an email needs mark up beyond that offered by the Trivial Markup conventions, consider reStructuredText.

Conclusion

HTML mail can be useful in specific settings. Groups of people can reasonably agree to use HTML for their email communications. However HTML email can impose costs in lost readability and accessibility, reduced convenience, higher bandwidth use, and reduced security. Therefore the use of HTML should be restricted to consenting parties, where it is understood that consent can be given implicitly as discussed above.

Resources

ASCII Ribbon Campaign

Boyd, G., Configuring Mail Clients to Send Plain ASCII Text (E.g., in Outlook you can select the menus Tools/Options/MailFormat and select "Plain Text" for your message format.)

Cottrell, Allin, 1999, Word Processors: Stupid and Inefficient (Last accessed: 2004 Nov 30)

Delorie, D.J., 1999, Why HTML Mail Is Evil

Gibson, Steve, 2004, The GRC Privacy FAQ: What's a Web Bug?

Goldberg, Jeff, 2003, MS-Word Is Not a Document Exchange Format

Hahn, Harley, How to Turn Off HTML in Your Outgoing Mail Messages

Hambridge, S., 1995, RFC 1855: Netiquette Guidelines

Isaac, Alan G., 2003, ASCII and Latin-1 Coded Character Sets

Nolte, Mike, HTML-Mail ist gefährlich und schädlich

Oakes, Chris, Word Docs with Ears? Wired, 31 Aug 2000. Last accessed: 30 Nov 2004.

Resnick, P., 2001, RFC 2822: Internet Message Format

RootsWeb, Problem Solving: Sending Messages in Plain Text

Smith, Richard M., undated, FAQ: Web Bugs Last accessed at the Privacy Foundation on 11 April 2003. (Document missing as of late 2004.)

[Smith-and-Martin-2001]http://www.privacyfoundation.org/privacywatch/report.asp?id=54 Last accessed at the Privacy Foundation on 11 April 2003. (Document missing as of late 2004.)

Tobias, Dan, Body: HTML Mail (Last access: 2007-03-30)

Vermeer, Martin, plaintext: In Praise of Practical Email Hygiene (Difficult to access as of 2004.)

Appendix: Quotes from RFC 1855

This appendix offers some quotes from RFC 1855 that suggest that email is presumed to be plain text. (This is for fun and context rather than for serious argument.)

Use symbols for emphasis. That is what I meant. Use underscores for underlining. _War and Peace_ is my favorite book.

This of course presumes ASCII mail, not HTML mail.

Do not include control characters or non-ASCII attachments in messages unless they are MIME attachments or unless your mailer encodes these.

HTML of course has no such character set restriction and cannot be presumed to.

Limit line length to fewer than 65 characters and end a line with a carriage return.

This of course presumes ASCII mail and is generally violated by HTML mail. The line length restriction may seem dated, but in fact many email clients will wrap at 70 characters and a slightly shorter line-length allows for the standard quoting conventions (e.g., >).

Remember that many people pay for connectivity by the minute, and the longer your message is, the more they pay.

As discussed above, HTML mail is always much longer than the ASCII equivalent (at least for European nations). One reason for this is that a text copy of the mail is always included with HTML email, because some email clients do not read HTML. Another reason is that the HTML codes require bandwidth. Finally, often HTML email will include graphics, which are usually unwanted by the recipient (as well as being a security risk).

'Reasonable' expectations for conduct via e-mail depend on your relationship to a person and the context of the communication. Norms learned in a particular e-mail environment may not apply in general to your e-mail communication with people across the Internet.

It is a norm on most active email lists and on most active newsgroups that only ASCII mail shall be posted.