By Julianne Chatelaine, Trellix Corporation
Reprinted from Usability Interface, Vol 4, No. 3, January 1998
This is the story of two real-life attempts to apply the research described in The Media
Equation (TME):
Reeves, Byron and Nass, Clifford, The Media Equation,. CLSI Publications, Stanford & Cambridge, University Press, 1996
In The Media Equation, Byron Reeves and Clifford Nass suggest that even text-only interfaces are experienced by humans as having some “personality”. The author reports on how she tried to apply this research in editing error messages for two different software products. Her goal was to tune product personality in ways that helped users and their perception of the software, while reducing possible negative effects.
The first set of error message revisions discussed here were completed in 1994-95 at OneSource Information Services, Inc. Although in 1994 The Media Equation had not yet been published, I heard Reeves and Nass present their research as part of the tutorial, “Seductive Interfaces: Satisfying A Mass Audience” at ACM-SIGCHI ’94 with co-authors Barry Linnett, Karen Fries, and Tim Skelly of Microsoft Corporation. Inspiration to try this method again with Trellix Corporation’s first product, Trellix 1.0, came at UPA ’97 where I had the opportunity to
purchase the book and hear Clifford Nass present the group’s research in context.
In The Media Equation, Byron Reeves and Clifford Nass sum up many of their research projects with the sentence, “People respond socially and naturally to media.” As they define media, they include everything from
radio to software. They admit that this conclusion is “counterintuitive.” (TME, page 7)
If you have participated in any group discussions of their findings, you may have heard harsher assessments. TME demands that we critically examine some long-cherished intuitions about ourselves as human beings, and not everyone enjoys that process.
Why is TME interesting to our SIG?
Intuition aside, I find TME fascinating, and of particular interest to usability advocates and technical communicators.
In studying usability, we are interested in reality: what is, rather than what should be. We’ve all had to give up (or help our teams decide to abandon) a long-cherished, well-refined design because the users just didn’t understand it. If you truly believe the users are always right, you will find TME’s descriptions of repeatable experiments easy to grasp.
In addition, TME emphasizes the power of simple text. Many TME experiments were carried out without pictures, icons, or sound, using only text on a black-and-white computer screen, plus graphical buttons to
register users’ choices. Varying that text caused significant differences in how users felt about the software. “Personality can be perceived in the most minimal of places–a simple English sentence.” (Page 89) As technical communicators we already believe text is powerful, but it’s nice to hear this validated by research.
Note that adding multimedia did not significantly change the results of TME experiments: “Voices did not make the interaction any more social than text.” (page 25) These findings implied that noticeable effects were being produced by text alone.
Why start with “error” messages?
I started applying TME findings to software text (resource) strings for two reasons.
First, I’ve spent most of my career in small software companies. My last two development groups didn’t have the funds required to ‘personalize’ their interfaces with animated characters or even multimedia. And the error messages were already scheduled to be reviewed by a writer. If TME’s findings were even partly right, I could potentially produce improvements in the way users perceived the software, with no additional expenditure.
Second, TME suggests that there are often unintended negative effects of poor text in interfaces. Before trying for any improvements, it seemed like a good idea to eliminate any negative effects we were currently causing.
Third, I wanted to follow Nass’ and Reeves’ own example and test the effect of the changes I made. I was not able to do this in the case of the first interface, but I am in the process of testing users’ reactions to messages in the second interface., and the results will be incorporated into future versions. (If you want to assess the “feel” of the messages after the editing pass described below, and you have a 32-bit Windows system, you can download and run a free trial version of Trellix 1.0 from Trellix.
TME findings about media “personality”
I tried to change my programs’ error messages to take advantage of the following TME recommendations and findings.
First of all, Reeves and Nass found it was possible to “suggest” personalities of two basic types, dominant or submissive, by making simple textual changes in an online exercise. Here is an example:
Though the language styles used for the two personalities were different, we kept the information the same. In the experiment, participants were asked to discuss items that might be useful for survival in the desert. For example, in a discussion about sunglasses, the dominant computer would say: “In the desert, the intense sunlight will clearly cause blindness by the second day. Without adequate vision, survival will become almost impossible. The sunglasses are absolutely important.” The submissive computer was more tentative: “In the desert, it seems that the intense sunlight could possibly cause blindness by the end of the second day. Without adequate vision, don’t you think that survival might become more difficult? The sunglasses might be important.” (pages 92-93)
They supported these textual changes by adding a numeric ranking of confidence, either high or low, to the textual discussions just described, and adjusting whether the computer spoke first or waited for
the user to act. However, this excerpt illustrates their approach with text.
Second, the TME studies found that social science research indicating that people prefer personalities like their own, applied to media also. “When the personality of the computer matched theirs [dominant or submissive] …participants were significantly more satisfied…they enjoyed themselves more, had more fun, and thought the whole experience was more interesting. …And it is important to remember that every participant was an experienced computer user who, after the experiment ended, denied any thoughts of a social relationship with a machine.” (page 96)
Psychologists have found that the general population contains equal amounts of the two basic personality types, dominant and submissive. If one does not have the technology to make the software assess each user’s personality and adapt to it (a provocative TME suggestion!), the next best thing is to select one personality and stick with it. “Make personalities strong and reliable. …Even though personality can be assessed with limited information, inconsistencies…diminish the purity of personality, and thereby contribute to confusion, and even dislike.” (page 84)
In other words, users are more comfortable when they can figure out what the software’s personality is – whether they like it or not. “Hydrid personalities, which inevitably emerge from uncoordinated group efforts, can be unpleasant for users. …If people can identify the personality, generally half will like it. If it’s ambiguous, almost no one will.” (page 98)
The published TME findings were underscored by Dr. Nass’ remarks at UPA ’97. He said that when a product’s textual messages were written by a variety of different people, using different styles and degrees of dominance, it made the product seem “psychotic.” A chill crept down my spine, for reasons I will explain shortly.
Case study #1: The software was grovelling
The year I first heard about TME, I was working on an integrated suite of tools with a graphical user interface (GUI). Our tools let business customers work with information from premium financial and reference databases packaged together in an easy-to-use format. The company’s support was first-rate, and we were considered a premium product in several market segments. There was only one problem: our software was grovelling (textually) before the users, in a way that, if it was done by a “person”, would be extremely tiresome.
Most notably, every message of any kind began with “please.” This had begun with a few default messages added by our GUI toolkit, all overly polite, and then the developers conscientiously copied the tone of those messages as the product grew, until the overall feel of the textual messages was extremely cloying.
My colleagues and I used TME research to rewrite all the messages in a neutral and “businesslike” voice. In addition to removing the humble “please,” we also rewrote phrases in second person (using “you”), following current industry standards for online help.
Unfortunately, we didn’t run specifically targeted before and after tests to measure the effects of these changes. However, qualitative usability reviews suggested there was a lighter feel to the interface afterwards and that using it felt like less “work” than before our edits.
Case study #2: A sane, reserved, but helpful interface
My current job involves both usability work and technical writing for a brand-new software application called Trellix. Early pre-release builds of Trellix had messages written by nine different developers, using tones
and styles that varied from bossy/directive to reserved/indirect. In addition, we had purchased specialized software libraries to help Trellix process text, images, and windows, and each of these libraries came with
its own (separate and further differentiated) messages. The resulting mix did (I realized with a chill as Dr. Nass spoke) seem somewhat “psychotic.”
TME indicates that these impressions of interface ‘personality’ are formed by the way, as users are engaged in their overt tasks. The Trellix designers hoped that users would be able to use the product in a highly productive, creative “flow” state; my guess was that in the presence of a potential psychotic, it would be difficult for them to relax sufficiently to do this.
One key implication of TME is the point that has also been made by Ursula K. LeGuin, Suzette Haden Elgin, and many others, that all native speakers have sufficient information to analyze dialogues and infer personality from them. Our team could use what we already knew to protect our software from users’ hatred and scorn, and maybe even make it more effective (if relaxation is a prerequisite for flow, or if the users came to “trust” the software and hang in longer when a problem was encountered).
With two other experienced writers, Molly Oldfield Yen and Jim Lindsay, I developed a texual “personality” and “character” for Trellix.
Here are the key elements of its personality:
- Doesn’t put itself forward; speaks only when spoken to
- When it finally speaks, it’s firm and authoritative
- Doesn’t brag
- Doesn’t grovel either – doesn’t even say Please, just states facts
- Uses language and constructions that are neutral OR, if a
choice must be made, that shade slightly towards dominant
And it has a character too
- Tells the truth
- Is consistent
- Is empathetic in a businesslike way – speaks to the users’ point of view or about their tasks, not system tasks/concerns
- When a serious problem occurs, is polite in giving a full explanation of how to remedy a serious condition, because
“…politeness takes time. In real life, most of us would choose politeness over brevity, even at work, and even in our most productive moments.” (page 30)
We found this a good personality for software aimed at business users, and tried to make Trellix messages reflect “a reliable, effective, genderless colleague, maybe a bit shy, who gets the job done smoothly and without offending most users.” One thing that helped was that as we gradually refined the product, the developers removed many messages completely, making it less possible to perform actions that resulted in error conditions. (The entire Trellix interface, which is largely wordless, also has personality given it by its design team; this topic is covered in the “advanced” section of TME, but it is outside the scope of this article.)
Examples from the second case study
A few examples cannot give the effect of the scope of the project, but here are some message pairs before and after our message rewrite, showing how we made a disparate group of messages more similar in tone.
The first two BEFORE examples show too-technical language and a system viewpoint; their AFTER versions changed quite a bit.
BEFORE – “Failed to create object.”
AFTER – “Trellix is experiencing an internal problem that is unrelated to your work.” [We then went on to state the remedy.]
BEFORE – “Create link cancelled.”
AFTER – “You have stopped linking.”
The next example was not too bad BEFORE, but we added more friendliness and clarity.
BEFORE – “The minimum dimension must not be greater than the ideal dimension.”
AFTER – “You have entered sizes that do not work together. The ideal must be greater than the minimum.”
The final example was too verbose, with a system viewpoint, but it did have the right idea in giving instructions. We changed the viewpoint and made the instructions shorter and clearer. (“Sequence” is a word from the
interface.)
BEFORE – “This is an automatically-generated list and you can’t edit between items in the list. Click to edit the contents of the list or click outside the list to edit text.”
AFTER – “You are editing a list created from a sequence. It is not possible to type in between the links or fields. You can edit any link’s contents by clicking it.”
Developers also have personality and character
During this second rewrite I inadvertently caused a new problem. The first message rewrite (case study #1) had gone swimmingly, but it extended over several months and involved long meetings with each developer. During case study #2 I had only a few days to complete the rewrite, and my consultation with developers was limited to messages that were obviously (to me) ambiguous.
One of my least favorite messages came from one of the packages we purchased and incorporated within Trellix. To avoid embarrassing the package builders, who perhaps assumed we would rewrite their messages anyway, I omit their name and area of expertise.
BEFORE – “<Subsystem Name>: Unknown Error.”
AFTER – “Trellix is experiencing problems with its management of <noun>s.”
I thought (and still think) that from the user point of view this was a fine rewrite. However, the developers who were involved in incorporating this subsystem, whom I had not consulted beforehand, took the AFTER message as an aspersion cast on their work. (Rightly, the entire team felt an intense pride of creation.) Jim Linday helped them craft a third version that would be helpful to users while honoring the quality of the software:
AFTER(2) – “Trellix cannot load this <noun>, because…”
I took away a important lesson about not rushing the rewrites!
Future work
I appreciate the way The Media Equation presents experimental findings that can immediately be applied in an industry setting, and suggests ways by which their effectiveness can be verified. As described above, I am in
the process of testing users’ reactions to Trellix version 1.0, and the results of explicit tests of the messages will be incorporated into future versions. (If you want to assess the “feel” of the messages we finally chose, and you have a 32-bit Windows system, you can download and run a free trial version from Trellix. I welcome any comments you may have!)
Another TME design suggestion is to leverage personality effects to make online help seem more effective. If users are already angry at an application, Nass and Reeves argue, the last thing they want is to interact with something that “seems like” what they’re angry at. Therefore, TME suggests that there is a role for some online help that is NOT integrated into the application, for users who would like to take a break from the app and interact with something “different.” This is not to say that help-in-the-app is bad, but to say that TME suggests that a separate help system still has value. Since most Windows applications now have two kinds of help, some integrated into the application and some in an external file, it would be interesting to test the “default” versions
against two others with different types of help selectively disabled.
Julianne Chatelain (julianne@trellix.com) has worked with online, interactive information since 1979. She can also be reached at her personal website, http://world.std.com/~jchat/.