














Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
There was one thing that most of my advisors agreed on; the writing of such an essay is bound to be a thankless task. Advisor 1: “By the time a mathematician.
Typology: Lecture notes
1 / 22
This page cannot be seen from the preview
Don't miss anything!















PAUL R. HALMOS
2 PAUL R. HALMOS
ones who are born with it are not usually born with full knowledge of all the tricks of the trade. A few essays such as this may serve to “remind” (in the sense of Plato) the ones who want to be and are destined to be the expositors of the future of the techniques found useful by the expositors of the past. The basic problem in writing mathematics is the same as in writing biology, writing a novel, or writing directions for assembling a harpsichord: the problem is to communicate an idea. To do so, and to do it clearly, you must have something to say, and you must have someone to say it to, you must organize what you want to say, and you must arrange it in the order you want it said in, you must write it, rewrite it, and re-rewrite it several times, and you must be willing to think hard about and work hard on mechanical details such as diction, notation, and punctuation. That’s all there is to it.
4 PAUL R. HALMOS
and hit something else, than to have an aim that is too inclusive or too vaguely specified and have no chance of hitting anything. Get ready, aim, and fire, and hope that you’ll hit a target: the target you were aiming at, for choice, but some target in preference to none.
HOW TO WRITE MATHEMATICS 5
impossible to patch up, and after that chaos rapidly set in. The organization of the book was tight; things were there because they were needed; the presentation had the kind of coherence which makes for ease in reading and understanding. At the same time the wires that were holding it all together were not obtrusive; they became visible only when a part of the structure was tampered with. Even the least organized authors make a coarse and perhaps unwritten outline; the subject itself is, after all, a one-concept outline of the book. If you know that you are writing about measure theory, then you have a two-word outline, and that’s something. A tentative chapter outline is something better. It might go like this: I’ll tell them about sets, and then measures, and then functions, and then integrals. At this stage you’ll want to make some decisions, which, however, may have to be rescinded later; you may for instance decide to leave probability out, but put Haar measure in. There is a sense in which the preparation of an outline can take years, or, at the very least, many weeks. For me there is usually a long time between the first joyful moment when I conceive the idea of writing a book and the first painful moment when I sit down and begin to do so. In the interim, while I continue my daily bread and butter work, I daydream about the new project, and, as ideas occur to me about it, I jot them down on loose slips of paper and put them helter-skelter in a folder. An “idea” in this sense may be a field of mathematics I feel should be included, or it may be an item of notation; it may be a proof, it may be an aptly descriptive word, or it may be a witticism that, I hope, will not fall flat but will enliven, emphasize, and exemplify what I want to say. When the painful moment finally arrives, I have the folder at least; playing solitaire with slips of paper can be a big help in preparing the outline. In the organization of a piece of writing, the question of what to put in is hardly more important than what to leave out; too much detail can be as discouraging as none. The last dotting of the last i, in the manner of the old-fashioned Cours d’Analyse in general and Bourbaki in particular, gives satisfaction to the author who understands it anyway and to the helplessly weak student who never will; for most serious-minded readers it is worse than useless. The heart of mathematics consists of concrete examples and concrete problems. Big general theories are usu- ally afterthoughts based on small but profound insights; the insights themselves come from concrete special cases. The moral is that it’s best to organize your work around the central, crucial examples and counterexamples. The observation that a proof proves something a little more general than it was invented for can frequently be left to the reader. Where the reader needs experienced guidance is in the discovery of the things the proof does not prove; what are the appropriate counterexamples and where do we go from here?
HOW TO WRITE MATHEMATICS 7
etc., etc., etc. It’s an obvious idea, and frequently an unavoidable one, but it may help a future author to know in advance what he’ll run into, and it may help him to know that the same phenomenon will occur not only for chapters, but for sections, for paragraphs, for sentences, and even for words. The first step in the process of writing, rewriting, and re-rewriting, is writing. Given the subject, the audience, and the outline (and, don’t forget, the alphabet), start writing, and let nothing stop you. There is no better incentive for writing a good book than a bad book. Once you have a first draft in hand, spiral-written, based on a subject, aimed at an audience and backed by as detailed an outline as you could scrape together, then your book is more than half done. The spiral plan accounts for most of the rewriting and re-rewriting that a book involves (most, but not all). In the first draft of each chapter I recommend that you spill your heart, write quickly, violate all rules, write with hate or with pride, be snide, confused, be “funny” if you must, be unclear, be ungrammatical — just keep on writing. When you come to rewrite, however, and however often that may be necessary, do not edit but rewrite. It is tempting to use a red pencil to indicate insertions, deletions, and permutations, but in my experience it leads to catastrophic blunders. Against human impatience, and against the all too human partiality everyone feels toward his own words, a red pencil is much too feeble a weapon. You are faced with a first draft that any reader except yourself would find all but unbearable; you must be merciless about changes of all kinds, and, especially, about wholesale omissions. Rewrite means write again — every word. I do not literally mean that, in a 10-chapter book, Chapter 1 should be written ten times, but I do mean something like three or four. The chances are that Chapter 1 should be re-written, literally, as soon as Chapter 2 is finished, and, very likely, at least once again, somewhere after Chapter 4. With luck you’ll have to write Chapter 9 only once. The description of my own practice might indicate the total amount of rewriting that I am talking about. After a spiral-written first draft I usually rewrite the whole book, and then add the mechanical but indispensable reader’s aids (such as list of prerequisites, preface, index, and table of contents). Next, I rewrite again, this time on the typewriter, or, in any event, so neatly and beautifully that a mathematically untrained typist can use this version (the third in some sense) to prepare the “final” typescript with no trouble. The rewriting in this third version is minimal; it is usually confined to changes that affect one word only, or, in the worst case, one sentence. The third version is the first that the others see. I ask friends to read it, my wife reads it, my students may read parts of it, and, best of all, an expert junior-grade, respectably paid to do good job, reads it and is encouraged not to be polite in his criticisms. The changes that become necessary in the third version can, with good luck, be effected with a red pencil; with bad luck they will cause one third of the pages to be retyped. The “final” typescript is based on the edited third version, and, once it exists, it is read, reread, proofread, and reproofread. Approximately tow years after it was started (two working years, which may be much more than two calendar years) the book is sent to the publisher. Then begins another kind of labor pain, but that is another story. Archimedes taught us that a small quantity added to itself often enough becomes a large quantity (or, in proverbial terms, every little bit helps). When it comes to accomplishing the bulk of the world’s work, and, in particular, when it comes to
8 PAUL R. HALMOS
writing a book, I believe that the converse of Archimedes’ teaching is also true: the only way to write a large book is to keep writing a small bit of it, steadily every day, with no exception, with no holiday. A good technique, to help the steadiness of your rate of production, is to stop each day by priming the pump for the next day. What will you begin with tomorrow? What is the content of the next section to be; what is its title? (I recommend that you find a possible short title for each section, before of after it’s written, even if you don’t plan to print section titles. The purpose is to test how well the section is planned: if you cannot find a title, the reason may be that the section doesn’t have a single unified subject.) Sometimes I write tomorrow’s first sentence today; some authors begin today by revising and rewriting the last page or so of yesterday’s work. In any case, end each work session on an up-beat; give your subconscious something solid to feed on between sessions. It’s surprising how well you can fool yourself that way; the pump-priming technique is enough to overcome the natural human inertia against creative work.
10 PAUL R. HALMOS
Good English style implies correct grammar, correct choice of words, correct punctuation, and, perhaps above all, common sense. There is a difference between “that” and “which”, and “less” and “fewer” are not the same, and a good math- ematical author must know such things. The reader may not be able to define the difference, but a hundred pages of colloquial misusage, or worse, has a cumu- lative abrasive effect that the author surely does not want to produce. Fowler [4], Roger [8], and Webster [10] are next to Dunford-Schwartz on my desk; they belong in a similar position on every author’s desk. It is unlikely that a single missing comma will convert a correct proof into a wrong one, but consistent mistreatment of such small things has large effects. The English language can be a beautiful and powerful instrument for interesting, clear, and completely precise information, and I have faith that the same is true for French or Japanese or Russian. It is just as important for an expositor to familiarize himself with that instrument as for a surgeon to know his tools. Euclid can be explained in bad grammar and bad diction, and a vermiform appendix can be removed with a rusty pocket knife, but the victim, even if he is unconscious of the reason for his discomfort, would surely prefer better treatment that that. All mathematicians, even very young students very near the beginning of their mathematical learning, know that mathematics has a language of its own (in fact it is one), and an author must have thorough mastery of the grammar and vocabulary of that language as well as of the vernacular. There is no Berlitz course for the language of mathematics; apparently the only way to learn it is to live with it for years. What follows is not, it cannot be, a mathematical analogue of Fowler, Roget, and Webster, but it may perhaps serve to indicate a dozen or two of the thousands of items that those analogues would contain.
HOW TO WRITE MATHEMATICS 11
you; for him the non-implication is, quite likely, unsupported. Is it obvious? (Say so.) Will a counterexample be supplied later? (Promise it now.) Is it a standard but for present purposes irrelevant part of the literature? (Give a reference.) Or, horrible dictu, do you merely mean that you have tried to derive q from p, you failed, and you don’t in fact know whether p implies q? (Confess immediately!) In any event: take the reader into your confidence. There is nothing wrong with the often derided “obvious” and “easy to see”, but there are certain minimal rules to their use. Surely when you wrote that something was obvious, you thought it was. When, a month, or two months, or six months later, you picked up the manuscript and re-read it, did you still think that something was obvious? (A few months’ ripening always improves manuscripts.) When you explained it to a friend, or to a seminar, was the something at issue accepted as obvious? (Or did someone question it and subside, muttering, when you reassured him? Did your assurance consist of demonstration or intimidation?) The obvious answers to these rhetorical questions area among the rules that should control the use of “obvious”. There is another rule, the major one, and everybody knows it, the one whose violation is the most frequent source of mathematical error: make sure that the “obvious” is true. It should go without saying that you are not setting out to hide facts from the reader; you are writing to uncover them. What I am saying now is that you should not hide the status of your statements and your attitude toward them either. Whenever you tell him something, tell him where it stands: this has been proved, that hasn’t, this will be proved, that won’t. Emphasize the important and minimize the trivial. There are many good reasons for making obvious statements every now and then; the reason for saying that they are obvious is to put them in proper perspective for the uninitiate. Even if your saying so makes an occasional reader angry at you, a good purpose is served by your telling him how you view the matter. But, of course, you must obey the rules. Don’t let the reader down; he wants to believe in you. Pretentiousness, bluff, and concealment may not get caught out immediately, but most readers will soon sense that there is something wrong, and they will blame neither the facts nor themselves, but, quite properly, the author. Complete honesty makes for greatest clarity.
HOW TO WRITE MATHEMATICS 13
such as mathematics, is the word-for-word repetition of a phrase, or even many phrases, with the purpose of emphasizing a slight change in a neighboring phrase. If you have defined something, or stated something, or proved something in Chapter 1, and if in Chapter 2 you want to treat a parallel theory or a more general one, it is a big help to the reader if you use the same words in the same order for as long as possible, and then, with a proper roll of drums, emphasize the difference. The roll of drums is important. It is not enough to list six adjectives in one definition, and re-list five of them, with a diminished sixth, in the second. That’s the thing to do, but what helps is to say, in addition: “Note that the first five conditions in the definitions of p and q are the same; what makes them different is the weakening of the sixth.” Often in order to be able to make such an emphasis in Chapter 2 you’ll have to go back to Chapter 1 and rewrite what you thought you had already written well enough, but this time so that its parallelism with the relevant part of Chapter 2 is brought out by the repetition device. This is another illustration of why the spiral plan of writing is unavoidable, and it is another aspect of what I call the organization of the material. The preceding paragraphs describe an important kind of mathematical repeti- tion, the good kind; there are two other kinds, which are bad. One sense in which repetition is frequently regarded as a device of good teaching is that the oftener you say the same thing, in exactly the same words, or else with slight differences each time, the more likely you are to drive the point home. I disagree. The second time you say something, even the vaguest reader will dimly recall that there was a first time, and he’ll wonder if what he is now learning is exactly the same as what he should have learned before, or just similar but different. (If you tell him “I am now saying exactly what I fist said on p. 3”, that helps.) Even the dimmest such wonder is bad. Anything is bad that unnecessarily frightens, irrelevantly amuses, or in any other way distracts. (Unintended double meanings are the woe of many an author’s life.) Besides, good organization, and, in particular, the spiral plan of organization discussed before is a substitute for repetition, a substitute that works much better. Another sense in which repetition is bad is summed up in the short and only partially inaccurate precept: never repeat a proof. If several steps in the proof of Theorem 2 bear a very close resemblance to parts of the Theorem 1, that’s signal that something may be less than completely understood. Other symptoms of the same disease are: “by the same technique (or method, or device, or trick) as in the proof of Theorem 1...”, or, brutally, “see the proof of Theorem 1”. When that happens the chances are very good that there is a lemma that is worth finding, formulating, and proving, a lemma which both Theorem 1 and Theorem 2 are more easily and more clearly deduced.
14 PAUL R. HALMOS
Since the best expository style is the least obtrusive one, I tend nowadays to prefer the neutral approach. That does not mean using “one” often, or ever; sen- tences like “one has thus proved that...” are awful. It does mean the complete avoidance or first person pronouns in either singular or plural. “Since p, it follows that q.” “This implies p.” “An application of p to q yields r.” Most (all ?) math- ematical writing is (should be?) factual; simple declarative sentences are the best for communicating facts. A frequently effective and time-saving device is the use of the imperative. “To find P , multiply q by r.” “Given p, put q equal to r.” (Two digressions about “given”. (1) Do not use it when it means nothing. Example: “For any given p there is a q.” (2) Remember that it comes from an active verb and resist the temptation to leave it dangling. Example: Not “Given p, there is a q”, but “Given p, find q”.) There is nothing wrong with the editorial “we”, but if you like it, do not misuse it. Let “we” mean “the author and the reader” (or “the lecturer and the audience”). Thus, it is fine to say “Using Lemma 2 we can generalize Theorem 1”, or “Lemma 3 gives us a technique for proving Theorem 4”. It is not good to say “Our work on this result was done in 1969” (unless the voice is that of two authors, or more, speaking in unison), and “We thank our wife for her help with the typing” is always bad. The use of “I”, and especially its overuse, sometimes has a repellent effect, as arrogance or ex-cathedra preaching, and, for that reason, I like to avoid it whenever possible. In short notes, obviously in personal historical remarks, and, perhaps, in essays such as this, it has it place.
16 PAUL R. HALMOS
nothing to do with the firstness of the first set, the secondness of the second, and so on; the sequence is irrelevant. The correct statement is that “the union of a countable set of measurable sets is measurable” (or, if a different emphasis is wanted, “the union of a countably infinite set of measurable sets is measurable”). The theorem that “the limit of a sequence of measurable functions is measurable” is a very different thing; there “sequence” is correctly used. If a reader knows what a sequence is, if he feels the definition in his bones, then the misuse of the word will distract him and slow his reading down, if ever so slightly; if he doesn’t really know, then the misuse will seriously postpone his ultimate understanding. “Contain” and “include” are almost always used as synonyms, often by the same people who carefully coach their students that ∈ and ⊂ are not the same thing at all. It is extremely unlikely that the interchangeable use of contain and include will lead to confusion. Still, some years ago I started an experiment, and I am still trying it: I have systematically and always, in spoken word and written, used “contain” for ∈ and “include” for ⊂. I don’t say that I have proved anything by this, but I can report that (a) it is very easy to get used to, (b) it does no harm whatever, and (c) I don’t think that anybody ever noticed it. I suspect, but that is not likely to be provable, that this kind of terminological consistency (with no fuss made about it) might nevertheless contribute to the reader’s (and listener’s) comfort. Consistency, by the way, is a major virtue and its opposite is a cardinal sin in exposition. Consistency is important in language, in notation, in references, in typography—it is important everywhere, and its absence can cause anything from mild irritation to severe misinformation. My advice about the use of words can be summed up as follows. (1) Avoid technical terms, and especially the creation of new ones, whenever possible. (2) Think hard about the new ones that you must create; consult Roget; and make them as appropriate as possible. (3) Use the old ones correctly and consistently, but with a minimum of obtrusive pedantry.
HOW TO WRITE MATHEMATICS 17
the proof will take a half line longer; it will have to begin with something like “Write ρ = limn α^1 n/n .” The repetition (of “limn α^1 n/n ”) is worth the trouble; both statement and proof read more easily and more naturally. A showy way to say “use no superfluous letters” is to say “use no letter only once”. What I am referring to here is what logicians would express by saying “leave no variable free”. In the example above, the one about continuous functions, “f ” was a free variable. The best way to eliminate is to convert it to free from bound. Most mathematicians would do that by saying “If f is a real-valued continuous function on a compact space, then f is bounded.” Some logicians would insist on pointing out that “f ” is still free in the new sentence (twice), and technically they would be right. To make it bound, it would be necessary to insert “for all f ” at some grammatically appropriate point, but the customary way mathematicians handle the problem is to refer (tacitly) to the (tacit) convention that every sentence preceded by all the universal quantifiers that are needed to convert all its variables into bound ones. The rule of never leaving a free variable in a sentence, like many of the rules I am stating, is sometimes better to break than to obey. The sentence, after all, is an arbitrary unit, and if you want a free “f ” dangling in one sentence so that you may refer to it in a later sentence in, say, the same paragraph, I don’t think you should necessarily be drummed out of the regiment. The rule is essentially sound, just the same, and while it may be bent sometimes, it does not deserve to be shattered into smithereens. There are other symbolic logical hairs that can lead to obfuscation, or, at best, temporary bewilderment, unless they are carefully split. Suppose, for an example, that somewhere you have displayed the relation
0
|f (x)|^2 dx < ∞
as, say, a theorem proved about some particular f. If, later, you run across another function g with what looks like the same property, you should resist the temptation to say “g also satisfies (1)”. That’s logical and alphabetical nonsense. Say instead “(1) remains satisfied if f is replaced by g”, or, better, give (1) a name (in this case it has a customary one) and say “g also belongs to L^2 (0, 1)”. What about “inequality (*)”, or “equation (7)”, or “formula (iii)”; should all displays be labelled or numbered? My answer is no. Reason: just as you shouldn’t mention irrelevant assumptions or name irrelevant concepts, you also shouldn’t attach irrelevant labels. Some small part of the reader’s attention is attracted to the label, and some small part of his mind will wonder why the label is there. If there is a reason, then the wonder serves a healthy purpose by way of preparation, with no fuss, for a future reference to the same idea; if there is no reason, then the attention and the wonder are wasted. It’s good to be stingy in the use of labels, but parsimony also can be carried to extremes. I do not recommend that you do what Dickson once did [2]. On p. 89 he says: “Then ... we have (1) ...”—but p. 89 is the beginning of a new chapter, and happens to contain no display at all, let alone one bearing the label (1). The display labelled (1) occurs on p. 90, overleaf, and I never thought of looking for it there. That trick gave me a helpless and bewildered five minutes. When I finally saw the light, I felt both stupid and cheated, and I have never forgiven Dickson.
HOW TO WRITE MATHEMATICS 19
we have x ∈ B”; all dissonance and all even momentary ambiguity is avoided. The same is true for “⊂” even though the verbal translation is longer, and even more true for “≤”. A sentence such as “Whenever a positive number is ≤ 3, its square is ≤ 9” is ugly. Not only paragraphs, sentences, words, letters, and mathematical symbols, but even the innocent looking symbols of standard prose can be the source of blemishes and misunderstanding; I refer to punctuation marks. A couple of examples will suffice. First: an equation, or inequality, or inclusion, or any other mathematical clause is, in its informative content, equivalent to a clause in ordinary language, and, therefore, it demands just as much to be separated from its neighbors. In other words: punctuate symbolic sentences just as would verbal ones. Second: don’t overwork a small punctuation mark such as a period or a comma. They are easy for the reader to overlook, and the oversight causes backtracking, confusion, delay. Example: “Assume that a ∈ X. X belongs to the class C,... ”. The period between the two X’s is overworked, and so is this one: “Assume that X vanishes. X belongs to the class C,... ”. A good general rule is: never start a sentence with a symbol. If you insist on starting the sentence with a mention of the thing the symbol denotes, put the appropriate word in apposition, thus: “The set X belongs to the class C,... ”. The overworked period is no worse than the overworked comma. Not “For in- vertible X, X∗^ also is invertible”, but “For invertible X, the adjoint X∗^ also is invertible”. Similarly, not “Since p 6 = 0, p ∈ U ”, but “Since p 6 = 0, it follows that p ∈ U ”. Even the ordinary “If you don’t like it, lump it” (or, rather, its mathe- matical relatives) is harder to digest that the stuffy-sounding “If you don’t like it, then lump it”; I recommend “then” with “if” in all mathematical contexts. The presence of “then”can never confuse; its absence can. A final technicality that can serve as an expository aid, and should be mentioned here, is in a sense smaller than even the punctuation marks, it is conspicuous aspect of the printed page. What I am talking about is the layout, the architecture, the appearance of the page itself, of all the pages. Experience with writing, or perhaps even with fully conscious and critical reading, should give you a feeling for how what you are now writing will look when it’s printed. If it looks like solid prose, it will have a forbidding, sermony aspect; if it looks like computational hash, with a page full of symbols, it will have a frightening, complicated aspect. The golden mean is golden. Break it up, but not too small; use prose, but not too much. Intersperse enough displays to give the eye a chance to help the brain; use symbols, but in the middle of enough prose to keep the mind from drowning in a morass of suffixes.
20 PAUL R. HALMOS
difficulties of oral presentation in mind. The reader of a book can let his attention wander, and later, when he decides to, he can pick up the thread, with nothing lost except his own time; a member of a lecture audience cannot do that. The reader can try to prove your theorems for himself, and use your exposition as a check on his work; the hearer cannot do that. The reader’s attention span is short enough; the hearer’s is much shorter. If computations are unavoidable, a reader can be subjected to them; a hearer must never be. Half the art of good writing is the art of omission; in speaking, the art of omission is nine-tenths of the trick. These differences are not large. To be sure, even a good expository paper, read out loud, would make an awful lecture—but not worse than some I have heard. The appearance of the printed page is replaced, for a lecture, by the appearance of the blackboard, and the author’s imagined audience is replaced for the lecturer by live people; these are big differences. As for the blackboard: it provides the opportunity to make something grow and come alive in a way that is not possible with the printed page. (Lecturers who prepare a blackboard, cramming it full before they start speaking, are unwise and unkind to audiences.) As for live people: they provide an immediate feedback that every author dreams about but can never have. The basic problem of all expository communication are the same; they are the ones I have been describing in this essay. Content, aim and organization, plus the vitally important details of grammar, diction, and notation—they, not showman- ship, are the essential ingredients of good lectures, as well good books.