Effects of Peer Review and Editing on the Readability
of Articles Published in Annals of Internal Medicine
(JAMA. 1994;272:119-121)
John C. Roberts, MD; Robert H. Fletcher, MD; Suzanne W.
Fletcher, MD
Objective.--To measure the effect of the peer review
and editorial processes on the readability of original articles.
Design.--Comparison of manuscripts before and after the peer
review and editorial processes.
Setting.--Annals of Internal Medicine between March 1
and November 30, 1992.
Manuscripts.--One hundred one consecutive manuscripts
reporting original research.
Measurements.--Assessment of readability by means of two
previously validated indexes: the Gunning fog index (units of
readability in the fog index roughly correlate to years of education)
and the Flesch reading ease score. Each manuscript was analyzed for
readability and length on receipt and after it had passed through the
peer review and editorial processes. Text and abstracts were analyzed
similarly but separately. Mean readability scores were compared by
two-tailed t tests for paired observations.
Results.--Mean (+/-SD) initial readability scores of
manuscripts and abstracts by the Gunning fog index were 17.16+/-1.55
and 16.65+/- 2.80, respectively. At publication, scores were
16.85+/-1.42 and 15.64+/- 2.42 (P=.0005 and
P<.0001 for before-after differences, respectively). By
comparison, studies of other print media showed scores of about 11 for
the New York Times editorial page and about 18 for a typical
legal contract. Similar changes were found for the Flesch scores. The
median length of the manuscripts increased by 2.6% and that of the
abstracts by 4.2% during the processes.
Conclusions.--The peer review and editorial processes
slightly improved the readability of original articles and their
abstracts, but both remained difficult to read at publication. Better
readability scores may improve readership.
(JAMA. 1994;272:119-121)
PEER-REVIEWED general medical journals are
intended mainly for use by clinicians, who must read and understand
journal articles if they are to use the information to improve the care
of patients.
Clinicians who read reports of original research must overcome many
obstacles.[1] With increasing specialization in medicine and research, it is difficult for researchers to communicate with a broad,
clinical audience. The concepts and language of modern clinical
research may be unfamiliar. Also, articles may not be directly relevant
to the reader's practice. In any case, readership of even the most
respected journals is not high--20% to 40% of subscribers--and may be
declining.[2]
One barrier to reading that is potentially improvable is the
readability of articles. We therefore studied the readability of
original research articles published in a large, peer-reviewed clinical
medical journal, Annals of Internal Medicine, to answer two
main questions. First, how readable are the articles submitted to
Annals? Second, do peer review and editing improve the
readability of these articles?
MATERIALS AND METHODS
This study was carried out at Annals of Internal Medicine, a
semimonthly clinical journal published by the American College of
Physicians. Its circulation is approximately 100,000, and it
receives about 2400 manuscripts a year, of which it publishes about
16%, all original research articles.
The peer review and editorial processes of Annals are similar
to those of most major journals. By the time a manuscript reaches
publication, it has been critiqued by at least one editor-in-chief, a
deputy editor, at least one associate editor, at least two reviewers, a
statistician, and at least one copy editor. Authors also revise their
manuscripts at least once in the process.
Readability is defined as "the ease of understanding or comprehension
due to the style of writing."[3] Quantitative measurements
of readability began in the 1920s, and readability formulas have been
applied to newspapers, magazines, textbooks, grant applications, and
other printed media.[3] [4]
The most commonly used and well-validated readability formulas
are those of Gunning[5] and Flesch[6] (Table 1). Both indexes are based on word and sentence
length. The Gunning fog index[5] is calculated as follows:
reading grade level=0.4x(mean sentence length+ percentage of words
more than two syllables). The Gunning fog index is scaled to reflect
the years of education of a reader who can "easily read and
comprehend the piece of writing." The Flesch reading ease
index[6] is calculated as follows: reading ease
score=206.835-[0.846x(number of syllables per 100
words)]-[1.015x(average number of words per sentence)]. The
Flesch score uses a 100-point scale, with 100 representing easiest
readability and 0, most difficult.
Although neither index is an ideal measure of readability,
comprehension, or readership, both have been studied many times for
reliability and validity (using measures of reading speed, expert
judgment, readership, and comprehension, with correlations ranging from
.62 to .90).[3,4] [7] Little research has been done regarding the validity of readability indexes in technical writing outside the
field of psychology.[4]
From March 1, 1992, to March 1, 1993, for all manuscripts of original
research ultimately published in the journal, computer disks of the
original submitted versions were obtained from the authors. After the
manuscripts passed through the entire peer review and editorial
processes, the published versions also were entered on computer disks.
Original and final versions were matched. Although associate editors,
copy editors, reviewers, and authors were aware that research studies
were being carried out, only the editors-in-chief were aware of the
goals of this study.
Using a computer program (RightWriter 4.0),[8] we compared readability scores of the original and final versions. In addition, all
versions were analyzed for average sentence and word lengths. Abstracts
and manuscripts were analyzed separately, and all bibliographies and
appendixes were removed before analysis. Data were collected on
Epi-Info 5.0 software[9] and were analyzed by
means of descriptive statistics and paired t tests on SAS, release
6.03.[10] Data are reported as means (+/-SD) except for manuscript lengths, which are reported as medians (range), since the
lengths were not normally distributed. The statistical significance of
before-after comparisons were analyzed by two-tailed t tests
for paired observations.
RESULTS
A total of 111 manuscripts were analyzed. The median length of
submitted manuscripts was 3057 words (range, 784 to 7844 words), and
the length of original abstracts was 262 words (range, 57 to 525
words). Lengths at publication were 3138 words (927 to 6412 words) and
273 words (135 to 463 words), respectively.
Readability scores are presented in
Table 2.
According to both indexes, abstracts and manuscripts were "very
difficult" to read on receipt at the editorial offices, and although
their readability did improve, they remained in the "very
difficult" range at publication.
Because medical vocabulary includes many long words and because both
the Gunning and Flesch formulas are based on sentence length and word
length, we also analyzed the abstracts and manuscripts for those
factors. Abstract sentence length was 19.30+/-5.16 words initially and
16.95+/-3.35 words at publication, a change of 2.35 words
(P<.00001) The mean number of syllables per word in the
abstracts was 1.91+/-0.15 initially and at publication. Manuscript
sentence length was 22.01+/-2.66 words initially and 21.01+/-2.38 words
at publication, a difference of 1.0 words (P<.00001). The
mean number of syllables per word was 1.85+/-0.09 initially and
1.85+/-0.09 at publication (P=.80).
COMMENT
Readability, as measured by these two well-validated indexes, improved
after abstracts and manuscripts passed through the peer review and
editorial processes. However, at publication, both abstracts and
manuscripts remained in the range of "very difficult" to read.
We could find no previous study of medical journal articles'
readability. Studies of readability in other contexts have shown
correlations between readability, as measured by these indexes, and
readership and comprehension.[3,4] Readability may be an
important factor in physicians' ability to use information in medical
journals to improve their care of patients.
These formulas test only readability. Readership, comprehension, and
retention also depend on other factors: content, format, logical
organization, redundancy, and reader motivation.[11]
Readability has been shown to correlate with the level of readership
and retention of newspaper articles. Readability also may promote
retention of information. In one experiment, a newspaper published two
editions that were identical except for articles' readability scores.
Follow-up surveys of readers showed higher levels of readership and
retention of news in the more readable edition.[12] However,
it is not certain that better readability leads to better readership
when the audience is physicians and the media are journals.
Three caveats should be made regarding our findings. First, we studied
only articles that were published, and they may have represented the
best writing among all submitted papers. Thus, the improvement might be
expected to be small. Second, the editors-in-chief were aware of the
goals of this study, and that knowledge may have caused them to exert
unusual efforts to improve readability. Even so, little effect was
seen. Third, and most important, although both indexes have been
validated in other media, no readability formula has been validated in
medical writing.[3,4] Both the Gunning and Flesch scores are
based on sentence and word length. One might then conclude that these
indexes are not valid for medical writing, since medicine's jargon
includes words from Latin and Greek (eg, "erythema") instead of
Anglo-Saxon (eg, "red skin"), as well as neologisms (eg,
"esophogastroduodenoscopy"). But it is just such characteristics
that editors criticize as detrimental to readability.[13]
Furthermore, the effect of acronyms on the readability of medical
writing may be minimized by these indexes, which treat them as
one-syllable words.
Even though these indexes have not been validated with medical
prose, authors and editors might want to use shorter and more familiar
words. Writing coaches have advised shorter words, and mass media
research demonstrates that the two most important elements of
readability are simplified vocabulary and decreased sentence
complexity: short words and short sentences
(Table 3).[4,13]
In our study, median manuscript length increased slightly during the
peer review and editing processes. Long manuscripts seem to contradict
a well-documented trend toward shorter articles and books, especially
in recent years.[13] Long manuscripts also may partly explain
the popularity of the structured abstract and such publications as
ACP Journal Club and Journal Watch.
In recent years, physicians have been receiving communications from a
rapidly growing number of media: peer-reviewed journals, other
journals, television, audiotapes, videotapes, computers, and the mass
media.[14] The burden of media on the physician
is great, and if peer-reviewed journals are to remain leaders in sustaining
excellent medical practice, they must tailor their editorial elements
to meet the needs and desires of their biggest audience: practicing
physicians. Readability is one element.
From the Editorial Offices,
Annals of Internal Medicine,
Philadelphia, Pa (Drs Roberts, R. Fletcher, and S. Fletcher); and the
Division of General Internal Medicine, Johns Hopkins Bayview Medical
Center, Baltimore, Md (Dr Roberts). Drs R. Fletcher and S. Fletcher are
now with the Division of Ambulatory Care and Prevention, Harvard
Medical School and Harvard Community Health Plan, Boston, Mass.
Presented in part at the Second International Congress on Peer Review
in Biomedical Publication, Chicago, Ill, September 9, 1993.
We gratefully acknowledge the assistance of Andrew Langman for his
technical contributions and Grace Lobb and Tori Ransome for their
efforts in collecting and maintaining the manuscripts on computer
files.
Address correspondence to Division of General Internal Medicine, B-2
North, Johns Hopkins Bayview Medical Center, 4940 Eastern Ave,
Baltimore, MD 21224 (Dr Roberts).
References
1. Fletcher SW, Fletcher RH. Responsibilities of
medical journals to readers. J Intern Med. 1992;232:223-228.
2. Media-Chek/Apex: A Study of Medical Journal
Readership and Advertising Page Exposure--American College of
Physicians. Princeton, NJ: Healthcare Communications; 1993.
3. Klare GR. The Measurement of Readability.
Ames: Iowa State University Press; 1963.
4. Severin WJ, Tankard JW. Communication Theories:
Origins, Methods, and Uses in the Mass Media. 3rd ed. New York,
NY: Longman Group Ltd; 1992.
5. Gunning R. The Technique of Clear Writing. New
York, NY: McGraw-Hill International Book Co; 1952.
6. Flesch RF. A new readability yardstick. J Appl Psychol. 1948;32:221-233.
7. Stone G. Examining Newspapers: What Research
Reveals About American Newspapers. Newbury Park, Calif: Sage
Publishers Inc; 1987.
8. RightWriter 4.0. Carmel, Ind: Que Corp; 1990.
9. Dean AG, Dean JA, Burton AH, Dicker RC. Epi-Info
5.0. Stone Mountain, Ga: USD Inc; 1990.
10. SAS Release 6.03. Cary, NC: SAS Institute
Inc; 1992.
11. Gray WS, Leary BE. What Makes a Book
Readable? Chicago, Ill: University of Chicago Press; 1935.
12. Swanson C. Readability and readership: a controlled
experiment. Journalism Q. 1948;25:339-345.
13. LaRocque P. Fuzziness: don't force readers to clear up
your writing. Quill. September 1993:36.
14. Wyatt J. Use and sources of medical journals.
Lancet. 1991;338:1368-1373.
Table of Contents