Transfer of Technology From Statistical Journals to the Biomedical
Literature
Past Trends and Future Predictions
(JAMA. 1994;272:129-132)
Douglas G. Altman, Steven N. Goodman, MD, PhD
Objective.--To investigate the speed of the transfer of
new statistical methods into the medical literature and, on the basis
of current data, to predict what methods medical journal editors should
expect to see in the next decade.
Design.--Influential statistical articles were identified
and the time pattern of citations in the medical literature was
ascertained. In addition, longitudinal studies of the statistical
content of articles in medical journals were reviewed.
Main Outcome Measures.--Cumulative number of citations in
medical journals of each article in the years after publication.
Results.--Annual citations show some evidence of decreasing
lag times between the introduction of new statistical methods and their
appearance in medical journals. Newer technical innovations still
typically take 4 to 6 years before they achieve 25 citations in the
medical literature. Few methodological advances of the 1980s seem yet
to have been widely cited in medical journals. Longitudinal studies
indicate a large increase in the use of more complex statistical
methods.
Conclusions.--Time trends suggest that technology diffusion
has speeded up during the last 30 years, although there is still a lag
of several years before medical citations begin to accrue. Journals
should expect to see more articles using increasingly sophisticated
methods. Medical journals may need to modify reviewing procedures to
deal with articles using these complex new methods.
(JAMA. 1994;272:129-132)
THE INFLUX of statistical methods into the medical
literature has increased over more than 60 years. Over the same period,
statistics itself has undergone major changes, so that not only is the
use of statistics in medical research much more common, but the methods
used have become progressively more complex. Although some of the
methods being introduced in medical research were developed in other
contexts, many statistical advances have arisen as solutions to
problems arising in medical research. Changes in the type of
statistical methods being used in medical articles have implications
for editors, referees, and readers.
We report herein a study of citations to investigate the transfer of
new statistical methods into the medical literature. We predict some
new methods that medical journal editors should expect to see in the
next decade.
METHODS
Influential statistical articles published after 1950 were
identified from two books that reprinted important statistical
articles,[1] [2] from a list of the most cited articles in
medical journals, and from personal knowledge
(Table 1). [Note: References 3-22 found in Table 1.] Several articles relate to survival analysis[6] [9] [11] [13] [14] or meta-analysis,[5] [7] two
of the strongest growth areas (in both medicine and medical statistics)
in recent years. Unfortunately, in some important areas of statistical
methods there was no key article that could be widely cited by a large
proportion of users, such as logistic regression and sample size
calculations for clinical trials. We have included some articles that
were published in medical journals (notably, cancer journals) when
these seemed to be the primary source of the new method, and also one
book.
For each article, the time pattern of citations in the medical
literature was ascertained. Citations prior to 1971 were obtained by
hand searching of printed volumes of the Science Citation
Index,[23] as were citations for a few of the later
articles with relatively few citations. Citations from 1971 to 1992
were obtained using computer searches of the SciSearch database
(Institute of Scientific Information, Philadelphia, Pa). These searches
were carried out in July and August 1993, by which time citations for
1992 should have been virtually complete. We did not search for
articles that had incorrect citations of the articles of interest. It
is our impression that the rate of incorrect citations of these
articles was about 10% (excluding errors in titles). Some minor
inconsistency between the two methods of searching may have arisen
through problems in identifying what constitutes a medical journal. For
comparison, similar citation analyses were performed for two heavily
cited expository statistical articles published in medical
journals.[21] [22]
We also sought evidence from longitudinal studies of the statistical
content of articles in medical journals to examine changes in the
methods used over time.
RESULTS
Figure 1 shows cumulative numbers of citations for
the articles listed in Table 1 divided into four decades--the 1950s,
1960s, 1970s, and 1980s. The article by [14] was excluded
because it has been cited much more often than the other articles. It
is shown in Figure 2, together with the
article by Kaplan and Meier.[6] These two articles are frequently cited
together in articles reporting the results of survival analyses. They
were published 14 years apart, and Fig 2 shows that the citations for
the earlier article have risen in parallel with those for the Cox
article, but about 14 years later in relation to the year of
publication. These are now two of the most heavily cited articles in
medical journals. The rise in citations for the article by Kaplan and
Meier[6] is especially marked given that it received only six
citations in medical journals in the first 10 years after publication.
Annual citations for the articles published in the four decades do show
some evidence of decreasing lag times between the introduction and
widespread use of new statistical methods. Newer technical innovations
still typically take 4 to 6 years before they achieve 25 citations in
the medical literature. Few methodological advances of the 1980s seem
yet to have been widely cited in medical journals. By contrast,
expository articles in medical journals can reach 500 citations within
4 to 5 years (Figure 3). Citations for one of the two expository articles[21] have leveled out, with a roughly
constant number of citations each year. Most of the
methodological articles (notably, the heavily cited
articles) have increasing numbers of citations each year.
Few authors have studied changes over time in the use of statistical
methods in one journal. Hayden[24] gave a brief summary of the rise in the use of simple statistical methods in Pediatrics
from 1952 to 1982, while Felson et al[25]
described similar changes in Arthritis and Rheumatism from 1967 to 1968
vs 1982. The most detailed information we are aware of relates to the
New England Journal of Medicine. Articles published in 1978
and 1979,[26] 1989,[27] and 1990[28] have
been reviewed using the same set of categories.[26] A large
increase was noted during this period in the use of most statistical
methods, especially the more complex methods
(Table 2). It is notable that survival analysis and logistic regression were found in almost a third of original articles
published in 1989 and 1990.
COMMENT
Citation studies are rightly criticized as a means of grading
researchers,[29] but we think they provide a valuable measure
of the impact of a new methodological development on medical
research. Figure 1 suggests that technology diffusion may have speeded
up during the last 30 to 40 years, although there is still usually a
lag of several years before medical citations begin to accrue.
We used cumulative citations rather than annual citations, as we feel
the total impact is more relevant in this context and that fluctuations
in the annual counts obscure the trends. For the purposes of
documenting technology transfer, it is not the actual number of
citations but the shape of the citation curve that is most informative.
This shape seems not to have changed greatly during four decades.
Almost all of the curves for these classic articles have a dormant
early phase followed by a somewhat dramatic takeoff. The general shape
does not seem to vary in relation to how heavily cited an article is.
There are, however, a few exceptions to this pattern, notably the
article by Hanley and McNeil[17] (Fig 1). Developments that
have probably contributed to the more rapid diffusion of statistical
methods into the medical literature are the increasing number of
statisticians working in medicine, the accessibility of powerful
desktop computers to medical researchers, and the more rapid
development and dissemination of software to implement new statistical
methods.
Our analyses took no account of the large increase in the number of
articles being published each year in medical journals (1730 journals
published in 1950, increasing in 10-year intervals to 2800, 4420, 6780,
and 9480) (Ulrich's International Serials database, Bowker Electronic
Publishing). However, this increase has been almost linear since 1970,
so adjustment for the increasing size of the literature would not
greatly alter the shapes of the curves. Furthermore, such adjustment is
not appropriate if, as seems likely, researchers today need to access
many more articles in a greater number of journals than their
predecessors. Huth[30] found a large increase between 1950
and 1985 in the number of different journals being cited in articles
published in the New England Journal of Medicine.
Independent evidence for genuine changes in the use of statistics comes
from studies that have looked at the same journals across time. The few
such studies that we are aware of have shown large increases in the use
of statistical methods and a tendency to use more complicated
methods. [24-28] Thus, there is clearly a strong component of
increased use and complexity of statistics independent of the total
journal expansion. It is relevant that the number of original articles
published per year by the New England Journal of Medicine
decreased during the period of the studies summarized in Table 2.
Cumulative citations for the methodological articles considered
generally curve upward, indicating that the annual number of citations
keeps increasing. By contrast, the two expository articles considered
show a much more rapid accrual of citations (starting in the year of
publication) but near-linear cumulative citation curves, indicating a
fairly steady annual citation rate. Expository statistical articles in
medical journals can reach 500 citations within 4 to 5 years (Fig 3).
Both articles we [21,22] described methods
previously published in statistical journals[13] [31] without
achieving many citations in medical journals. These citation figures
suggest that expository articles are valuable, especially for topics
that are not usually included in medical statistics textbooks. Indeed,
the International Committee of Medical Journal Editors guidelines
state, "References for study design and statistical methods should be
to standard works (with pages stated) when possible rather than to
papers in which the designs or methods were originally
reported."[32] Expository articles cowritten by a statistician and medical researcher may be especially helpful--a recent
example considers receiver operating characteristic
curves.[33] Unfortunately, such crossover articles require a
considerable amount of work, and such activity (being a form of
teaching) may not be helpful to the statistician's or researcher's
career in comparison with either more methodological or medical
articles.
Several complex statistical methods introduced in the 1980s are
beginning to be seen more frequently. Although it is not possible to
identify recent articles that will turn out to be major breakthroughs,
most of the newer methods are sophisticated. Journals should expect to
see growing numbers of articles using them. Methods likely to be seen
more often are described briefly in Table 3. Software is available for all of these techniques, and some are
beginning to be included in well-known statistics packages. It is worth
noting that by the time a topic reaches medical journals there may be a
large methodological literature. Ripley[38] notes that there
are already more than a dozen journals and at least 15 texts devoted to
neural networks.
The evidence of time trends within one major journal (Table 2) supports
the idea that there is an ever-increasing variety of statistical
methods appearing in medical articles. The speed with which new methods
are introduced may pose problems for statistical referees, for the
physicians who read the published work, and for the journals
themselves. Referees may not be able to judge new methods that they
have not yet learned. Physicians may feel that they have no chance of
understanding the new methods (even if they are comfortable with more
traditional methods) and will have to take the results of such studies
on faith. The journals, in whom that faith is being entrusted, may bear
an increasing burden to ensure that the methods are indeed valid, since
most of their audience will be unable to assess that for themselves.
We think that the following developments are possible and may be
desirable in the future:
- Authors using complex methods will be asked to supply additional
supporting material for referees but not for publication. This might
take the form of a formal appendix in the submitted manuscript, which
is peer reviewed (and possibly modified) but not published. It should
be supplied by authors to readers on request.
- Because statistical refereeing will be a more difficult process
(because of both the novelty and the complexity of methods), medical
journals may need to recruit panels of methodological reviewers who
specialize in specific methods.
- Editors of medical journals should encourage or actively solicit
more crossover (expository) articles on new methods, perhaps with both
medical and statistical authors.
- More postgraduate training for medical researchers should be
developed, with formal accreditation, both in basic statistical methods
and also to help those who wish to keep abreast of newer methods.
It is likely that the statistical education of physicians, already
poor,[39] [40] will in the future lag even further behind the
methods that are used in medical journals. Already the standard methods
taught in an introductory course would leave a reader unable to judge a
high percentage of articles published in the New England Journal
of Medicine, and that proportion is likely to increase with time.
From the Medical Statistics Laboratory, Imperial Cancer Research
Fund, London, England (Mr Altman), and Oncology Center, Division of
Biostatistics, The Johns Hopkins University, Baltimore, Md (Dr
Goodman).
Presented in part at the Second International Congress on Peer Review
in Biomedical Publication, Chicago, Ill, September 10, 1993.
We thank Scott Zeger, PhD, for suggesting this topic of
investigation. We are grateful to Will Russell-Edu for carrying out the
computer citation searches.
Reprint requests to Medical Statistics Laboratory, Imperial Cancer
Research Fund, PO Box 123, Lincoln's Inn Fields, London, England WC2A
3PX (Mr Altman).
References
1. Greenland S, ed. Evolution of Epidemiologic Ideas:
Annotated Readings on Concepts and Methods. Chestnut Hill, Mass:
Epidemiology Resources Inc; 1987.
2. Kotz S, Johnson N, eds. Breakthroughs in
Statistics: Volume II: Methodology and Distribution. New York, NY:
Springer Publishing Co; 1992.
[Note: References 3-22 are referred to in Table 1 or Table 3 if not otherwise
referred to in the text.]
3. Cornfield J. A method of estimating comparative rates
from clinical data: applications to cancer of the lung, breast, and
cervix. J Natl Cancer Inst. 1951;11:1269-1275.
4. Cochran WG. Some methods for strengthening the common
chi2 tests. Biometrics. 1954;10:417-451.
5. Woolf B. On estimating the relation between blood group
and disease. Ann Hum Genet. 1955;11:251-253.
6. Kaplan EL, Meier P. Nonparametric estimation from
incomplete observations. J Am Stat Assoc. 1958;53:457-481.
7. Mantel N, Haenszel W. Statistical aspects of the
analysis of data from retrospective studies of disease. J Natl
Cancer Inst. 1958;22:719-748.
8. Cohen J. A coefficient of agreement for nominal
scales. Educ Psychol Meas. 1960;20:37-46.
9. Mantel N. Chi-square tests with one degree of freedom:
extensions of the Mantel-Haenszel procedure. J Am Stat Assoc.
1963;58:690-700.
10. Box GEP, Cox DR. An analysis of transformations (with
discussion). J R Stat Soc B. 1964;26:211-252.
11. Mantel N. Evaluation of survival data and two new rank
order statistics arising in its consideration. Cancer Chemother
Rep. 1966;50:163-170.
12. Elston RC, Stewart J. A general model for the genetic
analysis of pedigree data. Hum Heredity. 1971;21:523-542.
13. Peto R, Peto J. Asymptotically efficient rank invariant
test procedures (with discussion). J R Stat Soc A.
1972;135:185-207.
14. Cox DR. Regression models and life tables (with
discussion). J R Stat Soc B. 1972;34:187-220.
15. Dempster AP, Laird N, Rubin D. Maximum likelihood from
incomplete data via the EM algorithm. J R Stat Soc B.
1977;39:1-38.
16. Efron B. Bootstrap methods: another look at the
jackknife. Ann Stat. 1979;7:1-26.
17. Hanley JA, McNeil BJ. The meaning and use of the area
under a receiver operating characteristic (ROC) curve.
Radiology. 1982;143:29-36.
18. Geman S, Geman D. Stochastic relaxation, Gibbs
distributions, and Bayesian restoration of images. IEEE Trans
Pattern Anal Machine Intell. 1984;6:721-741.
19. Breiman L, Friedman JH, Olshen RA, Stone CJ.
Classification and Regression Trees. Belmont, Calif: Wadsworth;
1984.
20. Zeger SL, Liang KY. Longitudinal data analysis for
discrete and continuous outcomes. Biometrics.
1986;42:121-130.
21. Peto R, Pike MC, Armitage P, et al. Design and analysis
of randomized clinical trials requiring prolonged observation of each
patient, II: analysis and examples. Br J Cancer.
1977;35:1-39.
22. Bland JM, Altman DG. Statistical methods for assessing
agreement between two methods of clinical measurement.
Lancet. 1986;1:307-310.
23. Institute of Scientific Information. Science
Citation Index. Philadelphia, Pa: Institute of Scientific
Information; 1951-1971.
24. Hayden GF. Biostatistical trends in
Pediatrics: implications for the future. Pediatrics.
1983;72:84-87.
25. Felson DT, Cupples LA, Meenan RF. Misuse of statistical
methods in Arthritis and Rheumatism: 1982 versus
1967-68. Arthritis Rheum. 1984;27:1018-1022.
26. Emerson JD, Colditz G. Use of statistical analysis in
the New England Journal of Medicine. N Engl J
Med. 1983;309:707-713.
27. Emerson JD, Colditz G. Use of statistical analysis in
the New England Journal of Medicine. In: Bailar JC III,
Mosteller F, eds. Medical Uses of Statistics. 2nd ed. Boston,
Mass: NEJM Books; 1992:45-57.
28. Altman DG. Statistics in medical journals: developments
in the 1980s. Stat Med. 1991;10:1897-1913.
29. Seglen PO. Citation frequency and journal impact: valid
indicators of scientific quality? J Intern Med.
1991;229:109-111.
30. Huth E. The information explosion. Bull N Y Acad
Med. 1989;65:647-661.
31. Altman DG, Bland JM. Measurement in medicine: the
analysis of method comparison studies. Statistician.
1983;32:307-317.
32. International Committee of Medical Journal Editors.
Uniform requirements for manuscripts submitted to biomedical
journals. JAMA. 1993;269:2282-2286.
33. Zweig MH, Campbell G. Receiver-operating characteristic
(ROC) plots: a fundamental tool in clinical medicine. Clin
Chem. 1993;39:561-577.
[Note: References 34-37 are referred to in Table 3 if not otherwise referred to in the text.]
34. Gilks WR, Clayton DG, Spiegelhalter DJ, et al.
Modelling complexity: applications of Gibbs sampling in medicine.
J R Stat Soc B. 1993;55:39-52.
35. Hastie T, Tibshirani R. Generalized additive
models. Stat Sci. 1986;1:297-318.
36. Ciampi A, Lawless JF, McKinney SM, Singhal K.
Regression and recursive partition strategies in the analysis of
medical and survival data. J Clin Epidemiol. 1988;41:737-748.
37. Goldstein H. Multilevel Models in Educational and
Social Research. London, England: Griffin; 1986.
38. Ripley BD. Statistical aspects of neural networks. In:
Barndorff-Nielsen OE, Jensen JL, Kendall WS, eds. Networks and
Chaos: Statistical and Probabilistic Aspects. London, England:
Chapman and Hall; 1993:40-123.
39. Wulff HR, Andersen B, Brandenhoff P, Guttler F. What do
doctors know about statistics? Stat Med. 1987;6:3-10.
40. Altman DG, Bland JM. Improving doctors' understanding
of statistics (with discussion). J R Stat Soc A.
1991;154:223-267.
Table of Contents