LINKS
Blinded Peer Review

A Citation Analysis of the Impact of Blinded Peer Review

(JAMA. 1994;272:147-149)

David N. Laband, PhD, Michael J. Piette, PhD

Objective.--To determine whether articles published in journals using blinded peer review receive significantly more or fewer citations than those published in journals using nonblinded peer review.

Design.--Drawing from a sample of 1051 full articles published in 28 economics journals during 1984, we used nonlinear regression and ordered probit techniques to estimate the impact of blinded peer review on citations of these articles in 1985 through 1989.

Outcomes.--Citations of articles.

Results.--Articles published in journals using blinded peer review were cited significantly more than articles published in journals using nonblinded peer review, controlling for a variety of author, article, and journal attributes.

Conclusions.--Nonblinded peer review apparently suffers from type I error to a greater extent than blinded peer review. That is, journals using nonblinded peer review publish a larger fraction of papers that should not have been published than do journals using blinded peer review. When reviewers know the identity of the author(s) of an article, they are able to (and evidently do) substitute particularistic criteria for universalistic criteria in their evaluative process.

(JAMA. 1994;272:147-149)


In this article, we use citation analysis to shed empirical light on the relative efficiency of double-blind vs single-blind refereeing of journal articles. A single-blind review occurs when the reviewer(s) knows the identity of the author(s), but the latter do not know the identities of the former. With a double-blind review, neither reviewers' nor authors' identities are revealed to the other party. We view the double-blind vs single-blind review process debate in terms of the relative incidence of type I errors and type II errors. Do editors using a single-blind review process systematically fail to reject papers that have little or no impact on the profession (type I error) and/or fail to accept more truly good papers (type II error) than do editors of double-blind journals? Both types of errors might characterize a single-blind review process for at least two reasons. First, reviewers with knowledge of the author's identity might economize on their refereeing costs by substituting the already-revealed value of the author's average contribution in previous articles for their evaluation and forecast of the marginal contribution contained in the manuscript under consideration. Second, personal characteristics of the author (eg, gender, institutional affiliation, friendship with the reviewer, race, and intellectual conformity with the reviewer) may weigh more heavily in a reviewer's evaluation of the publishability of a manuscript than the reviewer's forecast of the marginal contribution contained therein.

To gauge the severity of the type I and type II error problem that arguably plagues single-blind reviewing but not double-blind reviewing, one must have information regarding how the marketplace for scientific ideas responds to published articles. Without knowing the fate of manuscripts submitted but rejected for publication, one simply cannot determine the severity of the type II error problem. However, drawing from a large sample of articles published in 28 top economics journals in 1984, we were able to investigate the degree to which journals using a single-blind review process publish articles that are revealed not to have the impact that might reasonably have been expected.

DATA

We compiled information on 1051 articles published in 28 top economics journals in 1984. Specifically, we identified citations to each article, as listed in the Social Sciences Citation Index[1] for the 5 years following publication (1985 through 1989); length in American Economic Review-equivalent pages, a relative quality rating of the journal in which each article was published[2] ; and the review process used by each journal. (Our sample of 1051 articles consists only of those articles for which complete information on all variables was available. Thus, for example, if we were unable to locate an author in the American Economic Review directory in an effort to obtain age and affiliation information, the article [co]written by the individual was dropped from the data set. We concede that this may ultimately bias our results, but it is unclear a priori in which direction any such biases, if present, would run.)

Articles published in the 13 journals using double-blind review received an average of 6.7 citations per article during the immediate 5-year period following publication, while articles published in the 15 journals using single-blind review received an average of 7.3 citations per article during that same time period. Articles published in the former journals were, on average, shorter than articles published in the latter. The average quality rating (developed by Liebowitz and Palmer [2]) assigned to the set of journals using double-blind review was 38.5 (on a scale from 0 to 100), while the average rating of the journals using single-blind review was 58.6.

METHODS

We estimated the impact of the type of review process on citations to published articles, controlling for article- and journal-specific characteristics that might influence citations (Table 1). This method permitted us to evaluate whether one type of review process was superior to the other in terms of either (1) identifying articles that will attract more citations than would be predicted by the characteristics or (2) failing to identify articles that will be cited less than would be predicted by these characteristics.

The distribution of citations to articles is nonnormal. Most scholarly articles in economics are cited infrequently, if at all[3]; this finding holds across scientific disciplines generally.[4] [5] A relatively small number of articles and authors are truly influential; most offer marginal contributions to the stock of scientific knowledge. To account for this acknowledged skewness in the distribution of citations, we estimated our model using the ordered probit nonlinear regression method (LIMDEP software package, Version 6.0, Econometric Software Inc, Bellport, NY). Our dependent variable, citations in 1985 through 1989 to an article published in 1984, was distributed in such a manner that quintiles were easy to identify. Finer analysis by decile was impossible because some 21% of all articles received no citations at all during the 1985 through 1989 period.

We expected journals using a single-blind review process to be plagued by both type I and type II errors to a greater extent than journals using a double-blind review process. Thus, we expected articles published in the latter group to attract more citations than articles published in the former group. Citations should be a positive function of article length--more substantive scientific contributions will plausibly require greater elucidation than less substantive contributions. Citations also are likely to be influenced by journal of publication. Readers are attracted to a scholarly journal, at least in part, by the expected value of articles published. A citations-based rating of the relative impact of journals, like the one developed by Liebowitz and Palmer,[2] provides relevant information about the actual value of a journal's past contributions. Since the value of past contributions is probably used implicitly by readers in forecasting the expected value of current contributions in a scholarly journal, we expected citations of current articles to be influenced in a positive manner by the relative quality rating of the journal in which the article was published.

RESULTS

The estimated relationship between the dependent citations variable and the explanatory variables (double-blind review, article length, and journal quality), with asymptotic t values reported in parentheses, follows:

CITATIONS 1985-1989 =
-0.840 (-6.471) + 0.261 (3.442)
DOUBLE-BLIND + 0.087 (16.840)
length + 0.015 (9.789)
JOURNAL QUALITY

The coefficient estimate of the double-blind review dummy variable reveals what the residual impact of the review process is on citations. Articles published in journals using the double-blind review process have a significantly greater probability of receiving more citations than would be predicted on the basis of length and journal quality than articles reviewed single-blind. As expected, article length and relative quality of the publishing journal both demonstrate positive and statistically significant explanatory power with respect to the probability that an article achieves specific citation levels.

In previous work, Laband[3] argued that the distinguishing feature of top journals is their differential ability to "capture" the few high-impact articles written over time. This suggests that our understanding of differences between double-blind and single-blind reviewing might be further extended by examining the tail ends of the distribution of residuals. That is to say, does either review process outperform the other in identifying the articles that are much better than would be predicted by length, quality of journal, and citations per article? Conversely, does either review process consistently pick "losers," in the sense that the articles selected for publication do not live up to their expected impact, measured by subsequent citations? To investigate this, we report in Table 2 the actual and predicted probabilities of articles receiving citations in each of the quintiles, by review process.

The findings reported in Table 2 are revealing. Articles reviewed single-blind are less likely than those reviewed double-blind to be identified correctly as the highest-impact articles (those with nine or more citations in the ensuing 5 years). By the same token, articles reviewed single-blind are more likely than those reviewed double-blind to be misidentified as the lowest-impact papers (those with no citations in the ensuing 5 years).

CONCLUSIONS

Our findings with respect to 28 top economics journals suggest that the double-blind review process outperforms the single-blind review process. Specifically, citations to articles published in journals using a double-blind review process exceeded citations of articles published in journals using a single-blind review process, controlling for article length and the quality of the publishing journal. We conclude that the single-blind review process apparently suffers from a type I error bias to a greater extent than the double-blind review process. Can our findings be generalized to scholarly journals in the biomedical literature? We suspect that they can; our study offers a methodological basis for shedding empirical light on that question.


From the Department of Economics and Finance, Salisbury (Md) State University (Dr Laband) and Economic Research Services, Tallahassee, Fla (Dr Piette). Dr Laband is now with the Department of Economics, Auburn (Ala) University.

Presented in part at the Second International Congress on Peer Review in Biomedical Publication, Chicago, Ill, September 11, 1993.

Address correspondence to Department of Economics, Auburn University, Auburn, AL 36849 (Dr Laband).


References

1. Institute for Scientific Information. Social Sciences Citation Index. Philadelphia, Pa: Institute for Scientific Information; annual volumes 1985-1989.

2. Liebowitz SJ, Palmer JP. Assessing the relative impacts of economics journals. J Econ Lit. 1984;22:77-88.

3. Laband DN. Article popularity. Econ Inquiry. 1986;24:173-180.

4. Hamilton DP. Research papers: who's uncited now? Science. 1991;251:25.

5. Hamilton DP. Publishing by--and for?--the numbers. Science. 1990;250:1331-1332.

Table of Contents