The ''h''-index is an
author-level metric that measures both the
productivity
Productivity is the efficiency of production of goods or services expressed by some measure. Measurements of productivity are often expressed as a ratio of an aggregate output to a single input or an aggregate input used in a production proces ...
and
citation impact of the
publications, initially used for an individual
scientist or scholar. The ''h''-index correlates with obvious success indicators such as winning the
Nobel Prize
The Nobel Prizes ( ; sv, Nobelpriset ; no, Nobelprisen ) are five separate prizes that, according to Alfred Nobel's will of 1895, are awarded to "those who, during the preceding year, have conferred the greatest benefit to humankind." Alfre ...
, being accepted for research fellowships and holding positions at top universities. The index is based on the set of the scientist's most cited papers and the number of citations that they have received in other publications. The index has more recently been applied to the productivity and impact of a
scholarly journal
An academic journal or scholarly journal is a periodical publication in which scholarship relating to a particular academic discipline is published. Academic journals serve as permanent and transparent forums for the presentation, scrutiny, and d ...
as well as a group of scientists, such as a department or university or country. The index was suggested in 2005 by
Jorge E. Hirsch
Jorge Eduardo Hirsch (born 1953) is an Argentine American professor of physics at the University of California, San Diego. Hirsch received a PhD in physics from the University of Chicago in 1980 and completed his postdoctoral research at the Kavl ...
, a physicist at
UC San Diego, as a tool for determining
theoretical physicists' relative quality
and is sometimes called the Hirsch index or Hirsch number.
Definition and purpose

The ''h''-index is defined as the maximum value of ''h'' such that the given author/journal has published at least ''h'' papers that have each been cited at least ''h'' times.
The index is designed to improve upon simpler measures such as the total number of citations or publications. The index works best when comparing scholars working in the same field, since citation conventions differ widely among different fields.
Calculation
The ''h''-index is the largest number ''h'' such that ''h'' articles have at least ''h'' citations each. For example, if an author has five publications, with 9, 7, 6, 2, and 1 citations (ordered from greatest to least), then the author's ''h''-index is 3, because the author has three publications with 3 or more citations. However, the author does ''not'' have four publications with 4 or more citations.
Clearly, an author's ''h''-index can only be as great as their number of publications. For example, an author with only one publication can have a maximum ''h''-index of 1 (if their publication has 1 or more citations). On the other hand, an author with many publications, each with only 1 citation, would have a ''h''-index of 1.
Formally, if ''f'' is the function that corresponds to the number of citations for each publication, we compute the ''h''-index as follows: First we order the values of ''f'' from the largest to the lowest value. Then, we look for the last position in which ''f'' is greater than or equal to the
position (we call ''h'' this position). For example, if we have a researcher with 5 publications A, B, C, D, and E with 10, 8, 5, 4, and 3 citations, respectively, the ''h''-index is equal to 4 because the 4th publication has 4 citations and the 5th has only 3. In contrast, if the same publications have 25, 8, 5, 3, and 3 citations, then the index is 3 (i.e. the 3rd position) because the fourth paper has only 3 citations.
:''f''(A)=10, ''f''(B)=8, ''f''(C)=5, ''f''(D)=4, ''f''(E)=3 → ''h''-index=4
:''f''(A)=25, ''f''(B)=8, ''f''(C)=5, ''f''(D)=3, ''f''(E)=3 → ''h''-index=3
If we have the function ''f'' ordered in decreasing order from the largest
value to the lowest one, we can compute the ''h''-index as follows:
:''h''-index (''f'') =
The Hirsch index is analogous to the
Eddington number, an earlier metric used for evaluating cyclists.
''h''-index is also related to
Sugeno integral In mathematics, the Sugeno integral, named after M. Sugeno, is a type of integral with respect to a fuzzy measure.
Let (X,\Omega) be a measurable space and let h:X\to ,1/math> be an \Omega-measurable function.
The Sugeno integral over the cr ...
and
Ky Fan metric
KY or Ky may refer to:
Arts and entertainment
* Ky Kiske, a fictional character in the ''Guilty Gear'' video game series
* Kashiyatra, a festival in India
People
* Nguyễn Cao Kỳ (1930-2011), South Vietnamese general and prime minister
* Ky ...
. The ''h''-index serves as an alternative to more traditional journal
impact factor metrics in the evaluation of the impact of the work of a particular researcher. Because only the most highly cited articles contribute to the ''h''-index, its determination is a simpler process. Hirsch has demonstrated that ''h'' has high predictive value for whether a scientist has won honors like
National Academy
A national academy is an organizational body, usually operating with state financial support and approval, that co-ordinates scholarly research activities and standards for academic disciplines, most frequently in the sciences but also the hum ...
membership or the
Nobel Prize
The Nobel Prizes ( ; sv, Nobelpriset ; no, Nobelprisen ) are five separate prizes that, according to Alfred Nobel's will of 1895, are awarded to "those who, during the preceding year, have conferred the greatest benefit to humankind." Alfre ...
. The ''h''-index grows as citations accumulate and thus it depends on the "
academic age The academic age is the time that a scientist has been in the research field and performed active research.
The academic age of a scientist may be computed as the span of years from their first published work up until the present.
Another definiti ...
" of a researcher.
Input data
The ''h''-index can be manually determined by using citation databases or using automatic tools. Subscription-based databases such as
Scopus and the
Web of Science provide automated calculators. From July 2011
Google
Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
have provided an automatically calculated ''h''-index and ''i10''-index within their own
Google Scholar profile. In addition, specific databases, such as the
INSPIRE-HEP INSPIRE-HEP is an open access digital library for the field of high energy physics (HEP). It is the successor of the Stanford Physics Information Retrieval System (SPIRES) database, the main literature database for high energy physics since the 1970 ...
database can automatically calculate the ''h''-index for researchers working in
high energy physics.
Each database is likely to produce a different ''h'' for the same scholar, because of different coverage. A detailed study showed that the Web of Science has strong coverage of journal publications, but poor coverage of high impact conferences. Scopus has better coverage of conferences, but poor coverage of publications prior to 1996; Google Scholar has the best coverage of conferences and most journals (though not all), but like Scopus has limited coverage of pre-1990 publications.
[ (preprint of paper published as 'Impact of data sources on citation counts and rankings of LIS faculty: Web of Science versus Scopus and Google Scholar', in ''Journal of the American Society for Information Science and Technology'', Vol. 58, No. 13, 2007, 2105–25)] The exclusion of conference proceedings papers is a particular problem for scholars in
computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...
, where conference proceedings are considered an important part of the literature. Google Scholar has been criticized for producing "phantom citations," including
gray literature in its citation counts, and failing to follow the rules of
Boolean logic when combining search terms. For example, the Meho and Yang study found that Google Scholar identified 53% more citations than Web of Science and Scopus combined, but noted that because most of the additional citations reported by Google Scholar were from low-impact journals or conference proceedings, they did not significantly alter the relative ranking of the individuals. It has been suggested that in order to deal with the sometimes wide variation in ''h'' for a single academic measured across the possible citation databases, one should assume false negatives in the databases are more problematic than false positives and take the maximum ''h'' measured for an academic.
Examples
Little systematic investigation has been done on how the ''h''-index behaves over different institutions, nations, times and academic fields. Hirsch suggested that, for physicists, a value for ''h'' of about 12 might be typical for advancement to tenure (associate professor) at major
Sresearch universities. A value of about 18 could mean a full professorship, 15–20 could mean a fellowship in the
American Physical Society, and 45 or higher could mean membership in the
United States National Academy of Sciences. Hirsch estimated that after 20 years a "successful scientist" would have an ''h''-index of 20, an "outstanding scientist" would have an ''h''-index of 40, and a "truly unique" individual would have an ''h''-index of 60.
[
For the most highly cited scientists in the period 1983–2002, Hirsch identified the top 10 in the life sciences (in order of decreasing ''h''): ]Solomon H. Snyder
Solomon Halbert Snyder (born December 26, 1938) is an American neuroscientist who has made wide-ranging contributions to neuropharmacology and neurochemistry. He studied at Georgetown University, and has conducted the majority of his research at ...
, ''h'' = 191; David Baltimore, ''h'' = 160; Robert C. Gallo
Robert Charles Gallo (; born March 23, 1937) is an American biomedical researcher. He is best known for his role in establishing the human immunodeficiency virus (HIV) as the infectious agent responsible for acquired immune deficiency syndrome ( ...
, ''h'' = 154; Pierre Chambon
Pierre Chambon (born 7 February 1931 in Mulhouse, France) was the founder of the in Strasbourg, France. He was one of the leading molecular biologists who utilized gene cloning and sequencing technology to first decipher the structure of eukaryot ...
, ''h'' = 153; Bert Vogelstein, ''h'' = 151; Salvador Moncada, ''h'' = 143; Charles A. Dinarello
Charles A. Dinarello (born April 22, 1943) is a professor of medicine at the University of Colorado at Denver. He is an expert on inflammatory cytokines, specifically Interleukin 1.
Education and career
Dinarello received his Doctor of Medicine ...
, ''h'' = 138; Tadamitsu Kishimoto
is a Japanese immunologist known for research on IgM and cytokines, most famously, interleukin 6.
He did postdoctoral work under Kimishige Ishizaka, the discoverer of IgE at Johns Hopkins University.
He is listed by the Institute for Scientifi ...
, ''h'' = 134; Ronald M. Evans, ''h'' = 127; and Ralph L. Brinster, ''h'' = 126. Among 36 new inductees in the National Academy of Sciences in biological and biomedical sciences in 2005, the median ''h''-index was 57. However, Hirsch noted that values of ''h'' will vary among disparate fields.[
Among the 22 scientific disciplines listed in the ]Essential Science Indicators
Clarivate Plc is a British-American publicly traded analytics company that operates a collection of subscription-based services, in the areas of bibliometrics and scientometrics; business / market intelligence, and competitive profiling for pha ...
citation thresholds hus excluding non-science academics">non-science.html" ;"title="hus excluding non-science">hus excluding non-science academics physics has the second most citations after space science. During the period January 1, 2000 – February 28, 2010, a physicist had to receive 2073 citations to be among the most cited 1% of physicists in the world. The threshold for space science is the highest (2236 citations), and physics is followed by clinical medicine (1390) and molecular biology & genetics
Genetics is the study of genes, genetic variation, and heredity in organisms.Hartl D, Jones E (2005) It is an important branch in biology because heredity is vital to organisms' evolution. Gregor Mendel, a Moravian Augustinian friar worki ...
(1229). Most disciplines, such as environment/ecology (390), have fewer scientists, fewer papers, and fewer citations. Therefore, these disciplines have lower citation thresholds in the Essential Science Indicators, with the lowest citation thresholds observed in social sciences (154), computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...
(149), and multidisciplinary sciences (147).
Numbers are very different in social science disciplines: The ''Impact of the Social Sciences'' team at London School of Economics found that social scientists in the United Kingdom had lower average ''h''-indices. The ''h''-indices for ("full") professors, based on Google Scholar data ranged from 2.8 (in law), through 3.4 (in political science), 3.7 (in sociology), 6.5 (in geography) and 7.6 (in economics). On average across the disciplines, a professor in the social sciences had an ''h''-index about twice that of a lecturer or a senior lecturer, though the difference was the smallest in geography.
Advantages
Hirsch intended the ''h''-index to address the main disadvantages of other bibliometric indicators. The total number of papers metric does not account for the quality of scientific publications. The total number of citations metric, on the other hand, can be heavily affected by participation in a single publication of major influence (for instance, methodological papers proposing successful new techniques, methods or approximations, which can generate a large number of citations). The ''h''-index is intended to measure simultaneously the quality and quantity of scientific output.
Criticism
There are a number of situations in which ''h'' may provide misleading information about a scientist's output. Some of these failures are not exclusive to the ''h''-index but rather shared with other author-level metrics.
Misrepresentation of data
The ''h''-index does not account for the typical number of citations in different fields. Citation behavior in general is affected by field-dependent factors, which may invalidate comparisons not only across disciplines but even within different fields of research of one discipline. The ''h''-index discards the information contained in author placement in the authors' list, which in some scientific fields is significant though in others it is not. The ''h''-index is a natural number
In mathematics, the natural numbers are those numbers used for counting (as in "there are ''six'' coins on the table") and ordering (as in "this is the ''third'' largest city in the country").
Numbers used for counting are called '' cardinal ...
that reduces its discriminatory power. Ruane and Tol therefore propose a rational ''h''-index that interpolates between ''h'' and ''h'' + 1.
Prone to manipulation
Weaknesses apply to the purely quantitative calculation of scientific or academic output. Like other metrics that count citations, the ''h''-index can be manipulated by coercive citation, a practice in which an editor of a journal forces authors to add spurious citations to their own articles before the journal will agree to publish it. The ''h''-index can be manipulated through self-citations, and if based on Google Scholar output, then even computer-generated documents can be used for that purpose, e.g. using SCIgen.
Other shortcomings
The ''h''-index has been found in one study to have slightly less predictive accuracy and precision than the simpler measure of mean citations per paper. However, this finding was contradicted by another study by Hirsch. The ''h''-index does not provide a significantly more accurate measure of impact than the total number of citations for a given scholar. In particular, by modeling the distribution of citations among papers as a random integer partition and the ''h''-index as the Durfee square of the partition, Yong arrived at the formula , where ''N'' is the total number of citations, which, for mathematics members of the National Academy of Sciences, turns out to provide an accurate (with errors typically within 10–20 percent) approximation of ''h''-index in most cases.
Alternatives and modifications
Various proposals to modify the ''h''-index in order to emphasize different features have been made. As the variants have proliferated, comparative studies have become possible showing that most proposals are highly correlated with the original ''h''-index and therefore largely redundant, although alternative indexes may be important to decide between comparable CVs, as often the case in evaluation processes. These alternative metrics are applicable for author-level and journal-level rankings.
Applications
Indices similar to the ''h''-index have been applied outside of author level metrics.
The ''h''-index has been applied to Internet Media, such as YouTube channels. It is defined as the number of videos with ≥ h × 105 views. When compared with a video creator's total view count, the ''h''-index and ''g''-index better capture both productivity and impact in a single metric.
A successive Hirsch-type-index for institutions has also been devised. A scientific institution has a successive Hirsch-type-index of ''i'' when at least ''i'' researchers from that institution have an ''h''-index of at least ''i''.
See also
* Bibliometrics
*
References
Further reading
*
*
*
*
*
*
*
*
*
*
*
*
External links
Google Scholar Metrics
''H''-index for computer science and electronics
''H'' - index for computer scientists from Google Scholar
''H''-index for astronomers
{{DEFAULTSORT:H-Index
Citation metrics
Academic publishing
Index numbers