Constructing bibliometric networks: A comparison between full and fractional counting☆
Graphical abstract
Introduction
The study of bibliometric networks, such as co-authorship, bibliographic coupling, and co-citation networks, has a long history in the field of bibliometrics, with early work dating back to the 1960s and 1970s (e.g., De Solla Price, 1965; Kessler, 1963; Small, 1973). Many different methods for analyzing and visualizing bibliometric networks have been studied by bibliometricians (e.g., Börner, Chen, & Boyack, 2003; Milojević, 2014; Van Eck & Waltman, 2014; Zhao & Strotmann, 2015). However, before bibliometric networks can be analyzed and visualized, they first need to be constructed. The construction of bibliometric networks has received remarkably little attention in the literature (for important exceptions, see Batagelj & Cerinšek, 2013; Park, Yoon, & Leydesdorff, 2016). It seems that the construction of bibliometric networks is typically seen as a more or less trivial step that does not need any special consideration. In this paper, we argue that this step is far from trivial. We point out that different approaches can be taken to construct bibliometric networks. Our aim is to draw attention to the existence of different approaches for constructing bibliometric networks, to clarify the conceptual differences between these approaches, and to show that these approaches may yield very different results.
A well-known problem in the field of bibliometrics is the issue of assigning co-authored publications to individual authors. For instance, when a publication is co-authored by three researchers, how should the publication be counted for each individual researcher? In the context of the calculation of bibliometric indicators, many different approaches have been proposed to this problem (for overviews, see Gauffriau, Larsen, Maye, Roulin-Perriard, & Von Ins, 2007; Waltman, 2016). The most popular approaches are the full counting method (also known as the whole counting method) and the fractional counting method (e.g., Aksnes, Schneider, & Gunnarsson, 2012; Waltman & Van Eck, 2015). In the case of the full counting method, a publication co-authored by three researchers is assigned to each researcher with a full weight of one. On the other hand, in the case of the fractional counting method, the publication is assigned to each researcher with a fractional weight of 1/3.
In this paper, we show how the distinction between full and fractional counting, which has been studied extensively in the context of the calculation of bibliometric indicators, can be translated to the context of the construction of bibliometric networks. Consider for instance the construction of a co-authorship network. Suppose researcher X has co-authored a publication with five other researchers. In the conventional approach to the construction of bibliometric networks, this yields five co-authorship links with a weight of one for researcher X. We refer to this approach as the full counting method. An alternative approach is to assign a weight of 1/5 to each of the five co-authorship links. In this approach, which we refer to as the fractional counting method, the total weight of the co-authorship links that a researcher obtains because of co-authoring a publication equals one. This total weight of one is distributed equally over the individual co-authorship links.
To construct bibliometric networks, researchers have traditionally used the full counting method. To the best of our knowledge, the fractional counting method has hardly been used in the literature (for the only exception that we are aware of, see Newman, 2001c), although some related ideas have been proposed (Batagelj & Cerinšek, 2013; Cerinšek & Batagelj, 2015; Park et al., 2016; Persson, 1994, Persson, 2010).1 In this paper, we carefully define the full and fractional counting methods. Our focus is on three popular types of bibliometric networks, namely co-authorship, bibliographic coupling, and co-citation networks, but our ideas extend to other types of bibliometric networks as well. We also provide two examples of situations in which the choice between the full and fractional counting methods makes a big difference. One example is about co-authorship networks of universities. The other example deals with bibliographic coupling networks of journals. In both examples, we argue that the fractional counting method is preferable over the full counting method.
We note that the full and fractional counting methods are both available in the VOSviewer software (www.vosviewer.com; Van Eck and Waltman, 2010, Van Eck and Waltman, 2014) for constructing and visualizing bibliometric networks. The VOSviewer software can be used to construct bibliometric networks based on data downloaded from bibliographic databases such as Web of Science and Scopus. The software requests the user to choose between the use of the full and the fractional counting method. The information provided in this paper should help VOSviewer users in choosing the most appropriate counting method for their analyses.
This paper is organized as follows. Formal definitions of the full and fractional counting methods in the context of the construction of bibliometric networks are provided in Section 2. An empirical comparison between the two counting methods is reported in Section 3. We present our conclusions in Section 4.
Section snippets
Constructing bibliometric networks
In this section, we provide a detailed discussion of the full and fractional counting methods for constructing bibliometric networks. We first discuss in general terms the difference between full and fractional counting. We then focus specifically on co-authorship networks, followed by bibliographic coupling and co-citation networks. We focus on these three types of bibliometric networks because they seem to be the types of bibliometric networks that receive most attention in the literature.
Empirical analysis
We now present an empirical comparison of the full and fractional counting methods for constructing bibliometric networks. We will compare the results obtained using the two counting methods, but in addition we will also show why the two counting methods yield different results. Two analyses are presented. The first analysis focuses on co-authorship networks of universities. The second analysis is about bibliographic coupling networks of journals. We have selected these two analyses because
Discussion and conclusion
There is an extensive literature on the study of bibliometric networks, for instance to describe the structure and evolution of scientific collaboration, to detect research fronts, and to identify specialties within a discipline. However, with a few exceptions (Batagelj & Cerinšek, 2013; Park et al., 2016), the literature pays hardly any attention to the construction of bibliometric networks. Constructing a bibliometric network is seen as a more or less trivial step, and there seems to be an
Author contributions
Antonio Perianes-Rodriguez and Ludo Waltman – Conceived and designed the analysis, Collected the data, Performed the analysis, Wrote the paper.
Nees Jan van Eck – Conceived and designed the analysis, Collected the data, Performed the analysis.
Acknowledgments
We would like to thank Vladimir Batagelj, Giovanni Colavizza, Vincent Traag, and three referees for valuable comments on earlier versions of this paper.
Antonio Perianes-Rodriguez worked on this research project at the Centre for Science and Technology Studies of Leiden University as awardee of a José Castillejo grant (CAS15/00178) funded by the Spanish Ministry of Education.
References (37)
- et al.
Ranking national research systems by citation indicators. A comparative analysis using whole and fractionalised counting methods
Journal of Informetrics
(2012) Identifying research themes with weighted direct citation links
Journal of Informetrics
(2010)- et al.
Field-normalized citation impact indicators and the choice of an appropriate counting method
Journal of Informetrics
(2015) A review of the literature on citation impact indicators
Journal of Informetrics
(2016)- et al.
Comparing all-author and first-author co-citation analyses of information science
Journal of Informetrics
(2008) Towards all-author co-citation analysis
Information Processing & Management
(2006)- et al.
Requirements for a cocitation similarity measure, with special reference to Pearson’s correlation coefficient
Journal of the American Society for Information Science and Technology
(2003) - et al.
Visualizing knowledge domains
Annual Review of Information Science and Technology
(2003) - et al.
On bibliographic networks
Scientometrics
(2013) - et al.
Network analysis of Zentralblatt MATH data
Scientometrics
(2015)
Hyperauthorship: A postmodern perversion or evidence of a structural shift in scholarly communication practices?
Journal of the American Society for Information Science and Technology
Networks of scientific papers
Science
Publication, cooperation and productivity measures in scientific research
Scientometrics
A new methodological approach to bibliographic coupling and its application to the national, regional and institutional level
Scientometrics
Bibliographic coupling between scientific papers
American Documentation
Identifying a better measure of relatedness for mapping science
Journal of the American Society for Information Science and Technology
Mapping authors in intellectual space: A technical overview
Journal of the American Society for Information Science
Network analysis and indicators
Cited by (739)
Computational methods for biofabrication in tissue engineering and regenerative medicine - a literature review
2024, Computational and Structural Biotechnology JournalBibliometric analysis and systematic review of electrochemical methods for environmental remediation
2024, Journal of Environmental Sciences (China)A comprehensive review of nano-enhanced phase change materials on solar stills with scientometric analysis
2024, Results in EngineeringExisting industrial buildings – A review on multidisciplinary research trends and retrofit solutions
2024, Journal of Building EngineeringParkinson's disease diagnosis using deep learning: A bibliometric analysis and literature review
2024, Ageing Research ReviewsThe state of the research on circular economy in the European Union: A bibliometric review
2024, Cleaner Waste Systems
- ☆
The peer review process of this paper was handled by Vincent Larivière, Associate Editor of Journal of Informetrics.