RT Journal Article SR Electronic T1 Analysis of science journalism reveals gender and regional disparities in coverage JF bioRxiv FD Cold Spring Harbor Laboratory SP 2021.06.21.449261 DO 10.1101/2021.06.21.449261 A1 Natalie R. Davidson A1 Casey S. Greene YR 2021 UL http://biorxiv.org/content/early/2021/12/09/2021.06.21.449261.abstract AB Science journalism is a critical way in which the public can remain informed and benefit from new scientific findings. Such journalism also shapes the public’s view of the current state of scientific findings and legitimizes experts. Those covering science can only cite and quote a limited number of sources. Sources may be identified by the journalist’s research or by recommendations by other scientists. In both cases, biases may influence who is identified and ultimately included as an expert. To ensure our results are generalizable, we analyzed two journalism outlets that cater to different demogrgaphics, Nature and The Guardian. This resulted in 37,748 total articles, 15,747 science-related articles from The Guardian and 22,001 non-research articles published by Nature to quantify possible disparities. Our analysis considered three possible sources of disparity: gender, name origin, and country affiliation. To explore these sources of disparity, we extracted cited authors’ names and affiliations, as well as extracted names of quoted speakers. While citations and quotations within a piece do not reflect the entire information-gathering process, they can provide insight into the demographics of visible sources. We then used the extracted names to predict gender and name origin of the cited authors and speakers.In order to appropriately quantify the level of difference, we must identify a suitable reference set for comparison. We chose first and last authors within primary research articles in Nature and a subset of Springer Nature articles in the same time period as our comparator. In our analysis, we found a very similar skew towards male quotation in both The Guardian and Nature science journalism-related articles. However, quotation in Nature is trending toward equal representation at a faster rate than first and last authorship in academic publishing. Interestingly, we found that the gender disparity in Nature quotes was column-dependent, with the “Career Features” column reaching gender parity. Our name origin analysis found a significant over-representation of names with predicted Celtic/English origin and under-representation of names with a predicted East Asian origin. This finding was observed both in extracted quotes and journal citations, but dampened in citations. Finally, we performed an analysis to identify how countries vary in the way that they’re described in scientific journalism. We focused on two groups of countries: countries that are often mentioned in articles, but do not often have affiliated authors cited, and countries that have affiliated authors that are often cited, but the country is not typically mentioned. We found that the articles in which the less cited countries occur tend to have more agricultural, extraction-related, and political terms, whereas articles including highly cited countries have broader scientific terms. This discrepancy indicates a possible lack of regional diversity in the reporting of scientific output.Competing Interest StatementThe authors have declared no competing interest.