Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Sociobehavioural characteristics and HIV incidence in 29 sub-Saharan African countries: Unsupervised machine learning analysis

Aziza Merzouki, Janne Estill, Kali Tal, Olivia Keiser
doi: https://doi.org/10.1101/620450
Aziza Merzouki
aInstitute of Global Health, University of Geneva, Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: FatmaAziza.Merzouki@unige.ch
Janne Estill
aInstitute of Global Health, University of Geneva, Geneva, Switzerland
bInstitute of Mathematical Statistics and Actuarial Science, University of Bern, Bern, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kali Tal
cInstitute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Olivia Keiser
aInstitute of Global Health, University of Geneva, Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: olivia.keiser@unige.ch
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Objective HIV incidence varies widely between sub-Saharan African (SSA) countries. This variation coincides with a substantial sociobehavioural heterogeneity, which complicates the design of effective interventions. In this study, we investigated how socio-behavioural heterogeneity in sub-Saharan Africa could account for the variance of HIV incidence between countries.

Methods We used unsupervised machine learning to analyse data from the Demographic and Health Surveys of 29 SSA countries completed after 2010. We preselected 48 demographic, socio-economic, behavioural and HIV-related attributes to describe each country. We used Principle Component Analysis to visualize sociobehavioural similarity between countries, and to identify the variables that accounted for most sociobehavioural variance in SSA. We used hierarchical clustering to identify groups of countries with similar sociobehavioural profiles, and we compared the distribution of HIV incidence and sociobehavioural variables within each cluster.

Findings The most important characteristics, which explained 69% of sociobehavioural variance across SSA among the variables we assessed were: religion; male circumcision; number of sexual partners; literacy; uptake of HIV testing; women’s empowerment; accepting attitude toward people living with HIV/AIDS; rurality; ART coverage; and, knowledge about AIDS. Our model revealed three groups of countries, each with characteristic sociobehavioural profiles. HIV incidence was mostly similar within each cluster and different between clusters (median(IQR); 0.5/1000(0.6/1000), 1.8/1000(1.3/1000) and 5.0/1000(4.2/1000)).

Conclusion Our findings suggest that sociobehavioural factors play a key role in determining the course of the HIV epidemic, and that similar techniques can help to design and predict the effects of targeted country-specific interventions to impede HIV transmission.

Knowledge before this study We searched PubMed with the terms: “HIV”, “inequality”, “factors” and “sub-Saharan Africa” for articles published in English before February 28th, 2019. The reviewed literature was usually limited to a certain sub-population, sub-national region, or country; but some recent studies covered up to 31 sub-Saharan African countries. Based on a relatively small number of variable (5 to 13), and using descriptive statistics, regressions and concentration indices, previous works analysed the association between socio-economic inequalities, male circumcision, high-risk sexual behaviour, or HIV-related stigma, with HIV testing, uptake of treatment, ART adherence, or HIV prevalence.

Contribution of this study To our knowledge, this is the first study where unsupervised machine learning techniques (Principle Component Analysis and hierarchical clustering) were used to analyse the sociobehavioural heterogeneity in sub-Saharan Africa (SSA) and how it associates with the variability of HIV incidence in the region. We identified three distinct sociobehavioural profiles, which were associated with different geographical regions and different levels of HIV incidence in SSA. Because the association between the variability of HIV incidence across SSA and its underlying sociobehavioural factors is still not well understood, we believe that our analysis that compares 29 SSA countries based on 48 sociobehavioural characteristics brings significant value to the field. Identifying and comparing sociobehavioural profiles of countries helps to design and predict the effect of tailored country-specific interventions to impede HIV transmission.

Footnotes

  • Figure 1, left pannel revised

  • https://gitlab.com/AzizaM/dhs_ssa_countries_clustering

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted July 12, 2019.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Sociobehavioural characteristics and HIV incidence in 29 sub-Saharan African countries: Unsupervised machine learning analysis
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Sociobehavioural characteristics and HIV incidence in 29 sub-Saharan African countries: Unsupervised machine learning analysis
Aziza Merzouki, Janne Estill, Kali Tal, Olivia Keiser
bioRxiv 620450; doi: https://doi.org/10.1101/620450
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Sociobehavioural characteristics and HIV incidence in 29 sub-Saharan African countries: Unsupervised machine learning analysis
Aziza Merzouki, Janne Estill, Kali Tal, Olivia Keiser
bioRxiv 620450; doi: https://doi.org/10.1101/620450

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4838)
  • Biochemistry (10729)
  • Bioengineering (8005)
  • Bioinformatics (27167)
  • Biophysics (13930)
  • Cancer Biology (11079)
  • Cell Biology (15977)
  • Clinical Trials (138)
  • Developmental Biology (8757)
  • Ecology (13228)
  • Epidemiology (2067)
  • Evolutionary Biology (17307)
  • Genetics (11663)
  • Genomics (15877)
  • Immunology (10986)
  • Microbiology (25979)
  • Molecular Biology (10600)
  • Neuroscience (56312)
  • Paleontology (416)
  • Pathology (1727)
  • Pharmacology and Toxicology (2998)
  • Physiology (4528)
  • Plant Biology (9583)
  • Scientific Communication and Education (1610)
  • Synthetic Biology (2668)
  • Systems Biology (6956)
  • Zoology (1507)