ABSTRACT
Objective Parkinson’s Disease (PD) is a clinically diagnosed disease that carries a reported misdiagnosis rate of 10–20%. Recent scientific discoveries have provided evidence of volatile organic compounds in sebum that are unique to patients with PD. The purpose of this study was to determine if companion dogs could be trained to distinguish between sebum samples provided by PD-positive patients and PD-negative human controls.
Methods This was a randomized, handler-blind, controlled study. Twenty-three canines of varying breeds, ages, and environmental backgrounds were included. The two-year study period reported here was the final two years of a seven-year program which started in January of 2016. This study encompassed 200 total working session days from 2021 and 2022.
Results When averaged as a group over two years, the 23 dogs were 89% sensitive and 87% specific to an olfactory distinction between PD-positive and PD-negative human donor samples. Ten of the twenty-three dogs averaged 90% or higher in both sensitivity and specificity.
In 161 separate instances, dogs were presented with both unique PD-positive and PD-negative samples (the dogs had not previously encountered any of the samples presented). For these first-time exposures, the 23 dogs collectively averaged 86% sensitivity and 89% specificity.
When comparing the sensitivity and specificity of PD-positive samples from donors who reported levodopa usage to PD-positive samples from donors who reported no levodopa usage, the dogs showed no statistical difference in sensitivity or specificity at the 95% significance level, indicating levodopa usage is not a factor in PD canine detection. Other factors investigated as part of this study included sample donor gender, canine breed, age, duration of time in the program, and training.
Conclusions Companion dogs can be trained with reward-biased detection methodologies to distinguish between PD-positive and PD-negative donor sebum samples in a controlled setting. This study provides further evidence of one or more volatile organic compounds in the sebum of PD-positive patients that can be detected by canines. Summarily, study findings support the application of trained companion dogs for the screening of PD-positive and PD-negative samples in which numbers of samples are limited and the dogs are worked in short intervals, followed by recovery training.
Box 1. Glossary of Terms
INTRODUCTION
Parkinson’s Disease (PD) is a multi-symptomatic progressive neurological disease resulting in fatal symptomatic-system failure. It is estimated that as many as 10 million people worldwide are living with PD (1). In the United States alone, the incidence of PD is estimated at the rate of 90,000 new cases a year. The only neurological disease more prevalent than PD is Alzheimer’s Disease. The greatest known risk factor of PD is aging, with over 90% of cases in the United States at age 65 or older (2). The second-greatest risk factor is gender, with the incidence rate of males developing PD at 1.5× greater than females (2).
There is no conclusive screening test for PD and the diagnosis is most often provided by a combination of clinical assessment of symptoms, dopamine transporter scan, skin biopsy, hereditary, and environmental factors. The misdiagnosis rate in PD is estimated to be between 10–20%, with a greater rate of misdiagnosis occurring within the first two years of diagnosis (3). By the time a patient receives a clinical diagnosis, the disease has progressed to the point of a 60% dopaminergic reduction in the basal ganglia of the brain. Due to this reduction in dopamine, the diagnostic hallmark symptom of tremors become present, along with other associated symptoms, such as loss of balance, cramped handwriting, sleep disturbance, and altered patterns of speech (4).
The gradual reduction of dopamine is estimated to begin in the brain 5–10 years before the presentation of the symptoms that lead to a clinical diagnosis (5). Once this diagnosis has been made, a patient is usually treated with a drug or combination of drugs, which work to cross the blood-brain barrier and deliver a chemical substitute for dopamine, helping to slow, but not stop, the progressive reduction of dopaminergic loss to the brain (6). It is believed that if drugs, along with other therapeutic interventions, could be introduced at the onset of PD, then the progression of the disease could be slowed at a much greater rate than provided through current methods of detection and treatment. This explains the urgent need for an early detectable biomarker of PD (7).
Canines, with their high ability for scent detection, could provide detection for PD-associated odor(s) i.e., a biomarker. However, canines raised in controlled kennel environments for scent detection typically cost between $15,000 and $20,000 per dog. The cost of maintaining a kennel detection environment to raise and train dogs for scent detection is typically over $100,000 per year (8). By contrast, in this study, we set out to explore whether household canine pets could be utilized effectively for the detection of a PD-associated odor, thus eliminating the need for costly, and less available, kennel-raised professional detection dogs.
In 2015, Joy Milne, a woman diagnosed with hyperosmia, a condition that provides her with an extremely sensitive sense of smell, was shown to be able to distinguish between T-shirts worn by PD-positive sample donors and T-shirts worn by human control sample donors (9). The introduction of canine olfaction as a detection tool for the identification of a PD-associated odorant as a biomarker for the disease became a field of canine olfactory study with PADs (Parkinson’s Alert Dogs) for PD located in Washington State. PADs, as an organization, was solely focused on the canine detection of PD and conducted 745 days of research in which data was collected and compiled each day. Our study encompasses the final two years of data from the PADs Program. PADs was the first program to undertake canine detection training for PD-associated odor.
In 2020, working with Joy Milne, researchers at the University of Manchester utilized paper spray ionization mass spectrometry, along with Milne’s sense of smell, to discover more than 500 organic compounds unique to the sebum of PD-positive patients (10). This discovery is being used for the development of a laboratory skin test for the screening of a biomarker for Parkinson’s (9).
Dogs can be trained to detect target odors with a sensitivity that surpasses the capabilities of most modern instruments (11). Dogs can detect odors in parts per trillion (12) and have been successfully trained to use their olfactory ability to correctly identify and distinguish diseases such as lung (13–16), breast (17,18), and colorectal cancers (19), as well as malaria (20), COVID-19 (21–24) and diabetes—each with a reported sensitivity and specificity of >80% (11–14,17,19,20,23,25). In 2022, a study by Gao et al. (26) presented proof of concept for the canine detection of PD in a controlled, laboratory setting. The three canines trained for the study were of the Belgian Malinois breed and were trained in a laboratory setting for canine detection of PD. Before the study, all three dogs had been in training for two years for PD detection. In 2023, a study by Rooney et al (27) provided further supporting evidence of canine detection of PD with findings of two trained dogs in a medical detection program in the U.K.
Household-maintained canines were successfully trained for medical detection purposes in 2008. Medical Detection Dogs, U.K., assigns dogs to handlers who maintain the dog in a home environment, as opposed to a canine group-kenneled environment. The dogs are brought from their home to the facility for training purposes and returned to their home at the end of each training day. The dogs of Medical Detection have been successfully trained for the detection of cancer, diabetes, and malaria. (8)
The importance of breeds for detection has been brought into question in recent years. A canine detection breed comparison study was conducted in which German Shepherds, a breed considered desirable for scent detection, were compared to Pugs, a brachycephalic breed considered unsuitable for scent detection due to the physical attributes of a Pug’s shortened nose. In one study, the Pugs outperformed the German Shepherds in the detection tasks set before them. A third breed, Greyhounds, were also included in the study, but the Greyhounds did not perform well in the scent detection tasks set before them, likely due to lack of interest or drive in the task (28).
Additionally, we wanted to determine if the drug, levodopa, would emerge as a factor in the sensitivity findings of the dogs. Since 1970, levodopa has been the most prescribed drug for the therapeutic treatment of PD (29), and a possibility existed that dogs could target a volatile organic compound as a byproduct of levodopa drug usage, rather than a PD-associated odor(s) caused by the disease.
The primary objective of our study was to determine if detection-trained companion dogs of varied breeds, ages, and backgrounds could distinguish between PD-positive and PD-negative samples. This study further advances the field of canine medical detection of PD by reporting successful findings when the following factors of significance were investigated:
Levodopa usage as it affects study sensitivity in PD-positive samples.
Gender differences of sample donors.
Canine memorization of samples.
Large variety of breeds, backgrounds and ages of dogs.
High number of dogs in the trials with no outliers removed.
High number of trials with no outliers removed.
Comparison of canine sensitivity between PD-sourced samples that have never been encountered by the dogs to samples that have been previously encountered.
Length of detection training over extended time.
METHODS
Study design
This was a handler-blind, randomized study that included a significant number of detection-trained companion dogs of varying breeds, exposures, and training days to evaluate sensitivity and specificity when presented with PD-positive and PD-negative samples. The study took place in a controlled laboratory setting that was located within a dedicated facility. The facility building was leased and remodeled for the purpose of the seven-year study and included a 14 x 20 ft training area, canine waiting room, research and scribe observation area, sample storage room, office and storage room, and canine exit hallway. The facility was used exclusively for the purpose of this study.
The data collected and reported in this study was compiled from the final two years of a seven-year overall nonprofit canine detection program for the detection of PD. The study period included 200 total working days—95 days in 2021 and 105 days in 2022. Twenty-three dogs participated in a total of 4553 individual trials. The average canine attendance was 8.7 dogs per daily working session. For each daily working session in which 10 or more dogs participated, a minimum of 320 data points were recorded.
Due to a 10%, or higher, potential rate of misdiagnoses in PD, the dogs were at times presented with samples from donors with questionable diagnoses. In these instances, the dogs were not reinforced for a positive indication, regardless of whether it may have been a correct response. This was to reduce the opportunity for any dog to be reinforced for an incorrect response. These samples were only categorized as PD-positive when a minimum of 8 of 10 trained dogs provided a positive indication to the sample when the sample was first introduced. Since no conclusion could be drawn before the analysis of the response of the 10 dogs, no dogs were reinforced until the outcome was determined. This form of withholding the reward carried the risk of extinction of desired response. To counteract this, the dogs underwent training recovery rounds and recovery days on previously determined PD-positive samples, so that the dogs could regain drive for continued work and maintain their expected performance levels for sensitivity and specificity.
If the assessment sample in the first round of presentation to the dogs was deemed PD-positive based on the response of 80% or higher sensitivity of 10 dogs, then the assessment sample was then presented in a second round, in a new position on the wheel (see Figure 2), and the dogs were reinforced for a correct response in that round. If the assessment sample in round one was deemed PD-negative based on the response of 79% or lower sensitivity of 10 dogs, then the dogs were presented with a different, previously assessed PD-positive sample, in the second round.
Sample kit including T-shirt and canister, with additional materials for swab sample collection.
Panel I: a) Kit container, b) Sample container, c) T-shirt sample to be worn by donor, d) Disposable gloves for caretaker (if assistance is needed), e) Research collection forms;
Panel II: a) Sterile swabs, b) Swab sample container, c) Removable cap at either end so that heads of swabs are available for sniffing without being handled by a researcher.
Sample presentation wheel. Sample canister holders were elevated above the flooring to prevent residual odor contamination to the floor. The wheel is designed to be rotated in any direction.
In instances where the dogs were not reinforced for correct sensitivity responses on PD-positive samples (i.e. assessments), the data were not included in the final statistical analysis, but specificity on the PD-negative samples was included. This was because, in assessment sessions with the assessment sample of unknown PD status, the human control sample was a known PD-negative sample.
Sample participants
PD-positive sample donors were recruited from Parkinson’s support groups, Rock Steady Boxing (a PD-participant fitness organization), physicians, and neurologists, as well as via media coverage and pull-through responses from the website, padsforparkinsons.org). Control sample donors were recruited from activity clubs, friends, and family members of program volunteers, businesses, and other events. All samples were collected from donors who provided informed consent, and personal donor sample information was blinded from all study personnel, affiliates, and media. The informed consent document used to obtain permission for sample inclusion in this research study was prepared and reviewed by legal counsel.
There were 43 total PD-positive sample donors and 31 total PD-negative sample donors whose samples were used in this study spanning 2021 and 2022 (Table 1). PD-participant donors were screened for levodopa drug usage, age, gender, onset of symptoms, type of diagnosis, date of diagnosis, and whether the diagnosis was by a physician, movement disorder specialist, or neurologist. Overall, 24 of 43 sample donors were diagnosed by a neurologist, one was diagnosed by a movement disorder specialist, and the remaining 18 sample donors were diagnosed by a physician. Control participants were screened for age and gender and were eliminated for any of the following: household partner diagnosed with PD, anosmia, bowel disorder, sleep disorder, changes in gait, speech, or handwriting. When all unique samples were presented to the dogs, the Control and PD samples were matched in age within 10 years and matched by gender in 80% or greater instances. Both PD participant and Control participant donor samples were of varied geographic origin within the United States. Table 2 lists the control sample donor characteristics for the 31 control samples used in this study.
For this two-year study, 58% of PD sample donors were selected based on confirmed diagnosis by a neurologist and the elapsed time since diagnosis and symptoms. The remaining 42% of the PD-patient sample donors were selected based on reported symptoms with clinical diagnoses. The levodopa-negative Parkinson’s sample donors comprised 17 of the 43 total sample donors. Control sample donors were of varying age, ethnicity, and gender, reporting no associative Parkinson’s symptoms.
Sample material
T-shirts that were worn overnight by all participants were chosen as sample material. This selection was based on the Joy Milne test developed by Dr. Tilo Kunath at the University of Edinburgh (9). All T-shirts provided to both PD and control participants were purchased from the same manufacturer, tank style, ribbed, 100% cotton, and washed under the same conditions in the same machine, packaged and shipped using the same materials. Along with a new, freshly washed T-shirt, sample donors were provided with a new, stainless-steel double-walled vacuum-insulated 24-ounce thermos and a new metal lunchpail for packaging the T-shirt, once worn, for the return to the program (Figure 1). The thermos was designed to hold heat for 14 hours and cold for up to 24 hours and was selected as a solution for containing volatile organic compounds within a container for shipping and storage. Donor sample participants were instructed to pack the T-shirt into the thermos with the neck of the T-shirt at the top of the thermos.
Sample storage
All thermoses containing samples were stored in a dedicated cooler set at 11 °C and presented to the dogs within 30 days of receipt. Samples on which the dogs performed with averages of 80% or higher sensitivity as indicated by 10 or more dogs in one session were used for maintenance training, recovery training, and foundation training of new dogs.
In 2022, we learned that Medical Detection Dogs, U.K., had successfully trained dogs for the detection of PD-associated odor using cotton swabs. Based on this information, in September of 2022, the PADs program changed the sample material used in canine detection from T-shirts to cotton swabs, a less expensive and more efficient sample material for collection and storage. Under the sample collection protocol for swabs, 10 cotton swab samples were collected from the upper back and neck area of each PD and control sample participant. The swab samples were collected by a gloved PADs researcher and each sample collection of 10 swabs was placed in a 4.3” height × ½” diameter metal tube that was capped at both ends. This allowed for the swab-tip end of the cotton swab to be opened for sniffing by the dogs without handling by a researcher. The tubes, containing swab samples, were stored under the same protocol, and handled under the same procedure as the T-shirt samples (Figure 1).
Canine participants
A total of 23 canines of various breeds and ages participated. The canines were selected based on age, demonstration of drive, and availability for training, with little consideration given to breed or sex. All dogs completed a minimum of eight months of prior training for PD detection, whereas some had completed as much as five years of prior training. The Parkinson’s canine detection program had been underway since 2016, so all dogs in the study were previously trained to distinguish between T-shirt samples worn by Parkinson’s patients and T-shirt samples worn by healthy human controls with an average of 75% or higher sensitivity and specificity. Breeds represented in the study included commonly utilized breeds for detection purposes, such as Labrador Retriever and Vizsla, and some of the dogs were of less commonly utilized breeds, such as Pomeranian and English Mastiff. Table 3 shows the breed, age, and sex of each dog at the beginning of the study.
Foundation detection training
Before study selection and during the study period, all dogs were exposed to operant conditioning training methods in their home environment. Owner/handlers engaged in primarily two quadrants of behavioral training by utilizing positive reinforcement and negative-punishment methods for shaping canine behavior.
All dogs were trained for PD odor detection by a single detection trainer. None of the dogs were owned or separately handled by the detection trainer. The dogs underwent foundation training through methods similar to controlled substance detection training, beginning with building hunt drive on a primary reinforcer, then followed by pairing in which the primary reinforcer was available to the dog along with the target odor. At the final stage of foundation training, the dogs worked in the absence of pairing and received a reinforcer for an indication of the target odor. During foundation training, all dogs underwent a minimum of 300 paired exposures (the primary reinforcer, i.e., food, was available to the dog at the time of exposure) at the rate of 10 exposures per training day on their target odor for 6 months. The dogs were also subjected to varied background noise in both target and control samples and were presented with distractors (i.e., onion, coffee, spices, soaps, etc.) and clear (absent of target odor) rounds. In clear rounds, where no target odor was presented, the dogs were trained to go to the exit gate of the facility and were then reinforced outside of the exit gate. During foundation training, all dogs were presented with equal numbers of human-sourced control samples and PD-positive samples so the dogs would learn to distinguish their target odor from the background noise of human scent.
Once a dog had detected the target odor, the dog was trained to hold its body position at the target odor for a minimum of three seconds before receiving a reinforcer. This allowed adequate time for the dogs to clear the control samples with most clearing of samples taking place in one second or less. The dogs were allowed to sit, stand, down, or point while holding, but an alert was not confirmed unless the dog held the position for three or more seconds (30). If a dog held position at a human-sourced control sample for three seconds and then moved onto the target odor for three seconds, the dog was reinforced for indication of the target odor but scored as 0% sensitive, 0% specific for the round (see Table 4, Scoring of the Dogs).
Once foundation training had been completed and a dog was able to demonstrate the olfactory distinction between a unique unpaired sample containing the PD-associated target odor (sensitivity) and that of a human control sample (specificity) of 80% or higher across six working sessions on varied samples, the dog was considered trained to detect PD-associated odor and suitable for study inclusion.
Sample presentation
The donor T-shirt or swab samples within the stainless-steel canisters were not handled by the researcher or any member of the research team. A gloved researcher placed each sample canister on the wheel as indicated by color-coded random selection. The selector indicated the position of each sample to be placed on the wheel, and whether the wheel was to be rotated in counterclockwise or clockwise position between rounds. The wheel was rotated into the position as indicated by the randomizer, and then the predetermined PD-negative control samples were taken from the refrigerated storage unit, canister lids were uncapped, and the control samples were placed onto the wheel (Figure 2). The trainer re-gloved and then placed the PD-positive sample last on the wheel. The canister holders on the wheel were routinely wiped with isopropanol to eliminate lingering or residual odor.
For sniffing indication by the dogs, four samples were randomly placed on a sample wheel. To minimize the potential for fringing (an alert that occurs when the dog indicates close to the target odor but not at the source) and reduce odor transfer between samples, the canisters were spaced four feet (1.23 m). The wheel, for each round of canine sniffing, held one PD-positive sample, one human-sourced control sample, and two other unworn T-shirt samples. The wheel position in the room, sample position on the wheel, and order of dogs were all placed by random selection. The dogs worked off-leash and were free roaming as they entered the sample presentation room and were permitted to work the wheel in any direction. Some of the dogs worked the wheel in a clockwise direction and some worked counterclockwise. A few of the dogs were prone to changing direction based on the scent signature in the room. The dogs worked individually, one after the other, for two or three consecutive rounds and maintained the same run order as determined for the day. The sample wheel was rotated for each round so that, for each round, the dogs encountered the samples in a different position on the wheel.
For most of the study period, there were four working days per week. Not all dogs attended each working day, but most dogs attended at least one or two working days per week. This means that, on each working day, there was a different assembly of dogs. The dogs also varied in their run order, so that the order in which the dogs encountered the samples on the wheel and were subsequently scored was random on each day attended. The dogs were further varied in their ages, breeds, environmental and genetic factors, sex, duration, and experience within the program.
Handler-blind research protocol
All handlers observed the dogs under blind conditions and were instructed to stand motionless in one position outside of the immediate training area. One trainer/researcher observed the dogs in a mirror while standing motionless in the same marked spot through all rounds and all sessions in the study (Figure 2). While the dogs sniffed samples, no eye contact was made by any researcher or trainer with the dogs. A data recorder (scribe) sat outside the training room behind a half-wall barrier and was able to observe the dogs from a distance.
Scoring of the dogs
Indications were confirmed between the trainer and scribe and determined as 100% sensitive if the dog’s first 3-second indication (hold) on a PD participant-sourced sample was considered correct, and the dog had not held a three-second alert on any other sample on the wheel. If the dog cleared the human-sourced PD-negative control sample (sniffed and did not hold position) on the wheel before indicating on the PD-positive sample, the dog was determined as 100% specific for the round. If the dog indicated the target sample as positive before encountering the human control sample on the wheel, the specificity for that round was scored as “NA” or non-applicable and did not apply to the daily specificity calculations for the dog or sample in that round. If a dog were to hold for three seconds on a human-sourced control sample at any time in the round, the dog was scored as 0% specific and 0% sensitive. In this instance, the dog was scored as 0% sensitive, even if the dog did not have the opportunity to sniff the PD-sourced sample. (See Table 4 for an explanation of scoring).
Sensitivity and specificity were both calculated as in Trevethan (31). The overall averaged specificity with all control samples included (human-sourced and non-human-sourced controls) was 91.2%. When the non-human sourced controls were eliminated from the specificity calculations, specificity was at 86.6%, a truer calculation of PD-positive human-sourced samples as compared to PD-negative human-sourced samples as per the objective of the study. In all instances, a human-sourced control sample was presented on the wheel. In this manner, the dogs were scored for sensitivity and specificity to mimic a correct or incorrect response in a simulated screening scenario in which all people, or all samples, would exude human scent signature.
Statistical methods
Sensitivity and specificity data were recorded for each dog on prepared data sheets as the dog exited the training area. These daily rounds data sheets were then manually summarized into daily summaries, and then into monthly canine performance reports. The data from the daily sheets were also entered into individual daily spreadsheets. Daily spreadsheets were then combined and summarized to match the manually summarized reports, then verified against the manually entered canine performance reports, and discrepancies were corrected to match the original recordings made during each working session.
The verified spreadsheet data sets were imported into Microsoft Access database tables, where queries were developed to select and summarize the results. Where confidence limits are presented, the Clopper-Pearson (exact) confidence interval method (32) for binomial trials was used through the confidence interval for a proportion (33). Where comparisons of proportions are presented, the N-1 Chi-squared test as recommended by Campbell (34) and Richardson (35) was employed in MedCalc Software Ltd. For the comparison of proportion calculators (36) of levodopa usage results, a two-proportion z-test was used (37).
RESULTS
Average sensitivity/specificity
Table 5 summarizes the averaged sensitivity and specificity by dog for the two years of the study (4959 total dog-sample encounters). Assessment days were included if the sample was found to be PD-positive. During assessment days, sensitivity was not recorded, but specificity was. The dogs are ranked by percent sensitivity, excluding assessment days. There were 10 “top-tier” dogs (90% or higher in both sensitivity and specificity), although three of those dogs had less than 100 rounds.
For all 23 dogs in the program, the combined overall average sensitivity was 89.0% (4053 correct / 4553 total encounters), and overall specificity was 86.6% (2592 passed-up human control / 2993 total human control encounters). Specificity includes a lower number of encounters because sometimes dogs would alert on the PD sample before encountering the human control sample.
Average sensitivity/specificity for top-tier dogs
For dogs that achieved 90% or higher for both sensitivity and specificity (10 dogs: Sasha, Bendy, Velvet, Penny, Jaden, Hudson, Quil, Russell, River, Scarlett), the overall sensitivity was 93.5% (2163 correct / 2313 total encounters) and the overall specificity was 93.3% (1452 passed up human control / 1556 total human control encounters).
Average sensitivity and specificity when encountering all unique PD samples and control donor samples
The average sensitivity for first-time encounters (first round only with no warm-up round, so all first encounters were cold runs) with a unique PD sample and unique human control sample was 86.3% (139 correct / 161 total encounters). The average specificity for first-time encounters (first round only) with a unique PD sample and unique human control sample was 89.0% (121 passed-up human control / 136 total human control encounters). Dogs encountered the unique samples in random order for each round, and the samples were in randomized positions in the room for each round.
Comparing the sensitivity and specificity for first-time encounters with a unique PD sample and unique human control sample to the overall sensitivity and specificity for first-round encounters (with unique sample rounds removed) showed no significant difference (p = 0.95) as shown in Table 6. Specificity was higher for those rounds with unique samples but did not exceed the 90% confidence level. This indicates that it is not likely that the dogs are indicating PD-positive samples by memory.
Average sensitivity/specificity for top-tier dogs when encountering all unique PD and control donor samples
The combined sensitivity for the 10 top-tier dogs (90% overall sensitivity and specificity for the entire study period) when they first encountered a unique PD and unique human control sample (first round only) was 87.1% (74 correct / 85 total encounters). The combined specificity for the same group and samples was 90.4% (66 passed-up human control / 73 total human control encounters).
Averaged sensitivity/specificity for rounds 1 and 3
The overall combined average sensitivity for all dogs in round 1 of each daily session (where both round 1 and round 3 were attended) was 86.2% (1174 correct / 1362 total encounters), whereas the overall combined average sensitivity for all dogs in round 3 of each daily session (where both round 1 and round 3 were attended) was 91.1% (1241 correct / 1362 total encounters).
The overall combined specificity for all dogs in round 1 of each daily session was 83.6% (766 passed up human control / 916 total human control encounters), and the overall combined specificity for all dogs in round 3 of each daily session was 88.1% (749 passed up human control / 850 total human control encounters).
In both sensitivity and specificity, there was a statistically significant increase of 4.9% (p = 0.0001) and 4.5% (p = 0.007), respectively, between round 1 and round 3 (Table 7). This could indicate that the dogs were becoming more familiar with the PD odor, or that they were becoming more familiar with the procedure of the trials throughout the day.
The location of the PD-positive sample was randomized between each round, so the increase between rounds 1 and 3 was not likely due to memorizing the location of the PD sample. This further denotes that the dogs were not identifying PD-positive samples by memorization.
When comparing sensitivity and specificity for the first round of all unique samples to the final round of the day for all unique samples, the differences were not significant at the 95% level (p = 0.3 and p = 0.96, respectively), as shown in Table 8.
Tables 7 and 8 suggest that the dogs were not memorizing target odor by sample or position of sample regardless of whether they were encountering all samples on the wheel for the first time or had encountered them in a prior round.
Average sensitivity/specificity for the dogs by breed
To show variations in breed groups represented, dogs were categorized as shown in Table 9. These groupings are based on AKC-recognized breed standards (38).
The combined average sensitivity and specificity for the breed groups are reported in Table 9. In most cases, there are not enough dogs in any breed group to make any inferences about particular breed sensitivity or specificity trends. In addition, each dog was raised in a different environment, and most breed groups represented in the study consist of dogs of dissimilar ages with varied days and duration of program attendance.
Figure 3 represents the overall sensitivity for each dog grouped by breed, labeled with dog age. Figure 4 represents the overall specificity for each dog grouped by breed, labeled with dog age. Both Figures 3 and 4 show that breed did not significantly influence outcomes for sensitivity and specificity for this study.
Overall sensitivity for each dog by age and breed.
Overall specificity for each dog grouped by age and breed.
Average sensitivity/specificity compared to total exposures
Figure 5 plots the average sensitivity for each dog during the 2020–2021 study period against the total exposures that each dog had to PD-Positive odor over the life of the PADs program. While there was one dog with low total exposures and low average percent sensitivity, this is likely a special case related to the individual dog. Overall, more total PD-Positive exposures do not produce a higher average sensitivity for any given dog (R2 = 0.043, p = 0.344).
Average percent sensitivity vs. total program exposures for each dog.
Figure 6 plots the average specificity for each dog during the 2020–2021 study period against the total exposures that each dog had to PD-positive odor over the life of the PADs program. Overall, more total PD-positive exposures do not produce a higher average specificity for any given dog (R2 = 0.007, p = 0.711).
Average percent specificity vs. total program exposures for each dog.
Sensitivity/specificity of dogs on drug-naïve PD-positive sample participants vs. participating sample donors who report levodopa usage
Overall sensitivity for participating PD-positive sample donors who reported levodopa usage at the time of sample collection was 89.0% (3020 of 3395, 95% CI 87.9% to 90.0%), and sensitivity for participating PD-positive sample donors who reported no levodopa usage at the time of sample collection was 88.25% (954 of 1081, 95% CI 86.18% to 90.11%). Using a two-proportion z-test these percentages are not shown to be significantly different (z = 0.64, two-tailed p-value = 0.52).
Comparing specificity between participating PD-positive sample donors who reported levodopa usage at the time of sample collection vs. those who did not use levodopa showed similar non-significant results. Overall specificity for PD-positive levodopa users was 85.6% (1575 of 1839, 95% CI 84.0% to 87.2%), and specificity for PD-positive non-levodopa users was 87.1% (600 of 689, 95% CI 84.4% to 89.5%). Using a two-proportion z-test, these percentages are not significantly different from each other (z = - 0.69, two-tailed p-value = 0.49). This shows that the dogs were not indicating based on levodopa.
Comparison of sensitivity/specificity between male PD-positive donors to female PD-positive donors
Table 10 shows the overall sensitivity and specificity for male PD-positive sample donors compared to those of female PD-positive sample donors. For both sensitivity and specificity, there were no significant differences between male PD-positive donors and female PD-positive donors.
Sensitivity/specificity showing the transition from T-shirt samples to swab samples
During September 2022, the samples presented to the dogs transitioned from T-shirts to cotton swabs that had been rubbed over the skin near the back of the neck for both human control and PD-positive donors. Figures 7 and 8 show the daily average sensitivity and specificity trends for the transition period, August 1, 2022, to November 17, 2022.
Sensitivity trend during the period of transition from t-shirt samples to swab samples, August 1, 2022, to November 17, 2022. Comparing the seven days prior to the start of the transition from t-shirt samples (blue circles) to swab samples (orange squares) to the seven days after the transition was complete, average percent sensitivity increased from 82.4 (n = 204) to 91.1% (n=180), p = 0.0124.
Specificity trends during the period of transition from t-shirt samples to swab samples, August 1, 2022, to November 17, 2022. Comparing the seven days prior to the start of the transition from t-shirt samples (blue circles) to swab samples (orange squares) to the seven days after the transition was complete, average percent sensitivity increased from 81.7 (n = 180) to 88.4% (n=112), p = 0.1256.
During the introduction of cotton swabs only, we saw a significant decline in sensitivity and specificity during September 2022. Once paired with a small piece of T-shirt from the same donor, the results were comparable to the T-shirt-only sensitivity. By November 2022, the dogs showed confidence and proficiency in sensitivity using cotton swabs alone. The decline in sensitivity of September 2022 was included in all reported data. Specificity showed no decline during this period, suggesting that the dogs were not challenged by the absence of PD-associated odor in the sample material. This suggests that for the dogs, the absence of their target odor was understood by them, regardless of the difference in sample material.
DISCUSSION
In this study, we were able to determine that most household companion dogs when trained through classical detection methods could be used to distinguish between PD-positive and PD-negative samples with a sensitivity of 85% or greater. The top-tier group of dogs was able to distinguish and select PD-positive samples with a sensitivity of 94%. This indicates the selection of top-tier (highly motivated for detection purposes independently of breed) canines would be better suited for the detection of PD-associated odor.
An aspect of canine medical detection training that could be called into question is that of memorization of sample odor so that it could be surmised that target samples that are repeatedly presented to the dogs would realize an increase in sensitivity. In this study, we purposely presented the dogs with samples never encountered and compared these outcomes with samples previously encountered. We repeated this in a high number of instances and under varied conditions. For all the dogs, in all instances, there was less than a 4% difference in sensitivity and specificity between presenting the dogs with “unique” or previously never encountered samples, and samples that had been previously encountered. Findings were consistent even when all the samples presented in the round were unique. This would suggest that the dogs, as a group, were primarily reliant upon their olfactory sensitivity as opposed to any prior familiarization with a specific sample. We further suggest that in determining the sensitivity of canine detection, it would be of value to present findings of the dogs when they encounter all unique donor samples.
An additional aspect of canine detection of PD that is frequently called into question is how drug usage of sample donors affects sensitivity findings. In our study, we found that levodopa usage (the most prescribed drug in PD), did not significantly affect the sensitivity or specificity of the dogs. In Trivedi, et. al, a study that investigated levodopa in sebum secretions, it was determined that changes in sebum from PD-positive patients were not associated with this medication (39). In both Gao (26) and Rooney (27), it was determined that sensitivity and specificity findings were not influenced by drug usage of sample donors.
We did not find that sensitivity and specificity averages for the dogs increased with time spent in training beyond 300 exposures. The study data showed that once the dogs had been in training for one to two years and had achieved between 300 and 600 exposures during that time, the dogs did not increase in their sensitivity and would level off with some slight increases and decreases. Two of the dogs that had been in the program for more than five years showed a marked decrease in sensitivity as they aged, but both dogs had physical disorders—one dog had repeated seizures and the other had a malignant tumor.
Findings over time showed that very few of the dogs were able to recover on the same day from rounds in which reinforcement was not delivered. This was especially true of instances in which the dogs were likely (80% or higher probability) correct in their indication of PD-associated odor and did not receive a reinforcer. These were samples where the sample donor presented with symptoms, but the diagnosis was not by a neurologist, or the diagnosis was more recent than two years. In these instances, the dogs were simply observed in their indication of all unique samples on the wheel in the first round, and not reinforced for their indication unless a total of eight or more of ten dogs had presented with indication on the PD-possible sample.
The decision to not reinforce the dogs for samples in this situation was made based on the science and principles of operant conditioning. Operant conditioning behaviors that are reinforced are more likely to be repeated than behaviors that are not reinforced. For this reason, the dogs, as a group, were then presented with several days of recovery training in which they were presented with samples of known status. This was necessary to rebuild canine confidence that may have been diminished through a lack of reinforcement for a correct response. In the absence of recovery training, the dogs as a group were observed to trend downward in sensitivity. This is likely because companion dogs that are trained with operant conditioning and subjected to increased pressure are more likely to offer behavior leading to false positives in training. In the Hackner study (16), which simulated diagnostic screening conditions for dogs for lung cancer, the specificity of the dogs dropped to 34% under screening conditions, suggesting a high incidence of false positives. The dogs in the study had been trained and were performing at a sensitivity rate of 90% or higher before being subjected to a simulated diagnostic screening test (16,40).
For this reason, we would not recommend the use of household companion dogs for production-style diagnostic screening for PD. Since there is no readily available diagnostic test for the presence of the PD odor, it would not be feasible to reinforce the dogs for every positive indication with confidence in canine accuracy. It requires a high number of dogs to provide statistical significance for sensitivity and, until statistical probability is determined, the dogs cannot be reinforced without risking reinforcement of incorrect behavior. This limits the number of samples the dogs can be used to screen since reinforceable recovery training is needed between the presentation of questionable samples. However, the dogs could be used to effectively pre-select low numbers of samples, working in tandem with complimentary scientific methods to determine if the PD odor is present. This has been useful for the development of electronic or bioelectronic noses (41), for further research into the nature of PD-specific odor, and possibly to understand the evolution of odor as it relates to disease progression.
In Gao et al. (26) it was noted that the dogs in the study from China did not indicate on three separate samples provided by sample providers with a type of PD that is ascribed to a genetic mutation. In our study, it was also noted that none of the dogs indicated on a sample that was provided by a PD-positive donor who reported a GBA gene mutation. We also noted that the dogs in this study did not indicate on a prior confirmed canine-assessed sample from a sample donor who died within two weeks of providing an additional sample to the program. Samples from this donor had been provided to the dogs in prior instances and, in all previous cases, the dogs had indicated, as a group, that the sample was likely PD-positive. For the sample provided two weeks before death, none of the ten dogs in the round indicated a behavioral response to the sample. This would suggest that the target odor for this sample was not present immediately before the death of this sample donor. This could be due to the absence of target odor or qualitative difference in odor volume. Both possibilities require further investigation.
In Gao et al. (26) the researchers used a determination of two of the three dogs providing a sensitive indication to a PD-donor sourced sample to determine the sample as “PD-positive” to assess the sensitivity accuracy of the dogs. In our study, no weighting was done based on the performance of the dogs. In cases where a sample donor had not been diagnosed by a neurologist or had been diagnosed for less than two years, the researchers required the sensitivity of 80% of the dogs in the first daily working session for that sample to receive a determination of PD-positive.
To the best of our knowledge, this is the first detection program and most comprehensive study to investigate the sensitivity of household companion dogs for the detection of PD. This study systematically considered variables that could affect canine detection outcomes including breed, training duration, attendance days, age, and environmental background. Despite these variables, the dogs, as a group, were consistent in their ability to distinguish between PD-positive and PD-negative samples with a sensitivity rate of 89% or higher.
This study serves to advance the field of canine detection of PD as follows:
Household-raised companion dogs of varying breeds and backgrounds can be used for PD detection, thus eliminating the need for the stewardship responsibility and expense of canine ownership by a detection program.
Household-raised companion dogs can be used to assess small sample sizes for PD odor detection provided they are given recovery training periods.
This study provides additional substantiating evidence for the presence of one or more volatile organic compounds obtained from sebum samples of PD-positive patients.
This study provides additional evidence for the viability of canine detection of PD.
This study demonstrates that companion dogs can detect a Parkinson’s-associated target odor, which likely exists as one or more volatile organic compound(s). If better understood, this odor could be an important biomarker for the early and non-invasive detection of prodromal Parkinson’s disease. If a biomarker can be isolated and reproduced in the form of a singular training aid for dogs, companion dogs could then become a useful, cost-effective, and widespread method of early detection of Parkinson’s disease.
Competing Interest Statement
The authors have declared that no competing interests exist. This includes any submissions related to patents, patent applications or product development.
Funding Statement
The author(s) Lisa Holt and Samuel V Johnston received no specific funding for the work of publishing the manuscript.
Lisa Holt was a founding member of the PADs organization, and her salary was paid for through generous contributions of donors.
PADs for Parkinson’s, a 501 (c) (3) nonprofit organization, with the mission of training dogs for the detection of Parkinson’s, funded 5-1/2 years of the seven years of research which included this study. No donors to PADs played a role in the research, authorship, study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Ethics Statements
This research includes non-human subjects. This research received an IRB exemption (NHS Exemption-Holt (03-14-2023), WCG IRB Work Order #1-1641720-1) under the exclusionary guidelines that the research conducted was for the purpose of canine training for sample selection of PD-associated odor.
About the Authors
Lisa Holt
ROLES: PADs for Parkinson’s co-founder, program director, canine detection trainer, study design, writing, original draft, editing, program funding
AFFILIATIONS AND QUALIFICATIONS: Animal Behavior Certified Trainer and Animal Behavior Certified Mentor Trainer, Animal Behavior College; Certified Nose Work Instructor, National Association of Canine Scent Work; Therapy Pet Evaluator, Therapy Pets Unlimited; Certified American Treibball Trainer and Coach, American Treibball Association; Canine Good Citizenship Evaluator, American Kennel Club ORCID ID: https://orcid.org/0000-0002-8212-3375
Samuel V Johnston
ROLES: Data curation, formal analysis, validation, writing, statistical designer, analyzer, and data presentation
AFFILIATIONS AND QUALIFICATIONS: BS, Fisheries, Cal Poly at Humboldt; Principal Investigator, retired at Innovasea Systems, Inc. ORCID ID: https://orcid.org/0009-0004-1514-4223
Acknowledgements
ADDITIONAL CONTRIBUTORS: Kim Kyllo-Corson PADs for Parkinson’s research coordinator, sample collection, public relations, data entry, verification, citation references, reviewer; Sandra K Johnston, data entry, verification, citation references, reviewer; Natalie R Johnston, data entry, verification, reviewer; Craig Layman Ph.D., Literature search, editing, publication consultation
PRE-SUBMISSION REVIEWERS: Bryan Bartley Ph.D.; Andy Bary M.S.; Jack P. Bell Ph.D.; Robert P Hawkins Ph.D.; Craig Layman Ph.D.; Peter Ledochowitsch Ph.D.
PADS BOARD MEMBERS: Richard Lind; Sherry Smith Bell; Mark Hopkins; Lori Stokes; Katy Barsamian; Barbara Wright
ADVISORS TO THE BOARD: Jack P. Bell Ph.D.; Carolyn Haugen; Nancy Jones, Co-Founder of PADs
LEGAL COUNSEL: special projects, Markham Quehrn
DOGS AND HANDLER TEAMS: John and Allison Moalli - ‘Bendy’; Leah McConnell - ‘Bubba’; Katy Barsamian - ‘Ella’; Dr. Tim Kopet and Judith Okulitch - ‘Hudson’; Sandy and Natalie Johnston - ‘Indy’; Jack P. Bell Ph.D. - ‘Jaden’; Barbara Wright - ‘Mia and Sasha’; Dick Coffey - ‘Penny’; Bill Moore - ‘Rudi’; Kat Rose - ‘Scarlett’; Amber Chenoweth - ‘Shugga’; Liz Pillow - ‘Suki’; Holly Harbors - ‘Topper’