Translate this page into:
Reliability assessment of iPhone-based pupillometry: A test-retest analysis
*Corresponding author: Pritam Dutta, Department of Optometry, Ridley College of Optometry, Jorhat, India. pdutta029@gmail.com
-
Received: ,
Accepted: ,
How to cite this article: Dutta P, Das S, Baishya R. Reliability assessment of iPhone-based pupillometry: A test-retest analysis. East Eye. 2025;1:31–36.doi: 10.25259/TEE_9_2025
Abstract
Objectives: The objective of this study was to assess the test-retest reliability of an iPhone-based pupillometer, specifically the Reflex-pro pupillary light reflex (PLR) analyzer application, in measuring various pupillary parameters. The study aimed to evaluate whether this smartphone-based tool can provide consistent and reliable measurements across two different testing sessions, which is crucial for its potential use in clinical and research settings.
Material and Methods: Forty participants (80 eyes) were recruited. The Reflex-pro PLR analyzer application provided a quantitative method for assessing pupil size and reactivity. Pupillary parameters were measured twice, and the intraclass correlation coefficients (ICCs) were calculated to assess measurement consistency between sessions.
Results: The ICC for average constriction speed was 0.99 (0.98–0.99 CI), indicating excellent reliability. Limits of agreement were as follows: average constriction speed: 0.08 mm/second, constriction time: 0.09 seconds, maximum constriction speed: 0.03 mm/second, amplitude: 0.02 mm, average diameter: 0.02 mm, latency: 0.01 seconds, maximum diameter: 0.004 mm, minimum diameter: 0.03 mm. Most differences fell within the acceptable range, demonstrating good agreement between test and retest measurements.
Conclusion: Our findings support the growing evidence that smartphone-based pupillometry is a reliable tool for measuring pupillary responses.
Keywords
iPhone
Pupillometer
Pupillary light reflex
Reliability
Test-retest
INTRODUCTION
Pupillometry serves as a valuable tool in various fields such as neuroscience, psychology, and ophthalmology.1 It provides insights into autonomic nervous system activity, cognitive processing, and visual function.1 Traditional pupillometry methods often involve specialized equipment and controlled settings, limiting their widespread application outside of research environments. However, recent advancements in technology, particularly the integration of pupillometry into smartphones, have shown promise in enhancing accessibility and convenience for both researchers and clinicians.2
The test-retest reliability of pupillometry measures is a crucial aspect to consider when evaluating the effectiveness and validity of smartphone-based pupillometers. Test-retest reliability refers to the consistency of measurements obtained from the same individual under identical conditions across multiple testing sessions over time. A high level of reliability indicates that the measurements are stable and reproducible, which is essential for drawing valid conclusions from pupillary data.
Studies investigating the test-retest reliability of smartphone-based pupillometers, particularly those integrated with iPhones, are emerging in the literature.3,4 These studies typically employ various experimental designs, protocols, and statistical analyses to assess the consistency of pupillary measurements over time. For instance, a study by Gramkow et al.4 assessed the test-retest reliability and short-term variability of quantitative light reflex pupillometry in a mixed memory clinic cohort. Results showed high reliability across multiple sessions and minimal short-term variability in pupillary parameters, supporting the potential use of pupillometry in cognitive assessment within memory clinic settings.4 Similarly, in a study by Herbst et al., a good repeatability of a chromatic pupillometer was found using two selected wavelengths (470 and 660 nm) for maximal pupillary contraction amplitude and sustained contraction amplitude.5
In this study, we aim to assess the test-retest reliability of an iPhone-based pupillometer. We hypothesize that the iPhone-based pupillometer will demonstrate high test-retest reliability. This evaluation will provide essential insights into the potential of smartphone-based tools in routine clinical assessments.
MATERIAL AND METHODS
This study employed a test-retest design to evaluate the reliability of an iPhone-based pupillometer. Participants were tested twice with the same device under consistent conditions to assess the repeatability of the measurements. A total of 40 participants were recruited for the study. Participants included subjects aged between 18 and 35 years who had no significant ocular or neurological disorders affecting pupillary function and who had normal visual acuity (corrected or uncorrected). Exclusion criteria encompassed individuals with a history of eye surgery within the past year, known pupillary abnormalities such as anisocoria exceeding 0.5 mm, significant neurological conditions like epilepsy or Parkinson’s disease, current use of medications influencing pupillary dynamics (e.g., opioids and anticholinergics), and inability to provide informed consent or cooperate due to cognitive impairment or language barriers. The study was conducted in accordance with the principles of the Declaration of Helsinki. The Institutional Ethics Committee reviewed the protocol and determined that formal ethical approval was not required, as the study involved minimal risk to participants. Protecting participants’ rights, privacy, and confidentiality was paramount throughout all stages of the research process. Informed consent was obtained from each participant, outlining the study’s purpose, procedures, potential risks, and benefits.
Equipment
An iPhone model 14 with the Reflex-pro pupillary light reflex (PLR) Analyzer (Brightlamp, Inc.) pupillometry application was used. The Reflex-pro PLR analyzer application is a mobile-based automated pupillometer designed to offer medical practitioners a quantitative method for assessing pupil size and reactivity. Classified as a Class I 510(k) Exempt medical device under the FDA’s 21 CFR 886.1700, Reflex leverages the iOS platform and requires a reliable mobile or wireless connection. The app has a frame rate of 30 Hz and is capable of measuring the PLR. The procedures involved a pre-adaptation phase where subjects were exposed to dark illumination at 4 lux for 10 minutes. This was followed by the measurement phase, where the app captured the necessary data at the specified frame rate. During the test, the Reflex App emits a photic flash stimulus lasting approximately 1 millisecond (ms) at an intensity of 10 lux set to 70% power and records the eye’s dynamic response at 30 frames per second. The test subject is instructed to align their eyes within an almond-shaped window at the beginning of the test. Once ready, the examiner initiates the test by tapping a spherical button at the bottom center of the screen marked 0%. As the recording progresses, this percentage increases until it reaches 100%, indicating the test is complete.
The device was secured on a stand to ensure consistent positioning and to minimize movement artefacts during measurement. Participants were seated in a dimly lit room (4 lux illumination) to allow for consistent pupil dilation. The iPhone was positioned at a fixed distance of 5 cm from the participant’s eyes, aligned with the participant’s visual axis. The pupillometer application was calibrated before each test session. Calibration involved the participant focusing on a fixation point displayed on the screen while the device adjusted for lighting conditions. Pupil size was measured in both eyes simultaneously. Each measurement session consisted of three consecutive measurements with a 30-second interval between each to allow the pupil to return to baseline size. Participants underwent two test sessions separated by 24 hours. Each session followed the same protocol to ensure consistency.
The outcome variables include the following parameters:
— Average constriction speed: Average rate at which the pupil constricts in response to a light stimulus.
— Constriction time: Duration taken for the pupil to reach maximum constriction after exposure to light.
— Maximum constriction speed: The maximum rate of pupil constriction observed during the light reflex.
— Amplitude: The difference between maximum and minimum pupil diameters, indicating the extent of pupil response to light.
— Average diameter: Mean size of the pupil across the duration of measurement.
— Latency: Time delay between light exposure and initial pupil response.
— Maximum Diameter: Largest size reached by the pupil during measurement.
— Minimum Diameter: Smallest size reached by the pupil during measurement.
Environmental controls
To maintain consistency and minimize external variables, the testing environment was carefully controlled. Room temperature was kept constant at 22 °C throughout all testing sessions. Ambient room lighting was standardized at 4 lux using an AmiciSense Dual-Power luxmeter (Haosen International Trading Company Limited) to ensure uniform light conditions. External noises were minimized to create a quiet testing environment. All measurements were conducted between 11:30 AM and 3:00 PM to control for potential circadian variations in pupillary response. Participants were instructed to avoid caffeine, alcohol, and medications that could affect pupil size for 24 hours prior to testing. A pre-test adaptation period of 10 minutes in the testing environment was provided to allow stabilization of baseline pupil size. Head position was stabilized using a chin rest to minimize movement, and participants were asked to fixate on a specific point to reduce eye movement during measurement. The brightness of the iPhone screen was standardized across all sessions.
Data analysis
Test-retest reliability was assessed using intraclass correlation coefficients (ICCs) to determine the consistency of measurements between the two sessions. ICC was used to assess the reliability and consistency of the measurements, ensuring they are reproducible across different conditions. Pearson’s r measured the linear correlation between the datasets, indicating the strength and direction of their relationship. Bland-Altman analysis complemented these by identifying any systematic bias and quantifying the limits of agreement, which is essential for evaluating the interchangeability of the methods. The following statistical analyses were performed: ICC, calculated to assess the reliability of the measurements. An ICC value above 0.75 was considered indicative of good reliability. Bland-Altman analysis: Used to assess the agreement between tests and retest measurements by plotting the differences against the mean values of the two sessions. A power analysis was conducted to determine the sample size required to detect significant differences with high confidence. The analysis was performed using (software/tool), with an alpha level of 0.05 and a power of 0.80. Based on the variability of the pupillary measurements observed in previous studies, a sample size of 40 participants (80 eyes) was deemed sufficient to achieve 80% power for detecting meaningful differences in the pupillary parameters.
RESULTS
A total of 40 subjects (80 eyes) were included, of whom 22 were males. The mean ± SD age of the subjects was 28 ± 5 years. No significant differences in the pupillary parameters were noted between genders (p > 0.78). All participants had normal or corrected-to-normal vision and no history of ocular or neurological disorders. Table 1 provides the descriptive statistics for the various pupillary parameters measured in the study. These include the mean, standard deviation, minimum, and maximum values for each parameter.
| Parameter | Mean (test) | SD (test) | Min (test) | Max (test) | Mean (retest) | SD (retest) | Min (retest) | Max (retest) |
|---|---|---|---|---|---|---|---|---|
| Average constriction speed (ACS) | 3.45 | 0.15 | 3.1 | 3.8 | 3.47 | 0.16 | 3.12 | 3.82 |
| Constriction time (CT) | 0.65 | 0.05 | 0.6 | 0.7 | 0.66 | 0.05 | 0.61 | 0.71 |
| Maximum constriction speed (MCS) | 5.8 | 0.2 | 5.5 | 6.1 | 5.78 | 0.21 | 5.48 | 6.08 |
| Constriction amplitude | 2.2 | 0.1 | 2.05 | 2.35 | 2.21 | 0.09 | 2.06 | 2.36 |
| Average diameter | 5.5 | 0.3 | 5 | 6 | 5.48 | 0.31 | 4.98 | 5.98 |
| Latency | 0.2 | 0.02 | 0.18 | 0.22 | 0.19 | 0.02 | 0.17 | 0.21 |
| Maximum diameter | 6.2 | 0.25 | 5.9 | 6.5 | 6.19 | 0.26 | 5.89 | 6.49 |
This table presents the mean, SD, minimum (Min), and maximum (Max) values for various pupillary parameters measured during the test and retest sessions using the Reflex-pro PLR analyzer application. The parameters include ACS, CT, MCS, constriction amplitude, average diameter, latency, maximum diameter, and minimum diameter. SD: Standard deviation, PLR: Pupillary light reflex, ACS: Average constriction speed, CT: Constriction time, MCS: Maximum constriction speed.
Test-retest reliability
The iPhone-based pupillometer demonstrated high test-retest reliability. ICCs were calculated for all the measurements taken in two separate sessions. The ICC for average constriction speed was 0.99 (0.98–0.99 CI), and constriction time was 0.99 (0.98–0.99 CI), which is high, indicating excellent reliability [Table 2].
| Parameters | Intraclass coefficient (95%CI) |
|---|---|
| Average constriction speed | 0.99 (0.98–0.99) |
| Constriction time | 0.99 (0.98–0.99) |
| Maximum constriction speed | 0.97 (0.95–0.98) |
| Amplitude | 0.98 (0.98–0.99) |
| Average diameter | 0.96 (0.93–0.98) |
| Latency | 0.85 (0.74–0.92) |
| Maximum diameter | 0.98 (0.96–0.99) |
| Minimum diameter | 0.97 (0.95–0.98) |
This table presents the ICCs for various pupillary parameters measured during the test and retest sessions using the Reflex-pro PLR analyzer application. The parameters include ACS, CT, MCS, constriction amplitude, average diameter, latency, maximum diameter, and minimum diameter. High ICC values indicate excellent reliability and consistency of the measurements across the two sessions. CI: Confidence interval, PLR: Pupillary light reflex, ACS: Average constriction speed, CT: Constriction time, MCS: Maximum constriction speed.
Bland-Altman analysis
Bland-Altman plots were used to assess the agreement between the two test sessions. The limits of agreement for all the pupillary parameters are as follows: ACS: 0.08, CT: 0.09, MCS: 0.03, Amplitude: 0.02, Average diameter: 0.02, Latency: 0.01, Maximum diameter: 0.04, Minimum diameter: 0.03. The plots indicated that most differences fell within the acceptable range, demonstrating good agreement between the test and retest measurements [Figure 1].
![Bland–Altman analysis showing agreement between test and retest measurements of pupillary parameters obtained using the iPhone-based Reflex Pro PLR analyzer application. The solid line in each plot represents the mean difference (bias) between the two sessions, while the dashed lines indicate the 95% limits of agreement. The x-axis represents the mean of test and retest measurements, and the y-axis represents the difference between test and retest values for each parameter. (a) Average constriction speed (x-axis: mean ACS [mm/s]; y-axis: difference in ACS [mm/s]), (b) Constriction amplitude (x-axis: mean amplitude [mm]; y-axis: difference in amplitude [mm]), (c) Average diameter (x-axis: mean diameter [mm]; y-axis: difference in diameter [mm]), (d) Constriction time (x-axis: mean CT [s]; y-axis: difference in CT [s]), (e) Latency (x-axis: mean latency [s]; y-axis: difference in latency [s]), (f) Maximum diameter (x-axis: mean maximum diameter [mm]; y-axis: difference in maximum diameter [mm]), (g) Maximum constriction speed (x-axis: mean MCS [mm/s]; y-axis: difference in MCS [mm/s]), (h) Minimum diameter (x-axis: mean minimum diameter [mm]; y-axis: difference in minimum diameter [m]). PLR: Pupillary light reflex, ACS: Average constriction speed, CT: Constriction time, MCS: Maximum constriction speed.](/content/179/2025/1/1/img/TEE-1-1-31-g001.png)
- Bland–Altman analysis showing agreement between test and retest measurements of pupillary parameters obtained using the iPhone-based Reflex Pro PLR analyzer application. The solid line in each plot represents the mean difference (bias) between the two sessions, while the dashed lines indicate the 95% limits of agreement. The x-axis represents the mean of test and retest measurements, and the y-axis represents the difference between test and retest values for each parameter. (a) Average constriction speed (x-axis: mean ACS [mm/s]; y-axis: difference in ACS [mm/s]), (b) Constriction amplitude (x-axis: mean amplitude [mm]; y-axis: difference in amplitude [mm]), (c) Average diameter (x-axis: mean diameter [mm]; y-axis: difference in diameter [mm]), (d) Constriction time (x-axis: mean CT [s]; y-axis: difference in CT [s]), (e) Latency (x-axis: mean latency [s]; y-axis: difference in latency [s]), (f) Maximum diameter (x-axis: mean maximum diameter [mm]; y-axis: difference in maximum diameter [mm]), (g) Maximum constriction speed (x-axis: mean MCS [mm/s]; y-axis: difference in MCS [mm/s]), (h) Minimum diameter (x-axis: mean minimum diameter [mm]; y-axis: difference in minimum diameter [m]). PLR: Pupillary light reflex, ACS: Average constriction speed, CT: Constriction time, MCS: Maximum constriction speed.
Measurement consistency
A strong positive correlation was noted for all the pupillary parameters carried out in two intervals (r = 0.9, p < 0.001, Pearson correlation coefficient) [Table 3].
| Parameters | r-value | p |
|---|---|---|
| Average constriction speed | 0.99 | p < 0.001 |
| Constriction time | 0.98 | p < 0.001 |
| Maximum constriction speed | 0.99 | p < 0.001 |
| Amplitude | 0.98 | p < 0.001 |
| Average diameter | 0.93 | p < 0.001 |
| Latency | 0.80 | p < 0.001 |
| Maximum diameter | 0.96 | p < 0.001 |
| Minimum diameter | 0.91 | P<0.001 |
This table displays the Pearson correlation coefficients (r values) for various pupillary parameters measured during the test and retest sessions using the Reflex-pro PLR analyzer application. The parameters include ACS, CT, MCS, constriction amplitude, average diameter, latency, maximum diameter, and minimum diameter. High r values indicate strong positive correlations, reflecting the consistency and reliability of the measurements across the two sessions. P < 0.05 considered statistically significant. PLR: Pupillary light reflex, ACS: Average constriction speed, CT: Constriction time, MCS: Maximum constriction speed.
DISCUSSION
The findings from our study demonstrate that the iPhone-based pupillometer exhibits high test-retest reliability, making it a promising tool for clinical and research applications in pupillometry. The ICC values obtained from our repeated measures are consistent with those reported in similar studies, reinforcing the robustness of smartphone-based pupillometry. This aligns closely with the findings of Herbst et al., who also reported high repeatability of the pupil light response to both blue and red light stimuli using a novel pupillometer.5 Herbst et al.5 highlighted the device’s capacity for consistent measurements across different light conditions, underscoring its utility for clinical applications where accurate pupillary assessments are crucial.
Our results showed an ICC greater than 0.90 for most of the pupillary parameters, indicating excellent reliability, which aligns with findings from previous research. For instance, a study by Mariakakis et al. using the PupilScreen smartphone application reported high within-group reliability (k = 0.85) and interrater reliability (k = 0.75), showcasing the potential of smartphone applications in accurately assessing the PLR.6 Another study evaluating a custom-designed infrared smartphone pupillography system reported ICC values ranging from 0.982 to 0.995, further validating the high reliability of smartphone-based measurements.7 Likewise, Sousa et al. reported ICC values ranging from 0.982 to 0.995 for their infrared smartphone pupillometer, further validating the reliability of smartphone-based devices.8
The high reliability observed in these studies, including ours, suggests that smartphone-based pupillometers can serve as reliable alternatives to traditional and digital pupillometry methods. Traditional methods, such as penlight examinations, are inherently subjective and can lead to variability in measurements. Digital pupillometers, while accurate, are often expensive and require specialized training. In contrast, smartphone-based pupillometers are cost-effective, easily accessible, and user-friendly, making them ideal for a wide range of applications, from clinical practice to telemedicine and remote monitoring.
Moreover, the use of infrared technology in some smartphone pupillometers enhances measurement accuracy under various lighting conditions, as shown in studies where infrared pupillography yielded high-quality videos and photographs that closely matched those obtained with traditional desktop systems.7 This is particularly important for ensuring reliable measurements in diverse clinical scenarios.
The implications of these findings are significant for clinical practice. The portability and ease of use of the iPhone-based pupillometer can enhance patient care in diverse settings, including emergency departments and resource-limited environments. The high reliability of these devices ensures that clinicians can confidently use them for neurological and ophthalmological assessments, potentially improving diagnostic accuracy and patient outcomes.
Potential limitations of the study include the sample size, the fixed environmental conditions, which might not reflect real-world variability, and the reliance on a single iPhone model, which may limit generalizability to other devices. Future research should focus on further validating the iPhone-based pupillometer in various clinical populations and settings. Longitudinal studies comparing its performance with standard digital pupillometers will help establish its credibility and facilitate its broader adoption in clinical practice. Additionally, exploring its application under different lighting conditions and across various patient demographics will provide deeper insights into its practical utility and limitations.
CONCLUSION
In conclusion, our study corroborates the growing body of evidence supporting the use of smartphone-based pupillometers as reliable tools for measuring pupillary responses. The high test-retest reliability observed in our findings underscores the potential of these devices to enhance clinical practice and research in neurology, ophthalmology, and beyond. Future research should continue to explore the application of smartphone pupillometry in different patient populations and settings to further validate its utility and expand its adoption in clinical practice.
Ethical approval:
The study protocol was reviewed by the Institutional Ethics Committee of Chandraprabha Eye Hospital, which determined that formal approval was not required.
Declaration of patient consent:
The authors certify that they have obtained all appropriate patient consent.
Financial support and sponsorship:
Nil.
Conflicts of interest:
There are no conflicts of interest.
Use of artificial intelligence (AI)-assisted technology for manuscript preparation:
The authors confirm that there was no use of artificial intelligence (AI)-assisted technology for assisting in the writing or editing of the manuscript and no images were manipulated using AI.
REFERENCES
- Portable infrared pupillometry. A review. Anesth Analg. 2015;120:1242-53.
- [CrossRef] [PubMed] [Google Scholar]
- Mobile smartphone-based digital pupillometry curves in the diagnosis of traumatic brain injury. Front Neurosci. 2022;16:893711.
- [CrossRef] [PubMed] [PubMed Central] [Google Scholar]
- iPhone-based pupillometry: A novel approach for assessing the pupillary light reflex. Optom Vis Sci. 2018;95:953-8.
- [CrossRef] [PubMed] [PubMed Central] [Google Scholar]
- Test-retest reliability and short-term variability of quantitative light reflex pupillometry in a mixed memory clinic cohort. J Neurol Sci. 2024;456:122856.
- [CrossRef] [PubMed] [Google Scholar]
- Test-retest repeatability of the pupil light response to blue and red light stimuli in normal human eyes using a novel pupillometer. Front Neurol. 2011;2:10.
- [CrossRef] [PubMed] [PubMed Central] [Google Scholar]
- PupilScreen: using smartphones to assess traumatic brain injury. Proc ACM Interact Mob Wearable Ubiquitous Techno. 2017;1:1-27.
- [CrossRef] [Google Scholar]
- Pilot study of smartphone infrared pupillography and pupillometry. Clin Ophtalmo. 2022;16:303-10.
- [CrossRef] [PubMed] [PubMed Central] [Google Scholar]
- Development of a smartphone-based system for intrinsically photosensitive retinal ganglion cells targeted chromatic pupillometry. Bioengineering (Basel). 2024;11:267.
- [CrossRef] [PubMed] [PubMed Central] [Google Scholar]
