TOI Clinical Research

Transdermal Optical Imaging (TOI):Clinical Accuracy & Reliability Summary

A comprehensive analysis of TOI technology for measuring physiological and emotional parameters through non-invasive facial scanning.

Introduction

Overview of Transdermal Optical Imaging (TOI) technology and its applications

This document provides an overview of Transdermal Optical Imaging (TOI) Data Quality Tests. TOI is a non-invasive technology that uses conventional cameras to extract blood flow information from the face to measure various physiological and emotional parameters.

Heart Rate & Mental Stress Index

Tests on the data quality of Heart Rate (HR) and Mental Stress Index (MSI) were carried out in a normal and indoor environment. These measurements provide insights into cardiovascular function and stress levels.

Breathing

Tests on the data quality of breathing were carried out in a normal and indoor environment. Breathing rate is an important vital sign that can indicate respiratory health.

Blood Pressure

Tests on the data quality of blood pressure (BP) were carried out in a controlled indoor environment. Blood pressure is a critical vital sign for cardiovascular health assessment.

Cardiovascular Disease Risk Prediction

This tool calculates an individual's probability of having a first heart attack, first stroke, or first heart attack or stroke (which we term 'cardiovascular disease') within a specified number of years.

Emotions

Tests on the data quality of emotions (happiness, sadness, fear, disgust, surprise, and anger) were carried out using videos of individuals expressing various genuine and posed emotions.

Methods

Detailed methodology for testing TOI technology across different parameters

Heart Rate & Mental Stress Index

Tests on the data quality of HR and MSI were conducted in Hangzhou, China and Toronto, Canada. All tests were conducted by researchers with experience in data collection, signal processing, and machine learning.

Figure 1: Heart Rate & Mental Stress Index Test Setup

Figure 1: Heart Rate & Mental Stress Index Test Setup
Testing environment with controlled LED light, RGB video recorder, and 3-lead electrocardiograph (ECG)

Test Population: Heart Rate, Mental Stress Index

Over 10,000 test subjects took part in the test. The subjects were recruited from the Affiliated Hospital of Hangzhou Normal University in China and the University of Toronto in Canada. There were no exclusion criteria when recruiting participants, although most participants were healthy adults above 18 years of age.

Table 3: Characteristics of population for HR and MSI

Participants CompositionValue
Standard deviation of age15.2

Test Setup: Heart Rate & Mental Stress Index

Testing environment is a standard setting across all studies. The setting includes a dark room illuminated with a controlled LED light, a conventional RGB video recorder (e.g., iPhone camera, Go Pro, Basler) and a 3-lead electrocardiograph (ECG). Figure 1 displays the set up of materials and the way in which subjects were positioned during the test.

Test Procedure: Heart Rate & Mental Stress Index

Subjects were asked to remain still and face the camera during the test for 2 minutes. Video and ECG recordings were synchronized to start and stop at the same time.

Analysis of Accuracy & Reliability: Heart Rate & Mental Stress Index

Extraction of facial blood flow is done for each video using TOI. When predicting values for HR and MSI, they are compared against ground truth ECG data collected by the BIOPAC to evaluate the accuracy of the signal.

Figure 4: Labeled ECG Wave

Figure 4: Labeled ECG Wave
Components of an ECG recording showing P, QRS, and T waves

Figure 5: R-R Interval

Figure 5: R-R Interval
Two adjacent ECG waves showing an R-R interval that is used for heart rate computation

The figures above show the details of an ECG recording components. In healthy individuals, each Heartbeat recorded by an ECG device is determined by the R waves (peaks). As a result, the number of R waves per minute represent the rate at which a heart is beating.

TOI heart rate is determined for each individual, which is compared to ECG HR. Equation 1 is used to calculate the accuracy of TOI heart rate.

1 - (1/N) ∑ |Predicted HR - Actual HR| / |Actual HR| × 100%

Equation 1: Calculation of TOI heart rate accuracy

Stress is then derived based the heart rate variability over a period of time based on the inter-beat intervals (RRI) of the ECG recording or TOI signals (Figure 8).

1 - (1/N) ∑ |Predicted MRRI - Actual MRRI| / |Actual MRRI| × 100%

Equation 2: Calculation of TOI stress accuracy

Breathing

Tests on the data quality of breathing was carried out in a normal and indoor environment. All tests were conducted by researchers with experience in data collection, signal processing, and machine learning.

Figure 2: Breathing Test Setup

Figure 2: Breathing Test Setup
Testing environment with breathing belt attached around the thoracic region of subjects

Test Population

Over 10,000 test subjects took part in the test. The subjects were recruited from the Affiliated Hospital of Hangzhou Normal University in China and the University of Toronto in Canada. There were no exclusion criteria when recruiting participants, although most participants were healthy adults above 18 years of age.

Table 4: Characteristics of Population for Breathing

Participants CompositionValue
Standard deviation of age15.2

Test Setup: Breathing

Testing environment is a standard setting across all studies. The setting includes a dark room illuminated with a controlled LED light, a conventional RGB video recorder (e.g., iPhone camera, Go Pro, Basler) A breathing belt was attached around the thoracic region of subjects and connected to the Biopac device.

Test Procedure: Breathing

Subjects were asked to remain still and face the camera during the test for 2 minutes. Video and ECG recordings were synchronized to start and stop at the same time.

Analysis of Accuracy & Reliability: Breathing

Extraction of facial blood flow is done for each video using TOI. When predicting values for breathing, they are compared against ground truth data collected by the BIOPAC to evaluate the accuracy of the signal.

Blood Pressure

Tests on the data quality of blood pressure (BP) was carried out in a controlled indoor environment. The blood pressure of recruited subjects was first predicted by TOI using the Anura app, then measured using a mercury sphygmomanometer by an experienced nurse.

Figure 3: Blood Pressure Test Setup

Figure 3: Blood Pressure Test Setup
Subject seated with BP cuff attached to arm and facing iPhone camera with Anura application

Test Population: Blood Pressure

A sample of 85 adults were recruited. The subjects were recruited from the General Clinic of the Zhejiang Elderly Guanhai Hospital and the ECG Clinic of the Affiliated Hospital of Hangzhou Normal University, Hangzhou, China.

Table 5: Characteristics of population for BP

Participants CompositionValue
Standard deviation of age14.1

Test Setup: Blood Pressure

The testing environment is a standard setting. The setting includes a quiet room illuminated with a controlled LED light, an iPhone with the Anura application, and a mercury sphygmomanometer. Figure 3 displays the set up of materials and the way in which subjects were positioned during the test.

Test Procedure: Blood Pressure

Blood pressure was first measured using the Anura application on the iPhone followed by a measurement using the mercury sphygmomanometer as a reference. Specifically, blood pressure was measured on the Anura test device by triggering a 60 second BP measurement with a button on the screen. The subject was asked to face the iPhone and remain perfectly still for the duration of the recording. The nurse then immediately measured a reference BP via auscultation with the mercury sphygmomanometer. Systolic BP (SBP) was measured as the pressure corresponding to the onset of the 'tapping' sound and Diastolic BP (DBP) as the pressure corresponding to the disappearance of the last audible Korotkoff sound.

The first measurement pair was designated as a practice measure and discarded. Three additional measurement pairs were then taken. The SBP and DBP for each auscultation were recorded to the nearest 2mmHg. If Korotkoff sounds were difficult to distinguish or the subject moved during a measurement, then that measurement pair was discarded and another one was taken. Measurement pairs continued (up to a maximum of eight) until three valid pairs were obtained. Subjects with arrhythmia were excluded, where three satisfactory measurement pairs could not be obtained, or where the SBP or DBP range was more than 12mmHg or 8 mmHg, respectively.

Analysis of Accuracy & Reliability: Blood Pressure

Blood pressure data was analyzed according to two AAMI criteria. For criterion 1, the mean difference (bias) between the reference measurement and the Anura measurement (reference – Anura) and its standard deviation across all 255 measurement pairs for both SBP and DBP. The device fulfils criteria 1 if the mean error is within 5 mmHg of zero and its standard deviation is less than or equal to 8 mmHg. For criterion 2, we calculated the average mean difference for each subject, and then calculated the mean and standard deviation of this value across all 85 subjects. The test device fulfils criteria 2 if its standard deviation is below a given cut-off (based on the mean difference) listed in a table in the ANSI/AAMI/ISO 81060-2:2013 guidelines.

Cardiovascular Disease Risk Prediction

This tool calculates an individual's probability of having a first heart attack, first stroke, or first heart attack or stroke (which we term 'cardiovascular disease') within a specified number of years.

Current Method: Cox Regression-based Model

Cardiovascular disease risk prediction is performed using two methods:

Current method: Cox regression-based model
Predicts the following:

  • 10-year risk of cardiovascular disease (first heart attack or first stroke)
  • 10-year risk of first heart attack
  • 10-year risk of first stroke

Cox regression is the same methodological approach used in deriving the Framingham Heart Study risk prediction equations. It is also the same approach used in deriving the China-PAR equation, which is the gold standard for predicting cardiovascular disease risk in the Chinese population.

We derived two versions of each equation: one using a minimal set of risk factors and another using a full set, depending on how much information the user provides. These sets are listed in Tables 1 and 2.

Table 1: Minimal Set of Risk Factors

Risk Factor Set: Minimal
Age (years)
Body Mass Index (kg/m²)
Systolic blood pressure (mmHg)

Table 2: Full Set of Risk Factors

Risk Factor Set: Full set
Age (years)
Body Mass Index (kg/m²)
Systolic blood pressure (mmHg)
Are you currently a smoker? (yes/no)

Upcoming Method: Machine Learning-based Model

Upcoming method: Machine learning-based model

This model predicts the likelihood of an individual having a first heart attack or stroke within a specified number of years (from 2 to 20 years). It was derived using advanced machine learning techniques.

Cardiovascular disease is defined as: heart attack, stroke, or CHD death.

Test Population: Cardiovascular Disease Risk Prediction

Current method: Cox regression-based model
Models were both derived and tested on a sample of 4500 Chinese adults.

Upcoming method: Machine learning-based model
Models were derived and validated using a multi-racial dataset of over 30,000 adults from the United States, obtained over a period of 20 years. Training and validation occurred on separate portions of the dataset to minimize potential for 'overfitting.'

Test Procedure: Cardiovascular Disease Risk Prediction

Participants answered a survey with relevant questions about risk factors and cardiovascular disease status. The systolic blood pressure risk factor was measured (taken as the average of 3 measurements).

Emotions

Tests on the data quality of emotions (happiness, sadness, fear, disgust, surprise, and anger) were carried out using videos from people naturalistically expressing various emotions on their face.

Test Population: Emotions

A sample of adults were analyzed for their genuinely expressed emotions shown in videos. Tests on the data quality of emotions were conducted in Toronto, Canada by using videos from people naturalistically expressing various emotions (happiness, sadness, fear, disgust, surprise, and anger) on their face. All tests were conducted by researchers with experience in facial expression decoding and machine learning.

Table 6: Characteristics of Population for Emotion Expression

Subjects in videos CompositionValue
Standard deviation of age10.5

Test Setup: Emotions

The test involved videos taken under various natural settings.

Test Procedure: Emotions

Videos of people expressing different genuine and posed emotions were downloaded from websites that offer non-licensed video datasets for research or machine learning validation purposes. The criteria of videos included a front view of a person with an emotion being expressed.

To produce genuine emotional expressions, individuals were asked to view various stimuli (e.g., a giggling baby, a dirty toilet bowl) that were known to elicit specific emotions. Their natural reactions were video recorded.

To produce posed emotional expressions, actors were coached to manipulate various facial muscles to mimic specific emotions as instructed by the researcher, which was video recorded.

Analysis of Accuracy & Reliability: Emotions

As for emotion expression, each video undergoes facial extraction second by second and then a prediction of each expression for each second is made by TOI. Each expression is given a percentage to which emotion it might be.

We decoded the likelihood ratios of all emotions on each video second by second and compared the decoded emotion against the actual emotion expressed in each video to obtain the accuracy (%) of whether the two were matched.

Note that 100% reliability was obtained by reanalyzing the same videos multiple times using the same algorithms with the initial frame.

Results

Summary of findings from TOI technology testing across different parameters

Heart Rate & Mental Stress Index

Heart Rate Accuracy

99%

Correlation between TOI and ECG measurements: r=1.0 when SNR > 0

Heart Rate Reliability

100%

Consistent results with repeated analysis of the same videos

Mental Stress Index Accuracy

92%

Correlation with ECG measurements: r=0.94 (120s), r=0.88 (30s)

Table 7: Accuracy and Reliability of TOI Heart Rate and Mental Stress Index Data

Physiological MeasureAccuracy (%)Reliability (%)
Heart Rate99100
Mental Stress Index92100
Note that 100% reliability was obtained by reanalyzing the same videos multiple times using the same algorithms with the initial frame.

Accuracy & Reliability of Heart Rate

Table 8: Statistical Measure of TOI Heart Rate Accuracy

HR AccuracyValue
Mean ECG HR74.4
Mean TOI HR75.1
Mean Error0.63
Standard Deviation of Error1.21
Mean of Absolute Error0.79
Note that 100% reliability was obtained by reanalyzing the same videos multiple times using the same algorithms with the initial frame.

The correlation between HR obtained from TOI and ECG was calculated. There was a positive correlation between the measurements of HR obtained from TOI and ECG, r=1.0 when SNR > 0. Figure 9 demonstrates the points of heart rate (BPM) as obtained from TOI and ECG with a line of best fit drawn through the points to illustrate the positive correlation.

Figure 9: Correlation between predicted values of heart rate from TOI and ECG

Figure 9: Correlation between predicted values of heart rate from TOI and ECG
Correlation between heart rate measurements from TOI and ECG showing strong positive correlation

Accuracy & Reliability of Mental Stress Index

Table 9: Statistical Measure of TOI MSI Accuracy

MSI AccuracyValue
Mean ECG MSI3.87
Mean TOI MSI3.20
Mean Error-0.67
Standard Deviation of Error0.49
Mean of Absolute Error0.73
Note that 100% reliability was obtained by reanalyzing the same videos multiple times using the same algorithms with the initial frame.

The correlation was calculated between stress measurements obtained from TOI and ECG during a 120 second time frame. There was a positive correlation, r = 0.94 when SNR > 0. Figure 10 demonstrates the points of stress as obtained from TOI and ECG with a line of best fit drawn through the points to illustrate the positive correlation.

Figure 10: Correlation between predicted values of stress from TOI and ECG over a period of 120 seconds

Figure 10: Correlation between predicted values of stress from TOI and ECG over a period of 120 seconds
Positive correlation (r=0.94) between stress measurements from TOI and ECG over 120 seconds

Figure 11: Correlation between predicted values of stress from TOI and ECG over a period of 30 seconds

Figure 11: Correlation between predicted values of stress from TOI and ECG over a period of 30 seconds
Positive correlation (r=0.88) between stress measurements from TOI and ECG over 30 seconds

The correlation was also calculated between stress measurements obtained from TOI and ECG during a 30 second time frame. There was a positive correlation, r = 0.88 when SNR > 0. Figure 11 demonstrates the points of stress as obtained from TOI and ECG with a line of best fit drawn through the points to illustrate the positive correlation.

Breathing

Breathing Accuracy

>90%

Correlation with BIOPAC measurements: r=0.9 when SNR > 0

Breathing Reliability

99%

Consistent results with repeated analysis of the same videos

Table 10: Accuracy and Reliability of Predicted Breathing

Physiological MeasureAccuracy (%)Reliability (%)
Breathing>9099
Note that 99% reliability was obtained by reanalyzing the same videos multiple times using the same algorithms with the initial frame.

The correlation was calculated between breathing measurements obtained from TOI and BIOPAC. There was a positive correlation, r = 0.9 when SNR > 0.

Blood Pressure

Table 11: Blood Pressure Validation Results

Anura BPmean±SD (min–max)
SBP (mmHg)–0.4±6.7 (94–176)
SBP (mmHg)–0.4±5.8 (97–169)
SD threshold: 6.84mmHg

The mean difference between BP measurements from the mercury sphygmomanometer and Anura are presented in Table 11. The model was evaluated according to two AAMI criteria.

Figure 13: Standardized Bland–Altman scatterplot of differences between device (Anura) and observer measurements

Figure 13: Standardized Bland–Altman scatterplot of differences between device (Anura) and observer measurements
Standardized Bland–Altman scatterplot of differences between device (Anura) and observer measurements for (a) systolic blood pressure and (b) diastolic blood pressure

The model performance was visualized using Bland–Altman plots in Figure 13. Predictions for SBP are best for reference pressures between 110 and 150mmHg. Predictions for DBP are best for reference pressures between 75 and 85 mmHg.

Cardiovascular Disease Risk Prediction

Current: Cox-based Regression

Internal validation: AUC overall (n=4500) = 0.72

Figure 14: Validation against China-PAR Equation

Figure 14: Validation against China-PAR Equation
Validation against China-PAR equation; (a) Males and females, (b) Males only, and (c) Females only

Upcoming: Machine Learning-based Model

Table 12: Accurate Predictions for Time Windows

Time WindowAUC
2-year0.82
5-year0.81
10-year0.80
15-year0.79
20-year0.78
As a point of comparison, the 10-year AUC for the Pooled Cohort Equation is 79-80%.

Figure 15: 10-year Risk Predictions

Figure 15: 10-year Risk Predictions
10-year risk predictions for three different subjects: a young subject, a middle-aged subject and a typical older subject

Emotions in Adults

Table 13: Accuracy of Genuinely Expressed Emotions

Emotional ExpressionsAccuracy (%)Reliability (%)
Fear100100
Sadness100100
Happiness93100
Anger89100
Disgust86100
Surprise80100

Table 14: Accuracy of Posed Emotions

Emotional ExpressionsAccuracy (%)Reliability (%)
Fear80100
Sadness90100
Happiness95100
Anger89100
Disgust94100
Surprise89100

Emotion Valence Validation in Adults

The multiple emotions that people express are classified into 3 categories, also referred to as emotion valences: positive, negative, and neutral.

Table 15: TOI Accuracy and Reliability of Predicting Emotion Valences

Emotional valenceAccuracy (%)Reliability (%)
Positive93100
Negative94100
Neutral86100
Note that 100% reliability was obtained by reanalyzing the same videos multiple times using the same algorithms with the initial frame.

Figure 21: Changes in Valence Prediction

Figure 21: Changes in Valence Prediction
Changes in valence prediction based on changes in facial expression from neutral to happiness