Methodology & Objectives

To identify the characteristics of the voice that are related to chronic diseases

Colive Voice aims to better understanding how your voice can be used to monitor your health. More precisely, we are looking to identify vocal biomarkers, a combination of characteristics of the voice signal that can be associated with a symptom, a disease or the effect of a treatment.

Our study is devoted to the research of vocal biomarkers of various chronic diseases and frequent health symptoms:

Cancer
Diabetes
Stress
Anxiety
Fatigue
Depression
Covid-19
Multiple Sclerosis
Inflammatory Bowel Disease

The extracted audio features are used to train machine and deep learning models to identify selected vocal biomarkers or related symptoms.

Take a look at the following representations of audio waves and notice how different they are: the first shows the absence of a symptom while the second, its presence (here fatigue in patients with Covid-19)

Fig.1 Audio waveplot – Covid-19 patient with no fatigue symptom

Fig.2 Audio waveplot – Covid-19 patient with fatigue

Impact on the future of healthcare

In the future, vocal biomarkers could be used to predict disease severity, for diagnosis purposes or for remote patient monitoring using digital technologies. However, at this stage, the main aim of this study is to identify candidates for vocal biomarkers and study the feasibility of using the voice to monitor health.

How does Colive Voice work?

Colive Voice aims to collect data from participants worldwide, and in various languages (French, English, German and Spanish. Other languages will be added later).

We simultaneously collect voice recordings and clinical, epidemiological and patient reported outcomes (PROs) data through an anonymous survey on the Colive Voice web app.

People will first answer a detailed questionnaire on their health status and then do 5 different short voice records.

Participation in the study is voluntary. All data is gathered in a single session that lasts about 20 minutes and is accessible online. You can participate directly from a smartphone, a tablet or a laptop equipped with a microphone.

Adults and adolescents above 15 years of age, regardless of their health status, can participate in the study. We hope to gather the participation of more than 50.000 people in the survey, with participants from all around the world.

What are we going to do with the collected answers and audio recordings?

Preprocessing steps are necessary on voice recordings before analyzing the data. This includes steps such as resampling, normalization, noise reduction, framing and windowing the data as described in the figure below which represents the typical pathway to identify a vocal biomarker.

Features are then extracted from audio signals, i.e. characteristics that will be used to train machine learning algorithms to automatically predict or classify a clinical, medical or epidemiological feature of interest, alone or in combination with other health-related data.

The figure below shows more into details of audio preprocessing and feature extraction.

What topics are covered in our questionnaire?

A detailed health questionnaire is associated with the voice recording and addresses the following aspects:

Basic characteristics: language, age, gender, weight, lifestyle factors, quality of life, alcohol, smoking habits

Symptoms: stress, anxiety, constipation, pain, sleep disorders, respiratory quality of life, cough, fatigue, fever..

Current treatments: for pain, cholesterol, diabetes, hypertension, anticoagulants, antidepressants, anti-reflux, hormonal treatments..

Diseases: chronic diseases (diabetes, CVD..), cancer, endocrine diseases, mental health (depression, stress..), neurological diseases, communicable diseases (HIV, Covid-19, influenza, tuberculosis, malaria, Zika)…

To see the complete questionnaire, click here

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_8QMYD04EHJ	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_16961320_2	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.