Harmonise questionnaire items with Harmony.

Harmony is a tool for retrospective harmonisation of questionnaire items.

Do you need to compare questionnaire items across studies? Do you want to find the best match for a set of items? Are there different versions of the same questionnaire floating around and you want to make sure how compatible they are? Are the questionnaires written in different languages that you would like to compare?

The Harmony project is a data harmonisation project that uses Natural Language Processing to help researchers make better use of existing data from different studies by supporting them with the harmonisation of various measures and items used in different studies. Harmony is a collaboration project between Ulster University, University College London, the Universidade Federal de Santa Maria, and Fast Data Science. Harmony has been funded by the Economic and Social Research Council (ESRC) and by Wellcome as part of the Wellcome Data Prize in Mental Health.

In 2024, we published a paper validating Harmony on real world mental health data:

1. McElroy, Wood, Bond, Mulvenna, Shevlin, Ploubidis, Scopel Hoffmann, Moltrecht, Using natural language processing to facilitate the harmonisation of mental health questionnaires: a validation study using real-world data. BMC Psychiatry 24, 530 (2024).

Harmony Discovery

Find out about the next phase of the Harmony development roadmap

Contribute

Harmony is in active development and you can contribute.

Discord

Join our Discord server.

Raise NLP issue

Raise an issue with Harmony's natural language understanding.

Try Harmony Now! Newsletter signup

Harmony

Introduction to Harmony

Item harmonisation with natural language processing

Use Harmony in the browser

You can use Harmony on the web.

Try Harmony App

Use Harmony in Python

We’ve released Harmony as an easy-to-use Python library on Pypi, so you can incorporate Harmony into your data harmonisation workflow.

Python package on Pypi Source Code on Github Google Colab notebook Windows installation video Linux (Ubuntu) installation video Example scripts and Jupyter notebooks

pip install harmonydata
import harmony
harmony.download_models()
instruments = harmony.example_instruments["CES_D English"], 
harmony.example_instruments["GAD-7 Portuguese"]
questions, similarity, query_similarity, _ = harmony.match_instruments
(instruments) 
# How to load a PDF, Excel or Word into an instrument
harmony.load_instruments_from_local_file("gad-7.pdf")

Use Harmony in R

R users can now import Harmony as an R package for data harmonisation. By default, Harmony R connects to the Harmony API but you can also point it to Harmony API running on Docker and then you can run Harmony R on a computer without internet (see below).

R package on Github Documentation Google Colab notebook R markdown notebook for R Studio CRAN Windows installation video Example notebooks for RStudio

# install.packages("devtools") # If you don't have devtools installed already or CRAN is down.
# library(devtools)
# devtools::install_github("harmonydata/harmony_r")
install.packages("harmonydata")
library(harmonydata)
instrument = load_instruments_from_file(path = "examples/GAD-7.pdf")
instrument_2 = load_instruments_from_file("https://medfam.umontreal.ca/wp-content/uploads/sites/16/GAD-7-fran%C3%A7ais.pdf") 
instruments = append(instrument, instrument_2)
match = match_instruments(instruments)
names(match)

Harmony Docker container/API

We’ve released Harmony as a Docker container on Dockerhub, running a REST API using FastAPI. Check Dockerhub for the latest tag. You can run these commands in Windows Command Prompt or Terminal on Mac/Linux.

Dockerhub REST API REST API Documentation (PDF/Swagger)

docker run -p 8000:80 harmonydata/harmonyapi

Our Partners

Researcher at Kings College London
on using Harmony the first times

Our tool, Harmony, allows researchers to upload a set of mental health questionnaires in PDF or Excel format, such as the GAD-7 anxiety questionnaire. It identifies which questions among questionnaires are identical, similar in meaning, or antonyms of each other, and generates a network graph. This allows researchers to harmonise datasets.

Uniquely, Harmony relies on Transformer neural network architectures and is not dependent on a dictionary approach or word list. This allows for multilingual data harmonisation (English and Portuguese are our languages of focus), and Harmony is able to correctly map the GAD-7 used in the UK to the GAD-7 used in Brazil, despite the Brazilian questionnaire being in Brazilian Portuguese.

Using Harmony, our team was able to harmonise multilingual datasets and conduct groundbreaking research into social isolation and anxiety with NLP supplying a quantitative measure of the equivalence of the different mental health datasets.

HARMONY

A global platform for contextual harmonisation, translation and cooperation in mental health research

We aim to promote better integration of data harmonisation in mental health research through a natural language processing harmonisation tool (Harmony), allowing researchers to compare data from existing studies to investigate the active ingredients of mental health. Led by Dr Eoin McElroy from Ulster University and Dr Bettina Moltrecht from University College London, the team will develop Harmony to answer research questions around human connection and its influence on the development of depression and anxiety in young people.

More ways to connect

AI Challenges

Contribute to real-world AI and coding challenges.

Learn more

Discord Community

Join discussions with fellow researchers and developers.

Join

GitHub

Access our latest code, tools, and contributions. Become a certified contributor