ai-in-research

Competition to train a Large Language Model for Harmony on DOXA AI

Competition to train a Large Language Model for Harmony on DOXA AI

Harmony on DOXA AI: Train your own Large Language Model and win up to £500 in vouchers!

Join a competition to train a Large Language Model for mental health data. You don’t need to have trained a Large Language Model before.

Register on DOXA AI

Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!

Join our Discord

Join the Harmony Discord server. Check out the 🏅「matching-challenge」 channel!

We would like to improve Harmony’s matching algorithm. Sometimes, Harmony mistakenly thinks that sentences are similar when a psychologist would consider them dissimilar, or vice versa. We have evaluated Harmony’s performance in this blog post.

Harmony vs Human

Harmony is sometimes misaligned with human evaluators

We would like to improve Harmony with a fine tuned large language model. We have teamed up with DOXA AI and made an online competition where you can improve on the off-the-shelf LLMs which we are currently using. You can win up to £500 in vouchers! Click here to join the Harmony matching competition on DOXA AI.

Register on DOXA AI

Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!

Join our Discord

Join the Harmony Discord server. Check out the 🏅「matching-challenge」 channel!

Webinar recording

We had a livestreamed webinar/onboarding session to launch the competion on Wednesday 30 October at 5pm UK time.

What about data?

We have gathered training data for you to use to fine tune your model, and there is unseen validation data which we will use to score the model.

More information is available on the DOXA AI page.

How can I get started?

First, create an account on DOXA AI and enroll in the competition and download the code examples and training data.

Hugging Face has an excellent guide to fine tuning a large language model. We recommend using Hugging Face because we’ve already designed Harmony around the Hugging Face Python library, and it’s the best known framework for running LLMs in Python.

Prizes

The prize for the winner of the competition is £500 in vouchers and the runner up will get £250 in vouchers.

More information about how Harmony’s matching compares to human evaluators

The Harmony team has recently published a paper in BMC Psychiatry showing that there is a correlation between Harmony’s cosine similarity values and human evaluators, but this could be improved:

Register on DOXA AI

Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!

Join our Discord

Join the Harmony Discord server. Check out the 🏅「matching-challenge」 channel!

Tutorial on fine tuning your own LLM. Download the notebook used in this tutorial

See other events

Register on DOXA AI

Enter the competition on DOXA AI by fine tuning your own large language model and improve Harmony!

Related Posts

Improving Harmony's PDF extraction with user testing

Improving Harmony's PDF extraction with user testing

Since we built Harmony, a common complaint has been that it frequently identifies the wrong questions in PDFs. The original algorithm for finding questions in PDFs was a mixture of rule based heuristics and some hand coded logic to look for e.g. lines in the document which begin with numbers. This was very fragile and worked fine on short questionnaires such as the GAD-7, but failed on larger documents. We decided to run a competition with our partner DOXA AI where members of the public could train their own model to extract questions from PDFs.
Harmony at MQ and DataMind Data Science Workshop

Harmony at MQ and DataMind Data Science Workshop

Harmony at MQ and Datamind Data Science workshop On 2 May 2025, Dr Eoin McElroy demonstrated Harmony at the MQ and Datamind Data Science workshop in Deutsche Bank. Eoin’s presentation focused on “Maximising the use of existing survey data: facilitating cross-study research using retrospective harmonization.” The workshop brought together researchers interested in applying novel harmonisation techniques to existing datasets. Eoin explained traditional harmonisation processes and presented a user-friendly guide to the Harmony tool, demonstrating how natural language processing can streamline the harmonisation process.

Signup to our newsletter

The latest news on data harmonisation project.

Please select all the ways you would like to hear from Harmony project:

You can unsubscribe at any time by clicking the link in the footer of our emails. For information about our privacy practices, please visit our website. We use Mailchimp as our marketing platform. By clicking below to subscribe, you acknowledge that your information will be transferred to Mailchimp for processing. Learn more about Mailchimp's privacy practices.