Update: you can download the slides from the presentation here
Topic: NLP and generative models for psychology research
Thomas Wood will present our work on Harmony, harmonydata.ac.uk, which is a free online tool that uses generative AI and LLMs to help psychologists analyse datasets. It uses Python, Pandas and HuggingFace Sentence Transformers to find similarities between questionnaires.
Psychologists and social scientists often have to match items in different questionnaires, such as “I often feel anxious” and “Feeling nervous, anxious or afraid”.
This is called harmonisation.
Harmonisation is a time consuming and subjective process.
Going through long PDFs of questionnaires and putting the questions into Excel is no fun.
We’ve been working on an open source Python library and free web tool called Harmony which uses natural language processing and generative AI models to help researchers harmonise questionnaire items, even in different languages.
📅 Date: 2 July 2024
🕘 Time: 7pm
🏢 Where: EC4R 3AD
[Beta mode: we are currently testing this extension] We have developed a browser extension for Harmony called “Send to Harmony” which lets you send selected text to Harmony with a right-click. For PDFs, use the popup to paste your selected text. Send to Harmony enables users to send selected text to the Harmony Data Harmonization (https://harmonydata.ac.uk/) platform for analysis. This plugin provides a right-click or context menu item which allows users to easily bring text from into their harmonisations, making it easier to compare and analyze different measurement scales across research studies.
We have a number of exciting updates to Harmony including: some improvements to the R library which have been asked for by researchers around the world who have been using Harmony on studies in lots of different topics as well as making our own fine tuned large language model available in the web UI, which is José’s winning model from the DOXA challenge which ended on 10 January 2025. Harmony has its own Large Language Model!