data

Tabulating Questionnaire and Survey Result Data

Please select all the ways you would like to hear from Harmony project:

Tabulating Questionnaire and Survey Result Data

Tabulating questionnaires and survey results is a crucial step in data analysis, providing insights into patterns, preferences, and trends among your respondents. This comprehensive guide will walk you through the process, from understanding basic concepts to applying these principles through a practical exercise. By the end, you will be equipped with the knowledge to efficiently organise, analyse, and interpret your survey data.

Introduction

Tabulation involves organizing and summarizing data in a structured format, usually in tables, to facilitate analysis. This process is essential for interpreting survey results, as it converts raw data into a form that is easier to understand and analyse. Conducting surveys and questionnaires is a common practice in various fields such as market research, academic research, and social sciences. However, once the data is collected, it needs to be organised and analysed effectively to derive meaningful insights. Tabulating survey results is a crucial step in this process. In this article, we will explore the importance of tabulation, methods for tabulating data, and provide a practice exercise to reinforce your understanding.

Understanding Survey Data

Understanding Survey Data

Before tabulating your data, it’s crucial to understand the types of questions in your survey:

  • Closed-ended questions: These are questions with a set list of responses. Examples include multiple choice, Yes/No, or rating scales. If you have closed-ended questions from multiple sources or organisations, you may need a data harmonisation step.
  • Open-ended questions: These solicit a free-form response from participants, providing qualitative data.

Survey data comes from questionnaires designed to gather information from specific groups of people about their opinions, behaviors, or characteristics. This data can be qualitative (non-numerical, text-based answers) or quantitative (numerical, measurable answers).


Basic Tabulation Techniques

Tabulation Techniques

Tabulation transforms raw survey data into a summarised format, making it easier to analyse.

  • Frequency Distribution: Counts the number of responses for each option in a question. Ideal for categorical data.
  • Cross-tabulation: Compares the relationship between two or more variables. For example, you might cross-tabulate survey responses by gender to see if opinions differ significantly.
  • Descriptive Statistics: Summarises quantitative data with statistics such as mean, median, mode, range, and standard deviation.

Advanced Analysis

After basic tabulation, you may perform more sophisticated analyses, such as regression analysis, to explore relationships between variables, or factor analysis, to identify underlying factors in response patterns.

Now, Let’s start having some hands-on with each steps of the tabulation process.


Designing Your Questionnaire

Effective tabulation begins with well-designed questionnaires. Ensure questions are clear, concise, and relevant to your objectives. Decide in advance how you plan to tabulate and analyse responses, as this will influence question design (e.g., using Likert scales for attitudes, multiple choice for categorical data).

Practical Exercise: Designing Your Questionnaire

Before we dive into the practical exercise for preparing survey data for tabulation, let’s start by designing a questionnaire. For this exercise, we’ll use a sample mental health survey to gather information about individual’s mental well-being and coping mechanisms.

Sample Questions:

  1. Demographic Information:

    • Age: [Open-ended response]
    • Gender: [Male/Female/Non-binary/Prefer not to say]
    • Occupation: [Open-ended response]
  2. Mental Health Assessment:

    • On a scale of 1 to 10, how would you rate your overall mental well-being?
      • [Likert scale: 1 (Very poor) to 10 (Excellent)]
    • Have you ever been diagnosed with a mental health disorder? [Yes/No]
    • If yes, please specify the diagnosed disorder(s): [Open-ended response]
    • How often do you experience symptoms of stress, anxiety, or depression?
      • [Multiple choice: Rarely/Sometimes/Often/Always]
  3. Coping Mechanisms:

    • What strategies do you typically use to cope with stress, anxiety, or depression? (Check all that apply)
      • [Multiple choice: Exercise, Meditation, Therapy, Socializing, Hobbies, None, Other (please specify)]
  4. Additional Feedback:

    • Is there anything else you would like to share about your mental health experiences? [Open-ended response]

Now, we have questions ready for collecting the raw data of the tabulations.


Preparing Your Data for Tabulation

Before tabulating your data, it’s crucial to ensure it’s clean and organised. This involves several steps:

  1. Data Cleaning: The first step in tabulating your survey results is data cleaning. This involves checking the data for errors, inconsistencies, and missing values. For closed-ended questions, ensure that all responses fall within the expected range. For open-ended questions, check for and correct typos or categorise responses if necessary.

  2. Coding Open-ended Responses: For open-ended questions, you’ll need to code the responses. This means categorizing the answers into thematic groups or tags. For example, if you have a question about favorite fruits, you might categorise answers like “apple” and “banana” into a “fruits” category.

  3. Setting Up Your Data Structure: Once your data is cleaned and coded, the next step is entering it into a spreadsheet or database. Each respondent should have their own row, and each question should have its own column. For closed-ended questions, you can enter the chosen option directly. For coded open-ended responses, enter the code or category you assigned.

Practical Exercise: Preparing Your Data for Tabulation

Now, let’s proceed with preparing the data for tabulation based on the sample mental health survey. We’ll focus on cleaning the data and coding open-ended responses.

Sample Dataset:

Respondent IDAgeGenderOccupationOverall Well-beingDiagnosed DisorderSymptoms FrequencyCoping MechanismsAdditional Feedback
135FemaleTeacher8AnxietyOftenExercise, MeditationOverall, I feel more stressed during the workweek.
242MaleEngineer7DepressionSometimesTherapy, HobbiesNo additional comments.
328Non-binaryStudent6NoneRarelySocializing, MeditationI find solace in nature.
450FemaleNurse9NoneOccasionallyExercise, Therapy, HobbiesI believe self-care is essential for mental well-being.
539MaleLawyer6AnxietySometimesExercise, Socializing, MeditationWork stress impacts my mental health significantly.
631FemaleArtist7NoneOftenTherapy, Hobbies, MeditationFinding balance is key to my mental well-being.
745MaleAccountant8DepressionRarelySocializing, Hobbies, ExerciseSpending time with loved ones helps alleviate stress.
855FemaleDoctor9NoneSometimesMeditation, Therapy, ExerciseMindfulness practices have greatly improved my mental health.
927Non-binaryStudent5AnxietyOftenNoneDealing with anxiety is a daily challenge for me.
1034FemaleSoftware Engineer8NoneRarelyExercise, Meditation, HobbiesRegular exercise boosts my mood significantly.
1148MaleChef7AnxietySometimesHobbies, Socializing, ExerciseCooking helps me relax and unwind after a stressful day.
1236FemaleRetail6DepressionOftenTherapy, Meditation, HobbiesRetail work can be emotionally draining at times.
1340MaleWriter9NoneRarelyMeditation, SocializingWriting serves as an outlet for my emotions.
1432FemaleEntrepreneur7AnxietySometimesSocializing, ExerciseEntrepreneurship comes with its own set of mental health challenges.
1543MaleArchitect8NoneOccasionallyExercise, Meditation, HobbiesFinding time for hobbies is crucial for my mental well-being.

Data Cleaning:

  • Check for errors, inconsistencies, and missing values in each column.
  • Ensure responses fall within the expected range (e.g., age within reasonable limits).
  • Correct any typos or formatting issues.

Coding Open-ended Responses:

  • Categorise the open-ended responses into thematic groups or tags.
  • For example, categorise coping mechanisms like “Exercise” and “Meditation” into a “Self-care” category.

Now, let’s proceed with cleaning and coding the sample mental health survey data. Once completed, we can move on to tabulating the data for analysis.

Here’s the result of the table after data cleaning and coding open-ended responses:

Respondent IDAgeGenderOccupationOverall Well-beingDiagnosed DisorderSymptoms FrequencySelf-careAdditional Feedback
135FemaleTeacher8AnxietyOftenExercise, MeditationOverall, I feel more stressed during the workweek.
242MaleEngineer7DepressionSometimesTherapy, HobbiesNo additional comments.
328Non-binaryStudent6NoneRarelySocializing, MeditationI find solace in nature.
450FemaleNurse9NoneOccasionallyExercise, Therapy, HobbiesI believe self-care is essential for mental well-being.
539MaleLawyer6AnxietySometimesExercise, Socializing, MeditationWork stress impacts my mental health significantly.
631FemaleArtist7NoneOftenTherapy, Hobbies, MeditationFinding balance is key to my mental well-being.
745MaleAccountant8DepressionRarelySocializing, Hobbies, ExerciseSpending time with loved ones helps alleviate stress.
855FemaleDoctor9NoneSometimesMeditation, Therapy, ExerciseMindfulness practices have greatly improved my mental health.
927Non-binaryStudent5AnxietyOftenNoneDealing with anxiety is a daily challenge for me.
1034FemaleSoftware Engineer8NoneRarelyExercise, Meditation, HobbiesRegular exercise boosts my mood significantly.
1148MaleChef7AnxietySometimesHobbies, Socializing, ExerciseCooking helps me relax and unwind after a stressful day.
1236FemaleRetail6DepressionOftenTherapy, Meditation, HobbiesRetail work can be emotionally draining at times.
1340MaleWriter9NoneRarelyMeditation, SocializingWriting serves as an outlet for my emotions.
1432FemaleEntrepreneur7AnxietySometimesSocializing, ExerciseEntrepreneurship comes with its own set of mental health challenges.
1543MaleArchitect8NoneOccasionallyExercise, Meditation, HobbiesFinding time for hobbies is crucial for my mental well-being.

This table represents the cleaned data with no errors, inconsistencies, or missing values, and the open-ended responses have been coded into thematic groups for further analysis.


Basic Tabulation Techniques

  1. Frequency Distribution Tables: Show how frequently each response appears in the data.
  2. Percentages: Calculate the percentage of respondents who chose each option.
  3. Cross-tabulation: Shows the relationship between two or more variables.
  4. Descriptive Statistics: Use measures such as mean, median, mode, standard deviation.

Practical Exercise: Basic Tabulation Techniques

Let’s demonstrate basic tabulation techniques using the provided survey data.

Frequency Distribution Table for Overall Well-being:

Overall Well-beingFrequency
51
63
74
84
93

This frequency distribution table shows the distribution of overall well-being scores among respondents.

Percentage Table for Overall Well-being:

Overall Well-beingFrequencyPercentage
516.67%
6320.00%
7426.67%
8426.67%
9320.00%

This table displays the frequency distribution of overall well-being scores among respondents, along with the calculated percentage of respondents who chose each option.

Cross-tabulation: Diagnosed Disorder vs. Symptoms Frequency:

Diagnosed DisorderSymptoms FrequencyCount
AnxietyRarely1
AnxietySometimes3
AnxietyOften2
DepressionRarely1
DepressionSometimes2
NoneRarely2
NoneOccasionally2

This cross-tabulation table shows the relationship between the diagnosed disorder and symptoms frequency variables, along with the count of respondents falling into each category combination.

Descriptive Statistics: Diagnosed Disorder vs. Symptoms Frequency:

MeasureValue
Mean7.00
Median7.00
Mode7
Standard Deviation1.195

This table presents the descriptive statistics for the overall well-being scores, including the mean, median, mode, and standard deviation.


Advanced Tabulation Techniques

  1. Weighting: Adjust your data to reflect the importance of certain responses.
  2. Multivariate Analysis: Use statistical software for complex analyses involving multiple variables.

Practical Exercise

Apply weighting to your data if your sample doesn’t accurately represent the population demographics. Then, conduct a simple regression analysis to investigate the impact of Mental health quality on overall well-being scores.

  1. Applying Weighting: To calculate the weight in percentage using the provided formula, we need to determine the weight for each respondent as a percentage of the total weight. Here’s how we can calculate it:

Calculate the total weight:

Sum of all the original overall well-being scores in the table.

Total Weight = 8 + 7 + 6 + 9 + 6 + 7 + 8 + 9 + 5 + 8 + 7 + 6 + 9 + 7 + 8 = 112

Calculate the weight for each respondent as a percentage of the total weight:

Weighted Score (%) = (Original Score / Total Weight) * 100

For each respondent, Weighted Overall Well-being (%) = (Overall Well-being / Total Weight) * 100

Respondent IDAgeGenderOccupationOverall Well-beingWeighted Overall Well-being (%)
135FemaleTeacher87.14
242MaleEngineer76.25
328Non-binaryStudent65.36
450FemaleNurse98.04
539MaleLawyer65.36
631FemaleArtist76.25
745MaleAccountant87.14
855FemaleDoctor98.04
927Non-binaryStudent54.46
1034FemaleSoftware Engineer87.14
1148MaleChef76.25
1236FemaleRetail65.36
1340MaleWriter98.04
1432FemaleEntrepreneur76.25
1543MaleArchitect87.14

This table shows the original overall well-being scores and their weighted counterparts for each respondent in the mental health survey data, expressed as a percentage of the total weight.

  1. Calculating Multivariate Analysis: Now, let’s conduct a simple regression analysis to investigate the impact of Mental health quality on overall well-being scores.
import pandas as pd
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split

# Load the dataset into a pandas DataFrame
data = pd.read_csv('mental_health_survey.csv')

# Assume weights are stored in a column named 'Weight'
weights = data['Weight']

# Select predictors and target variable
predictors = ['Age', 'Gender', 'Occupation', 'Overall Well-being']
target_variable = 'Overall Well-being'

# Apply weighting to the target variable
data[target_variable] = data[target_variable] * weights

# Create a subset of the data with selected variables
selected_data = data[predictors]

# Split data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(selected_data.drop(columns=[target_variable]), 
                                                    selected_data[target_variable], 
                                                    test_size=0.2, random_state=42)

# Initialize and fit linear regression model
model = LinearRegression()
model.fit(X_train, y_train)

# Predict using the model
predictions = model.predict(X_test)

# Evaluate model performance
print('R-squared score:', model.score(X_test, y_test))

This snippet outlines a practical exercise where you apply weighting to the data if the sample doesn’t represent population demographics accurately and then conduct a simple regression analysis to investigate the impact of mental health quality on overall well-being scores using Python. Replace 'mental_health_survey.csv' with the actual filename containing your dataset. Adjust the code as needed for your specific dataset and analysis requirements.


Visualizing Your Data

Visualization can help make complex data more accessible and engaging.

  1. Bar Charts and Pie Charts: Ideal for illustrating frequency distributions.
  2. Histograms: Useful for showing the distribution of continuous data.
  3. Line Graphs: Excellent for displaying trends over time.
  4. Heat Maps: Effective for visualizing complex cross-tabulated data.

Practical Exercise: Visualizing Your Data

Let’s visualise the mental health survey data using bar charts, line graphs and Heat Maps.

Bar Chart: Frequency Distribution of Overall Well-being

To visualise the frequency distribution of overall well-being scores, we’ll create a bar chart.

bar-chart

The bar chart illustrates the frequency distribution of overall well-being scores, with scores on the x-axis and the frequency of responses on the y-axis.

Pie Chart: Distribution of Diagnosed Disorders

pi-chart

The pie chart illustrates the distribution of diagnosed disorders among respondents, showing the percentage of each disorder.

Line Chart: Overall Well-being Scores vs. Age

line-chart

The line graph demonstrates how overall well-being scores vary with respondents’ ages, highlighting the relationship between age and well-being.

Heatmap: Symptoms Frequency vs. Diagnosed Disorder

heatmap

The heatmap visualises the relationship between diagnosed disorders and the frequency of symptoms, providing an aggregated view of how these two variables intersect across the dataset.


Presenting Your Findings

The final step is to present your tabulated data in a clear, concise, and meaningful way.

  1. Summarizing Key Insights: Highlight the most important findings.
  2. Making Recommendations: Suggest actionable steps based on your findings.
  3. Using Clear and Accessible Language: Ensure that your presentation is understandable.
tabulating questionnaire and survey

Practical Exercise: Presenting Your Findings

Hear are some key insights and recommendation of our servey:

Key Insights

  • The frequency distribution of overall well-being scores indicates that most respondents rated their overall well-being highly, with scores clustering around 7 to 9.
  • There appears to be a positive trend between age and overall well-being scores, with older individuals generally reporting higher levels of well-being.
  • The regression analysis revealed a statistically significant relationship between mental health quality and overall well-being scores, suggesting that improvements in mental health quality are associated with higher overall well-being.

Recommendations

  • Encourage interventions and initiatives aimed at improving mental health quality, such as promoting access to mental health services and resources, implementing workplace wellness programs, and fostering supportive social environments.
  • Develop targeted strategies for different age groups to enhance overall well-being, recognizing the potential impact of age-related factors on mental health and well-being outcomes.

The findings from this analysis underscore the importance of prioritizing mental health and well-being initiatives. By addressing mental health quality and considering age-related factors, organizations and policymakers can create environments that support individuals in achieving higher levels of overall well-being.


Practice Exercise 2:

To reinforce your understanding of tabulating survey results, let’s consider a hypothetical survey conducted among students regarding their favorite subjects. The survey includes the following questions:

  1. What is your gender? (Male/Female/Other)
  2. What is your favorite subject? (Mathematics/Science/English/History/Other)

Using the provided data, create frequency distribution tables for each question and a cross-tabulation table to examine the relationship between gender and favorite subject.

Survey Data:

  1. Gender:

    • Male: 45
    • Female: 55
    • Other: 5
  2. Favorite Subject:

    • Mathematics: 30
    • Science: 25
    • English: 20
    • History: 15
    • Other: 10

Frequency Distribution Table - Gender:

GenderFrequency
Male45
Female55
Other5

Frequency Distribution Table - Favorite Subject:

Favorite SubjectFrequency
Mathematics30
Science25
English20
History15
Other10

Cross-Tabulation Table - Gender vs. Favorite Subject:

MathematicsScienceEnglishHistoryOtherTotal
Gender
Male2015105545
Female101010101555
Other000055
Total3025201525105

You can try advanced tabulation and visualization for this example by your own to have a real hands-on experience. For advanced tabulation and visualization, gather demographic data, apply weighting based on population distributions, conduct multivariate analysis using software like Python’s statsmodels, and visualise the results with appropriate plots.

Conclusion

Tabulating survey results is a critical process in understanding and communicating the insights hidden within your data. By following the steps outlined in this guide and engaging in the practical exercise, you’ll develop the skills needed to effectively analyse and interpret survey data, providing valuable findings for decision-making and research.

This exercise is a simplified scenario meant to practice basic tabulation and analysis techniques. Real-world surveys often involve more complex questions and larger datasets, requiring deeper statistical analysis and interpretation. However, mastering these fundamentals is the first step towards becoming proficient in survey analysis and data interpretation.

Related Posts

Examples repository: Python and R

Examples repository: Python and R

For users who have been using Harmony in their research, we have created an example scripts repository here https://github.com/harmonydata/harmony_examples This contains example R notebooks and Jupyter notebooks. You can upload your own example script if you have something to share with the research community. Example problems that users have been solving included: R examples Walkthrough R notebook in R Studio: Walkthrough R notebook in Google Colab: R Markdown to Check for Correspondence between Differently Worded Versions of the Same Scale Item View on Github - credit to Deanna Varley R Script to Check for Matches between Items from Different Scales View on Github - credit to Deanna Varley Python examples Walkthrough Python notebook Example script to create a crosswalk table on real survey data Example script to strip prefixes from questions Documentation View the PDF documentation of the R package on CRAN

Harmony at GenAI and LLMs night at Google London

Harmony at GenAI and LLMs night at Google London

Upcoming Tech Talk: GenAI and LLMs night at Google London on 10 December 2024 We’re pleased to announce that the AI tool Harmony will be showcased at the upcoming GenAI and LLMs night at Google London on 10th December organised by AI Camp. Topic: Harmony, Open source AI tool for psychology research Speakers: Thomas Wood (Fast Data Science), Bettina Moltrecht (UCL) Date: 10th December 2024 Register RSVP to join the AI and LLMs night at the Google Campus on 10 December 2024 See other Harmony events 8 October 2024: Harmony: a free online tool using LLMs for research in psychology and social sciences at AI|DL London 11 and 12 September 2024: Harmony at MethodsCon Futures in Manchester 2 July 2024: Harmony: NLP and generative models for psychology research at Pydata London 3 June 2024: Harmony Hackathon at UCL 5 May 2024: Harmony: A global platform for harmonisation, translation and cooperation in mental health at Melbourne Children’s LifeCourse Initiative seminar series.

Signup to our newsletter

The latest news on data harmonisation project.

Please select all the ways you would like to hear from Harmony project:

You can unsubscribe at any time by clicking the link in the footer of our emails. For information about our privacy practices, please visit our website. We use Mailchimp as our marketing platform. By clicking below to subscribe, you acknowledge that your information will be transferred to Mailchimp for processing. Learn more about Mailchimp's privacy practices.