methods of research in social work

Social Work Research Methods That Drive the Practice

A social worker surveys a community member.

Social workers advocate for the well-being of individuals, families and communities. But how do social workers know what interventions are needed to help an individual? How do they assess whether a treatment plan is working? What do social workers use to write evidence-based policy?

Social work involves research-informed practice and practice-informed research. At every level, social workers need to know objective facts about the populations they serve, the efficacy of their interventions and the likelihood that their policies will improve lives. A variety of social work research methods make that possible.

Data-Driven Work

Data is a collection of facts used for reference and analysis. In a field as broad as social work, data comes in many forms.

Quantitative vs. Qualitative

As with any research, social work research involves both quantitative and qualitative studies.

Quantitative Research

Answers to questions like these can help social workers know about the populations they serve — or hope to serve in the future.

How many students currently receive reduced-price school lunches in the local school district?
How many hours per week does a specific individual consume digital media?
How frequently did community members access a specific medical service last year?

Quantitative data — facts that can be measured and expressed numerically — are crucial for social work.

Quantitative research has advantages for social scientists. Such research can be more generalizable to large populations, as it uses specific sampling methods and lends itself to large datasets. It can provide important descriptive statistics about a specific population. Furthermore, by operationalizing variables, it can help social workers easily compare similar datasets with one another.

Qualitative Research

Qualitative data — facts that cannot be measured or expressed in terms of mere numbers or counts — offer rich insights into individuals, groups and societies. It can be collected via interviews and observations.

What attitudes do students have toward the reduced-price school lunch program?
What strategies do individuals use to moderate their weekly digital media consumption?
What factors made community members more or less likely to access a specific medical service last year?

Qualitative research can thereby provide a textured view of social contexts and systems that may not have been possible with quantitative methods. Plus, it may even suggest new lines of inquiry for social work research.

Mixed Methods Research

Combining quantitative and qualitative methods into a single study is known as mixed methods research. This form of research has gained popularity in the study of social sciences, according to a 2019 report in the academic journal Theory and Society. Since quantitative and qualitative methods answer different questions, merging them into a single study can balance the limitations of each and potentially produce more in-depth findings.

However, mixed methods research is not without its drawbacks. Combining research methods increases the complexity of a study and generally requires a higher level of expertise to collect, analyze and interpret the data. It also requires a greater level of effort, time and often money.

The Importance of Research Design

Data-driven practice plays an essential role in social work. Unlike philanthropists and altruistic volunteers, social workers are obligated to operate from a scientific knowledge base.

To know whether their programs are effective, social workers must conduct research to determine results, aggregate those results into comprehensible data, analyze and interpret their findings, and use evidence to justify next steps.

Employing the proper design ensures that any evidence obtained during research enables social workers to reliably answer their research questions.

Research Methods in Social Work

The various social work research methods have specific benefits and limitations determined by context. Common research methods include surveys, program evaluations, needs assessments, randomized controlled trials, descriptive studies and single-system designs.

Surveys involve a hypothesis and a series of questions in order to test that hypothesis. Social work researchers will send out a survey, receive responses, aggregate the results, analyze the data, and form conclusions based on trends.

Surveys are one of the most common research methods social workers use — and for good reason. They tend to be relatively simple and are usually affordable. However, surveys generally require large participant groups, and self-reports from survey respondents are not always reliable.

Program Evaluations

Social workers ally with all sorts of programs: after-school programs, government initiatives, nonprofit projects and private programs, for example.

Crucially, social workers must evaluate a program’s effectiveness in order to determine whether the program is meeting its goals and what improvements can be made to better serve the program’s target population.

Evidence-based programming helps everyone save money and time, and comparing programs with one another can help social workers make decisions about how to structure new initiatives. Evaluating programs becomes complicated, however, when programs have multiple goal metrics, some of which may be vague or difficult to assess (e.g., “we aim to promote the well-being of our community”).

Needs Assessments

Social workers use needs assessments to identify services and necessities that a population lacks access to.

Common social work populations that researchers may perform needs assessments on include:

People in a specific income group
Everyone in a specific geographic region
A specific ethnic group
People in a specific age group

In the field, a social worker may use a combination of methods (e.g., surveys and descriptive studies) to learn more about a specific population or program. Social workers look for gaps between the actual context and a population’s or individual’s “wants” or desires.

For example, a social worker could conduct a needs assessment with an individual with cancer trying to navigate the complex medical-industrial system. The social worker may ask the client questions about the number of hours they spend scheduling doctor’s appointments, commuting and managing their many medications. After learning more about the specific client needs, the social worker can identify opportunities for improvements in an updated care plan.

In policy and program development, social workers conduct needs assessments to determine where and how to effect change on a much larger scale. Integral to social work at all levels, needs assessments reveal crucial information about a population’s needs to researchers, policymakers and other stakeholders. Needs assessments may fall short, however, in revealing the root causes of those needs (e.g., structural racism).

Randomized Controlled Trials

Randomized controlled trials are studies in which a randomly selected group is subjected to a variable (e.g., a specific stimulus or treatment) and a control group is not. Social workers then measure and compare the results of the randomized group with the control group in order to glean insights about the effectiveness of a particular intervention or treatment.

Randomized controlled trials are easily reproducible and highly measurable. They’re useful when results are easily quantifiable. However, this method is less helpful when results are not easily quantifiable (i.e., when rich data such as narratives and on-the-ground observations are needed).

Descriptive Studies

Descriptive studies immerse the researcher in another context or culture to study specific participant practices or ways of living. Descriptive studies, including descriptive ethnographic studies, may overlap with and include other research methods:

Informant interviews
Census data
Observation

By using descriptive studies, researchers may glean a richer, deeper understanding of a nuanced culture or group on-site. The main limitations of this research method are that it tends to be time-consuming and expensive.

Single-System Designs

Unlike most medical studies, which involve testing a drug or treatment on two groups — an experimental group that receives the drug/treatment and a control group that does not — single-system designs allow researchers to study just one group (e.g., an individual or family).

Single-system designs typically entail studying a single group over a long period of time and may involve assessing the group’s response to multiple variables.

For example, consider a study on how media consumption affects a person’s mood. One way to test a hypothesis that consuming media correlates with low mood would be to observe two groups: a control group (no media) and an experimental group (two hours of media per day). When employing a single-system design, however, researchers would observe a single participant as they watch two hours of media per day for one week and then four hours per day of media the next week.

These designs allow researchers to test multiple variables over a longer period of time. However, similar to descriptive studies, single-system designs can be fairly time-consuming and costly.

Learn More About Social Work Research Methods

Social workers have the opportunity to improve the social environment by advocating for the vulnerable — including children, older adults and people with disabilities — and facilitating and developing resources and programs.

Learn more about how you can earn your Master of Social Work online at Virginia Commonwealth University . The highest-ranking school of social work in Virginia, VCU has a wide range of courses online. That means students can earn their degrees with the flexibility of learning at home. Learn more about how you can take your career in social work further with VCU.

From M.S.W. to LCSW: Understanding Your Career Path as a Social Worker

How Palliative Care Social Workers Support Patients With Terminal Illnesses

How to Become a Social Worker in Health Care

Gov.uk, Mixed Methods Study

MVS Open Press, Foundations of Social Work Research

Open Social Work Education, Scientific Inquiry in Social Work

Open Social Work, Graduate Research Methods in Social Work: A Project-Based Approach

Routledge, Research for Social Workers: An Introduction to Methods

SAGE Publications, Research Methods for Social Work: A Problem-Based Approach

Theory and Society, Mixed Methods Research: What It Is and What It Could Be

READY TO GET STARTED WITH OUR ONLINE M.S.W. PROGRAM FORMAT?

Bachelor’s degree is required.

VCU Program Helper

This AI chatbot provides automated responses, which may not always be accurate. By continuing with this conversation, you agree that the contents of this chat session may be transcribed and retained. You also consent that this chat session and your interactions, including cookie usage, are subject to our privacy policy .

Subject List
Take a Tour
For Authors
Subscriber Services
Publications
African American Studies
African Studies
American Literature
Anthropology
Architecture Planning and Preservation
Art History
Atlantic History
Biblical Studies
British and Irish Literature
Childhood Studies
Chinese Studies
Cinema and Media Studies
Communication
Criminology
Environmental Science
Evolutionary Biology
International Law
International Relations
Islamic Studies
Jewish Studies
Latin American Studies
Latino Studies
Linguistics
Literary and Critical Theory
Medieval Studies
Military History
Political Science
Public Health
Renaissance and Reformation

Social Work

Urban Studies
Victorian Literature
Browse All Subjects

How to Subscribe

Free Trials

In This Article Expand or collapse the "in this article" section Social Work Research Methods

Introduction.

History of Social Work Research Methods
Feasibility Issues Influencing the Research Process
Measurement Methods
Existing Scales
Group Experimental and Quasi-Experimental Designs for Evaluating Outcome
Single-System Designs for Evaluating Outcome
Program Evaluation
Surveys and Sampling
Introductory Statistics Texts
Advanced Aspects of Inferential Statistics
Qualitative Research Methods
Qualitative Data Analysis
Historical Research Methods
Meta-Analysis and Systematic Reviews
Research Ethics
Culturally Competent Research Methods
Teaching Social Work Research Methods

Other Subject Areas

Forthcoming articles expand or collapse the "forthcoming articles" section.

Abolitionist Perspectives in Social Work
Randomized Controlled Trials in Social Work
Social Work Practice with Transgender and Gender Expansive Youth
Find more forthcoming articles...
Export Citations
Share This Facebook LinkedIn Twitter

Social Work Research Methods by Allen Rubin LAST REVIEWED: 14 December 2009 LAST MODIFIED: 14 December 2009 DOI: 10.1093/obo/9780195389678-0008

Social work research means conducting an investigation in accordance with the scientific method. The aim of social work research is to build the social work knowledge base in order to solve practical problems in social work practice or social policy. Investigating phenomena in accordance with the scientific method requires maximal adherence to empirical principles, such as basing conclusions on observations that have been gathered in a systematic, comprehensive, and objective fashion. The resources in this entry discuss how to do that as well as how to utilize and teach research methods in social work. Other professions and disciplines commonly produce applied research that can guide social policy or social work practice. Yet no commonly accepted distinction exists at this time between social work research methods and research methods in allied fields relevant to social work. Consequently useful references pertaining to research methods in allied fields that can be applied to social work research are included in this entry.

This section includes basic textbooks that are used in courses on social work research methods. Considerable variation exists between textbooks on the broad topic of social work research methods. Some are comprehensive and delve into topics deeply and at a more advanced level than others. That variation is due in part to the different needs of instructors at the undergraduate and graduate levels of social work education. Most instructors at the undergraduate level prefer shorter and relatively simplified texts; however, some instructors teaching introductory master’s courses on research prefer such texts too. The texts in this section that might best fit their preferences are by Yegidis and Weinbach 2009 and Rubin and Babbie 2007 . The remaining books might fit the needs of instructors at both levels who prefer a more comprehensive and deeper coverage of research methods. Among them Rubin and Babbie 2008 is perhaps the most extensive and is often used at the doctoral level as well as the master’s and undergraduate levels. Also extensive are Drake and Jonson-Reid 2007 , Grinnell and Unrau 2007 , Kreuger and Neuman 2006 , and Thyer 2001 . What distinguishes Drake and Jonson-Reid 2007 is its heavy inclusion of statistical and Statistical Package for the Social Sciences (SPSS) content integrated with each chapter. Grinnell and Unrau 2007 and Thyer 2001 are unique in that they are edited volumes with different authors for each chapter. Kreuger and Neuman 2006 takes Neuman’s social sciences research text and adapts it to social work. The Practitioner’s Guide to Using Research for Evidence-based Practice ( Rubin 2007 ) emphasizes the critical appraisal of research, covering basic research methods content in a relatively simplified format for instructors who want to teach research methods as part of the evidence-based practice process instead of with the aim of teaching students how to produce research.

Drake, Brett, and Melissa Jonson-Reid. 2007. Social work research methods: From conceptualization to dissemination . Boston: Allyn and Bacon.

This introductory text is distinguished by its use of many evidence-based practice examples and its heavy coverage of statistical and computer analysis of data.

Grinnell, Richard M., and Yvonne A. Unrau, eds. 2007. Social work research and evaluation: Quantitative and qualitative approaches . 8th ed. New York: Oxford Univ. Press.

Contains chapters written by different authors, each focusing on a comprehensive range of social work research topics.

Kreuger, Larry W., and W. Lawrence Neuman. 2006. Social work research methods: Qualitative and quantitative applications . Boston: Pearson, Allyn, and Bacon.

An adaptation to social work of Neuman's social sciences research methods text. Its framework emphasizes comparing quantitative and qualitative approaches. Despite its title, quantitative methods receive more attention than qualitative methods, although it does contain considerable qualitative content.

Rubin, Allen. 2007. Practitioner’s guide to using research for evidence-based practice . Hoboken, NJ: Wiley.

This text focuses on understanding quantitative and qualitative research methods and designs for the purpose of appraising research as part of the evidence-based practice process. It also includes chapters on instruments for assessment and monitoring practice outcomes. It can be used at the graduate or undergraduate level.

Rubin, Allen, and Earl R. Babbie. 2007. Essential research methods for social work . Belmont, CA: Thomson Brooks Cole.

This is a shorter and less advanced version of Rubin and Babbie 2008 . It can be used for research methods courses at the undergraduate or master's levels of social work education.

Rubin, Allen, and Earl R. Babbie. Research Methods for Social Work . 6th ed. Belmont, CA: Thomson Brooks Cole, 2008.

This comprehensive text focuses on producing quantitative and qualitative research as well as utilizing such research as part of the evidence-based practice process. It is widely used for teaching research methods courses at the undergraduate, master’s, and doctoral levels of social work education.

Thyer, Bruce A., ed. 2001 The handbook of social work research methods . Thousand Oaks, CA: Sage.

This comprehensive compendium includes twenty-nine chapters written by esteemed leaders in social work research. It covers quantitative and qualitative methods as well as general issues.

Yegidis, Bonnie L., and Robert W. Weinbach. 2009. Research methods for social workers . 6th ed. Boston: Allyn and Bacon.

This introductory paperback text covers a broad range of social work research methods and does so in a briefer fashion than most lengthier, hardcover introductory research methods texts.

Users without a subscription are not able to see the full content on this page. Please subscribe or login .

Oxford Bibliographies Online is available by subscription and perpetual access to institutions. For more information or to contact an Oxford Sales Representative click here .

About Social Work »
Meet the Editorial Board »
Adolescent Depression
Adolescent Pregnancy
Adolescents
Adoption Home Study Assessments
Adult Protective Services in the United States
African Americans
Aging out of foster care
Aging, Physical Health and
Alcohol and Drug Abuse Problems
Alcohol and Drug Problems, Prevention of Adolescent and Yo...
Alcohol Problems: Practice Interventions
Alcohol Use Disorder
Alzheimer's Disease and Other Dementias
Anti-Oppressive Practice
Asian Americans
Asian-American Youth
Autism Spectrum Disorders
Baccalaureate Social Workers
Behavioral Health
Behavioral Social Work Practice
Bereavement Practice
Bisexuality
Brief Therapies in Social Work: Task-Centered Model and So...
Bullying and Social Work Intervention
Canadian Social Welfare, History of
Case Management in Mental Health in the United States
Central American Migration to the United States
Child Maltreatment Prevention
Child Neglect and Emotional Maltreatment
Child Poverty
Child Sexual Abuse
Child Welfare
Child Welfare and Child Protection in Europe, History of
Child Welfare and Parents with Intellectual and/or Develop...
Child Welfare Effectiveness
Child Welfare, Immigration and
Child Welfare Practice with LGBTQ Youth and Families
Children of Incarcerated Parents
Christianity and Social Work
Chronic Illness
Clinical Social Work Practice with Adult Lesbians
Clinical Social Work Practice with Males
Cognitive Behavior Therapies with Diverse and Stressed Pop...
Cognitive Processing Therapy
Cognitive-Behavioral Therapy
Community Development
Community Policing
Community-Needs Assessment
Comparative Social Work
Computational Social Welfare: Applying Data Science in Soc...
Conflict Resolution
Council on Social Work Education
Counseling Female Offenders
Criminal Justice
Crisis Interventions
Cultural Competence and Ethnic Sensitive Practice
Culture, Ethnicity, Substance Use, and Substance Use Disor...
Dementia Care
Dementia Care, Ethical Aspects of
Depression and Cancer
Development and Infancy (Birth to Age Three)
Differential Response in Child Welfare
Digital Storytelling for Social Work Interventions
Direct Practice in Social Work
Disabilities
Disability and Disability Culture
Domestic Violence Among Immigrants
Early Pregnancy and Parenthood Among Child Welfare–Involve...
Eating Disorders
Ecological Framework
Elder Mistreatment
End-of-Life Decisions
Epigenetics for Social Workers
Ethical Issues in Social Work and Technology
Ethics and Values in Social Work
European Institutions and Social Work
European Union, Justice and Home Affairs in the
Evidence-based Social Work Practice: Issues, Controversies...
Families with Gay, Lesbian, or Bisexual Parents
Family Caregiving
Family Group Conferencing
Family Policy
Family Services
Family Therapy
Family Violence
Fathering Among Families Served By Child Welfare
Fetal Alcohol Spectrum Disorders
Field Education
Financial Literacy and Social Work
Financing Health-Care Delivery in the United States
Forensic Social Work
Foster Care
Foster care and siblings
Gender, Violence, and Trauma in Immigration Detention in t...
Generalist Practice and Advanced Generalist Practice
Grounded Theory
Group Work across Populations, Challenges, and Settings
Group Work, Research, Best Practices, and Evidence-based
Harm Reduction
Health Care Reform
Health Disparities
Health Social Work
History of Social Work and Social Welfare, 1900–1950
History of Social Work and Social Welfare, 1950-1980
History of Social Work and Social Welfare, pre-1900
History of Social Work from 1980-2014
History of Social Work in China
History of Social Work in Northern Ireland
History of Social Work in the Republic of Ireland
History of Social Work in the United Kingdom
HIV/AIDS and Children
HIV/AIDS Prevention with Adolescents
Homelessness
Homelessness: Ending Homelessness as a Grand Challenge
Homelessness Outside the United States
Human Needs
Human Trafficking, Victims of
Immigrant Integration in the United States
Immigrant Policy in the United States
Immigrants and Refugees
Immigrants and Refugees: Evidence-based Social Work Practi...
Immigration and Health Disparities
Immigration and Intimate Partner Violence
Immigration and Poverty
Immigration and Spirituality
Immigration and Substance Use
Immigration and Trauma
Impaired Professionals
Indigenous Peoples
Individual Placement and Support (IPS) Supported Employmen...
In-home Child Welfare Services
Intergenerational Transmission of Maltreatment
International Human Trafficking
International Social Welfare
International Social Work
International Social Work and Education
International Social Work and Social Welfare in Southern A...
Internet and Video Game Addiction
Interpersonal Psychotherapy
Intervention with Traumatized Populations
Intimate-Partner Violence
Juvenile Justice
Kinship Care
Korean Americans
Latinos and Latinas
Law, Social Work and the
LGBTQ Populations and Social Work
Mainland European Social Work, History of
Major Depressive Disorder
Management and Administration in Social Work
Maternal Mental Health
Medical Illness
Men: Health and Mental Health Care
Mental Health
Mental Health Diagnosis and the Addictive Substance Disord...
Mental Health Needs of Older People, Assessing the
Mental Health Services from 1990 to 2023
Mental Illness: Children
Mental Illness: Elders
Microskills
Middle East and North Africa, International Social Work an...
Military Social Work
Mixed Methods Research
Moral distress and injury in social work
Motivational Interviewing
Multiculturalism
Native Americans
Native Hawaiians and Pacific Islanders
Neighborhood Social Cohesion
Neuroscience and Social Work
Nicotine Dependence
Organizational Development and Change
Pain Management
Palliative Care
Palliative Care: Evolution and Scope of Practice
Pandemics and Social Work
Parent Training
Personalization
Person-in-Environment
Philosophy of Science and Social Work
Physical Disabilities
Podcasts and Social Work
Police Social Work
Political Social Work in the United States
Positive Youth Development
Postsecondary Education Experiences and Attainment Among Y...
Post-Traumatic Stress Disorder (PTSD)
Practice Interventions and Aging
Practice Interventions with Adolescents
Practice Research
Primary Prevention in the 21st Century
Productive Engagement of Older Adults
Profession, Social Work
Program Development and Grant Writing
Promoting Smart Decarceration as a Grand Challenge
Psychiatric Rehabilitation
Psychoanalysis and Psychodynamic Theory
Psychoeducation
Psychometrics
Psychopathology and Social Work Practice
Psychopharmacology and Social Work Practice
Psychosocial Framework
Psychosocial Intervention with Women
Psychotherapy and Social Work
Race and Racism
Readmission Policies in Europe
Redefining Police Interactions with People Experiencing Me...
Refugee Children, Unaccompanied Immigrant and
Rehabilitation
Religiously Affiliated Agencies
Reproductive Health
Restorative Justice
Risk Assessment in Child Protection Services
Risk Management in Social Work
Rural Social Work in China
Rural Social Work Practice
School Social Work
School Violence
School-Based Delinquency Prevention
Services and Programs for Pregnant and Parenting Youth
Severe and Persistent Mental Illness: Adults
Sexual and Gender Minority Immigrants, Refugees, and Asylu...
Sexual Assault
Single-System Research Designs
Social and Economic Impact of US Immigration Policies on U...
Social Development
Social Insurance and Social Justice
Social Justice and Social Work
Social Movements
Social Planning
Social Policy
Social Policy in Denmark
Social Security in the United States (OASDHI)
Social Work and Islam
Social Work and Social Welfare in East, West, and Central ...
Social Work and Social Welfare in Europe
Social Work Education and Research
Social Work Leadership
Social Work Luminaries: Luminaries Contributing to the Cla...
Social Work Luminaries: Luminaries contributing to the fou...
Social Work Luminaries: Luminaries Who Contributed to Soci...
Social Work Practice, Rare and Orphan Diseases and
Social Work Regulation
Social Work Research Methods
Social Work with Interpreters
Solution-Focused Therapy
Strategic Planning
Strengths Perspective
Strengths-Based Models in Social Work
Supplemental Security Income
Survey Research
Sustainability: Creating Social Responses to a Changing En...
Syrian Refugees in Turkey
Task-Centered Practice
Technology Adoption in Social Work Education
Technology, Human Relationships, and Human Interaction
Technology in Social Work
Terminal Illness
The Impact of Systemic Racism on Latinxs’ Experiences with...
Transdisciplinary Science
Translational Science and Social Work
Transnational Perspectives in Social Work
Transtheoretical Model of Change
Trauma-Informed Care
Triangulation
Tribal child welfare practice in the United States
United States, History of Social Welfare in the
Universal Basic Income
Veteran Services
Vicarious Trauma and Resilience in Social Work Practice wi...
Vicarious Trauma Redefining PTSD
Victim Services
Virtual Reality and Social Work
Welfare State Reform in France
Welfare State Theory
Women and Macro Social Work Practice
Women's Health Care
Work and Family in the German Welfare State
Workforce Development of Social Workers Pre- and Post-Empl...
Working with Non-Voluntary and Mandated Clients
Young and Adolescent Lesbians
Youth at Risk
Youth Services
Privacy Policy
Cookie Policy
Legal Notice
Accessibility

[81.177.182.154]
81.177.182.154

Scientific Inquiry in Social Work

(9 reviews)

Matthew DeCarlo, Radford University

ISBN 13: 9781975033729

Publisher: Open Social Work Education

Language: English

Formats Available

Conditions of use.

Learn more about reviews.

Reviewed by Shannon Blajeski, Assistant Professor, Portland State University on 3/10/23

Comprehensiveness rating: 5 see less

This book provides an introduction to research and inquiry in social work with an applied focus geared for the MSW student. The text covers 16 chapters, including several dedicated to understanding how to begin the research process, a chapter on ethics, and then eight chapters dedicated to research methods. The subchapters (1-5 per chapter) are concise and focused while also being tied to current knowledge and events so as to hold the reader's attention. It is comprehensive, but some of the later chapters covering research methods as well as the final chapter seem a bit scant and could be expanded. The glossary at the end of each chapter is helpful as is the index that is always accessible from the left-hand drop-down menu.

Content Accuracy rating: 4

The author pulls in relevant current and recent public events to illustrate important points about social research throughout the book. Each sub-chapter reads as accurate. I did not come across any inaccuracies in the text, however I would recommend a change in the title of Chapter 15 as "real world research" certainly encompasses more than program evaluation, single-subject designs, and action research.

Relevance/Longevity rating: 5

Another major strength of this book is that it adds currency to engage the reader while also maintaining its relevance to research methods. None of the current events/recent events that are described seem dated nor will they fade from relevance in a number of years. In addition, the concise nature of the modules should make them easy to update when needed to maintain relevancy in future editions.

Clarity rating: 5

Clarity is a major strength of this textbook. As described in the interface section, this book is written to be clear and concise, without unnecessary extra text that detracts from the concise content provided in each chapter. Any lengthy excerpts are also very engaging which lends itself to a clear presentation of content for the reader.

Consistency rating: 5

The text and content seems to be presented consistently throughout the book. Terminology and frameworks are balanced with real-world examples and current events.

Modularity rating: 5

The chapters of this textbook are appropriately spaced and easily digestible, particularly for readers with time constraints. Each chapter contains 3-5 sub-chapters that build upon each other in a scaffolding style. This makes it simple for the instructor to assign each chapter (sometimes two) per weekly session as well as add in additional assigned readings to complement the text.

Organization/Structure/Flow rating: 5

The overall organization of the chapters flow well. The book begins with a typical introduction to research aimed at social work practitioners or new students in social work. It then moves into a set of chapters on beginning a research project, reviewing literature, and asking research questions, followed by a chapter on ethics. Next, the text transitions to three chapters covering constructs, measurement, and sampling, followed by five chapters covering research methods, and a closing chapter on dissemination of research. This is one of the more logically-organized research methods texts that I have used as an instructor.

Interface rating: 5

The modular chapters are easy to navigate and the interface of each chapter follows a standard presentation style with the reading followed by a short vocabulary glossary and references. This presentation lends itself to a familiarity for students that helps them become more efficient with completing reading assignments, even in short bursts of time. This is particularly important for online and returning learners who may juggle their assignment time with family and work obligations.

Grammatical Errors rating: 5

No grammatical errors were noted.

Cultural Relevance rating: 4

At first glance at the table of contents, the book doesn't seem to be overtly committed to cultural representation, however, upon reading the chapters, it becomes clear that the author does try to represent and reference marginalized groups (e.g., women, individuals with disabilities, racial/ethnic/gender intersectionality) within the examples used. I also am very appreciative that the bottom of each introduction page for each chapter contains content trigger warnings for any possible topics that could be upsetting, e.g., substance abuse, violence.

As the author likely knows, social work students are eager to engage in learning that is current and relevant to their social causes. This book is written in a way that engages a non-researcher social worker into reading about research by weaving research information into topics that they might find compelling. It also does this in a concise way where short bits of pertinent information are presented, making the text accessible without needing to sustain long periods of attention. This is particularly important for online and returning learners who may need to sit with their readings in short bursts due to attending school while juggling work and family obligations.

Reviewed by Lynn Goerdt, Associate Professor, University of Wisconsin - Superior on 9/17/21

Text appears to be comprehensive in covering steps for typical SWK research class, taking students from the introduction of the purpose and importance of research to how to design and analyze research. Author covers the multitude of ways that social workers engage in research as way of building knowledge and ways that social work practitioners conduct research to evaluate their practice, including outcome evaluation, single subject design, and action research. I particularly appreciated the last section on reporting research, which should be very practical.

Overall, content appears mostly accurate which few errors. Definitions and citations are mostly thorough and clear. Author does cite Wikipedia in at least one occasion which could be credible, depending on the source of the Wikipedia content. There were a few references within the text to comic or stories but the referenced material was not always apparent.

Relevance/Longevity rating: 4

The content of Scientific Inquiry for Social Work is relevant, as the field of social work research methods does not appear to change quickly, although there are innovations. The author referenced examples which appear to be recent and likely relatable to interests of current students. Primary area of innovation is in using technology for the collection and analysis of data, which could be expanded, particularly using social media for soliciting research participants.

Style is personable and content appears to be accessible, which is a unique attribute for a research textbook. Author uses first person in many instances, particularly in the beginning to present the content as relatable.

Format appears to be consistent in format and relative length. Each chapter includes learning objectives, content advisory (if applicable), key takeaways and glossary. Author uses color and text boxes to draw attention to these sections.

Modularity rating: 4

Text is divided into modules which could easily be assigned and reviewed in a class. The text modules could also be re-structured if desired to fit curricular uniqueness’s. Author uses images to illuminate the concepts of the module or chapter, but they often take about 1/3 of the page, which extends the size of the textbook quite a bit. Unclear if benefit of images outweighs additional cost if PDF version is printed.

Textbook is organized in a very logical and clear fashion. Each section appears to be approximately 6-10 pages in length which seems to be an optimal length for student attention and comprehension.

Interface rating: 4

There were some distortions of the text (size and visibility) but they were a fairly minor distraction and did not appear to reduce access to the content. Otherwise text was easy to navigate.

Grammatical Errors rating: 4

No grammatical errors were noted but hyperlinks to outside content were referenced but not always visible which occasionally resulted in an awkward read. Specific link may be in resources section of each chapter but occasionally they were also included in the text.

I did not recognize any text which was culturally insensitive or offensive. Images used which depicted people, appeared to represent diverse experiences, cultures, settings and persons. Did notice image depicting homelessness appeared to be stereotypical person sleeping on sidewalk, which can perpetuate a common perception of homelessness. Would encourage author to consider images representing a wider range of experiences of a social phenomena. Content advisories are used for each section, which is not necessarily cultural relevance but is respectful and recognizes the diversity of experiences and triggers that the readers may have.

Overall, I was very impressed and encouraged with the well organized content and thoughtful flow of this important textbook for social work students and instructors. The length and readability of each chapter would likely be appreciated by instructors as well as students, increasing the extent that the learning outcomes would be achieved. Teaching research is very challenging because the content and application can feel very intimidating. The author also has provided access to supplemental resources such as presentations and assignments.

Reviewed by elaine gatewood, Adjunct Faculty, Bridgewater State University on 6/15/21

The book provides concrete and clear information on using research as consumers, It provides a comprehensive review of each step to take to develop a research project from beginning to completion, with examples. read more

Content Accuracy rating: 5

From my perspective, content is highly accurate in the field of learning research method and unbiased. It's all there!

The content is highly relevant and up-to-date in the field. The book is written and arranged in a way that its easy to follow along with adding updates.

The book is written in clear and concise. The book provides appropriate context for any jargon/technical terminology used along with examples which readers are able to follow along and understand.

The contents of the book flow quite well. The framework in the book is consistent.

The text appears easily adaptable for readers and the author also provides accompanying PowerPoint presentations; these are a good foundation tools for readers to use and implement.

Organization/Structure/Flow rating: 4

The contents of the book flow very well. Readers would be able to put into practice the key reading strategies shared in the book ) because its organization is laid out nicely

Interface rating: 3

The interface is generally good, but I was only able to download the .pdf. This may present issues for some student readers.

There are no grammatical errors.

The text was culturally relevant and provided diverse research and practice examples. The text could have benefited from sexamples of intersectional and anti-oppressive lenses for students to consider in their practice.

This text is a comprehensive introduction to research that can be easily adapted for a BSW/MSW research course.

Reviewed by Taylor Hall, Assistant Professor, Bridgewater State University on 6/30/20

This text is more comprehensive than the text I currently use in my Research Methods in Social Work course, which students have to pay for. This text not only covers both qualitative and quantitative research methods, but also all parts of the research process from thinking about research ideas to questions all the way to evaluation after social work programs/policies have been employed.

Not much to say here- with research methods, things are black and white; it is or isn't. This content is accurate. I also like to way the content is explained in light of social work values and ethics. This is something our students can struggle with, and this is helpful in terms of showing why social work needs to pay attention to research.

There are upcoming changes to CSWE's competencies, therefore lots of text materials are going to need to be updated soon. Otherwise, case examples are pertinent and timely.

Clarity rating: 4

I think that research methods for social workers is a difficult field of study. Many go into the field to be clinicians, and few understand (off the bat) the importance of understanding methods of research. I think this textbook makes it clear to me, but hard to rate a 5 as I know from a student's perspective, lots of the terminology is so new.

Appears to be so- I was able to follow, seems consistent.

Yes- and I think this is a strong point of this text. This was easy to follow and read, and I could see myself easily divvying up different sections for students to work on in groups.

Yes- makes sense to me and the way I teach this course. I like the 30,000 ft view then honing in on specific types of research, all along the way explaining the different pieces of the research process and in writing a research paper.

I sometimes struggle with online platforms versus in person texts to read, and this OER is visually appealing- there is not too much text on the pages, it is spaced in a way that makes it easier to read. Colors are used well to highlight pertinent information.

Not something I found in this text.

Cultural Relevance rating: 3

This is a place where I feel the text could use some work. A nod to past wrongdoings in research methods on oppressed groups, and more of a discussion on social work's role in social justice with an eye towards righting the wrongs of the past. Updated language re: person first language, more diverse examples, etc.

This is a very useful text, and I am going to recommend my department check it out for future use, especially as many of our students are first gen and working class and would love to save money on textbooks where possible.

Reviewed by Olubunmi Oyewuwo-Gassikia, Assistant Professor, Northeastern Illinois University on 5/5/20

This text is an appropriate and comprehensive introduction to research methods for BSW students. It guides the reader through each stage of the research project, including identifying a research question, conducting and writing a literature review, research ethics, theory, research design, methodology, sampling, and dissemination. The author explains complex concepts - such as paradigms, epistemology, and ontology - in clear, simple terms and through the use of practical, social work examples for the reader. I especially appreciated the balanced attention to quantitative and qualitative methods, including the explanation of data collection and basic analysis techniques for both. The text could benefit from the inclusion of an explanation of research design notations.

The text is accurate and unbiased. Additionally, the author effectively incorporates referenced sources, including sources one can use for further learning.

The content is relevant and timely. The author incorporates real, recent research examples that reflects the applicability of research at each level of social practice (micro, meso, and macro) throughout the text. The text will benefit from regular updates in research examples.

The text is written in a clear, approachable manner. The chapters are a reasonable length without sacrificing appropriate depth into the subject manner.

The text is consistent throughout. The author is effective in reintroducing previously explained terms from previous chapters.

The text appears easily adaptable. The instructions provided by the author on how to adapt the text for one's course are helpful to one who would like to use the text but not in its entirety. The author also provides accompanying PowerPoint presentations; these are a good foundation but will likely require tailoring based on the teaching style of the instructor.

Generally, the text flows well. However, chapter 5 (Ethics) should come earlier, preferably before chapter 3 (Reviewing & Evaluating the Literature). It is important that students understand research ethics as ethical concerns are an important aspect of evaluating the quality of research studies. Chapter 15 (Real-World Research) should also come earlier in the text, most suitably before or after chapter 7 (Design and Causality).

The interface is generally good, but I was only able to download the .pdf. The setup of the .pdf is difficult to navigate, especially if one wants to jump from chapter to chapter. This may present issues for the student reader.

The text was culturally relevant and provided diverse research and practice examples. The text could have benefited from more critical research examples, such as examples of research studies that incorporate intersectional and anti-oppressive lenses.

This text is a comprehensive introduction to research that can be easily adapted for a BSW level research course.

Reviewed by Smita Dewan, Assistant Professor, New York City College of Technology, Department of Human Services on 12/6/19

Comprehensiveness rating: 4 see less

This is a very good introductory research methodology textbook for undergraduate students of social work or human services. For students who might be intimidated by social research, the text provides assurance that by learning basic concepts of research methodology, students will be better scholars and social work or human service practitioners. The content and flow of the text book supports a basic assignment of most research methodology courses which is to develop a research proposal or a research project. Each stage of research is explained well with many examples from social work practice that has the potential to keep the student engaged.

The glossary at the end of each chapter is very comprehensive but does not include the page number/s where the content is located. The glossary at the end of the book also lacks page numbers which might make it cumbersome for students seeking a quick reference.

The content is accurate and unbiased. Suggested exercises and prompts for students to engage in critical thinking and to identify biases in research that informs practice may help students understand the complexities of social research.

Content is up-to-date and concepts of research methodology presented is unlikely to be obsolete in the coming years. However, recent trends in research such as data mining, using algorithms for social policy and practice implications, privacy concerns, role of social media are topics that could be considered for inclusion in the forthcoming editions.

Content is presented very clearly for undergraduate students. Key takeaways and glossary for each section of the chapter is very useful for students.

Presentation of content, format and organization is consistent throughout the book.

Subsections within each chapter is very helpful for the students who might be assigned readings just in parts for the class.

Students would benefit from reading about research ethics right after the introductory chapter. I would also move Chapter 8 to right after the literature review which might inform creating and refining the research question. Content on evaluation research could also be moved up to follow the chapter on experimental designs. Regardless of the organization, the course instructors can assign chapters according to the course requirements.

PDF version of the book is very easy to use especially as students can save a copy on their computers and do not have to be online. Charts and tables are well presented but some of the images/photographs do not necessarily serve to enhance learning. Image attributions could be provided at the end of the chapter instead of being listed under the glossary. Students might also find it useful to be able to highlight the content and make annotations. This requires that students sign-in. Students should be able to highlight and annotate a downloaded version through Adobe Reader.

I did not find any grammatical errors.

Cultural Relevance rating: 5

Content is not insensitive or offensive in any way. Supporting examples in chapters are very diverse. Students would benefit from some examples of international research (both positive and negative examples) of protection of human subjects.

Reviewed by Jill Hoffman, Assistant Professor, Portland State University on 10/29/19

This text includes 16 chapters that cover content related to the process of conducting research. From identifying a topic and reviewing the literature, to formulating a question, designing a study, and disseminating findings, the text includes research basics that most other introductory social work research texts include. Content on ethics, theory, and to a lesser extent evaluation, single-subject design, and action research are also included. There is a glossary at the end of the text that includes information on the location of the terms. There is a practice behaviors index, but not an index in the traditional sense. If using the text electronically, search functions make it easy to find necessary information despite not having an index. If using a printed version, this would be more difficult. The text includes examples to illustrate concepts that are relevant to settings in which social workers might work. As most other introductory social work research texts, this book appears to come from a mainly positivist view. I would have appreciated more of a discussion related to power, privilege, and oppression and the role these play in the research topics that get studied and who benefits, along with anti-oppressive research. Related to evaluations, a quick mention of logic models would be helpful.

The information appears to be accurate and error free. The language in the text seems to emphasize "right/wrong" choices/decisions instead of highlighting the complexities of research and practice. Using gender-neutral pronouns would also make the language more inclusive.

Content appears to be up-to-date and relevant. Any updating would be straightforward to carry out. I found at least one link that did not work (e.g., NREPP) so if you use this text it will be important to check and make sure things are updated.

The content is clearly written, using examples to illustrate various concepts. I appreciated prompts for questions throughout each chapter in order to engage students in the content. Key terms are bolded, which helps to easily identify important points.

Information is presented in a consistent manner throughout the text.

Each chapter is divided into subsections that help with readability. It is easy to pick and choose various pieces of the text for your course if you're not using the entire thing.

There are many ways you can organize a social work research text. Personally, I prefer to talk about ethics and theory early on, so that students have this as a framework as they read about other's studies and design their own. In the case of this text, I'd put those two chapters right after chapter 1. As others have suggested, I'd also move up the content on research questions, perhaps after chapter 4.

In the online version, no significant interface issues arose. The only thing that would be helpful is to have chapter titles clearly presented when navigating through the text in the online version. For example, when you click through to a new chapter, the title simply says "6.0 Chapter introduction." In order to see the chapter title you have to click into the contents tab. Not a huge issue but could help with navigating the online version. In the pdf version, the links in the table of contents allowed me to navigate through to various sections. I did notice that some of the external links were not complete (e.g., on page 290, the URL is linked as "http://baby-").

Cultural representation in the text is similar to many other introductory social work research texts. There's more of an emphasis on white, western, cis-gendered individuals, particularly in the images. In examples, it appeared that only male/female pronouns were used.

Reviewed by Monica Roth Day, Associate Professor, Social Work, Metropolitan State University (Saint Paul, Minnesota) on 12/26/18

The book provides concrete and clear information on using research as consumers, then developing research as producers of knowledge. It provides a comprehensive review of each step to take to develop a research project from beginning to completion, with appropriate examples. More specific social work links would be helpful as students learn more about the field and the uses of research.

The book is accurate and communicates information and largely without bias. Numerous examples are provided from varied sources, which are then used to discuss potential for bias in research. The addition of critical race theory concepts would add to this discussion, to ground students in the importance of understanding implicit bias as researchers and ways to develop their own awareness.

The book is highly relevant. It provides historical and current examples of research which communicate concepts using accessible language that is current to social work. The text is written so that updates should be easy. Links need to be updated on a regular basis.

The book is accessible for students at it uses common language to communicate concepts while helping students build their research vocabulary. Terminology is communicate both within the text and in glossaries, and technical terms are minimally used.

The book is consistent in its use of terminology and framework. It follows a pattern of development, from consuming research to producing research. The steps are predictable and walk students through appropriate actions to take.

The book is easily readable. Each chapter is divided in sections that are easy to navigate and understand. Pictures and tables are used to support text.

Chapters are in logical order and follow a common pattern.

When reading the book online, the text was largely free of interface issues. As a PDF, there were issues with formatting. Be aware that students who may wish to download the book into a Kindle or other book reader may experience issues.

The text was grammatically correct with no misspellings.

While the book is culturally relevant, it lacks the application of critical race theory. While students will learn about bias in research, critical race theory would ground students in the importance of understanding implicit bias as researchers and ways to develop their own awareness. It would also help students understand why the background of researchers is important in relation to the ways of knowing.

Reviewed by Jennifer Wareham, Associate Professor, Wayne State University on 11/30/18

The book provides a comprehensive introduction to research methods from the perspective of the discipline of Social Work. The book borrows heavily from Amy Blackstone’s Principles of Sociological Inquiry – Qualitative and Quantitative Methods open textbook. The book is divided into 16 chapters, covering: differences in reasoning and scientific thought, starting a research project, writing a literature review, ethics in social science research, how theory relates to research, research design, causality, measurement, sampling, survey research, experimental design, qualitative interviews and focus groups, evaluation research, and reporting research. Some of the more advanced concepts and topics are only covered at superficial level, which limits the intended population of readers to high school students, undergraduate students, or those with no background in research methods. Since the book is geared toward Social Work undergraduate students, the chapters and content address methodologies commonly used in this field, but ignore methodologies that may be more popular in other social science fields. For example, the material on qualitative methods is narrow and focuses on commonly used qualitative methods in Social Work. In addition, the chapter on evaluation is limited to a general overview of evaluation research, which could be improved with more in-depth discussion of different types of evaluation (e.g., needs assessment, evaluability assessment, process evaluation, impact/outcomes evaluation) and real-world examples of different types of evaluation implemented in Social Work. Overall, the author provides examples that are easy for practitioners in Social Work to understand, which are also easily relatable for students in similar disciplines such as criminal justice. The book provides a glossary of key terms. There is no index; however, users can search for terms using the find (Ctrl-F) function in the PDF version of the book.

Overall, the content inside this book is accurate, error-free, and unbiased. However, the content is limited to the Social Work perspective, which may be considered somewhat biased or inaccurate from the perspective of others in different disciplines.

The book describes classic examples used in most texts on social science research methods. It also includes contemporary and relevant examples. Some of the content (such as web addresses and contemporary news pieces) will need to be updated every few years. The text is written and arranged in such a way that any necessary updates should be relatively easy and straightforward to implement.

The book is written in clear and accessible prose. The book provides appropriate context for any jargon/technical terminology used. Readers from any social science discipline should be able to understand the content and context of the material presented in the book.

The framework and use of terminology in the book are consistent.

This book is highly modular. The author has even improved upon the modularity of the book from Blackstone’s open text (which serves as the basis of the present text). Each chapter is divided into short, related subsections. The design of the chapters and their subsections make it easy to divide the material into units of study across a semester or quarter of instruction.

Generally, the book is organized in a similar manner as other texts on social science research methods. However, the organization could be improved slightly. Chapters 2 through 4 describe the process of beginning a research project and conducting a literature review. Chapter 8 describes refining a research question. This chapter could be moved to follow the Chapter 4. Chapter 12 describes experimental design, while Chapter 15 provides a description and examples of evaluation research. Since evaluation research tends to rely on experimental and quasi-experimental design, this chapter should follow the experimental design chapter.

For the online version of the book, there were no interface issues. The images and charts were clear and readable. The hyperlinks to sources mentioned in the text worked. The Contents menu allowed for easy and quick access to any section of the book. For the PDF version of the book, there were interface issues. The images and charts were clear and readable. However, the URLs and hyperlinks were not active in the PDF version. Furthermore, the PDF version was not bookmarked, which made it more difficult to access specific sections of the book.

I did not find grammatical errors in the book.

Overall, the cultural relevance and sensitivity were consistent with other social science research methods texts. The author does a good job of using both female and male pronouns in the prose. While there are pictures of people of color, there could be more. Most of the pictures are of white people. Also, the context is generally U.S.-centric.

Chapter 1: Introduction to research
Chapter 2: Beginning a research project
Chapter 3: Reading and evaluating literature
Chapter 4: Conducting a literature review
Chapter 5: Ethics in social work research
Chapter 6: Linking methods with theory
Chapter 7: Design and causality
Chapter 8: Creating and refining a research question
Chapter 9: Defining and measuring concepts
Chapter 10: Sampling
Chapter 11: Survey research
Chapter 12: Experimental design
Chapter 13: Interviews and focus groups
Chapter 14: Unobtrusive research: Qualitative and quantitative approaches
Chapter 15: Real-world research: Evaluation, single-subjects, and action research
Chapter 16: Reporting and reading research

Ancillary Material

Open Social Work Education

About the Book

As an introductory textbook for social work students studying research methods, this book guides students through the process of creating a research project. Students will learn how to discover a researchable topic that is interesting to them, examine scholarly literature, formulate a proper research question, design a quantitative or qualitative study to answer their question, carry out the design, interpret quantitative or qualitative results, and disseminate their findings to a variety of audiences. Examples are drawn from the author's practice and research experience, as well as topical articles from the literature.

There are ancillary materials available for this book.

About the Contributors

Matt DeCarlo earned his PhD in social work at Virginia Commonwealth University and is an Assistant Professor of Social Work at Radford University. He earned an MSW from George Mason University in 2010 and a BA in Psychology from the College of William and Mary in 2007. His research interests include open educational resources, self-directed Medicaid supports, and basic income. Matt is an Open Textbook Network Campus Leader for Radford University. He is the founder of Open Social Work Education, a non-profit collaborative advancing OER in social work education.

Get Citation

Research for Social Workers has built a strong reputation as an accessible guide to the key research methods and approaches used in the discipline. Ideal for beginners, the book outlines the importance of social work research, its guiding principles and explains how to choose a topic area, develop research questions together with describing the key steps in the research process. The authors outline the principles of sampling, systematic reviews and surveys and interviews, provide guidance on evaluation and statistical analysis and explain how research can influence policy and practice. This new edition includes: â€¢ an expanded discussion of rigour in qualitative research â€¢ more detailed analysis of systematic reviews â€¢ a new section on on-line surveys â€¢ enhanced examination of action research including recent examples of action research programs and â€¢ an expanded section on evidence-based practice. Featuring practical examples and end-of-chapter exercises and questions, and using non-technical language throughout, this is a vital reference tool for both students and practicing social workers.

Part i | 109 pages, beginning social work research, chapter 1 | 36 pages, social work research, chapter 2 | 14 pages, choosing your topic area, chapter 3 | 28 pages, developing research questions, chapter 4 | 29 pages, steps in the research process, part ii | 114 pages, research methods for social work, chapter 5 | 20 pages, chapter 6 | 27 pages, systematic reviews, chapter 7 | 38 pages, surveys and interviews, chapter 8 | 27 pages, assessing community needs and strengths, part iii | 116 pages, chapter 9 | 25 pages, how do i evaluate my program, chapter 10 | 20 pages, action research, chapter 11 | 25 pages, evidence-based practice and best practice evaluation, chapter 12 | 23 pages, research in post-disaster recovery and other crisis situations: community-based rapid appraisals, chapter 13 | 21 pages, other methods, part iv | 110 pages, statistical analysis, chapter 14 | 24 pages, producing results: qualitative research, chapter 15 | 18 pages, producing results: quantitative research, chapter 16 | 33 pages, statistics for social workers: analysis of a single variable, chapter 17 | 33 pages, statistics for social workers: two or more variables, part v | 44 pages, bringing it all together, chapter 18 | 24 pages, influencing policy and practice, chapter 19 | 18 pages, developing a research proposal.

Privacy Policy
Terms & Conditions
Cookie Policy
Taylor & Francis Online
Taylor & Francis Group
Students/Researchers
Librarians/Institutions

Connect with us

Request new password
Create a new account

Research Methods for Social Work: A Problem-Based Approach

Student resources, welcome to the sage edge site for research methods for social work , 1e.

Research Methods for Social Work: A Problem-Based Approach is a comprehensive introduction to methods instruction that engages students innovatively and interactively. Using a case study and problem-based learning (PBL) approach, authors Antoinette Y. Farmer and G. Lawrence Farmer utilize case examples to achieve a level of application that builds readers’ confidence in methodology and reinforces their understanding of research across all levels of social work practice. These real-case examples, along with critical thinking questions, research tips, and step-by-step problem-solving methods, will improve student mastery and help them see why research is relevant. With the guidance of this new and noteworthy textbook, readers will transform into both knowledgeable consumers of research and skilled practitioners who can effectively address the needs of their clients through research.

This site features an array of free resources you can access anytime, anywhere.

Acknowledgments

We gratefully acknowledge Antoinette Y. Farmer and G. Lawrence Farmer for writing an excellent text. Special thanks are also due to Kryss Shane for developing the resources on this site.

For instructors

Access resources that are only available to Faculty and Administrative Staff.

Want to explore the book further?

Order Review Copy

USC Libraries
Research Guides
SOWK 546: The Science of Social Work
Research Designs and Methods

SOWK 546: The Science of Social Work: Research Designs and Methods

Finding Scholarly Articles
Developing a Search Strategy
Evaluating Research Effectively
Appraising Research
Evidence-based Practice Resources
Understanding Journal Impact Factors
Policy and Legislation Resources
Demographics, Data & Statistics for Social Work
Tests & Measurements
Background on Theories/Reference Sources
APA 7th Edition

Research Methods Map

Explore the methods map below from SAGE Research Methods online to learn more about various research methods and find definitions of research terms. Click on the image of the map to interact with the map online.

Research Design and Design Notation Guide

Resources for Research Methods

Use the resources below to get more background and information on various research designs and methods.

Independent and Dependent Variables

The following information and examples are from the Encyclopedia of Research Design cited and linked above:

Independent Variables and Dependent Variables

In research design, independent variables are those that a researcher can manipulate, whereas dependent variables are the responses to the effects of independent variables (Salkind, 2010).

Independent variables are predetermined by researchers before an experiment is started. They are carefully controlled in controlled experiments or selected in observational studies (i.e., they are manipulated by the researcher according to the purpose of a study).

The dependent variable is the effect to be observed and is the primary interest of the study (Salkind, 2010).

Consider a study on the relationship between physical inactivity and obesity in young children: The parameter(s) that measures physical inactivity, such as the hours spent on watching television and playing video games, and the means of transportation to and from daycares/schools is the independent variable. These are chosen by the researcher based on his or her preliminary research or on other reports in literature on the same subject prior to the study. The parameter(s) that measure obesity, such as the body mass index, is (are) the dependent variable (Salkind, 2010)

*Salkind, N. J. (2010). Encyclopedia of research design. Thousand Oaks, CA: SAGE Publications, Inc. doi: 10.4135/9781412961288

Internal and External Validity

Types of validity , internal validity .

refers to the accuracy of statements made about the causal relationship between two variables, namely, the manipulated (treatment or independent) variable and the measured variable (dependent)
internal validity claims are based on the procedures and operations used to conduct a research study, including the choice of design and measurement of variables.

*From Salkind, N. J. (2010). Encyclopedia of research design. Thousand Oaks, CA: SAGE Publications, Inc. doi: 10.4135/9781412961288

External Validity

refers to the degree to which the relations among variables observed in one sample of observations in one population will hold for other samples of observations within the same population or in other populations. i.e. how general are your results?

*From Frey, B. (2018). The SAGE encyclopedia of educational research, measurement, and evaluation (Vols. 1-4). Thousand Oaks,, CA: SAGE Publications, Inc. doi: 10.4135/9781506326139

Quick guide available from USC School of Social Work: Threats to Internal Validity quick guide

<< Previous: Developing a Search Strategy
Next: Evaluating Research Effectively >>
Last Updated: Jul 29, 2024 2:13 PM
URL: https://libguides.usc.edu/sowk546

Social Work 3500: Methods of Social Work Research

What is empirical research, sections in empirical research.

What to Look For
Is this an Empirical article?
Finding Empirical Articles
AYSPS Online Students This link opens in a new window
Library Services
Citing Your Sources This link opens in a new window
Contact Me / Online Drop-in Hours

Empirical Research is defined as research based on observed and measured phenomena. It is research that derives knowledge from actual experience rather than from theory or belief.

Requests for "Empirical" articles are usually from instructors in Education or Psychology; most other disciplines will ask for Scholarly, Peer Reviewed or Primary literature.

See if the article mentions a study, an observation, an analysis or a number of participants or subjects. Was data collected, a survey or questionnaire administered, an assessment or measurement used, an interview conducted? All of these terms indicate possible methodologies used in empirical research.

Empirical articles often contain these sections:

Introduction
Literature review
Methodology

The sections may be combined, and may have different headings or no headings at all; however, the information that would fall within these sections should be present in an empirical article.

Abstract: A report of an empirical study includes an abstract that provides a very brief summary of the research.

Introduction: The introduction sets the research in a context, which provides a review of related research and develops the hypotheses for the research.

Method: The method section is a description of how the research was conducted, including who the participants were, the design of the study, what the participants did, and what measures were used.

Results: The results section describes the outcomes of the measures of the study.

Discussion : The discussion section contains the interpretations and implications of the study.

General Discussion: There may be more than one study in the report; in this case, there are usually separate Methods and Results sections for each study followed by a general discussion that ties all the research together.

References: A references section contains information about the articles and books cited in the report.

Length of Article: Empirical research articles are usually substantial (more than 1 or 2 pages) and include a bibliography or cited references section (usually at the end of the article).

Type of Publication: Empirical research articles are published in scholarly or academic journals . These publications are sometimes referred to as “peer-reviewed,” “academic” or “refereed” publications. Examples of such publications include: Social Work Research, Mental Health Practice, and Journal of Substance Abuse.

<< Previous: Welcome
Next: What to Look For >>
Last Updated: May 9, 2024 10:26 AM
URL: https://research.library.gsu.edu/SW3500

Want to create or adapt books like this? Learn more about how Pressbooks supports open publishing practices.

Part 3: Using quantitative methods

13. Experimental design

Chapter outline.

What is an experiment and when should you use one? (8 minute read)
True experimental designs (7 minute read)
Quasi-experimental designs (8 minute read)
Non-experimental designs (5 minute read)
Critical, ethical, and critical considerations (5 minute read)

Content warning : examples in this chapter contain references to non-consensual research in Western history, including experiments conducted during the Holocaust and on African Americans (section 13.6).

13.1 What is an experiment and when should you use one?

Learning objectives.

Learners will be able to…

Identify the characteristics of a basic experiment
Describe causality in experimental design
Discuss the relationship between dependent and independent variables in experiments
Explain the links between experiments and generalizability of results
Describe advantages and disadvantages of experimental designs

The basics of experiments

The first experiment I can remember using was for my fourth grade science fair. I wondered if latex- or oil-based paint would hold up to sunlight better. So, I went to the hardware store and got a few small cans of paint and two sets of wooden paint sticks. I painted one with oil-based paint and the other with latex-based paint of different colors and put them in a sunny spot in the back yard. My hypothesis was that the oil-based paint would fade the most and that more fading would happen the longer I left the paint sticks out. (I know, it’s obvious, but I was only 10.)

I checked in on the paint sticks every few days for a month and wrote down my observations. The first part of my hypothesis ended up being wrong—it was actually the latex-based paint that faded the most. But the second part was right, and the paint faded more and more over time. This is a simple example, of course—experiments get a heck of a lot more complex than this when we’re talking about real research.

Merriam-Webster defines an experiment as “an operation or procedure carried out under controlled conditions in order to discover an unknown effect or law, to test or establish a hypothesis, or to illustrate a known law.” Each of these three components of the definition will come in handy as we go through the different types of experimental design in this chapter. Most of us probably think of the physical sciences when we think of experiments, and for good reason—these experiments can be pretty flashy! But social science and psychological research follow the same scientific methods, as we’ve discussed in this book.

As the video discusses, experiments can be used in social sciences just like they can in physical sciences. It makes sense to use an experiment when you want to determine the cause of a phenomenon with as much accuracy as possible. Some types of experimental designs do this more precisely than others, as we’ll see throughout the chapter. If you’ll remember back to Chapter 11 and the discussion of validity, experiments are the best way to ensure internal validity, or the extent to which a change in your independent variable causes a change in your dependent variable.

Experimental designs for research projects are most appropriate when trying to uncover or test a hypothesis about the cause of a phenomenon, so they are best for explanatory research questions. As we’ll learn throughout this chapter, different circumstances are appropriate for different types of experimental designs. Each type of experimental design has advantages and disadvantages, and some are better at controlling the effect of extraneous variables —those variables and characteristics that have an effect on your dependent variable, but aren’t the primary variable whose influence you’re interested in testing. For example, in a study that tries to determine whether aspirin lowers a person’s risk of a fatal heart attack, a person’s race would likely be an extraneous variable because you primarily want to know the effect of aspirin.

In practice, many types of experimental designs can be logistically challenging and resource-intensive. As practitioners, the likelihood that we will be involved in some of the types of experimental designs discussed in this chapter is fairly low. However, it’s important to learn about these methods, even if we might not ever use them, so that we can be thoughtful consumers of research that uses experimental designs.

While we might not use all of these types of experimental designs, many of us will engage in evidence-based practice during our time as social workers. A lot of research developing evidence-based practice, which has a strong emphasis on generalizability, will use experimental designs. You’ve undoubtedly seen one or two in your literature search so far.

The logic of experimental design

How do we know that one phenomenon causes another? The complexity of the social world in which we practice and conduct research means that causes of social problems are rarely cut and dry. Uncovering explanations for social problems is key to helping clients address them, and experimental research designs are one road to finding answers.

As you read about in Chapter 8 (and as we’ll discuss again in Chapter 15 ), just because two phenomena are related in some way doesn’t mean that one causes the other. Ice cream sales increase in the summer, and so does the rate of violent crime; does that mean that eating ice cream is going to make me murder someone? Obviously not, because ice cream is great. The reality of that relationship is far more complex—it could be that hot weather makes people more irritable and, at times, violent, while also making people want ice cream. More likely, though, there are other social factors not accounted for in the way we just described this relationship.

Experimental designs can help clear up at least some of this fog by allowing researchers to isolate the effect of interventions on dependent variables by controlling extraneous variables . In true experimental design (discussed in the next section) and some quasi-experimental designs, researchers accomplish this w ith the control group and the experimental group . (The experimental group is sometimes called the “treatment group,” but we will call it the experimental group in this chapter.) The control group does not receive the intervention you are testing (they may receive no intervention or what is known as “treatment as usual”), while the experimental group does. (You will hopefully remember our earlier discussion of control variables in Chapter 8 —conceptually, the use of the word “control” here is the same.)

In a well-designed experiment, your control group should look almost identical to your experimental group in terms of demographics and other relevant factors. What if we want to know the effect of CBT on social anxiety, but we have learned in prior research that men tend to have a more difficult time overcoming social anxiety? We would want our control and experimental groups to have a similar gender mix because it would limit the effect of gender on our results, since ostensibly, both groups’ results would be affected by gender in the same way. If your control group has 5 women, 6 men, and 4 non-binary people, then your experimental group should be made up of roughly the same gender balance to help control for the influence of gender on the outcome of your intervention. (In reality, the groups should be similar along other dimensions, as well, and your group will likely be much larger.) The researcher will use the same outcome measures for both groups and compare them, and assuming the experiment was designed correctly, get a pretty good answer about whether the intervention had an effect on social anxiety.

You will also hear people talk about comparison groups , which are similar to control groups. The primary difference between the two is that a control group is populated using random assignment, but a comparison group is not. Random assignment entails using a random process to decide which participants are put into the control or experimental group (which participants receive an intervention and which do not). By randomly assigning participants to a group, you can reduce the effect of extraneous variables on your research because there won’t be a systematic difference between the groups.

Do not confuse random assignment with random sampling. Random sampling is a method for selecting a sample from a population, and is rarely used in psychological research. Random assignment is a method for assigning participants in a sample to the different conditions, and it is an important element of all experimental research in psychology and other related fields. Random sampling also helps a great deal with generalizability , whereas random assignment increases internal validity .

We have already learned about internal validity in Chapter 11 . The use of an experimental design will bolster internal validity since it works to isolate causal relationships. As we will see in the coming sections, some types of experimental design do this more effectively than others. It’s also worth considering that true experiments, which most effectively show causality , are often difficult and expensive to implement. Although other experimental designs aren’t perfect, they still produce useful, valid evidence and may be more feasible to carry out.

Key Takeaways

Experimental designs are useful for establishing causality, but some types of experimental design do this better than others.
Experiments help researchers isolate the effect of the independent variable on the dependent variable by controlling for the effect of extraneous variables .
Experiments use a control/comparison group and an experimental group to test the effects of interventions. These groups should be as similar to each other as possible in terms of demographics and other relevant factors.
True experiments have control groups with randomly assigned participants, while other types of experiments have comparison groups to which participants are not randomly assigned.
Think about the research project you’ve been designing so far. How might you use a basic experiment to answer your question? If your question isn’t explanatory, try to formulate a new explanatory question and consider the usefulness of an experiment.
Why is establishing a simple relationship between two variables not indicative of one causing the other?

13.2 True experimental design

Describe a true experimental design in social work research
Understand the different types of true experimental designs
Determine what kinds of research questions true experimental designs are suited for
Discuss advantages and disadvantages of true experimental designs

True experimental design , often considered to be the “gold standard” in research designs, is thought of as one of the most rigorous of all research designs. In this design, one or more independent variables are manipulated by the researcher (as treatments), subjects are randomly assigned to different treatment levels (random assignment), and the results of the treatments on outcomes (dependent variables) are observed. The unique strength of experimental research is its internal validity and its ability to establish ( causality ) through treatment manipulation, while controlling for the effects of extraneous variable. Sometimes the treatment level is no treatment, while other times it is simply a different treatment than that which we are trying to evaluate. For example, we might have a control group that is made up of people who will not receive any treatment for a particular condition. Or, a control group could consist of people who consent to treatment with DBT when we are testing the effectiveness of CBT.

As we discussed in the previous section, a true experiment has a control group with participants randomly assigned , and an experimental group . This is the most basic element of a true experiment. The next decision a researcher must make is when they need to gather data during their experiment. Do they take a baseline measurement and then a measurement after treatment, or just a measurement after treatment, or do they handle measurement another way? Below, we’ll discuss the three main types of true experimental designs. There are sub-types of each of these designs, but here, we just want to get you started with some of the basics.

Using a true experiment in social work research is often pretty difficult, since as I mentioned earlier, true experiments can be quite resource intensive. True experiments work best with relatively large sample sizes, and random assignment, a key criterion for a true experimental design, is hard (and unethical) to execute in practice when you have people in dire need of an intervention. Nonetheless, some of the strongest evidence bases are built on true experiments.

For the purposes of this section, let’s bring back the example of CBT for the treatment of social anxiety. We have a group of 500 individuals who have agreed to participate in our study, and we have randomly assigned them to the control and experimental groups. The folks in the experimental group will receive CBT, while the folks in the control group will receive more unstructured, basic talk therapy. These designs, as we talked about above, are best suited for explanatory research questions.

Before we get started, take a look at the table below. When explaining experimental research designs, we often use diagrams with abbreviations to visually represent the experiment. Table 13.1 starts us off by laying out what each of the abbreviations mean.

Table 13.1 Experimental research design notations
R	Randomly assigned group (control/comparison or experimental)
O	Observation/measurement taken of dependent variable
X	Intervention or treatment
X	Experimental or new intervention
X	Typical intervention/treatment as usual
A, B, C, etc.	Denotes different groups (control/comparison and experimental)

Pretest and post-test control group design

In pretest and post-test control group design , participants are given a pretest of some kind to measure their baseline state before their participation in an intervention. In our social anxiety experiment, we would have participants in both the experimental and control groups complete some measure of social anxiety—most likely an established scale and/or a structured interview—before they start their treatment. As part of the experiment, we would have a defined time period during which the treatment would take place (let’s say 12 weeks, just for illustration). At the end of 12 weeks, we would give both groups the same measure as a post-test .

In the diagram, RA (random assignment group A) is the experimental group and RB is the control group. O 1 denotes the pre-test, X e denotes the experimental intervention, and O 2 denotes the post-test. Let’s look at this diagram another way, using the example of CBT for social anxiety that we’ve been talking about.

In a situation where the control group received treatment as usual instead of no intervention, the diagram would look this way, with X i denoting treatment as usual (Figure 13.3).

Hopefully, these diagrams provide you a visualization of how this type of experiment establishes time order , a key component of a causal relationship. Did the change occur after the intervention? Assuming there is a change in the scores between the pretest and post-test, we would be able to say that yes, the change did occur after the intervention. Causality can’t exist if the change happened before the intervention—this would mean that something else led to the change, not our intervention.

Post-test only control group design

Post-test only control group design involves only giving participants a post-test, just like it sounds (Figure 13.4).

But why would you use this design instead of using a pretest/post-test design? One reason could be the testing effect that can happen when research participants take a pretest. In research, the testing effect refers to “measurement error related to how a test is given; the conditions of the testing, including environmental conditions; and acclimation to the test itself” (Engel & Schutt, 2017, p. 444) [1] (When we say “measurement error,” all we mean is the accuracy of the way we measure the dependent variable.) Figure 13.4 is a visualization of this type of experiment. The testing effect isn’t always bad in practice—our initial assessments might help clients identify or put into words feelings or experiences they are having when they haven’t been able to do that before. In research, however, we might want to control its effects to isolate a cleaner causal relationship between intervention and outcome.

Going back to our CBT for social anxiety example, we might be concerned that participants would learn about social anxiety symptoms by virtue of taking a pretest. They might then identify that they have those symptoms on the post-test, even though they are not new symptoms for them. That could make our intervention look less effective than it actually is.

However, without a baseline measurement establishing causality can be more difficult. If we don’t know someone’s state of mind before our intervention, how do we know our intervention did anything at all? Establishing time order is thus a little more difficult. You must balance this consideration with the benefits of this type of design.

Solomon four group design

One way we can possibly measure how much the testing effect might change the results of the experiment is with the Solomon four group design. Basically, as part of this experiment, you have two control groups and two experimental groups. The first pair of groups receives both a pretest and a post-test. The other pair of groups receives only a post-test (Figure 13.5). This design helps address the problem of establishing time order in post-test only control group designs.

For our CBT project, we would randomly assign people to four different groups instead of just two. Groups A and B would take our pretest measures and our post-test measures, and groups C and D would take only our post-test measures. We could then compare the results among these groups and see if they’re significantly different between the folks in A and B, and C and D. If they are, we may have identified some kind of testing effect, which enables us to put our results into full context. We don’t want to draw a strong causal conclusion about our intervention when we have major concerns about testing effects without trying to determine the extent of those effects.

Solomon four group designs are less common in social work research, primarily because of the logistics and resource needs involved. Nonetheless, this is an important experimental design to consider when we want to address major concerns about testing effects.

True experimental design is best suited for explanatory research questions.
True experiments require random assignment of participants to control and experimental groups.
Pretest/post-test research design involves two points of measurement—one pre-intervention and one post-intervention.
Post-test only research design involves only one point of measurement—post-intervention. It is a useful design to minimize the effect of testing effects on our results.
Solomon four group research design involves both of the above types of designs, using 2 pairs of control and experimental groups. One group receives both a pretest and a post-test, while the other receives only a post-test. This can help uncover the influence of testing effects.
Think about a true experiment you might conduct for your research project. Which design would be best for your research, and why?
What challenges or limitations might make it unrealistic (or at least very complicated!) for you to carry your true experimental design in the real-world as a student researcher?
What hypothesis(es) would you test using this true experiment?

13.4 Quasi-experimental designs

Describe a quasi-experimental design in social work research
Understand the different types of quasi-experimental designs
Determine what kinds of research questions quasi-experimental designs are suited for
Discuss advantages and disadvantages of quasi-experimental designs

Quasi-experimental designs are a lot more common in social work research than true experimental designs. Although quasi-experiments don’t do as good a job of giving us robust proof of causality , they still allow us to establish time order , which is a key element of causality. The prefix quasi means “resembling,” so quasi-experimental research is research that resembles experimental research, but is not true experimental research. Nonetheless, given proper research design, quasi-experiments can still provide extremely rigorous and useful results.

There are a few key differences between true experimental and quasi-experimental research. The primary difference between quasi-experimental research and true experimental research is that quasi-experimental research does not involve random assignment to control and experimental groups. Instead, we talk about comparison groups in quasi-experimental research instead. As a result, these types of experiments don’t control the effect of extraneous variables as well as a true experiment.

Quasi-experiments are most likely to be conducted in field settings in which random assignment is difficult or impossible. They are often conducted to evaluate the effectiveness of a treatment—perhaps a type of psychotherapy or an educational intervention. We’re able to eliminate some threats to internal validity, but we can’t do this as effectively as we can with a true experiment. Realistically, our CBT-social anxiety project is likely to be a quasi experiment, based on the resources and participant pool we’re likely to have available.

It’s important to note that not all quasi-experimental designs have a comparison group. There are many different kinds of quasi-experiments, but we will discuss the three main types below: nonequivalent comparison group designs, time series designs, and ex post facto comparison group designs.

Nonequivalent comparison group design

You will notice that this type of design looks extremely similar to the pretest/post-test design that we discussed in section 13.3. But instead of random assignment to control and experimental groups, researchers use other methods to construct their comparison and experimental groups. A diagram of this design will also look very similar to pretest/post-test design, but you’ll notice we’ve removed the “R” from our groups, since they are not randomly assigned (Figure 13.6).

Researchers using this design select a comparison group that’s as close as possible based on relevant factors to their experimental group. Engel and Schutt (2017) [2] identify two different selection methods:

Individual matching : Researchers take the time to match individual cases in the experimental group to similar cases in the comparison group. It can be difficult, however, to match participants on all the variables you want to control for.
Aggregate matching : Instead of trying to match individual participants to each other, researchers try to match the population profile of the comparison and experimental groups. For example, researchers would try to match the groups on average age, gender balance, or median income. This is a less resource-intensive matching method, but researchers have to ensure that participants aren’t choosing which group (comparison or experimental) they are a part of.

As we’ve already talked about, this kind of design provides weaker evidence that the intervention itself leads to a change in outcome. Nonetheless, we are still able to establish time order using this method, and can thereby show an association between the intervention and the outcome. Like true experimental designs, this type of quasi-experimental design is useful for explanatory research questions.

What might this look like in a practice setting? Let’s say you’re working at an agency that provides CBT and other types of interventions, and you have identified a group of clients who are seeking help for social anxiety, as in our earlier example. Once you’ve obtained consent from your clients, you can create a comparison group using one of the matching methods we just discussed. If the group is small, you might match using individual matching, but if it’s larger, you’ll probably sort people by demographics to try to get similar population profiles. (You can do aggregate matching more easily when your agency has some kind of electronic records or database, but it’s still possible to do manually.)

Time series design

Another type of quasi-experimental design is a time series design. Unlike other types of experimental design, time series designs do not have a comparison group. A time series is a set of measurements taken at intervals over a period of time (Figure 13.7). Proper time series design should include at least three pre- and post-intervention measurement points. While there are a few types of time series designs, we’re going to focus on the most common: interrupted time series design.

But why use this method? Here’s an example. Let’s think about elementary student behavior throughout the school year. As anyone with children or who is a teacher knows, kids get very excited and animated around holidays, days off, or even just on a Friday afternoon. This fact might mean that around those times of year, there are more reports of disruptive behavior in classrooms. What if we took our one and only measurement in mid-December? It’s possible we’d see a higher-than-average rate of disruptive behavior reports, which could bias our results if our next measurement is around a time of year students are in a different, less excitable frame of mind. When we take multiple measurements throughout the first half of the school year, we can establish a more accurate baseline for the rate of these reports by looking at the trend over time.

We may want to test the effect of extended recess times in elementary school on reports of disruptive behavior in classrooms. When students come back after the winter break, the school extends recess by 10 minutes each day (the intervention), and the researchers start tracking the monthly reports of disruptive behavior again. These reports could be subject to the same fluctuations as the pre-intervention reports, and so we once again take multiple measurements over time to try to control for those fluctuations.

This method improves the extent to which we can establish causality because we are accounting for a major extraneous variable in the equation—the passage of time. On its own, it does not allow us to account for other extraneous variables, but it does establish time order and association between the intervention and the trend in reports of disruptive behavior. Finding a stable condition before the treatment that changes after the treatment is evidence for causality between treatment and outcome.

Ex post facto comparison group design

Ex post facto (Latin for “after the fact”) designs are extremely similar to nonequivalent comparison group designs. There are still comparison and experimental groups, pretest and post-test measurements, and an intervention. But in ex post facto designs, participants are assigned to the comparison and experimental groups once the intervention has already happened. This type of design often occurs when interventions are already up and running at an agency and the agency wants to assess effectiveness based on people who have already completed treatment.

In most clinical agency environments, social workers conduct both initial and exit assessments, so there are usually some kind of pretest and post-test measures available. We also typically collect demographic information about our clients, which could allow us to try to use some kind of matching to construct comparison and experimental groups.

In terms of internal validity and establishing causality, ex post facto designs are a bit of a mixed bag. The ability to establish causality depends partially on the ability to construct comparison and experimental groups that are demographically similar so we can control for these extraneous variables .

Quasi-experimental designs are common in social work intervention research because, when designed correctly, they balance the intense resource needs of true experiments with the realities of research in practice. They still offer researchers tools to gather robust evidence about whether interventions are having positive effects for clients.

Quasi-experimental designs are similar to true experiments, but do not require random assignment to experimental and control groups.
In quasi-experimental projects, the group not receiving the treatment is called the comparison group, not the control group.
Nonequivalent comparison group design is nearly identical to pretest/post-test experimental design, but participants are not randomly assigned to the experimental and control groups. As a result, this design provides slightly less robust evidence for causality.
Nonequivalent groups can be constructed by individual matching or aggregate matching .
Time series design does not have a control or experimental group, and instead compares the condition of participants before and after the intervention by measuring relevant factors at multiple points in time. This allows researchers to mitigate the error introduced by the passage of time.
Ex post facto comparison group designs are also similar to true experiments, but experimental and comparison groups are constructed after the intervention is over. This makes it more difficult to control for the effect of extraneous variables, but still provides useful evidence for causality because it maintains the time order of the experiment.
Think back to the experiment you considered for your research project in Section 13.3. Now that you know more about quasi-experimental designs, do you still think it’s a true experiment? Why or why not?
What should you consider when deciding whether an experimental or quasi-experimental design would be more feasible or fit your research question better?

13.5 Non-experimental designs

Describe non-experimental designs in social work research
Discuss how non-experimental research differs from true and quasi-experimental research
Demonstrate an understanding the different types of non-experimental designs
Determine what kinds of research questions non-experimental designs are suited for
Discuss advantages and disadvantages of non-experimental designs

The previous sections have laid out the basics of some rigorous approaches to establish that an intervention is responsible for changes we observe in research participants. This type of evidence is extremely important to build an evidence base for social work interventions, but it’s not the only type of evidence to consider. We will discuss qualitative methods, which provide us with rich, contextual information, in Part 4 of this text. The designs we’ll talk about in this section are sometimes used in qualitative research but in keeping with our discussion of experimental design so far, we’re going to stay in the quantitative research realm for now. Non-experimental is also often a stepping stone for more rigorous experimental design in the future, as it can help test the feasibility of your research.

In general, non-experimental designs do not strongly support causality and don’t address threats to internal validity. However, that’s not really what they’re intended for. Non-experimental designs are useful for a few different types of research, including explanatory questions in program evaluation. Certain types of non-experimental design are also helpful for researchers when they are trying to develop a new assessment or scale. Other times, researchers or agency staff did not get a chance to gather any assessment information before an intervention began, so a pretest/post-test design is not possible.

A genderqueer person sitting on a couch, talking to a therapist in a brightly-lit room

A significant benefit of these types of designs is that they’re pretty easy to execute in a practice or agency setting. They don’t require a comparison or control group, and as Engel and Schutt (2017) [3] point out, they “flow from a typical practice model of assessment, intervention, and evaluating the impact of the intervention” (p. 177). Thus, these designs are fairly intuitive for social workers, even when they aren’t expert researchers. Below, we will go into some detail about the different types of non-experimental design.

One group pretest/post-test design

Also known as a before-after one-group design, this type of research design does not have a comparison group and everyone who participates in the research receives the intervention (Figure 13.8). This is a common type of design in program evaluation in the practice world. Controlling for extraneous variables is difficult or impossible in this design, but given that it is still possible to establish some measure of time order, it does provide weak support for causality.

Imagine, for example, a researcher who is interested in the effectiveness of an anti-drug education program on elementary school students’ attitudes toward illegal drugs. The researcher could assess students’ attitudes about illegal drugs (O 1 ), implement the anti-drug program (X), and then immediately after the program ends, the researcher could once again measure students’ attitudes toward illegal drugs (O 2 ). You can see how this would be relatively simple to do in practice, and have probably been involved in this type of research design yourself, even if informally. But hopefully, you can also see that this design would not provide us with much evidence for causality because we have no way of controlling for the effect of extraneous variables. A lot of things could have affected any change in students’ attitudes—maybe girls already had different attitudes about illegal drugs than children of other genders, and when we look at the class’s results as a whole, we couldn’t account for that influence using this design.

All of that doesn’t mean these results aren’t useful, however. If we find that children’s attitudes didn’t change at all after the drug education program, then we need to think seriously about how to make it more effective or whether we should be using it at all. (This immediate, practical application of our results highlights a key difference between program evaluation and research, which we will discuss in Chapter 23 .)

After-only design

As the name suggests, this type of non-experimental design involves measurement only after an intervention. There is no comparison or control group, and everyone receives the intervention. I have seen this design repeatedly in my time as a program evaluation consultant for nonprofit organizations, because often these organizations realize too late that they would like to or need to have some sort of measure of what effect their programs are having.

Because there is no pretest and no comparison group, this design is not useful for supporting causality since we can’t establish the time order and we can’t control for extraneous variables. However, that doesn’t mean it’s not useful at all! Sometimes, agencies need to gather information about how their programs are functioning. A classic example of this design is satisfaction surveys—realistically, these can only be administered after a program or intervention. Questions regarding satisfaction, ease of use or engagement, or other questions that don’t involve comparisons are best suited for this type of design.

Static-group design

A final type of non-experimental research is the static-group design. In this type of research, there are both comparison and experimental groups, which are not randomly assigned. There is no pretest, only a post-test, and the comparison group has to be constructed by the researcher. Sometimes, researchers will use matching techniques to construct the groups, but often, the groups are constructed by convenience of who is being served at the agency.

Non-experimental research designs are easy to execute in practice, but we must be cautious about drawing causal conclusions from the results. A positive result may still suggest that we should continue using a particular intervention (and no result or a negative result should make us reconsider whether we should use that intervention at all). You have likely seen non-experimental research in your daily life or at your agency, and knowing the basics of how to structure such a project will help you ensure you are providing clients with the best care possible.

Non-experimental designs are useful for describing phenomena, but cannot demonstrate causality.
After-only designs are often used in agency and practice settings because practitioners are often not able to set up pre-test/post-test designs.
Non-experimental designs are useful for explanatory questions in program evaluation and are helpful for researchers when they are trying to develop a new assessment or scale.
Non-experimental designs are well-suited to qualitative methods.
If you were to use a non-experimental design for your research project, which would you choose? Why?
Have you conducted non-experimental research in your practice or professional life? Which type of non-experimental design was it?

13.6 Critical, ethical, and cultural considerations

Describe critiques of experimental design
Identify ethical issues in the design and execution of experiments
Identify cultural considerations in experimental design

As I said at the outset, experiments, and especially true experiments, have long been seen as the gold standard to gather scientific evidence. When it comes to research in the biomedical field and other physical sciences, true experiments are subject to far less nuance than experiments in the social world. This doesn’t mean they are easier—just subject to different forces. However, as a society, we have placed the most value on quantitative evidence obtained through empirical observation and especially experimentation.

Major critiques of experimental designs tend to focus on true experiments, especially randomized controlled trials (RCTs), but many of these critiques can be applied to quasi-experimental designs, too. Some researchers, even in the biomedical sciences, question the view that RCTs are inherently superior to other types of quantitative research designs. RCTs are far less flexible and have much more stringent requirements than other types of research. One seemingly small issue, like incorrect information about a research participant, can derail an entire RCT. RCTs also cost a great deal of money to implement and don’t reflect “real world” conditions. The cost of true experimental research or RCTs also means that some communities are unlikely to ever have access to these research methods. It is then easy for people to dismiss their research findings because their methods are seen as “not rigorous.”

Obviously, controlling outside influences is important for researchers to draw strong conclusions, but what if those outside influences are actually important for how an intervention works? Are we missing really important information by focusing solely on control in our research? Is a treatment going to work the same for white women as it does for indigenous women? With the myriad effects of our societal structures, you should be very careful ever assuming this will be the case. This doesn’t mean that cultural differences will negate the effect of an intervention; instead, it means that you should remember to practice cultural humility implementing all interventions, even when we “know” they work.

How we build evidence through experimental research reveals a lot about our values and biases, and historically, much experimental research has been conducted on white people, and especially white men. [4] This makes sense when we consider the extent to which the sciences and academia have historically been dominated by white patriarchy. This is especially important for marginalized groups that have long been ignored in research literature, meaning they have also been ignored in the development of interventions and treatments that are accepted as “effective.” There are examples of marginalized groups being experimented on without their consent, like the Tuskegee Experiment or Nazi experiments on Jewish people during World War II. We cannot ignore the collective consciousness situations like this can create about experimental research for marginalized groups.

None of this is to say that experimental research is inherently bad or that you shouldn’t use it. Quite the opposite—use it when you can, because there are a lot of benefits, as we learned throughout this chapter. As a social work researcher, you are uniquely positioned to conduct experimental research while applying social work values and ethics to the process and be a leader for others to conduct research in the same framework. It can conflict with our professional ethics, especially respect for persons and beneficence, if we do not engage in experimental research with our eyes wide open. We also have the benefit of a great deal of practice knowledge that researchers in other fields have not had the opportunity to get. As with all your research, always be sure you are fully exploring the limitations of the research.

While true experimental research gathers strong evidence, it can also be inflexible, expensive, and overly simplistic in terms of important social forces that affect the resources.
Marginalized communities’ past experiences with experimental research can affect how they respond to research participation.
Social work researchers should use both their values and ethics, and their practice experiences, to inform research and push other researchers to do the same.
Think back to the true experiment you sketched out in the exercises for Section 13.3. Are there cultural or historical considerations you hadn’t thought of with your participant group? What are they? Does this change the type of experiment you would want to do?
How can you as a social work researcher encourage researchers in other fields to consider social work ethics and values in their experimental research?

Media Attributions

Being kinder to yourself © Evgenia Makarova is licensed under a CC BY-NC-ND (Attribution NonCommercial NoDerivatives) license
Original by author is licensed under a CC BY-NC-SA (Attribution NonCommercial ShareAlike) license
Original by author. is licensed under a CC BY-NC-SA (Attribution NonCommercial ShareAlike) license
Orginal by author. is licensed under a CC BY-NC-SA (Attribution NonCommercial ShareAlike) license
therapist © Zackary Drucker is licensed under a CC BY-NC-ND (Attribution NonCommercial NoDerivatives) license
nonexper-pretest-posttest is licensed under a CC BY-NC-SA (Attribution NonCommercial ShareAlike) license
Engel, R. & Schutt, R. (2016). The practice of research in social work. Thousand Oaks, CA: SAGE Publications, Inc. ↵
Sullivan, G. M. (2011). Getting off the “gold standard”: Randomized controlled trials and education research. Journal of Graduate Medical Education , 3 (3), 285-289. ↵

an operation or procedure carried out under controlled conditions in order to discover an unknown effect or law, to test or establish a hypothesis, or to illustrate a known law.

explains why particular phenomena work in the way that they do; answers “why” questions

variables and characteristics that have an effect on your outcome, but aren't the primary variable whose influence you're interested in testing.

the group of participants in our study who do not receive the intervention we are researching in experiments with random assignment

in experimental design, the group of participants in our study who do receive the intervention we are researching

the group of participants in our study who do not receive the intervention we are researching in experiments without random assignment

using a random process to decide which participants are tested in which conditions

The ability to apply research findings beyond the study sample to some broader population,

Ability to say that one variable "causes" something to happen to another variable. Very important to assess when thinking about studies that examine causation such as experimental or quasi-experimental designs.

the idea that one event, behavior, or belief will result in the occurrence of another, subsequent event, behavior, or belief

An experimental design in which one or more independent variables are manipulated by the researcher (as treatments), subjects are randomly assigned to different treatment levels (random assignment), and the results of the treatments on outcomes (dependent variables) are observed

a type of experimental design in which participants are randomly assigned to control and experimental groups, one group receives an intervention, and both groups receive pre- and post-test assessments

A measure of a participant's condition before they receive an intervention or treatment.

A measure of a participant's condition after an intervention or, if they are part of the control/comparison group, at the end of an experiment.

A demonstration that a change occurred after an intervention. An important criterion for establishing causality.

an experimental design in which participants are randomly assigned to control and treatment groups, one group receives an intervention, and both groups receive only a post-test assessment

The measurement error related to how a test is given; the conditions of the testing, including environmental conditions; and acclimation to the test itself

a subtype of experimental design that is similar to a true experiment, but does not have randomly assigned control and treatment groups

In nonequivalent comparison group designs, the process by which researchers match individual cases in the experimental group to similar cases in the comparison group.

In nonequivalent comparison group designs, the process in which researchers match the population profile of the comparison and experimental groups.

a set of measurements taken at intervals over a period of time

Research that involves the use of data that represents human expression through words, pictures, movies, performance and other artifacts.

Graduate research methods in social work Copyright © 2021 by Matthew DeCarlo, Cory Cummings, Kate Agnelli is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License , except where otherwise noted.

Share This Book

Fayetteville State University

2020-2021 Undergraduate Catalog [ARCHIVED CATALOG]

SWRK 320 - Social Work Research Methods: Practice Oriented

The Use and Value of Mixed Methods Research in Social Work

Josphine Chaumba Troy University

Author Biography

Josphine chaumba, troy university, additional files, make a submission, information.

For Readers
For Authors
For Librarians

Current Issue

Advances in Social Work ISSN: 1527-8565 eISSN: 2331-4125

Land Acknowledgement. We acknowledge the Indiana University School of Social Work is located on the ancestral lands of Indigenous Peoples from time immemorial. Indiana is the traditional lands of Potawatomi, Illini, Miami, Kickapoo, Lenape/Delaware, Wea, Piankashaw, Shawnee, Nanticoke, and Wyandot. We are dedicated to amplifying Indigenous voices and perspectives, improving community relationships, correcting the narratives, and making the Indiana University School of Social Work supportive and inclusive places for Native and Indigenous students, faculty, and staff. With humility and respect, we at Indiana University School of Social Work recognize and honor all Indigenous Peoples, their histories, their political rights and sovereignty and their sacred ties to the land and waters.

More information about the publishing system, Platform and Workflow by OJS/PKP.

Search Menu

Sign in through your institution

Advance articles
Editor's Choice
Author Guidelines
Submission Site
Open Access
About The British Journal of Social Work
About the British Association of Social Workers
Editorial Board
Advertising and Corporate Services
Journals Career Network
Self-Archiving Policy
Dispatch Dates
Journals on Oxford Academic
Books on Oxford Academic

< Previous

Changing Trends in Child Welfare Inequalities in Northern Ireland

Article contents
Figures & tables
Supplementary Data

Lisa Bunting, Nicole Gleghorne, Aideen Maguire, Sarah McKenna, Dermot O’Reilly, Changing Trends in Child Welfare Inequalities in Northern Ireland, The British Journal of Social Work , Volume 54, Issue 5, July 2024, Pages 1809–1829, https://doi.org/10.1093/bjsw/bcad259

Permissions Icon Permissions

Longitudinal research in England and Wales has identified increasing inequality in child welfare interventions, particularly with respect to children in the poorest areas coming into care. Although previous cross-sectional research has shown associations between area level deprivation and child welfare interventions to be weakest in Northern Ireland (NI), it remains unknown if this reflects wider trends over time. This study uses longitudinal administrative data to investigate the relationship between area level deprivation and the (1) referral, (2) investigation, (3) registration and (4) looked after stages of children’s contact with child and family social work from 2010 to 2017 (stages 1–3) and 2020 (stage 4). Both relative and absolute measures of inequality (Ratio of Inequality, Slope Index of Inequality and Relative Index of Inequality) were calculated to examine trends. The results highlight a clear and increasing social gradient in child welfare interventions in NI over time, particularly at the higher levels of intervention and those involving children aged 0–4 years. Routine analysis of children’s social care caseloads by deprivation is highlighted as a means of focusing attention on poverty and material inequality, prompting practitioners, managers and policy makers to consider the drivers of such inequality and how this might be addressed.

Personal account

Sign in with email/username & password
Get email alerts
Save searches
Purchase content
Activate your purchase/trial code
Add your ORCID iD

Institutional access

Sign in with username/password
Recommend to your librarian
Institutional account management
Get help with access

Access to content on Oxford Academic is often provided through institutional subscriptions and purchases. If you are a member of an institution with an active account, you may be able to access content in one of the following ways:

IP based access

Typically, access is provided across an institutional network to a range of IP addresses. This authentication occurs automatically, and it is not possible to sign out of an IP authenticated account.

Choose this option to get remote access when outside your institution. Shibboleth/Open Athens technology is used to provide single sign-on between your institutionâ€™s website and Oxford Academic.

Click Sign in through your institution.
Select your institution from the list provided, which will take you to your institution's website to sign in.
When on the institution site, please use the credentials provided by your institution. Do not use an Oxford Academic personal account.
Following successful sign in, you will be returned to Oxford Academic.

If your institution is not listed or you cannot sign in to your institutionâ€™s website, please contact your librarian or administrator.

Enter your library card number to sign in. If you cannot sign in, please contact your librarian.

Society Members

Society member access to a journal is achieved in one of the following ways:

Sign in through society site

Many societies offer single sign-on between the society website and Oxford Academic. If you see â€˜Sign in through society siteâ€™ in the sign in pane within a journal:

Click Sign in through society site.
When on the society site, please use the credentials provided by that society. Do not use an Oxford Academic personal account.

If you do not have a society account or have forgotten your username or password, please contact your society.

Sign in using a personal account

Some societies use Oxford Academic personal accounts to provide access to their members. See below.

A personal account can be used to get email alerts, save searches, purchase content, and activate subscriptions.

Some societies use Oxford Academic personal accounts to provide access to their members.

Viewing your signed in accounts

Click the account icon in the top right to:

View your signed in personal account and access account management features.
View the institutional accounts that are providing access.

Signed in but can't access content

Oxford Academic is home to a wide variety of products. The institutional subscription may not cover the content that you are trying to access. If you believe you should have access to that content, please contact your librarian.

For librarians and administrators, your personal account also provides access to institutional account management. Here you will find options to view and activate subscriptions, manage institutional settings and access options, access usage statistics, and more.

Short-term Access

To purchase short-term access, please sign in to your personal account above.

Don't already have a personal account? Register

Month:	Total Views:
December 2023	4
January 2024	32
February 2024	22
March 2024	34
April 2024	19
May 2024	17
June 2024	5
July 2024	13
August 2024	15

Email alerts

Citing articles via.

Recommend to your Library

Affiliations

Online ISSN 1468-263X
Print ISSN 0045-3102
Copyright © 2024 British Association of Social Workers
About Oxford Academic
Publish journals with us
University press partners
What we publish
New features
Open access
Rights and permissions
Accessibility
Advertising
Media enquiries
Oxford University Press
Oxford Languages
University of Oxford

Oxford University Press is a department of the University of Oxford. It furthers the University's objective of excellence in research, scholarship, and education by publishing worldwide

Copyright © 2024 Oxford University Press
Cookie settings
Cookie policy
Privacy policy
Legal notice

This Feature Is Available To Subscribers Only

This PDF is available to Subscribers Only

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Open access
Published: 16 August 2024

A theory-informed deep learning approach to extracting and characterizing substance use-related stigma in social media

David Roesler 1 ,
Shana Johnny 2 ,
Mike Conway 3 &
Annie T. Chen 1

BMC Digital Health volume 2 , Article number: 60 ( 2024 ) Cite this article

34 Accesses

Metrics details

Stigma surrounding substance use can result in severe consequences for physical and mental health. Identifying situations in which stigma occurs and characterizing its impact could be a critical step toward improving outcomes for individuals experiencing stigma. As part of a larger research project with the goal of informing the development of interventions for substance use disorder, this study leverages natural language processing methods and a theory-informed approach to identify and characterize manifestations of substance use stigma in social media data.

We harvested social media data, creating an annotated corpus of 2,214 Reddit posts from subreddits relating to substance use. We trained a set of binary classifiers; each classifier detected one of three stigma types: Internalized Stigma, Anticipated Stigma, and Enacted Stigma, from the Stigma Framework. We evaluated hybrid models that combine contextual embeddings with features derived from extant lexicons and handcrafted lexicons based on stigma theory, and assessed the performance of these models. Then, using the trained and evaluated classifiers, we performed a mixed-methods analysis to quantify the presence and type of stigma in a corpus of 161,448 unprocessed posts derived from subreddits relating to substance use.

For all stigma types, we identified hybrid models (RoBERTa combined with handcrafted stigma features) that significantly outperformed RoBERTa-only baselines. In the model’s predictions on our unseen data, we observed that Internalized Stigma was the most prevalent stigma type for alcohol and cannabis, but in the case of opioids, Anticipated Stigma was the most frequent. Feature analysis indicated that language conveying Internalized Stigma was predominantly characterized by emotional content, with a focus on shame, self-blame, and despair. In contrast, Enacted Stigma and Anticipated involved a complex interplay of emotional, social, and behavioral features.

Our main contributions are demonstrating a theory-based approach to extracting and comparing different types of stigma in a social media dataset, and employing patterns in word usage to explore and characterize its manifestations. The insights from this study highlight the need to consider the impacts of stigma differently by mechanism (internalized, anticipated, and enacted), and enhance our current understandings of how each stigma mechanism manifests within language in particular cognitive, emotional, social, and behavioral aspects.

Peer Review reports

Persons with substance use disorders (SUDs) can experience stigma in various forms, including stereotypes, prejudice, and discrimination, and this stigma can have far-ranging consequences for their health, employment, housing, and relationships [ 1 ]. Individuals experiencing stigma may internalize these negative beliefs and feelings, have diminished self-esteem and recovery capital [ 2 , 3 ], and be reluctant to seek treatment [ 4 ].

Interventions focused on stigma reduction in the context of substance use have been limited, and these have tended to focus on structural stigma (e.g., education of professionals that work with persons with SUDs) as opposed to social or self-stigma [ 5 ]. There is also awareness of the bias in words used to describe SUDs, and the need to consider word choice [ 6 , 7 ]. However, despite the potential harms of substance use stigma, our knowledge of how different types of stigma affect persons within the context of SUDs remains limited [ 5 , 8 , 9 , 10 , 11 , 12 ].

In this article, we demonstrate a stigma theory-informed deep learning approach to the task of identifying examples of substance use stigma in a large dataset. To ensure that we capture stigma in the diverse forms in which it occurs, we employ the Stigma Framework [ 13 ], which defines three stigma mechanisms for those who experience stigma: Internalized Stigma , Anticipated Stigma , and Enacted Stigma . The Stigma Framework has been used to characterize stigma processes in various health-related contexts, including problematic substance use [ 11 ] and HIV [ 13 ], and extant literature has sought to develop instruments to assess the experience of these three types of stigma [ 11 ]. To our knowledge, however, prior work has not explored how the three stigma mechanisms are conveyed by the language used in social media. We examine stigma as expressed in social media for two main reasons: 1) previous literature has shown that stigma relating to mental health is endemic in social media [ 14 , 15 ]; and 2) social media can serve an important role in understanding and promoting public health [ 16 , 17 ].

This current study aims to answer the research question: How do the three stigma mechanisms in the Stigma Framework manifest differently in terms of distribution and nature in social media? We take the following approach:

We develop classifiers to identify three stigma mechanisms in an annotated social media dataset and evaluate the performance of these classifiers.

To gain a deeper understanding of the prevalence of the three stigma mechanisms in social media at large, we analyze how each stigma mechanism is distributed in the predictions made by the classifiers on the unseen portion of our data.

To better understand the linguistic expression of the different stigma mechanisms in social media, we identify the highest-ranking features associated with each mechanism and offer illustrative examples.

Related work

Conceptualizations of stigma.

Goffman [ 18 ] influentially defined stigma as “an attribute that is deeply discrediting”, and which reduces the stigmatized “from a whole and usual person to a tainted, discounted one” (p. 3). Goffman described stigma as a product of interactions, and stated that “a language of relations, not attributes, is really needed to describe stigma” [ 18 ] (p.3). The relational nature of stigma was emphasized by subsequent stigma theory [ 19 , 20 ] that characterized stigma as a social process situated in a social context, with Link and Phelan [ 19 ] conceptualizing stigma as a convergence of labeling, stereotyping, separation, status loss, and discrimination, all within a power structure.

To complement existing societal-level conceptualizations of stigma with individual-level ones and create a more comprehensive theory of stigma and its impact, Earnshaw and Chaudoir [ 13 ] proposed the Stigma Framework. In this framework, which draws on stigma theory from a variety of domains [ 19 , 20 , 21 , 22 , 23 ], attention is given to both the mechanisms of stigma employed by those with power, and also the ways that stigma is experienced or adopted by stigmatized individuals. Earnshaw and Chaudoir distinguish three mechanisms employed by those who distance themselves from the “mark” of stigma: prejudice, stereotyping, and discrimination; and three mechanisms (hereafter primarily called “types”) for those who experience stigma: Internalized Stigma , Anticipated Stigma , and Enacted Stigma . Table 1 provides definitions and examples of each of the three types of experienced stigma, in the context of substance use, as defined in Smith et al. [ 11 ]. The stigma mechanisms identified by the Stigma Framework have been assessed in various health-related contexts and have been associated with physical, mental, and behavioral outcomes for those that experience stigma [ 11 , 24 , 25 ].

Despite the existence of different conceptualizations of stigma, there is much that we do not yet understand about stigma processes. In particular, there is a recognized need to more clearly define and characterize the nature of stigma [ 9 , 26 ]; to identify societal and individual-level factors affecting stereotyping, prejudice, and discrimination [ 12 ]; and to develop a more nuanced understanding of how different stigma mechanisms may affect substance use recovery [ 11 ]. In this study, we develop models to identify stigma in a large social media dataset for subsequent qualitative analysis intended to enhance our understanding of the complex interplay of the effects of stigma on the individual within their embedded contexts.

Computational models of stigma detection

Although a multitude of computational models for the detection of abusive language and hate speech in social media texts has been proposed [ 27 , 28 ], the computational detection of social stigma has been less extensively explored. Whereas hate speech is commonly defined as a communicative act of disparagement of a person or group [ 29 ], the arguably broader concept of stigma can include, in addition to direct antagonism, more subtle and systematic forms of discrimination and distancing, of both others and the self [ 1 , 18 , 19 , 30 ]. Research on stigma detection in a variety of specific domains has been conducted, with works on the detection of depression stigma [ 14 ], mental health stigma [ 31 , 32 ], stigmatizing language in healthcare discussions [ 33 ], Alzheimer’s Disease stigma [ 34 ], schizophrenia stigma [ 35 ], and obesity stigma [ 36 ].

Li et al. [ 14 ] produce models for the detection of depression stigma in Mandarin Chinese Weibo posts. In their data, they find only 6% of the posts contain stigmatizing content; however, when training their model, the authors create a balanced corpus of texts (stigmatizing vs. non-stigmatizing). The researchers test logistic regression, multi-layer perceptron (MLP), support vector machine, and random forest classifiers trained in conjunction with a simplified Chinese version of Linguistic Inquiry and Word Count (LIWC) features [ 37 ]. The trained models detect stigmatizing posts and also classify each stigma-positive instance as an instance of one of three depression stigma sub-narratives (‘unpredictability’, ‘weakness’, or ‘false illness’), with the researchers finding best results when using random forest models.

Straton et al. [ 33 ] build a model for the detection of stigmatizing language in Facebook healthcare discussions around the topic of vaccination. In their annotated corpus of postings from anti-vaccination message walls, they find language stigmatizing government organizations and institutions, and in pro-vaccination message walls, they find language stigmatizing the anti-vaccination movement. Using a balanced dataset, the researchers use term frequency-inverse document frequency (TF-IDF) weighted n-grams and LIWC psychological features to train a variety of classifiers, with a convolutional neural network model resulting in the best performance.

Gottipati et al. [ 32 ] perform mental disorder stigma detection on a corpus of mental health-related news articles published by Singapore’s largest media organizations. The authors create an (approximately) balanced dataset of stigmatizing and non-stigmatizing news article titles paired with a sentence from the same article. The researchers create features from TF-IDF weighted n-grams and compare a variety of machine learning classifiers, finding best performance with XGBoost [ 38 ].

To develop a model for detecting stigmatizing language related to mental health, Lee and Kyung [ 31 ] create a corpus of 240 sentence pairs (stigmatizing and non-stigmatizing), entitled the Mental Health Stigma Corpus. The authors fine-tune a BERT-base model [ 39 ] to classify sentences as stigma-positive or stigma-negative and achieve promising results, though the synthetic nature of their dataset may raise questions with regard its ability to generalize to real-world data. We summarize the results of the four stigma detection studies described here in Table 2 .

Although research on health-related stigma detection has been performed in a variety of domains, to our knowledge, all have treated stigma as a single monolithic concept. In this work, we incorporate the three stigma mechanisms (Internalized, Anticipated, and Enacted Stigma) of the Stigma Framework [ 13 ] to better differentiate between different types of stigma experiences, including identifying linguistic features which are most characteristic of each stigma type. For instance, the social media examples that we observed included stigmatizing language (“my sister is a hopeless alcoholic”), reports of stigmatization (“my husband took away the kids and said I’d never get clean”), and the experience of stigma (“I feel so much shame that I can’t tell anyone”).

Based on the effectiveness of BERT contextual embeddings, TF-IDF-weighted n-grams, and LIWC features for the purpose of stigmatizing language detection [ 14 , 31 , 33 ], we experiment with combinations of these resources. Given the prevalence of affect types such as sadness, anxiety, and fear in social media posts discussing experiences of substance use [ 40 ] and prior literature arguing that emotion regulation can be a factor in stigma coping [ 41 , 42 ], we also experiment with count-based features derived from extant affect lexicons and our own handcrafted stigma lexicons. These handcrafted lexicons incorporate affective, social, and behavioral concepts based on stigma theory, including anxiety, depression, and secretive behavior [ 5 , 9 ].

In this study, we employ classifiers to identify three different types of stigma in a social media dataset. We train and evaluate a set of models for each stigma type and then perform a mixed-methods analysis of the data identified by these models. A flowchart overview of our project is depicted in Fig. 1 .

Project overview flowchart

Dataset creation

Harvesting data.

To create our dataset, approximately 160 thousand English-language Reddit posts authored between January 1, 2013 and December 31, 2019 were collected using Pushshift.io [ 43 ]. To capture diverse manifestations of substance use stigma and stigma-related behaviors (including navigation of legality for users), we focused on three substances for this analysis: alcohol, cannabis, and opioids. We selected subreddits related to the three substances of interest (e.g., ‘r/stopdrinking’, ‘r/marijuana’, and ‘r/opiates’) and sampled only thread-initiating posts, as these posts often contain richer descriptions of Redditor’s experiences [ 44 ]. In our previous research [ 40 , 45 ], we found these subreddits contained detailed accounts of both substance use and SUD recovery. Table 3 provides a breakdown of post counts for each subreddit in the harvested Reddit data. Subreddits that allude to or mention recovery or support in subreddit titles, descriptions or rules are labeled with checkmarks.

Sampling for annotation

We observed that posts containing explicit references to stigma were relatively uncommon. To increase the volume of relevant data for annotation and to support subsequent natural language processing, we employed the keyword sampling method used in Chen et al. [ 40 ] to build our annotated corpus. Only the posts that matched a regular expression containing a keyword list were sampled to increase the probability of sampling stigma-related content. The theory-informed keyword list, derived from stigma literature [ 10 , 11 , 24 , 25 ], includes terms with stigma-related connotations (such as ‘shame’, ‘disappoint’, and ‘untrustworthy’) and terms referring to the actors who may be involved in stigma-related experiences (‘family’, ‘co-worker’, ‘husband’). Over the course of the annotation process, this list of keywords was iteratively refined to increase the prevalence of stigma in samples. The final set of sampling keywords is listed in Table 4 . Additionally, subreddits that produced low yields for stigma content (e.g., r/alcohol, r/Petioles, r/trees) were removed from the candidates for annotation sampling. Table 5 shows the breakdown of post counts for each of the subreddits and the distribution of the three stigma types in the annotated dataset.

Annotation process

Three annotators with expertise in informatics, natural language processing, nursing, and public health annotated a total of 2,214 Reddit posts at the span-level for three stigma types based on the Stigma Framework [ 13 ]: Internalized Stigma, Anticipated Stigma, and Enacted Stigma. We developed an annotation guide including definitions, synthetic examples, and instructions for identifying and distinguishing these three stigma types based on extant literature [ 11 , 46 ]. A detailed description of our annotation guidelines is provided as Additional file 1 .

Annotators independently identified passages containing stigma in the posts before discussing and reconciling the annotations. In addition to labeling stigma spans, annotators also labeled posts for substance type and the author’s recovery outlook (positive, neutral, or negative), and identified spans containing mentions of social isolation and labels (e.g., ‘addict’). Table 6 lists pairwise inter-annotator agreement for the three annotators at post level, prior to reconciliation, measured using Cohen’s Kappa [ 47 ]. Overall, pair-wise agreement on the stigma mechanisms reflected moderate agreement [ 48 ], with the highest agreement being for Internalized Stigma. Pair-wise agreement scores on all annotation types varied between 0.66 and 0.71, indicating substantial agreement.

Text segmentation

In the annotated corpus, we observed that Reddit posts ranged in length from 28 characters to 25,743 characters, with a mean length of 1,816 characters (Fig. 2 ). As many posts exceed the 512-token input length limit of the RoBERTa encoder [ 49 ] that we use in our detection model, we opt to chunk posts into text segments. We use the term ‘segment’ to refer to the chunks of text used as inputs to our classifiers, and we use ‘span’ to refer to passages of text within posts labeled by annotators. We map the annotated span labels onto the segments, and then use the labeled segments to train our models. When the trained models make predictions, they first make predictions on individual segments before we map these predictions back to the post level, where, if any segment within a post is predicted as stigma-positive, the entire post is then predicted to be stigma-positive.

Architecture of the hybrid model

Although segmenting posts solves the input limitation issue, this also increases class imbalance in our dataset. In our annotated corpus, we find that within individual posts, the stigma-positive spans can be infrequent, with multi-paragraph posts sometimes only containing a few stigma-positive words. As a result, when we split the Reddit posts into smaller units (such as sentences), we produce far more negative examples than positive ones, and the portion of stigma-positive texts in our corpus decreases (Table 7 ). When splitting posts down to the level of sentences, we see severe class imbalance, with only 1.69% of the data containing Enacted Stigma.

Class imbalance can result in classifiers which perform well for the majority class, but poorly for the minority class [ 50 , 51 ]. To mitigate class imbalance, we experimented with a variety of segmentation lengths, and found the best performing length to be approximately 600 characters. At this length, text segments seem to be short enough to mitigate the amount of irrelevant information (features unrelated to stigma), but they also remain lengthy enough to keep the imbalance of classes from becoming severe.

To build segments from our post data, we begin by splitting all posts into sentences using Natural Language Toolkit (NLTK) 3.5 [ 52 ]. We then join the resulting sentences in the order they appear in the post until the threshold value of 600 characters in length is reached, after which, a new segment is started. We do not split sentences, and thus segments vary in length. After segmenting texts, labels are assigned to segments by checking for overlap between segment spans and annotation spans. The texts are then pre-processed by removing URLs, hyperlinks, and other HTML-related text residue.

Substance use stigma detection model

To identify Reddit posts in the harvested data that have a high probability of containing reports and instances of substance use stigma, we create binary classifiers for each stigma type: Internalized Stigma, Anticipated Stigma, and Enacted Stigma. Because each segment of input text may be stigma-positive for multiple stigma types, we treat this classification task as a set of independent binary classification tasks rather than a single multi-class classification task.

We utilize a RoBERTa encoder [ 49 ] as the main component of the classifier, and also make use of n-gram features, features derived from affective and psychological lexicons, and handcrafted features to enrich the model with external knowledge relevant to the task. To integrate RoBERTa embeddings with the additional features, we use a hybrid model (Fig. 3 ) based on Prakash et al. [ 53 ], where the first stage is MLP pre-training. The MLP is pre-trained on a concatenated vector of TF-IDF weighted n-grams, features derived from the NRC Footnote 1 Emotional Intensity Lexicon [ 54 ], features derived from Wordnet-Affect [ 55 ], features generated from the LIWC 2015 lexicon [ 37 ], and handcrafted substance use stigma features.

Histogram of post character length

After pre-training is complete, the trained MLP weights are used along with a pre-trained RoBERTa encoder in the fine-tuning process. The < s > token output of the RoBERTa encoder and the MLP output are normalized and then concatenated before being passed to an MLP classifier head, which outputs the probability that a given sequence of text contains the current type of substance use stigma.

Feature vector construction

When building input to the MLP component of the classifier, we create the following feature sets:

TF-IDF weighted n-grams (TF-IDF)

To create TF-IDF features, we remove English stop words from the text using the NLTK 3.5 package, and then use Scikit-learn 1.8 [ 56 ] to create TF-IDF weighted n-grams in the range (2, 6) with a dimensionality of 10,000.

NRC affective intensity features (NRC)

We include NRC features [ 54 ] to take advantage of the scaled emotional intensity scores that the NRC lexicon provides. We use the NRC Emotional Intensity Lexicon to generate 10-dimensional intensity-scaled affect features (with each dimension corresponding to one of the concepts listed in Table 8 ). To produce feature vectors, we follow the method of Babanejad et al. [ 57 ], who create ‘EAISe’ representations (Emotion Affective Intensity with Sentiment Features) for their sarcasm detection model.

Wordnet Affect features (WNA)

Wordnet-Affect [ 1 ], developed based on Wordnet 1.6 [ 58 ], enabled us to incorporate finer-grained affect types. Based on literature relating to substance use, stigma, and emotion and an examination of our Reddit corpus, we identified 13 Wordnet-Affect concepts that were relevant to substance use stigma (Table 8 ) and constructed lexical sets around each of the 13 Wordnet-Affect concepts using Wordnet. Using these sets, we generate 13-dimensional feature vectors using the same method that we use to build our NRC vectors.

LIWC features

Linguistic, grammatical, and psychological features are generated using LIWC 2015 software [ 37 ]. We remove the ‘word count’ feature and retain all others, resulting in a 92-dimensional vector.

Substance use stigma features (INT / ANT / ENA)

We create handcrafted lexicons (identified as ‘INT’, ‘ANT’, and ‘ENA’) to capture affective, behavioral, and social concepts related to each stigma type. These lexicons were developed through examination of TF-IDF weighted n-gram chi-square rankings for the training data, identification of recurring concepts in the stigma-positive examples of the training data that corresponded to concepts from stigma literature and survey instruments [ 10 , 11 , 24 , 25 , 46 , 59 ], and iterative building and evaluation of lexical sets for each concept using a validation set. For Anticipated Stigma, an associated behavior such as concealment [ 25 ] is included in the ‘secrecy’ concept through keywords such as ‘sneak’, ‘hid’, or ‘throwaway’ (used in mentions of ‘throwaway’ Reddit accounts created to preserve anonymity). The six concepts included in each feature set is listed here in Table 8 , and the complete list of keywords included in each concept is listed in Additional file 2 . To create 6-dimensional feature vectors, we start with a vector of zeros. We then search text segments for each of the words in our lexical sets. If a lexicon word is present, we add ‘1’ to the concept dimension associated with the word.

After building all feature vectors, we separately normalize each set of features, then concatenate them to form a 10,121-dimensional input vector.

Data handling

Training sets are sampled from our segment-level data and contain a mixture of stigma-positive and stigma-negative texts. In development, the best results for MLP and hybrid models were found when using a training set with a negative to positive rate of 3:1, and we use this rate to train our final hybrid models. Our validation and test sets are randomly sampled from 10% of the post-level data. After a set of Reddit posts is sampled, the constituent segments are retrieved and used as the evaluation set.

Hyperparameters

We train all models on a single Tesla A100 GPU on the Google Colab platform. Training is implemented using Pytorch 1.12 [ 60 ] and the Huggingface library [ 61 ]. We pre-train our MLP for 30 epochs using the AdamW optimizer with a learning rate of 5.e-5 (controlled by a learning rate scheduler) and a batch size of 32. We determine the optimal threshold for positive class F1 after each training epoch using a precision-recall curve on the validation set. The best model is checkpointed based on positive class F1 performance.

During fine-tuning, we fine-tune cased RoBERTa-base (123 million parameters) for 10 epochs with a learning rate of 5.e-5 and batch size of 32. We also experiment with the cased RoBERTa-large encoder (354 million parameters), and when fine-tuning RoBERTa-large, we train for 10 epochs with a learning rate of 7.e-6 and a batch size of 32. Less than 15 min of GPU time were required to train a single hybrid model.

Model evaluation

As we sought to identify the stigma-positive Reddit posts within the unseen harvested Reddit data, we evaluate each model’s predictions at the post-level by mapping segment predictions to each post. We compare the performance of models by reporting the mean macro F1 score of five runs on the same data, using different random seeds. We list results from variations of hybrid models utilizing different sets of features. As a baseline for comparison to the hybrid models, we list results using RoBERTa-base and RoBERTa-large with a simple classifier head, trained on a balanced training set (via undersampling), and using the same threshold moving method as used in our hybrid model.

Improvements over the RoBERTa-only baselines are considered significant at a significance level (α) of 0.05 according to McNemar’s test [ 62 ] with false discovery rate (FDR) correction [ 63 ]. McNemar’s significance test has been considered appropriate for binary classification tasks [ 64 ]; thus, we employ it on the predictions of the paired models. Because we make multiple hypothesis tests in our comparisons, FDR correction is applied to p -values.

To explore each feature set’s potential for use in stigma detection, we also considered the results of MLP evaluation on single feature sets and set combinations. We use an MLP for this comparison rather than a hybrid model since in the hybrid models, redundancies in the information encoded by feature set combinations and the information encoded by RoBERTa can make the relative performance contribution of each feature set difficult to disentangle. We also perform exploratory feature ranking of all features using the chi-square measure to explore the strength of association between each feature and its relevant stigma type. The feature selection tools of the Scikit-learn package were used to implement this experiment [ 56 ].

Last, we perform an error analysis of the hybrid model’s predictions. This evaluation not only informs future improvements on our approach, but also provides insights into difficulties that arise in the perception and experience of stigma.

Mixed-methods analysis

Mixed-methods research can facilitate research that cannot be answered using a single method. Though there is controversy concerning what constitutes mixed-methods research, integrating quantitative and qualitative approaches is considered increasingly important, and extant literature has observed and demonstrated that the definition of mixed-methods research will continue to grow [ 65 , 66 ]. In this study, we leverage both quantitative and qualitative methods for various affordances identified by Doyle et al. [ 66 ] including: triangulation, completeness, and illustration of data.

We performed a mixed-methods analysis to: 1) estimate the amount of stigma in the larger social media data store; and 2) characterize the nature of the different stigma mechanisms. First, we characterized the presence of stigma in the unseen portion of the harvested Reddit data by examining patterns in the distribution of stigma predictions with respect to substance and subreddit, and the correlations between stigma type predictions. We employ chi-square tests to compare the presence of the stigma mechanisms in the three substances. As a chi-square test of independence on its own merely shows that there is an association between two nominal variables and does not show which cells are contributing to the lack of fit [ 67 , 68 ], we calculated standardized Pearson residuals. A standardized Pearson residual exceeding two in absolute value in a given cell indicates a lack of fit [ 67 , 68 ]. Second, we considered the feature rankings and the instances of predicted stigma in the test data in concert to illustrate how the three types of stigma concretely manifest in cognitive and emotional processes, social interactions, and behaviors in everyday life. To protect the identities of the posters, we employ synthetic quotes in our illustration [ 69 ].

Results and discussion

Model performance and evaluation, overall model performance.

Table 9 lists the results of post-level stigma detection for the three stigma types. For all three stigma types, we found hybrid models that significantly outperformed their respective RoBERTa-only baselines, with the largest gain observed for the Anticipated Stigma RoBERTa-large hybrid model using only the handcrafted stigma features (+ 7.08 F1). These results provide evidence that n-gram, affective, behavioral, and social features can be combined with contextual embeddings to improve substance use stigma detection.

In the results of MLP evaluation (Table 10 ), the handcrafted lexicons (STIG) appeared to be relatively effective resources for the task of stigma detection, and the other feature sets (NRC, WNA, and LIWC) also appear to be viable resources (to varying degrees). For individual feature sets, the handcrafted stigma lexicons appeared to provide the best results for Internalized Stigma and Anticipated Stigma, whereas LIWC provided best results for Enacted Stigma. For feature set combinations, adding additional feature sets usually led to improvement for MLP models (with some exceptions), although the combination of all features only outperformed the handcrafted stigma lexicons for the case of Enacted Stigma.

Comparing performance by stigma mechanisms and contributing features

The results in Tables 9 and 10 show that, overall, scores for Internalized Stigma are higher than for the other stigma types; Internalized Stigma was the most frequent of the three stigma types in the annotated corpus (making it the stigma type with the greatest number of examples). When performing exploratory feature ranking of all features (Table 11 ), count-based features had stronger associations (higher chi-square scores) with Internalized Stigma than they did with the other stigma types. Affective concepts such as ‘shame’ and ‘guilt’ had strong relationships with Internalized Stigma, which likely benefitted performance.

Overall performance for Anticipated and Enacted Stigma was weaker than for Internalized Stigma. There may be a number of reasons for this. First, Anticipated and Enacted Stigma had fewer examples and relatively weaker associations with count-based features in comparison with Internalized Stigma. For Enacted Stigma, the highest-ranking features were labels such as ‘alcoholic’ and ‘junkie’, which were fairly common in the entire corpus. Labeling terms such as ‘alcoholic’ may be used to enact stigma, but they may also be used to express membership in recovery groups and are a part of ‘recovery dialects’ used within such groups [ 2 ]. Moreover, labeling terms may also be appropriated by members of stigmatized groups to increase perceptions of power for the stigmatized individual or group [ 70 ]. The variety of motivations behind the uses of such labeling terms such as ‘junkie’ may be a limiting factor to their viability as features for stigma detection.

Another potential factor for the weaker performance for Anticipated and Enacted Stigma may be their social nature. Whereas Internalized Stigma frequently focus on a single entity (the post author), with feature rankings showing strong relationships with inward features (n-grams such as ‘i ashamed’), both Anticipated and Enacted Stigma involved other actors. With Anticipated Stigma, the highest ranking features involved concealment of use (ANT_secrecy) and other actors (ANT_social), as post authors were concerned about concealing their use from others. With Enacted Stigma, there was a wide variety of actors involved in the relationships between the stigmatizer and the person(s) being stigmatized (e.g. ‘family to partner’, ‘partner to society’, ‘co-workers to society’). Further, while Internalized Stigma frequently focused on the act of shaming oneself, Enacted Stigma involved a more diverse set of verbs/actions through which stigma was performed (e.g., disapproving looks, expressions of distrust, arrests, searches, evictions, insults, generalizations, coerced drug tests, denial of healthcare services, termination of employment, termination of personal relationships). Many of the verbs related to these stigmatizing actions were included in the ENA_stigmatizing_actions and ENA_trust features, which ranked second and third, respectively, in the feature ranking.

Model performance by stigma type followed a similar pattern to that of inter-annotator agreement across stigma types (Table 6 ), in which annotators found highest agreement on Internalized Stigma and less agreement on Anticipated and Enacted Stigma. The complexities involved in identifying these two stigma types seemed to be a challenge for both human annotators as well as the models.

Error analysis

We provide an error analysis of the Anticipated and Enacted Stigma models to gain insights into the challenges involved in detecting these stigma types. We give synthetic quotes based on our data to demonstrate error types, with features typical of Anticipate or Enacted Stigma texts bolded.

Temporal errors

We observed that both the Anticipated and Enacted Stigma hybrid models produced false positives for texts which do not match the temporal requirements of their respective stigma type (future for Anticipated Stigma, present or past for Enacted Stigma). The following example (a false positive for Enacted Stigma due to temporal mismatch), is representative of this error type:

If I come clean, my family will disown me – that isn’t even an option.

For the RoBERTa-only baseline models, this error type was noticeably less frequent. This may be a limitation of the use of count-based features in the hybrid models, as the model may weighting keywords such as disown more heavily than the tense-related syntactic information that has been shown to be encoded by BERT [ 71 ].

Stigmatizing quitters

During annotation, we observed that individuals abstaining from substance use were pressured by persons who engaged in substance use, often in the context of alcohol use when it is normalized in home or work-related settings. Though this behavior was not annotated as stigma, when it appeared in texts, it led to false positive predictions by both the baseline and hybrid models, and is exemplified by the following excerpt:

I told my mother I quit drinking and she laughed at me. I quit in May and have avoided telling my family because they drink a lot and I didn't want to put up with the questions or judgement .

In examples like this, the model seems to leverage features relevant to stigma ( she laughed at me , judgement ) while failing to learn cues that indicate the mother is an alcohol user critical of another user’s abstinence.

Motivations

Both the baseline and hybrid models for Anticipated and Enacted Stigma were prone to produce false positives for texts where typical features of stigma are present, but the motivation behind an action potentially construed as stigmatizing is unrelated to stereotyping, prejudice, or discrimination. In the following example, a partner appears to terminate a relationship due to apathetic behavior rather than stigma, and thus should be labeled as stigma-negative:

I struggled for a long time with the sadness that comes with addiction, so the feelings of apathy that followed it seemed like a relief. Eventually, they resulted in my partner breaking up with me.

Although BERT models have been demonstrated to encode information that can be leveraged to make predictions about causality [ 72 ], interpreting the motivations behind the actions described in texts can be a difficult task even for human judgement. We further discuss this issue in our limitations section.

Characterizing the presence of stigma in the unexplored data

To better understand how the three stigma mechanisms outlined in the Stigma Framework manifest within our social media dataset, we employed the classifiers to identify instances of the stigma types in the previously unexplored portion of our collected Reddit data ( n = 161,448). The distribution of stigma predictions across subreddits is presented in Table 12 . Overall, the portion of stigma-positive predictions for each type were noticeably lower than the portions seen in the annotated data (Table 5 ). This outcome aligns with expectations, given that: 1) keyword sampling was used to increase the proportion of stigma in the annotated data; and 2) in the unexplored data, a larger portion of posts originated from subreddits focused on general substance use, rather than on support or recovery. In both the predictions and annotations, we observed that, for all three substance types, the estimated stigma proportion was highest for support-focused subreddits, where posters often described challenging experiences relating to their attempts at recovery.

With respect to alcohol and cannabis, Internalized Stigma appeared to be the most common of the three stigma types. The focus on the self makes intuitive sense given the first-person viewpoint of social media narratives, and the prominent features of Internalized Stigma (Table 11 ) suggest that these data could serve as a rich source for future research on how individuals may seek to internally reconcile the cognitive and emotional aspects of shame and guilt that accompany Internalized Stigma.

However, in the case of opioids, we observed a higher frequency of Anticipated Stigma compared to Internalized Stigma. Chi-square tests examining the presence of the three stigma mechanisms in the three substances, with the standardized Pearson residual for Anticipated Stigma x Opioids, also confirm that the observed presence of Anticipated Stigma exceeds the expected in that case (see Additional file 3 ).

Co-occurrence of the three stigma mechanisms

We also explored the extent to which the stigma mechanisms co-occurred in the data. Figure 4 shows a Pearson correlation matrix between stigma labels for text segments in the annotated data (left) and also for the predictions on the unseen data (right). The largest correlation score is a value of 0.11 between Internalized and Anticipated Stigma (in the annotated data), indicating that text segments with multiple stigma labels are relatively infrequent in the annotated data. Although we observed some concepts were shared across stigma types in the feature rankings, such as labeling terms (e.g., ‘addict’), the relatively low correlation between paired stigma types illustrates the utility of developing separate models for each stigma type. Furthermore, this underscores the potential utility of the three stigma types distinguished by the Stigma Framework [ 13 ] for future research in clarifying the mechanisms by which stigma can affect persons with SUDs.

Pearson correlation between stigma types for text segments in the annotated dataset (left) and the predictions on the unseen data (right)

Exploring the relationship between language and stigma experience

To characterize the nature of stigma as manifested in social media, we consider the feature rankings associated with each stigma type, along with the instances of stigma in the test data. Figure 5 depicts the concepts from the handcrafted stigma lexicons that were among the highest-ranking features for each stigma type, along with synthetic examples. Among the posts associated with Internalized Stigma, we observed an abundance of affective content (shame, self-blame, and despair). Our examination of the test data further uncovers that posts containing shame and self-blame also often involved the poster using self-deprecating language (in the form of pejoratives) and labels to describe themselves, and express feelings of weakness and perceptions of failure.

Conceptual differentiation of stigma types. All examples are synthetic quotes that resemble the phenomena and sentiment observed in the data

For Anticipated and Enacted Stigma, emotion was still important, but social and behavioral features were also prominent (i.e., ANT_social, ENA_stigmatizing_actions). The ‘ANT_social’ lexicon includes possible members of a user’s social circle (e.g., ‘parents’, ‘partner’, ‘friend’). Since, by definition, Internalized Stigma is focused on the self, Anticipated Stigma is focused on one’s expectation of how they are perceived by others, and Enacted stigma, by stigmatizing behavior, these associations make intuitive sense. The social media data highlights additional features tied to Anticipated Stigma, such as secretive behavior, concern over how one is perceived, and a fear of disappointing others. Notably, the theme of concealment, especially from close relations like family members, partners, or employers, is prominent in the Anticipated Stigma texts (as exemplified in the examples 3–5 in Fig. 5 ).

Enacted Stigma often involved the use of labels to describe another person, and as seen in the final two examples of Fig. 5 , the usage of these terms can be descriptive (‘He is always drunk’) or may have judgmental motivations in their usage (‘down-and-out junkies’). Stigmatizing actions related to judging, disparaging, or confronting others figured prominently in terms of this type of stigma, and could involve many different pairs of stigmatizer and stigmatized persons (e.g., parent–child, child-parent, friends, partners, co-workers, and the poster feeling stigmatized by the public, people, or society at large). Features related to trust also ranked highly for Enacted Stigma, corresponding to previous stigma research which identified ‘untrustworthiness’ as a common stereotype espoused by user’s family members [ 24 ].

Other phenomena to consider were instances in which multiple stigma types were present. The third text in Fig. 5 exemplifies a common scenario for the pairing of Internalized Stigma and Anticipated Stigma, with posters expressing reticence to interact with others due to their own shame. Text segments containing all three stigma types were relatively rare in the annotated corpus (0.78% of all stigma-positive segments), though the fifth example in Fig. 5 illustrates an instance where an author appears to negatively judge persons experiencing SUDs, describe concealment of their own use, and express internal guilt for their use, all within a relatively brief sequence of text.

Similar to Straton et al. [ 33 ], we observed that the LIWC categories for emotional tone and clout showed fairly strong relationships with stigma; however, we observed a limited relation to stigma for the remaining 90 LIWC categories. The clout feature, derived from ratios of personal pronoun frequencies, is based on Kacewicz et al. [ 73 ], who found that high-status authors consistently used more 1st person plural (e.g., ‘we’, ‘our’) and 2nd person singular (‘you’) pronouns, whereas low-status authors were more frequently self-focused and used more 1st person singular pronouns (‘I’, ‘me’). This may explain the effectiveness of the clout feature for predicting Internalized Stigma (low clout scores appeared to be indicative of Internalized Stigma), which is heavily focused on inner experiences, with heavy use of 1st person pronouns. The LIWC emotional tone feature [ 74 ] calculates the difference between positive emotion word count and negative emotion word count, with higher scores indicating greater overall positivity. The generally negative emotional content of stigma-positive texts is a likely factor for the high ranking of the tone feature for all three stigma types.

Discussion and limitations

In this study, our objective was to investigate how the three different stigma mechanisms in the Stigma Framework manifest differently in terms of distribution and nature in a social media dataset. Through an analysis of feature rankings, the distribution of predictions, and specific instances of stigma in our data, we discerned distinct patterns across Internalized, Anticipated, and Enacted Stigma. Furthermore, we characterized the language used to convey and describe each of these three mechanisms.

In terms of the distributions of the three stigma mechanisms, we observed that Internalized Stigma was the most prevalent stigma type with respect to alcohol and cannabis. However, in the case of opioids, Anticipated Stigma was more frequent than Internalized Stigma. Though these patterns were only observed in a single dataset and further exploration of the presence of different stigma mechanisms in other data is needed, it is worthwhile to consider these findings in the context of the larger societal concern about opioid use. Extant literature emphasizes that great care must be used in crafting public health messaging concerning opioid addiction due to the potential for increased stigmatization of those who use opioids [ 75 ]. The social environment surrounding opioid use appears to lead to greater anticipation of stigma and a tendency to conceal behavior, compared to the environments surrounding cannabis and alcohol. Thus, it may be important to focus on the portrayal of opioid use, anonymous forms of support, and an emphasis on support for interpersonal interactions in the context of opioid use.

Additionally, our study considered the nature of language used to express stigma as it manifests in social media. This exploration not only confirms that language is a powerful vehicle for expressing stigma, as established in prior literature [ 2 ], but also illuminates the nuanced relationship between word usage and specific stigma types, and the pivotal roles of affect, social perceptions, personal interactions, and behavior in the expression of stigma, in social media. In the social media data, we found that Internalized Stigma is predominantly characterized by emotional content, with a focus on shame, self-blame, and despair. In contrast, Enacted Stigma and Anticipated involve a complex interplay of emotional, social, and behavioral features. The former encompasses stigmatizing behaviors and issues of trust, while the latter centers on expectations of external perceptions and the fear of disappointing others. For Anticipated Stigma, the feature analysis demonstrated that issues of concealment were prominent, along with the presence of close interpersonal relationships.

Insights from this study can serve as priorities in the design of stigma reduction interventions. For example, the high-ranking features from the Enacted Stigma lexicon include both stigmatizing actions such as confronting and blaming, as well as indicators of trust (e.g., expressed as disappointment, suspicion, or a lack of respect for privacy). In future intervention development, the integration of components addressing these core issues is critical.

Overall, our findings improve our understanding of stigma mechanisms in social media discourse and could also inform the development of targeted interventions that address the challenges of those affected by stigma. Furthermore, the adaptability of our lexicons to stigma research in other contexts, such as HIV/AIDS or disordered eating, where similar emotions, behaviors (e.g., hiding, concealment), and attitudinal constructs such as trust [ 24 , 76 ] are at play, hold promise for broader applications beyond substance use.

Limitations

Although the purposive sampling used in this study allowed us to develop a sufficient corpus of stigma-positive texts within a reasonable amount of time, our sampling method may also be viewed as one of its limitations. By sampling from a limited set of subreddits focused on substance use, we realize that our detection model may not generalize to other types of texts. Additionally, since keyword matching enrichment was used during the sampling process, the distribution of texts in our corpus differs from that of the substance recovery subreddits which they were sampled from. When making predictions on random samples, our models may have faced performance issues due to the increased imbalance between stigma-positive and stigma-negative texts.

To facilitate the aims of this research, we sought to identify stigma and accounts of stigma within social media narratives. In many of the possible instances of stigma that appear, the motivations behind the potentially stigmatizing actions are unclear or unstated. For posts containing sequences such as ‘my parents kicked me out of the house’, it may be difficult to determine whether the parents’ actions are motivated by stigma or by other factors. Causal ambiguity can lead our models to produce errors, and also lead to disagreement among our annotators. Collection and triangulation of data collected through other means, such as interview, survey, or diary data, could perhaps complement insights from social media.

In this study, we performed an examination of stigma surrounding substance use within the realm of social media. Our approach encompassed data collection, corpus annotation, and the development of binary classifiers tailored to detect three different stigma mechanisms. By synergizing contextual embeddings with count-based features, we achieved models that exhibited superior performance across all three stigma categories compared to RoBERTa-only baselines. Through a mixed-methods analysis of the model's predictions, we unraveled critical insights into the relations of word usage to the expression of different types of stigma. Affective, social, and behavioral features emerged as pivotal components in the expression of substance use stigma.

Our main contributions include: demonstrating a theory-based approach to extracting and comparing different types of stigma in a large social media dataset, and employing patterns in word usage to explore and characterize its manifestations. The insights from this study highlight the need to consider the impacts of stigma differently by mechanism (internalized, anticipated, and enacted), and enhance our current understandings of how the stigma mechanisms manifest within language in particular cognitive, emotional, social, and behavioral aspects. Moving forward, we envisage further analysis of stigma instances in our dataset to glean insights into how individuals navigate the challenges they encounter, informing the development of more effective stigma reduction strategies. Furthermore, the concepts encapsulated in our handcrafted lexicons hold promise for future stigma research in diverse contexts, extending the applicability of our findings beyond substance use disorders.

Availability of data and materials

Stigma datasets and models trained to detect stigma could potentially be used by bad actors to target vulnerable individuals. In order to reduce the risk of any potential harms to the authors of the sensitive posts examined in our research, we do not share our models or annotated dataset publicly.

National Research Council Canada.

Abbreviations

Substance use disorder

Multi-layer perceptron

Linguistic inquiry and word count

Term frequency-inverse document frequency

Bidirectional encoder representations from transformers

Robustly optimized bidirectional encoder representations from transformers

National research council Canada

Natural language toolkit

Wordnet-affect

Internalized stigma feature lexicon

Anticipated stigma feature lexicon

Enacted stigma feature lexicon

Kulesza M, Ramsey S, Brown R, Larimer M. Stigma among Individuals with Substance Use Disorders: Does it Predict Substance Use, and Does it Diminish with Treatment? J Addict Behav Ther Rehabil. 2014;3(1):1000115. https://doi.org/10.4172/2324-9005.1000115 .

Article CAS PubMed PubMed Central Google Scholar

Ashford RD, Brown AM, Ashford A, Curtis B. Recovery dialects: A pilot study of stigmatizing and nonstigmatizing label use by individuals in recovery from substance use disorders. Exp Clin Psychopharmacol. 2019;27(6):530–5. https://doi.org/10.1037/pha0000286 .

Article PubMed PubMed Central Google Scholar

Bozdağ N, Çuhadar D. Internalized stigma, self-efficacy and treatment motivation in patients with substance use disorders. J Subst Use. 2022;27(2):174–80. https://doi.org/10.1080/14659891.2021.1916846 .

Article Google Scholar

Hammarlund R, Crapanzano KA, Luce L, Mulligan L, Ward KM. Review of the effects of self-stigma and perceived social stigma on the treatment-seeking decisions of individuals with drug- and alcohol-use disorders. Subst Abuse Rehabil. 2018;9:115–36. https://doi.org/10.2147/SAR.S183256 .

Livingston JD, Milne T, Fang ML, Amari E. The effectiveness of interventions for reducing stigma related to substance use disorders: a systematic review. Addiction. 2012;107(1):39–50. https://doi.org/10.1111/j.1360-0443.2011.03601.x .

Ashford RD, Brown AM, Curtis B. Substance use, recovery, and linguistics: The impact of word choice on explicit and implicit bias. Drug Alcohol Depend. 2018;189:131–8. https://doi.org/10.1080/07347324.2019.1585216 .

Volkow ND, Gordon JA, Koob GF. Choosing appropriate language to reduce the stigma around mental illness and substance use disorders. Neuropsychopharmacol. 2021;46(13):2230–2. https://doi.org/10.1038/s41386-021-01069-4 .

Brown SA. Standardized measures for substance use stigma. Drug Alcohol Depend. 2011;116(1):137–41. https://doi.org/10.1016/j.drugalcdep.2010.12.005 .

Article PubMed Google Scholar

Kulesza M, Larimer ME, Rao D. Substance Use Related Stigma: What we Know and the Way Forward. J Addict Behav Ther Rehabil. 2013;2(2). https://doi.org/10.4172/2324-9005.1000106 .

Kulesza M, Watkins KE, Ober AJ, Osilla KC, Ewing B. Internalized stigma as an independent risk factor for substance use problems among primary care patients: Rationale and preliminary support. Drug Alcohol Depend. 2017;180:52–5. https://doi.org/10.1016/j.drugalcdep.2017.08.002 .

Smith LR, Earnshaw VA, Copenhaver MM, Cunningham CO. Substance use stigma: Reliability and validity of a theory-based scale for substance-using populations. Drug Alcohol Depend. 2016;162:34–43. https://doi.org/10.1016/j.drugalcdep.2016.02.019 .

Corrigan P, Schomerus G, Shuman V, Kraus D, Perlick D, Harnish A, et al. Developing a research agenda for understanding the stigma of addictions Part I: Lessons from the Mental Health Stigma Literature. Am J Addict. 2017;26(1):59–66. https://doi.org/10.1111/ajad.12458 .

Earnshaw VA, Chaudoir SR. From Conceptualizing to Measuring HIV Stigma: A Review of HIV Stigma Mechanism Measures. AIDS Behav. 2009;13(6):1160–77. https://doi.org/10.1007/s10461-009-9593-3 .

Li A, Jiao D, Zhu T. Detecting depression stigma on social media: A linguistic analysis. J Affect Disord. 2018;232:358–62. https://doi.org/10.1016/j.jad.2018.02.087 .

Article CAS PubMed Google Scholar

Li A, Jiao D, Liu X, Zhu T. A Comparison of the Psycholinguistic Styles of Schizophrenia-Related Stigma and Depression-Related Stigma on Social Media: Content Analysis. J Med Internet Res. 2020;22(4): e16470. https://doi.org/10.2196/16470 .

Clark O, Lee MM, Jingree ML, O’Dwyer E, Yue Y, Marrero A, et al. Weight Stigma and Social Media: Evidence and Public Health Solutions. Front Nutr. 2021;8:739056.

Dredze M. How Social Media Will Change Public Health. IEEE Intell Syst. 2012;27(4):81–4. https://doi.org/10.1109/MIS.2012.76 .

Goffman E. Stigma: Notes on the management of spoiled identity. New York: Simon and Schuster; 2009. p. 52, 65.

Link BG, Phelan JC. Conceptualizing Stigma. Annu Rev Sociol. 2001;27(1):363–85. https://doi.org/10.1146/annurev.soc.27.1.363 .

Parker R, Aggleton P. HIV and AIDS-related stigma and discrimination: a conceptual framework and implications for action. Soc Sci Med. 2003;57(1):13–24. https://doi.org/10.1016/S0277-9536(02)00304-0 .

Brewer MB, Brown RJ. Intergroup relations. In: Gilbert DT, Fiske ST, Lindzey G, editors. The handbook of social psychology. 4. New York: Oxford University Press; 1998.

Google Scholar

Meyer IH. Prejudice, Social Stress, and Mental Health in Lesbian, Gay, and Bisexual Populations: Conceptual Issues and Research Evidence. Psychol Bull. 2003;129(5):674–97. https://doi.org/10.1037/0033-2909.129.5.674 .

Phelan J, Link BG, Dovidio JF. Stigma and Prejudice: One Animal or Two? Soc Sci Med. 2008;67(3):358–67. https://doi.org/10.1016/j.socscimed.2008.03.022 .

Earnshaw V, Smith L, Copenhaver M. Drug Addiction Stigma in the Context of Methadone Maintenance Therapy: An Investigation into Understudied Sources of Stigma. Int J Ment Health Addict. 2013;11(1):110–22. https://doi.org/10.1007/s11469-012-9402-5 .

Quinn DM, Earnshaw VA. Concealable Stigmatized Identities and Psychological Well-Being. Soc Personal Psychol Compass. 2013;7(1):40–51 ( https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3664915/ ).

Crapanzano KA, Hammarlund R, Ahmad B, Hunsinger N, Kullar R. The association between perceived stigma and substance use disorder treatment outcomes: a review. Subst Abuse Rehabil. 2018;10:1–12. https://doi.org/10.2147/SAR.S183252 .

Schmidt A, Wiegand M. A Survey on Hate Speech Detection using Natural Language Processing. In: Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media. Valencia, Spain: Association for Computational Linguistics; 2017. p. 1–10. https://doi.org/10.18653/v1/W17-1101 .

Chapter Google Scholar

Yin W, Zubiaga A. Towards generalisable hate speech detection: a review on obstacles and solutions. PeerJ Comput Sci. 2021;7:e598. https://doi.org/10.7717/peerj-cs.598 .

Nockleby JT, Levy LW, Karst KL, Mahoney DJ. Encyclopedia of the American constitution. Detroit, MI: Macmillan Reference; 2000. p. 1277–9.

Allport GW, Clark K, Pettigrew T. The Nature of Prejudice. 1954.

Lee MH, Kyung R. Mental Health Stigma and Natural Language Processing: Two Enigmas Through the Lens of a Limited Corpus. In: 2022 IEEE World AI IoT Congress (AIIoT). 2022. p. 688–91. https://doi.org/10.1109/AIIoT54504.2022.9817362 .

Gottipati S, Chong M, Kiat A, Kawidiredjo B. Exploring Media Portrayals of People with Mental Disorders using NLP: In: Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies. p. 708–15. https://doi.org/10.5220/0010380007080715 .

Straton N, Jang H, Ng R. Stigma Annotation Scheme and Stigmatized Language Detection in Health-Care Discussions on Social Media. In: Proceedings of the 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association; 2020. p. 1178–90 https://aclanthology.org/2020.lrec-1.148 .

Oscar N, Fox PA, Croucher R, Wernick R, Keune J, Hooker K. Machine Learning, Sentiment Analysis, and Tweets: An Examination of Alzheimer’s Disease Stigma on Twitter. J Gerontol B. 2017;72(5):742–51. https://doi.org/10.1093/geronb/gbx014 .

Jilka S, Odoi CM, van Bilsen J, Morris D, Erturk S, Cummins N, et al. Identifying schizophrenia stigma on Twitter: a proof of principle model using service user supervised machine learning. Schizophr. 2022;8(1):1–8. https://doi.org/10.1038/s41537-021-00197-6 .

Pollack CC, Emond JA, O’Malley AJ, Byrd A, Green P, Miller KE, et al. Characterizing the Prevalence of Obesity Misinformation, Factual Content, Stigma, and Positivity on the Social Media Platform Reddit Between 2011 and 2019: Infodemiology Study. Journal of Medical Internet Research. 2022;24(12):e36729. Available from: https://www.jmir.org/2022/12/e36729 .

Pennebaker JW, Boyd RL, Jordan K, Blackburn K. The Development and Psychometric Properties of LIWC2015. 2015. https://repositories.lib.utexas.edu/handle/2152/31333 .

Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. p. 785–94. Available from: http://arxiv.org/abs/1603.02754 . Accessed 14 Jan 2022.

Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv:181004805. 2019. http://arxiv.org/abs/1810.04805 . Accessed 11 May 2020.

Chen AT, Johnny S, Conway M. Examining stigma relating to substance use and contextual factors in social media discussions. Drug and Alcohol Dependence Reports. 2022;3:100061. https://doi.org/10.1016/j.dadr.2022.100061 .

Hatzenbuehler ML, Nolen-Hoeksema S, Dovidio J. How Does Stigma “Get Under the Skin”?: The Mediating Role of Emotion Regulation. Psychol Sci. 2009;20(10):1282–9. https://doi.org/10.1111/j.1467-9280.2009.02441.x .

Wang K, Burton CL, Pachankis JE. Depression and Substance Use: Towards the Development of an Emotion Regulation Model of Stigma Coping. Subst Use Misuse. 2018;53(5):859–66. https://doi.org/10.1080/10826084.2017.1391011 .

Baumgartner J, Zannettou S, Keegan B, Squire M, Blackburn J. The Pushshift Reddit Dataset. Proc Int AAAI Conf Web Soc Media. 2020;14:830–9.

MacLean D, Gupta S, Lembke A, Manning C, Heer J. Forum77: An Analysis of an Online Health Forum Dedicated to Addiction Recovery. In: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing. New York, NY, USA: Association for Computing Machinery; 2015. p. 1511–26. (CSCW ’15). https://doi.org/10.1007/BF02295996 .

Benson R, Hu M, Chen AT, Zhu SH, Conway M. Examining Cannabis, Tobacco, and Vaping Discourse on Reddit: An Exploratory Approach Using Natural Language Processing. Front Public Health. 2022. Available from: https://www.frontiersin.org/article/10.3389/fpubh.2021.738513 .

Palamar JJ, Kiang MV, Halkitis PN. Development and Psychometric Evaluation of Scales that Assess Stigma Associated With Illicit Drug Users. Subst Use Misuse. 2011;46(12):1457–67. https://doi.org/10.3109/10826084.2011.596606 .

Cohen J. A Coefficient of Agreement for Nominal Scales. Educ Psychol Measur. 1960;20(1):37–46. https://doi.org/10.1177/001316446002000104 .

Viera AJ, Garrett JM. Understanding interobserver agreement: the kappa statistic. Fam Med. 2005;37(5):360–3.

PubMed Google Scholar

Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv. 2019. https://doi.org/10.48550/arXiv.1907.11692 .

He H, Garcia EA. Learning from Imbalanced Data. IEEE Trans Knowl Data Eng. 2009;21(9):1263–84. https://doi.org/10.1109/TKDE.2008.239 .

Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H, Bing G. Learning from class-imbalanced data: Review of methods and applications. Expert Syst Appl. 2017;73:220–39. https://doi.org/10.1016/j.eswa.2016.12.035 .

Bird S, Klein E, Loper E. Natural language processing with Python: analyzing text with the natural language toolkit. O’Reilly Media, Inc. 2009.

Prakash A, Tayyar Madabushi H. Incorporating Count-Based Features into Pre-Trained Models for Improved Stance Detection. In: Proceedings of the 3rd NLP4IF Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda. Barcelona, Spain: International Committee on Computational Linguistics (ICCL); 2020. p. 22–32 https://aclanthology.org/2020.nlp4if-1.3 .

Mohammad S. Word Affect Intensities. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan: European Language Resources Association (ELRA); 2018. https://aclanthology.org/L18-1027 .

Strapparava C, Valitutti A. WordNet-Affect: An Affective Extension of WordNet. In: Proceedings of the 4th International Conference on Language Resources and Evaluation (LREC 2004). Lisbon: European Language Resources Association (ELRA); 2004. p. 1083–6.

Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. J Mach Learn Res. 2011;12:2825–30.

Babanejad N, Davoudi H, An A, Papagelis M. Affective and Contextual Embedding for Sarcasm Detection. In: Proceedings of the 28th International Conference on Computational Linguistics. Barcelona, Spain: International Committee on Computational Linguistics; 2020. p. 225–43. https://doi.org/10.18653/v1/2020.coling-main.20 .

Miller GA. WordNet: a lexical database for English. Commun ACM. 1995;38(11):39–41. https://doi.org/10.1145/219717.219748 .

Brown-Johnson CG, Cataldo PhD JK, Orozco N, Lisha NE, Hickman N, Prochaska JJ. Validity and Reliability of the Internalized Stigma of Smoking Inventory: An Exploration of Shame, Isolation, and Discrimination in Smokers with Mental Health Diagnoses. Am J Addict. 2015;24(5):410–8. https://doi.org/10.1111/ajad.12215 .

Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In: Wallach H, Larochelle H, Beygelzimer A, Alché-Buc F d’, Fox E, Garnett R, editors. Advances in Neural Information Processing Systems 32. Curran Associates, Inc.; 2019. p. 8024–35. http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf .

Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, et al. HuggingFace’s Transformers: State-of-the-art Natural Language Processing. 2019. https://doi.org/10.48550/arXiv.1910.03771 .

Book Google Scholar

McNemar Q. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika. 1947;12(2):153–7. https://doi.org/10.1007/BF02295996 .

Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J Roy Stat Soc: Ser B (Methodol). 1995;57(1):289–300.

Dror R, Baumer G, Shlomov S, Reichart R. The Hitchhiker’s Guide to Testing Statistical Significance in Natural Language Processing. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Melbourne, Australia: Association for Computational Linguistics; 2018. p. 1383–92. https://doi.org/10.18653/v1/P18-1128 .

Creswell JW, Clark VLP. Designing and Conducting Mixed Methods Research. SAGE Publications; 2017.

Doyle L, Brady AM, Byrne G. An overview of mixed methods research. J Res Nurs. 2009;14(2):175–85. https://doi.org/10.1177/1744987108093962 .

Agresti A. Categorical data analysis, Second Edition. New York: Wiley; 2002.

Sharpe D. Chi-square test is statistically significant: Now what? Pract Assess Res Eval. 2015;20(1):8.

Moreno MA, Goniu N, Moreno PS, Diekema D. Ethics of Social Media Research: Common Concerns and Practical Considerations. Cyberpsychol Behav Soc Netw. 2013;16(9):708–13. https://doi.org/10.1089/cyber.2012.0334 .

Galinsky AD, Wang CS, Whitson JA, Anicich EM, Hugenberg K, Bodenhausen GV. The Reappropriation of Stigmatizing Labels: The Reciprocal Relationship Between Power and Self-Labeling. Psychol Sci. 2013;24(10):2020–9. https://doi.org/10.1177/0956797613482943 .

Jawahar G, Sagot B, Seddah D. What Does BERT Learn about the Structure of Language? In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: Association for Computational Linguistics; 2019. p. 3651–7. https://doi.org/10.18653/v1/P19-1356 .

Khetan V, Ramnani R, Anand M, Sengupta S, Fano AE. Causal BERT: Language Models for Causality Detection Between Events Expressed in Text. In: Arai K, editor. Intelligent Computing. Cham: Springer International Publishing; 2022. p. 965–80. (Lecture Notes in Networks and Systems). https://doi.org/10.1007/978-3-030-80119-9_64 .

Kacewicz E, Pennebaker JW, Davis M, Jeon M, Graesser AC. Pronoun Use Reflects Standings in Social Hierarchies. J Lang Soc Psychol. 2014;33(2):125–43. https://doi.org/10.1177/0261927X13502654 .

Cohn MA, Mehl MR, Pennebaker JW. Linguistic Markers of Psychological Change Surrounding September 11, 2001. Psychol Sci. 2004;15(10):687–93. https://doi.org/10.1111/j.0956-7976.2004.00741.x .

Moore MD, Ali S, Burnich-Line D, Gonzales W, Stanton MV. Stigma, Opioids, and Public Health Messaging: The Need to Disentangle Behavior From Identity. Am J Public Health. 2020;110(6):807–10. https://doi.org/10.2105/AJPH.2020.305628 .

O’Connor C, McNamara N, O’Hara L, McNicholas M, McNicholas F. How do people with eating disorders experience the stigma associated with their condition? A mixed-methods systematic review. J Ment Health. 2021;30(4):454–69. https://doi.org/10.1080/09638237.2019.1685081 .

Download references

Acknowledgements

Not applicable.

Research reported in this publication was supported by the National Institute On Drug Abuse of the National Institutes of Health under Award Number R21DA056684. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and affiliations.

University of Washington School of Medicine, Biomedical Informatics and Medical Education, 850 Republican St., Box 358047, Seattle, WA, 98109, USA

David Roesler & Annie T. Chen

School of Nursing, University of Washington, Box 357260, Seattle, WA, 98195, USA

Shana Johnny

School of Computing and Information Systems, University of Melbourne, Parkville, VIC, 3052, Australia

Mike Conway

You can also search for this author in PubMed Google Scholar

Contributions

ATC and DR conceptualized the study. All authors performed data curation, and DR and ATC performed data analysis. DR drafted the initial manuscript and iteratively revised with ATC. All authors reviewed and approved the final manuscript.

Corresponding authors

Correspondence to Mike Conway or Annie T. Chen .

Ethics declarations

Ethics approval and consent to participate.

The work performed in this study was determined as non-human subjects research by the Human Subjects Division at the University of Washington (STUDY00015737), and approved by the Office of Research Ethics and Integrity of the University of Melbourne (reference no. 2022–25512-34338–4). All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Competing interests.

The authors declare no competing interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1..

A detailed description of our annotation guidelines.

Additional file 2.

A complete list of keywords included in each of the handcrafted stigma lexicons.

Additional file 3.

Results of chi-square tests examining the distribution of stigma labels for each substance.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Roesler, D., Johnny, S., Conway, M. et al. A theory-informed deep learning approach to extracting and characterizing substance use-related stigma in social media. BMC Digit Health 2 , 60 (2024). https://doi.org/10.1186/s44247-024-00065-0

Download citation

Received : 19 May 2023

Accepted : 26 January 2024

Published : 16 August 2024

DOI : https://doi.org/10.1186/s44247-024-00065-0

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Substance use
Social media
Machine learning
Information extraction

BMC Digital Health

ISSN: 2731-684X

Submission enquiries: [email protected]
General enquiries: [email protected]

Log in using your username and password

Search More Search for this keyword Advanced search
Latest content
Current issue
BMJ Journals

Rebecca Payne 1 ,
Aileen Clarke 1 ,
Nadia Swann 1 ,
Jackie van Dael 1 ,
Natassia Brenman 1 ,
Rebecca Rosen 2 ,
Adam Mackridge 3 ,
Lucy Moore 1 ,
Asli Kalin 1 ,
Emma Ladds 1 ,
Nina Hemmings 2 ,
Sarah Rybczynska-Bunt 4 ,
Stuart Faulkner 1 ,
Isabel Hanson 1 ,
Sophie Spitters 5 ,
http://orcid.org/0000-0002-7758-8493 Sietse Wieringa 1 , 6 ,
Francesca H Dakin 1 ,
Sara E Shaw 1 ,
Joseph Wherton 1 ,
Richard Byng 4 ,
Laiba Husain 1 ,
http://orcid.org/0000-0003-2369-8088 Trisha Greenhalgh 1
1 Nuffield Department of Primary Care Health Sciences , University of Oxford , Oxford , UK
2 Nuffield Trust , London , UK
3 Betsi Cadwaladr University Health Board , Bangor , UK
4 Peninsula Schools of Medicine and Dentistry , University of Plymouth , Plymouth , UK
5 Wolfson Institute of Population Health , Queen Mary University of London , London , UK
6 Sustainable Health Unit , University of Oslo , Oslo , Norway
Correspondence to Professor Trisha Greenhalgh; trish.greenhalgh{at}phc.ox.ac.uk

Background Triage and clinical consultations increasingly occur remotely. We aimed to learn why safety incidents occur in remote encounters and how to prevent them.

Setting and sample UK primary care. 95 safety incidents (complaints, settled indemnity claims and reports) involving remote interactions. Separately, 12 general practices followed 2021–2023.

Methods Multimethod qualitative study. We explored causes of real safety incidents retrospectively (‘Safety I’ analysis). In a prospective longitudinal study, we used interviews and ethnographic observation to produce individual, organisational and system-level explanations for why safety and near-miss incidents (rarely) occurred and why they did not occur more often (‘Safety II’ analysis). Data were analysed thematically. An interpretive synthesis of why safety incidents occur, and why they do not occur more often, was refined following member checking with safety experts and lived experience experts.

Results Safety incidents were characterised by inappropriate modality, poor rapport building, inadequate information gathering, limited clinical assessment, inappropriate pathway (eg, wrong algorithm) and inadequate attention to social circumstances. These resulted in missed, inaccurate or delayed diagnoses, underestimation of severity or urgency, delayed referral, incorrect or delayed treatment, poor safety netting and inadequate follow-up. Patients with complex pre-existing conditions, cardiac or abdominal emergencies, vague or generalised symptoms, safeguarding issues, failure to respond to previous treatment or difficulty communicating seemed especially vulnerable. General practices were facing resource constraints, understaffing and high demand. Triage and care pathways were complex, hard to navigate and involved multiple staff. In this context, patient safety often depended on individual staff taking initiative, speaking up or personalising solutions.

Conclusion While safety incidents are extremely rare in remote primary care, deaths and serious harms have resulted. We offer suggestions for patient, staff and system-level mitigations.

Primary care
Diagnostic errors
Safety culture
Qualitative research
Prehospital care

Data availability statement

Data are available upon reasonable request. Details of real safety incidents are not available for patient confidentiality reasons. Requests for data on other aspects of the study from other researchers will be considered.

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/ .

https://doi.org/10.1136/bmjqs-2023-016674

Statistics from Altmetric.com

Request permissions.

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

WHAT IS ALREADY KNOWN ON THIS TOPIC

Safety incidents are extremely rare in primary care but they do happen. Concerns have been raised about the safety of remote triage and remote consultations.

WHAT THIS STUDY ADDS

Rare safety incidents (involving death or serious harm) in remote encounters can be traced back to various clinical, communicative, technical and logistical causes. Telephone and video encounters in general practice are occurring in a high-risk (extremely busy and sometimes understaffed) context in which remote workflows may not be optimised. Front-line staff use creativity and judgement to help make care safer.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

As remote modalities become mainstreamed in primary care, staff should be trained in the upstream causes of safety incidents and how they can be mitigated. The subtle and creative ways in which front-line staff already contribute to safety culture should be recognised and supported.

Introduction

In early 2020, remote triage and remote consultations (together, ‘remote encounters’), in which the patient is in a different physical location from the clinician or support staff member, were rapidly expanded as a safety measure in many countries because they eliminated the risk of transmitting COVID-19. 1–4 But by mid-2021, remote encounters had begun to be depicted as potentially unsafe because they had come to be associated with stories of patient harm, including avoidable deaths and missed cancers. 5–8

Providing triage and clinical care remotely is sometimes depicted as a partial solution to the system pressures facing primary healthcare in many countries, 9–11 including rising levels of need or demand, the ongoing impact of the COVID-19 pandemic and workforce challenges (especially short-term or longer-term understaffing). In this context, remote encounters may be an important component of a mixed-modality health service when used appropriately alongside in-person contacts. 12 13 But this begs the question of what ‘appropriate’ and ‘safe’ use of remote modalities in a primary care context is. Safety incidents (defined as ‘any unintended or unexpected incident which could have, or did, lead to harm for one or more patients receiving healthcare 14 ’) are extremely rare in primary healthcare consultations generally, 15 16 in-hours general practice telephone triage 17 and out-of-hours primary care. 18 But the recent widespread expansion of remote triage and remote consulting in primary care means that a wider range of patients and conditions are managed remotely, making it imperative to re-examine where the risks lie.

Theoretical approaches to safety in healthcare fall broadly into two traditions. 19 ‘Safety I’ studies focus on what went wrong. Incident reports are analysed to identify ‘root causes’ and ‘safety gaps’, and recommendations are made to reduce the chance that further similar incidents will happen in the future. 20 Such studies, undertaken in isolation, tend to lead to a tightening of rules, procedures and protocols. ‘Safety II’ studies focus on why, most of the time, things do not go wrong. Ethnography and other qualitative methods are employed to study how humans respond creatively to unique and unforeseen situations, thereby preventing safety incidents most of the time. 19 Such studies tend to show that actions which achieve safety are highly context specific, may entail judiciously breaking the rules and require human qualities such as courage, initiative and adaptability. 21 Few previous studies have combined both approaches.

In this study, we aimed to use Safety I methods to learn why safety incidents occur (although rarely) in remote primary care encounters and also apply Safety II methods to examine the kinds of creative actions taken by front-line staff that contribute to a safety culture and thereby prevent such incidents.

Study design and origins

Multimethod qualitative study across UK, including incident analysis, longitudinal ethnography and national stakeholder interviews.

The idea for this safety study began during a longitudinal ethnographic study of 12 general practices across England, Scotland and Wales as they introduced (and, in some cases, subsequently withdrew) various remote and digital modalities. Practices were selected for maximum diversity in geographical location, population served and digital maturity and followed from mid-2021 to end 2023 using staff and patient interviews and in-person ethnographic visits. The study protocol, 22 baseline findings 23 and a training needs analysis 24 have been published. To provide context for our ethnography, we interviewed a sample of national stakeholders in remote and digital primary care, including out-of-hours providers running telephone-led services, and held four online multistakeholder workshops, one of which was on the theme of safety, for policymakers, clinicians, patients and other parties. Early data from this detailed qualitative work revealed staff and patient concerns about the safety of remote encounters but no actual examples of harm.

To explore the safety theme further, we decided to take a dual approach. First, following Safety I methodology for the study of rare harms, 20 we set out to identify and analyse a sample of safety incidents involving remote encounters. These were sourced from arm’s-length bodies (NHS England, NHS Resolution, Healthcare Safety Investigation Branch) and providers of healthcare at scale (health boards, integrated care systems and telephone advice services), since our own small sample had not identified any of these rare occurrences. Second, we extended our longitudinal ethnographic design to more explicitly incorporate Safety II methodology, 19 allowing us to examine safety culture and safety practices in our 12 participating general practices, especially the adaptive work done by staff to avert potential safety incidents.

Data sources and management

Table 1 summarises the data sources.

View inline

Summary of data sources

The Safety I dataset (rows 2-5) consisted of 95 specific incident reports, including complaints submitted to the main arm’s-length NHS body in England, NHS England, between 2020 and 2023 (n=69), closed indemnity claims that had been submitted to a national indemnity body, NHS Resolution, between 2015 and 2023 (n=16), reports from an urgent care telephone service in Wales (NHS 111 Wales) between 2020 and 2023 (n=6) and a report on an investigation of telephone advice during the COVID-19 crisis between 2020 and 2022 7 (n=4). These 95 incidents were organised using Microsoft Excel spreadsheets.

The Safety II dataset (rows 6-10) consisted of extracts from fieldnotes, workshop transcripts and interviews collected over 2 years, stored and coded on NVivo qualitative software. These were identified by searching for text words and codes (e.g. ‘risk’, ‘safety’, ‘incident’) and by asking researchers-in-residence, who were closely familiar with practices, to highlight safety incidents involving harm and examples of safety-conscious work practices. This dataset included over 100 formal interviews and numerous on-the-job interviews with practice staff, plus interviews with a sample of 10 GP (general practitioner) trainers and 10 GP trainees (penultimate row of table 1 ) and with six clinical safety experts identified through purposive sampling from government, arm’s-length bodies and health boards (bottom row of table 1 ).

Data analysis

We analysed incident reports, interview data and ethnographic fieldnotes using thematic analysis as described by Braun and Clarke. 25 These authors define a theme as an important, broad pattern in a set of qualitative data, which can (where necessary) be further refined using coding.

Themes in the incident dataset were identified by five steps. First, two researchers (both medically qualified) read each source repeatedly to gain familiarity. Second, those researchers worked independently using Braun and Clarke’s criterion (‘whether it captures something important in relation to the overall research question’—p 82 25 ) to identify themes. Third, they discussed their initial interpretations with each other and resolved differences through discussion. Fourth, they extracted evidence from the data sources to illustrate and refine each theme. Finally, they presented their list of themes along with illustrative examples to the wider team. Cases used to illustrate themes were systematically fictionalised by changing age, randomly allocating gender and altering clinical details. 26 For example, an acute appendicitis could be changed to acute diverticulitis if the issue was a missed acute abdomen.

These safety themes were then used to sensitise us to seek relevant (confirming and disconfirming) material from our ethnographic and interview datasets. For example, the theme ‘poor communication’ (and subthemes such as ‘failure to seek further clarification’ within this) promoted us to look for examples in our stakeholder interviews of poor communication offered as a cause of safety incidents and examples in our ethnographic notes of good communication (including someone seeking clarification). We used these wider data to add nuance to the initial list of themes.

As a final sense-checking step, the draft findings from this study were shown to each of the six safety experts in our sample and refined in the light of their comments (in some cases, for example, they considered the case to have been overfictionalised, thereby losing key clinical messages; they also gave additional examples to illustrate some of the themes we had identified, which underlined the importance of those themes).

Overview of dataset

The dataset ( table 1 ) consisted of 95 incident reports (see fictionalised examples in box 1 ), plus approximately 400 pages of extracts from interviews, ethnographic fieldnotes and workshop discussions, including situated safety practices (see examples in box 2 ), plus strategic insights relating to policy, organisation and planning of services. Notably, almost all incidents related to telephone calls.

Examples of safety incidents involving death or serious harm in remote encounters

All these cases have been systematically fictionalised as explained in the text.

Case 1 (death)

A woman in her 70s experiencing sudden breathlessness called her GP (general practitioner) surgery. The receptionist answered the phone and informed her that she would place her on the doctor’s list for an emergency call-back. The receptionist was distracted by a patient in the waiting room and did not do so. The patient deteriorated and died at home that afternoon.—NHS Resolution case, pre-2020

Case 2 (death)

An elderly woman contacted her GP after a telephone contact with the out-of-hours service, where constipation had been diagnosed. The GP prescribed laxatives without seeing the patient. The patient self-presented to the emergency department (ED) the following day in obstruction secondary to an incarcerated hernia and died in the operating theatre.—NHS Resolution case, pre-2020

Case 3 (risk to vulnerable patients)

A daughter complained that her elderly father was unable to access his GP surgery as he could not navigate the online triage system. When he phoned the surgery directly, he was directed back to the online system and told to get a relative to complete the form for him.—Complaint to NHS England, 2021

Case 4 (harm)

A woman in her first pregnancy at 28 weeks’ gestation experiencing urinary incontinence called NHS 111. She was taken down by a ‘urinary problems’ algorithm. Both the call handler and the subsequent clinician failed to recognise that she had experienced premature rupture of membranes. She later presented to the maternity department in active labour, and the opportunity to give early steroids to the premature infant was missed.—NHS Resolution case, pre-2020

Case 5 (death)

A doctor called about a 16-year-old girl with lethargy, shaking, fever and poor oral intake who had been unwell for 5 days. The doctor spoke to her older sister and advised that the child had likely glandular fever and should rest. When the parents arrived home, they called an ambulance but the child died of sepsis in the ED.—NHS Resolution case, pre-2020

Case 6 (death)

A 40-year-old woman, 6 weeks after caesarean section, contacted her GP due to shortness of breath, increased heart rate and dry cough. She was advised to get a COVID test and to dial 111 if she developed a productive cough, fever or pain. The following day she collapsed and died at home. The postmortem revealed a large pulmonary embolus. On reviewing the case, her GP surgery felt that had she been seen face to face, her oxygen saturations would have been measured and may have led to suspicion of the diagnosis.—NHS Resolution case, 2020

Case 7 (death)

A son complained that his father with diabetes and chronic kidney disease did not receive any in-person appointments over a period of 1 year. His father went on to die following a leg amputation arising from a complication of his diabetes.—Complaint to NHS England, 2021

Case 8 (death)

A 73-year-old diabetic woman with throat pain and fatigue called the surgery. She was diagnosed with a viral illness and given self-care advice. Over the next few days, she developed worsening breathlessness and was advised to do a COVID test and was given a pulse oximeter. She was found dead at home 4 days later. Postmortem found a blocked coronary artery and a large amount of pulmonary oedema. The cause of death was myocardial infarction and heart failure.—NHS Resolution case, pre-2020

Case 9 (harm)

A patient with a history of successfully treated cervical cancer developed vaginal bleeding. A diagnosis of fibroids was made and the patient received routine care by telephone over the next few months until a scan revealed a local recurrence of the original cancer.—Complaint to NHS England, 2020

Case 10 (death)

A 65-year-old female smoker with chronic cough and breathlessness presented to her GP. She was diagnosed with chronic obstructive pulmonary disease (COPD) and monitored via telephone. She did not respond to inhalers or antibiotics but continued to receive telephone monitoring without further investigation. Her symptoms continued to worsen and she called an ambulance. In the ED, she was diagnosed with heart failure and died soon after.—Complaint to NHS England, 2021

Case 11 (harm)

A 30-year-old woman presented with intermittent episodes of severe dysuria over a period of 2 years. She was given repeated courses of antibiotics but no urine was sent for culture and she was not examined. After 4 months of symptoms, she saw a private GP and was diagnosed with genital herpes.—Complaint to NHS England, 2021

Case 12 (harm)

There were repeated telephone consultations about a baby whose parents were concerned that the child was having a funny colour when feeding or crying. The 6-week check was done by telephone and at no stage was the child seen in person. Photos were sent in, but the child’s dark skin colour meant that cyanosis was not easily apparent to the reviewing clinician. The child was subsequently admitted by emergency ambulance where a significant congenital cardiac abnormality was found.—Complaint to NHS England, 2020 1

Case 13 (harm)

A 35-year-old woman in her third trimester of pregnancy had a telephone appointment with her GP about a breast lump. She was informed that this was likely due to antenatal breast changes and was not offered an in-person appointment. She attended after delivery and was referred to a breast clinic where a cancer was diagnosed.—Complaint to NHS England, 2020

Case 14 (harm)

A 63-year-old woman with a variety of physical symptoms including diarrhoea, hip girdle pain, palpitations, light-headedness and insomnia called her surgery on multiple occasions. She was told her symptoms were likely due to anxiety, but was diagnosed with stage 4 ovarian cancer and died soon after.—Complaint to NHS England, 2021

Case 15 (death)

A man with COPD with worsening shortness of breath called his GP surgery. The staff asked him if it was an emergency, and when the patient said no, scheduled him for 2 weeks later. The patient died before the appointment.—Complaint to NHS England, 2021

Examples of safety practices

Case 16 (safety incident averted by switching to video call for a sick child)

‘I’ve remembered one father that called up. Really didn’t seem to be too concerned. And was very much under-playing it and then when I did a video call, you know this child… had intercostal recession… looked really, really poorly. And it was quite scary actually that, you know, you’d had the conversation and if you’d just listened to what Dad was saying, actually, you probably wouldn’t be concerned.’—GP (general practitioner) interview 2022

Case 17 (‘red flag’ spotted by support staff member)

A receptionist was processing routine ‘administrative’ encounters sent in by patients using AccuRx (text messaging software). She became concerned about a sick note renewal request from a patient with a mental health condition. The free text included a reference to feeling suicidal, so the receptionist moved the request to the ‘red’ (urgent call-back) list. In interviews with staff, it became apparent that there had recently been heated discussion in the practice about whether support staff were adding ‘too many’ patients to the red list. After discussing cases, the doctors concluded that it should be them, not the support staff, who should absorb the risk in uncertain cases. The receptionist said that they had been told: ‘if in doubt, put it down as urgent and then the duty doctor can make a decision.’—Ethnographic fieldnotes from general practice 2023

Case 18 (‘check-in’ phone call added on busy day)

A duty doctor was working through a very busy Monday morning ‘urgent’ list. One patient had acute abdominal pain, which would normally have triggered an in-person appointment, but there were no slots and hard decisions were being made. This patient had had the pain already for a week, so the doctor judged that the general rule of in-person examination could probably be over-ridden. But instead of simply allocating to a call-back, the doctor asked a support staff member to phone the patient, ask ‘are you OK to wait until tomorrow?’ and offer basic safety-netting advice.—Ethnographic fieldnotes from general practice 2023

Case 19 (receptionist advocating on behalf of ‘angry’ walk-in patient)

A young Afghan man with limited English walked into a GP surgery on a very busy day, ignoring the prevailing policy of ‘total triage’ (make contact by phone or online in the first instance). He indicated that he wanted a same-day in-person appointment for a problem he perceived as urgent. A heated exchange occurred with the first receptionist, and the patient accused her of ‘racism’. A second receptionist of non-white ethnicity herself noted the man’s distress and suspected that there may indeed be an urgent problem. She asked the first receptionist to leave the scene, saying she wanted to ‘have a chat’ with the patient (‘the colour of my skin probably calmed him down more than anything’). Through talking to the patient and looking through his record, she ascertained that he had an acute infection that likely needed prompt attention. She tried to ‘bend the rules’ and persuade the duty doctor to see the patient, conveying the clinical information but deliberately omitting the altercation. But the first receptionist complained to the doctor (‘he called us racists’) and the doctor decided that the patient would not therefore be offered a same-day appointment. The second receptionist challenged the doctor (‘that’s not a reason to block him from getting care’). At this point, the patient cried and the second receptionist also became upset (‘this must be serious, you know’). On this occasion, despite her advocacy the patient was not given an immediate appointment.—Ethnographic fieldnotes from general practice 2022

Case 20 (long-term condition nurse visits ‘unengaged’ patients at home)

An advanced nurse practitioner talks of two older patients, each with a long-term condition, who are ‘unengaged’ and lacking a telephone. In this practice, all long-term condition reviews are routinely done by phone. She reflects that some people ‘choose not to have avenues of communication’ (ie, are deliberately not contactable), and that there may be reasons for this (‘maybe health anxiety or just old’). She has, on occasion, ‘turned up’ unannounced at the patient’s home and asked to come in and do the review, including bloods and other tests. She reflects that while most patients engage well with the service, ‘half my job is these patients who don’t engage very well.’—Ethnographic fieldnotes from digitally advanced general practice 2022

Case 21 (doctor over-riding patient’s request for telephone prescribing)

A GP trainee described a case of a 53-year-old first-generation immigrant from Pakistan, a known smoker with hypertension and diabetes. He had booked a telephone call for vomiting and sinus pain. There was no interpreter available but the man spoke some English. He said he had awoken in the night with pain in his sinuses and vomiting. All he wanted was painkillers for his sinuses. The story did not quite make sense, and the man ‘sounded unwell’. The GP told him he needed to come in and be examined. The patient initially resisted but was persuaded to come in. When the GP went to call him in, the man was visibly unwell and lying down in the waiting room. When seen in person, he admitted to shoulder pain. The GP sent him to accident and emergency (A&E) where a myocardial infarction was diagnosed.—Trainee interview 2023

Below, we describe the main themes that were evident in the safety incidents: a challenging organisational and system context, poor communication compounded by remote modalities, limited clinical information, patient and carer burden and inadequate training. Many safety incidents illustrated multiple themes—for example, poor communication and failures of clinical assessment or judgement and patient complexity and system pressures. In the detailed findings below, we illustrate why safety incidents occasionally occur and why they are usually avoided.

The context for remote consultations: system and operational challenges

Introduction of remote triage and expansion of remote consultations in UK primary care occurred at a time of unprecedented system stress (an understaffed and chronically under-resourced primary care sector, attempting to cope with a pandemic). 23 Many organisations had insufficient telephone lines or call handlers, so patients struggled to access services (eg, half of all calls to the emergency COVID-19 telephone service in March 2020 were never answered 7 ). Most remote consultations were by telephone. 27

Our safety incident dataset included examples of technically complex access routes which patients found difficult or impossible to navigate (case 3 in box 1 ) and which required non-clinical staff to make clinical or clinically related judgements (cases 4 and 15). Our ethnographic dataset contained examples of inflexible application of triage rules (eg, no face-to-face consultation unless the patient had already had a telephone call), though in other practices these rules could be over-ridden by staff using their judgement or asking colleagues. Some practices had a high rate of failed telephone call-backs (patient unobtainable).

High demand, staff shortages and high turnover of clinical and support staff made the context for remote encounters inherently risky. Several incidents were linked to a busy staff member becoming distracted (case 1). Telephone consultations, which tend to be shorter, were sometimes used in the hope of improving efficiency. Some safety incidents suggested perfunctory and transactional telephone consultations, with flawed decisions made on the basis of incomplete information (eg, case 2).

Many practices had shifted—at least to some extent—from a demand-driven system (in which every request for an appointment was met) to a capacity-driven one (in which, if a set capacity was exceeded, patients were advised to seek care elsewhere), though the latter was often used flexibly rather than rigidly with an expectation that some patients would be ‘squeezed in’. In some practices, capacity limits had been introduced to respond to escalation of demand linked to overuse of triage templates (eg, to inquire about minor symptoms).

As a result of task redistribution and new staff roles, a single episode of care for one problem often involved multiple encounters or tasks distributed among clinical and non-clinical staff (often in different locations and sometimes also across in-hours and out-of-hours providers). Capacity constraints in onward services placed pressure on primary care to manage risk in the community, leading in some cases to failure to escalate care appropriately (case 6).

Some safety incidents were linked to organisational routines that had not adapted sufficiently to remote—for example, a prescription might be issued but (for various reasons) it could not be transmitted electronically to the pharmacy. Certain urgent referrals were delayed if the consultation occurred remotely (a referral for suspected colon cancer, for example, would not be accepted without a faecal immunochemical test).

Training, supervising and inducting staff was more difficult when many were working remotely. If teams saw each other less frequently, relationship-building encounters and ‘corridor’ conversations were reduced, with knock-on impacts for individual and team learning and patient care. Those supervising trainees or allied professionals reported loss of non-verbal cues (eg, more difficult to assess how confident or distressed the trainee was).

Clinical and support staff regularly used initiative and situated judgement to compensate for an overall lack of system resilience ( box 1 ). Many practices had introduced additional safety measures such as lists of patients who, while not obviously urgent, needed timely review by a clinician. Case 17 illustrates how a rule of thumb ‘if in doubt, put it down as urgent’ was introduced and then applied to avert a potentially serious mental health outcome. Case 18 illustrates how, in the context of insufficient in-person slots to accommodate all high-risk cases, a unique safety-netting measure was customised for a patient.

Poor communication is compounded by remote modalities

Because sense data (eg, sight, touch, smell) are missing, 28 remote consultations rely heavily on the history. Many safety incidents were characterised by insufficient or inaccurate information for various reasons. Sometimes (cases 2, 5, 6, 8, 9, 10 and 11), the telephone consultation was too short to do justice to the problem; the clinician asked few or no questions to build rapport, obtain a full history, probe the patient’s answers for additional detail, confirm or exclude associated symptoms and inquire about comorbidities and medication. Video provided some visual cues but these were often limited to head and shoulders, and photographs were sometimes of poor quality.

Cases 2, 4, 5 and 9 illustrate the dangers of relying on information provided by a third party (another staff member or a relative). A key omission (eg, in case 5) was failing to ask why the patient was unable to come to the phone or answer questions directly.

Some remote triage conversations were conducted using an inappropriate algorithm. In case 4, for example, the call handler accepted a pregnant patient’s assumption that leaking fluid was urine when the problem was actually ruptured membranes. The wrong pathway was selected; vital questions remained unasked; and a skewed history was passed to (and accepted by) the clinician. In case 8, the patient’s complaint of ‘throat’ pain was taken literally and led to ‘viral illness’ advice, overlooking a myocardial infarction.

The cases in box 2 illustrate how staff compensated for communication challenges. In case 16, a GP plays a hunch that a father’s account of his child’s asthma may be inaccurate and converts a phone encounter to video, revealing the child’s respiratory distress. In case 19 (an in-person encounter but relevant because the altercation occurs partly because remote triage is the default modality), one receptionist correctly surmises that the patient’s angry demeanour may indicate urgency and uses her initiative and interpersonal skills to obtain additional clinical information. In case 20, a long-term condition nurse develops a labour-intensive workaround to overcome her elderly patients’ ‘lack of engagement’. More generally, we observed numerous examples of staff using both formal tools (eg, see ‘red list’ in case 17) and informal measures (eg, corridor chats) to pass on what they believed to be crucial information.

Remote consulting can provide limited clinical information

Cases 2 and 4–14 all describe serious conditions including congenital cyanotic heart disease, pulmonary oedema, sepsis, cancer and diabetic foot which would likely have been readily diagnosed with an in-person examination. While patients often uploaded still images of skin lesions, these were not always of sufficient quality to make a confident diagnosis.

Several safety incidents involved clinicians assuming that a diagnosis made on a remote consultation was definitive rather than provisional. Especially when subsequent consultations were remote, such errors could become ingrained, leading to diagnostic overshadowing and missed or delayed diagnosis (cases 2, 8, 9, 10, 11 and 13). Patients with pre-existing conditions (especially if multiple or progressive), the very young and the elderly were particularly difficult to assess by telephone (cases 1, 2, 8, 10, 12 and 16). Clinical conditions difficult to assess remotely included possible cardiac pain (case 8), acute abdomen (case 2), breathing difficulties (cases 1, 6 and 10), vague and generalised symptoms (cases 5 and 14) and symptoms which progressed despite treatment (cases 9, 10 and 11). All these categories came up repeatedly in interviews and workshops as clinically risky.

Subtle aspects of the consultation which may have contributed to safety incidents in a telephone consultation included the inability to fully appraise the patient’s overall health and well-being (including indicators relevant to mental health such as affect, eye contact, personal hygiene and evidence of self-harm), general demeanour, level of agitation and concern, and clues such as walking speed and gait (cases 2, 5, 6, 7, 8, 10, 12 and 14). Our interviews included stories of missed cases of new-onset frailty and dementia in elderly patients assessed by telephone.

In most practices we studied, most long-term condition management was undertaken by telephone. This may be appropriate (and indeed welcome) when the patient is well and confident and a physical examination is not needed. But diabetes reviews, for example, require foot examination. Case 7 describes the deterioration and death of a patient with diabetes whose routine check-ups had been entirely by telephone. We also heard stories of delayed diagnosis of new diabetes in children when an initial telephone assessment failed to pick up lethargy, weight loss and smell of ketones, and point-of-care tests of blood or urine were not possible.

Nurses observed that remote consultations limit opportunities for demonstrating or checking the patient’s technique in using a device for monitoring or treating their condition such as an inhaler, oximeter or blood pressure machine.

Safety netting was inadequate in many remote safety incidents, even when provided by a clinician (cases 2, 5, 6, 8, 10, 12 and 13) but especially when conveyed by a non-clinician (case 15). Expert interviewees identified that making life-changing diagnoses remotely and starting patients on long-term medication without an in-person appointment was also risky.

Our ethnographic data showed that various measures were used to compensate for limited clinical information, including converting a phone consultation to video (case 16), asking the patient if they felt they could wait until an in-person slot was available (case 18), visiting the patient at home (case 20) and enacting a ‘if the history doesn’t make sense, bring the patient in for an in-person assessment’ rule of thumb (case 21). Out-of-hours providers added examples of rules of thumb that their services had developed over years of providing remote services, including ‘see a child face-to-face if the parent rings back’, ‘be cautious about third-party histories’, ‘visit a palliative care patient before starting a syringe driver’ and ‘do not assess abdominal pain remotely’.

Remote modalities place additional burdens on patients and carers

Given the greater importance of the history in remote consultations, patients who lacked the ability to communicate and respond in line with clinicians’ expectations were at a significant disadvantage. Several safety incidents were linked to patients’ limited fluency in the language and culture of the clinician or to specific vulnerabilities such as learning disability, cognitive impairment, hearing impairment or neurodiversity. Those with complex medical histories and comorbidities, and those with inadequate technical set-up and skills (case 3), faced additional challenges.

In many practices, in-person appointments were strictly limited according to more or less rigid triage criteria. Some patients were unable to answer the question ‘is this an emergency?’ correctly, leading to their condition being deprioritised (case 15). Some had learnt to ‘game’ the triage system (eg, online templates 29 ) by adapting their story to obtain the in-person appointment they felt they needed. This could create distrust and lead to inaccurate information on the patient record.

Our ethnographic dataset contained many examples of clinical and support staff using initiative to compensate for vulnerable patients’ inability or unwillingness to take on the additional burden of remote modalities (cases 19 and 20 in Box 2 30 31 ).

Training for remote encounters is often inadequate

Safety incidents highlighted various training needs for support staff members (eg, customer care skills, risks of making clinical judgements) and clinicians (eg, limitations of different modalities, risks of diagnostic overshadowing). Whereas out-of-hours providers gave thorough training to novice GPs (covering such things as attentiveness, rapport building, history taking, probing, attending to contextual cues and safety netting) in telephone consultations, 32–34 many in-hours clinicians had never been formally taught to consult by telephone. Case 17 illustrates how on-the-job training based on acknowledgement of contextual pressures and judicious use of rules of thumb may be very effective in averting safety incidents.

Statement of principal findings

An important overall finding from this study is that examples of deaths or serious harms associated with remote encounters in primary care were extremely rare, amounting to fewer than 100 despite an extensive search going back several years.

Analysis of these 95 safety incidents, drawn from multiple complementary sources, along with rich qualitative data from ethnography, interviews and workshops has clarified where the key risks lie in remote primary care. Remote triage and consultations expanded rapidly in the context of the COVID-19 crisis; they were occurring in the context of resource constraints, understaffing and high demand. Triage and care pathways were complex, multilayered and hard to navigate; some involved distributed work among multiple clinical and non-clinical staff. In some cases, multiple remote encounters preceded (and delayed) a needed in-person assessment.

In this high-risk context, safety incidents involving death or serious harm were rare, but those that occurred were characterised by a combination of inappropriate choice of modality, poor rapport building, inadequate information gathering, limited clinical assessment, inappropriate clinical pathway (eg, wrong algorithm) and failure to take account of social circumstances. These led to missed, inaccurate or delayed diagnoses, underestimation of severity or urgency, delayed referral, incorrect or delayed treatment, poor safety netting and inadequate follow-up. Patients with complex or multiple pre-existing conditions, cardiac or abdominal emergencies, vague or generalised symptoms, safeguarding issues and failure to respond to previous treatment, and those who (for any reason) had difficulty communicating, seemed particularly at risk.

Strengths and limitations of the study

The main strength of this study was that it combined the largest Safety I study undertaken to date of safety incidents in remote primary care (using datasets which have not previously been tapped for research), with a large, UK-wide ethnographic Safety II analysis of general practice as well as stakeholder interviews and workshops. Limitations of the safety incident sample (see final column in table 1 ) include that it was skewed towards very rare cases of death and serious harm, with relatively few opportunities for learning that did not result in serious harm. Most sources were retrospective and may have suffered from biases in documentation and recall. We also failed to obtain examples of safeguarding incidents (which would likely turn up in social care audits). While all cases involved a remote modality (or a patient who would not or could not use one), it is impossible to definitively attribute the harm to that modality.

Comparison with existing literature

This study has affirmed previous findings that processes, workflows and training in in-hours general practice have not adapted adequately to the booking, delivery and follow-up of remote consultations. 24 35 36 Safety issues can arise, for example, from how the remote consultation interfaces with other key practice routines (eg, for making urgent referrals for possible cancer). The sheer complexity and fragmentation of much remote and digital work underscores the findings from a systematic review of the importance of relational coordination (defined as ‘a mutually reinforcing process of communicating and relating for the purpose of task integration ’ (p 3) 37 ) and psychological safety (defined as ‘people’s perceptions of the consequences of taking interpersonal risks in a particular context such as a workplace ’ (p 23) 38 ) in building organisational resilience and assuring safety.

The additional workload and complexity associated with running remote appointments alongside in-person ones is cognitively demanding for staff and requires additional skills for which not all are adequately trained. 24 39 40 We have written separately about the loss of traditional continuity of care as primary care services become digitised, 41–43 and about the unmet training needs of both clinical and support staff for managing remote and digital encounters. 24

Our findings also resonate with research showing that remote modalities can interfere with communicative tasks such as rapport building, establishing a therapeutic relationship and identifying non-verbal cues such as tearfulness 35 36 44 ; that remote consultations tend to be shorter and feature less discussion, information gathering and safety netting 45–48 ; and that clinical assessment in remote encounters may be challenging, 27 49 50 especially when physical examination is needed. 35 36 51 These factors may rarely contribute to incorrect or delayed diagnoses, underestimation of the seriousness or urgency of a case, and failure to identify a deteriorating trajectory. 35 36 52–54

Even when systems seem adequate, patients may struggle to navigate them. 23 30 31 This finding aligns with an important recent review of cognitive load theory in the context of remote and digital health services: because such services are more cognitively demanding for patients, they may widen inequities of access. 55 Some patients lack navigating and negotiating skills, access to key technologies 13 36 or confidence in using them. 30 35 The remote encounter may require the patient to have a sophisticated understanding of access and cross-referral pathways, interpret their own symptoms (including making judgements about severity and urgency), obtain and use self-monitoring technologies (such as a blood pressure machine or oximeter) and convey these data in medically meaningful ways (eg, by completing algorithmic triage forms or via a telephone conversation). 30 56 Furthermore, the remote environment may afford fewer opportunities for holistically evaluating, supporting or safeguarding the vulnerable patient, leading to widening inequities. 13 35 57 Previous work has also shown that patients with pre-existing illness, complex comorbidities or high-risk states, 58 59 language non-concordance, 13 35 inability to describe their symptoms (eg, due to autism 60 ), extremes of age 61 and those with low health or system literacy 30 are more difficult to assess remotely.

Lessons for safer care

Many of the contributory factors to safety incidents in remote encounters have been suggested previously, 35 36 and align broadly with factors that explain safety incidents more generally. 53 62 63 This new study has systematically traced how upstream factors may, very rarely, combine to contribute to avoidable human tragedies—and also how primary care teams develop local safety practices and cultures to help avoid them. Our study provides some important messages for practices and policymakers.

First, remote encounters in general practice are mostly occurring in a system designed for in-person encounters, so processes and workflows may work less well.

Second, because the remote encounter depends more on history taking and dialogue, verbal communication is even more mission critical. Working remotely under system pressures and optimising verbal communication should both be priorities for staff training.

Third, the remote environment may increase existing inequities as patients’ various vulnerabilities (eg, extremes of age, poverty, language and literacy barriers, comorbidities) make remote communication and assessment more difficult. Our study has revealed impressive efforts from staff to overcome these inequities on an individual basis; some of these workarounds may become normalised and increase efficiency, but others are labour intensive and not scalable.

A final message from this study is that clinical assessment provides less information when a physical examination (and even a basic visual overview) is not possible. Hence, the remote consultation has a higher degree of inherent uncertainty. Even when processes have been optimised (eg, using high-quality triage to allocate modality), but especially when they have not, diagnoses and assessments of severity or urgency should be treated as more provisional and revisited accordingly. We have given examples in the Results section of how local adaptation and rule breaking bring flexibility into the system and may become normalised over time, leading to the creation of locally understood ‘rules of thumb’ which increase safety.

Overall, these findings underscore the need to share learning and develop guidance about the drivers of risk, how these play out in different kinds of remote encounters and how to develop and strengthen Safety II approaches to mitigate those risks. Table 2 shows proposed mitigations at staff, process and system levels, as well as a preliminary list of suggestions for patients, which could be refined with patient input using codesign methods. 64

Reducing safety incidents in remote primary care

Unanswered questions and future research

This study has helped explain where the key risks lie in remote primary care encounters, which in our dataset were almost all by telephone. It has revealed examples of how front-line staff create and maintain a safety culture, thereby helping to prevent such incidents. We suggest four key avenues for further research. First, additional ethnographic studies in general practice might extend these findings and focus on specific subquestions (eg, how practices identify, capture and learn from near-miss incidents). Second, ethnographic studies of out-of-hours services, which are mostly telephone by default, may reveal additional elements of safety culture from which in-hours general practice could learn. Third, the rise in asynchronous e-consultations (in which patients complete an online template and receive a response by email) raises questions about the safety of this new modality which could be explored in mixed-methods studies including quantitative analysis of what kinds of conditions these consultations cover and qualitative analysis of the content and dynamics of the interaction. Finally, our findings suggest that the safety of new clinically related ‘assistant’ roles in general practice should be urgently evaluated, especially when such staff are undertaking remote assessment or remote triage.

Ethics statements

Patient consent for publication.

Not applicable.

Ethics approval

Ethical approval was granted by the East Midlands—Leicester South Research Ethics Committee and UK Health Research Authority (September 2021, 21/EM/0170 and subsequent amendments). Access to the NHS Resolution dataset was obtained by secondment of the RP via honorary employment contract, where she worked with staff to de-identify and fictionalise relevant cases. The Remote by Default 2 study (referenced in main text) was co-designed by patients and lay people; it includes a diverse patient panel. Oversight was provided by an independent external advisory group with a lay chair and patient representation. A person with lived experience of a healthcare safety incident (NS) is a co-author on this paper and provided input to data analysis and writing up, especially the recommendations for patients in table 2 .

Acknowledgments

We thank the participating organisations for cooperating with this study and giving permission to use fictionalised safety incidents. We thank the participants in the ethnographic study (patients, practice staff, policymakers, other informants) who gave generously of their time and members of the study advisory group.

Sarbadhikari SN ,
Jacob AG , et al
Hall Dykgraaf S ,
Desborough J ,
de Toca L , et al
Koonin LM ,
Tsang CA , et al
England NHS
Papoutsi C ,
Greenhalgh T
↵ Healthcare safety investigation branch: NHS 111’s response to callers with COVID-19-related symptoms during the pandemic . 2022 . Available : https://www.hsib.org.uk/investigations-and-reports/response-of-nhs-111-to-the-covid-19-pandemic/nhs-111s-response-to-callers-with-covid-19-related-symptoms-during-the-pandemic [Accessed 25 Jun 2023 ].
Royal College of General Practitioners
NHS Confederation
Gupta PP , et al
Panagioti M ,
Keers RN , et al
Panesar SS ,
deSilva D ,
Carson-Stevens A , et al
Campbell JL ,
Britten N ,
Green C , et al
Huibers L , et al
Hollnagel E ,
Braithwaite J
Institute of Medicine (US)
Jerak-Zuiderent S
Greenhalgh T ,
Alvarez Nishio A , et al
Hemmings N , et al
Hughes G , et al
Salisbury C , et al
Berens E-M ,
Nowak P , et al
Macdonald S ,
Browne S , et al
Edwards PJ ,
Bennett-Britton I ,
Ridd MJ , et al
Warren C , et al
Challiner J , et al
Wieringa S ,
Greenhalgh T , et al
Rushforth A , et al
Edmondson AC ,
Eddison N ,
Healy A , et al
Paparini S , et al
Byng R , et al
Moore L , et al
McKinstry B ,
Hammersley V ,
Burton C , et al
Gafaranga J ,
McKinstry B
Seuren LM ,
Wherton J , et al
Donaghy E ,
Parker R , et al
Sharma SC ,
Thakker A , et al
Johnsen TM ,
Norberg BL ,
Kristiansen E , et al
Wherton J ,
Ferwerda R ,
Tijssen R , et al
NHS Resolution
Marincowitz C ,
Bath P , et al
Antonio MG ,
Williamson A ,
Kameswaran V , et al
Oudshoorn N
Winters D ,
Newman T , et al
Huibers L ,
Renaud V , et al
↵ Remote consultations . n.d. Available : https://www.gmc-uk.org/ethical-guidance/ethical-hub/remote-consultations
Doherty M ,
Neilson S ,
O’Sullivan J , et al
Carson-Stevens A ,
Hibbert P ,
Williams H , et al
Edwards A ,
Powell C , et al
Morris RL ,
Fraser SW ,

X @dakinfrancesca, @trishgreenhalgh

Contributors RP led the Safety I analysis with support from AC. The Safety II analysis was part of a wider ethnographic study led by TG and SS, on which all other authors undertook fieldwork and contributed data. TG and RP wrote the paper, with all other authors contributing refinements. All authors checked and approved the final manuscript. RP is guarantor.

Funding Funding was from NIHR HS&DR (grant number 132807) (Remote by Default 2 study) and NIHR School for Primary Care Research (grant number 594) (ModCons study), plus an NIHR In-Practice Fellowship for RP.

Competing interests RP was National Professional Advisor, Care Quality Commission 2017–2022, where her role included investigation of safety issues.

Provenance and peer review Not commissioned; externally peer reviewed.

Linked Articles

Editorial Examining telehealth through the Institute of Medicine quality domains: unanswered questions and research agenda Timothy C Guetterman Lorraine R Buis BMJ Quality & Safety 2024; 33 552-555 Published Online First: 09 May 2024. doi: 10.1136/bmjqs-2023-016872

Read the full text or download the PDF:

Information

Author Services

Initiatives

You are accessing a machine-readable page. In order to be human-readable, please install an RSS reader.

All articles published by MDPI are made immediately available worldwide under an open access license. No special permission is required to reuse all or part of the article published by MDPI, including figures and tables. For articles published under an open access Creative Common CC BY license, any part of the article may be reused without permission provided that the original article is clearly cited. For more information, please refer to https://www.mdpi.com/openaccess .

Feature papers represent the most advanced research with significant potential for high impact in the field. A Feature Paper should be a substantial original Article that involves several techniques or approaches, provides an outlook for future research directions and describes possible research applications.

Feature papers are submitted upon individual invitation or recommendation by the scientific editors and must receive positive feedback from the reviewers.

Editor’s Choice articles are based on recommendations by the scientific editors of MDPI journals from around the world. Editors select a small number of articles recently published in the journal that they believe will be particularly interesting to readers, or important in the respective research area. The aim is to provide a snapshot of some of the most exciting work published in the various research areas of the journal.

Original Submission Date Received: .

Active Journals
Find a Journal
Proceedings Series
For Authors
For Reviewers
For Editors
For Librarians
For Publishers
For Societies
For Conference Organizers
Open Access Policy
Institutional Open Access Program
Special Issues Guidelines
Editorial Process
Research and Publication Ethics
Article Processing Charges
Testimonials
Preprints.org
SciProfiles
Encyclopedia

Article Menu

Subscribe SciFeed
Recommended Articles
Google Scholar
on Google Scholar
Table of Contents

Find support for a specific problem in the support section of our website.

Please let us know what you think of our products and services.

Visit our dedicated information section to learn more about MDPI.

JSmol Viewer

Diagnostic evaluation of the contribution of complementary training subjects in the self-perception of competencies in ethics, social responsibility, and sustainability in engineering students.

1. Introduction

2. theoretical framework, 3. review of related research, 4. materials and methods, 4.1. study population, 4.2. instrument, 4.3. data analysis technique, 5.1. descriptive statistics, 5.2. analysis of competencies in ers vs. courses taken, 5.3. relationship of ers competencies with sociodemographic variables, 6. discussion, 7. conclusions, 8. future work, author contributions, institutional review board statement, informed consent statement, data availability statement, conflicts of interest.

Abad-Segura, E.; González-Zamar, M.D.; Infante-Moro, J.C.; García, G.R. Sustainable management of digital transformation in higher education: Global research trends. Sustainability 2020 , 12 , 2107. [ Google Scholar ] [ CrossRef ]
Candanedo, G.U. Tendencias globales en Educación. Prospectiva, visión y desafíos. Rev. Anu. 2019 , 44 , 177–199. [ Google Scholar ]
Børsen, T.; Serreau, Y.; Reifschneider, K.; Baier, A.; Pinkelman, R.; Smetanina, T.; Zandvoort, H. Initiatives, experiences and best practices for teaching social and ecological responsibility in ethics education for science and engineering students. Eur. J. Eng. Educ. 2021 , 46 , 186–209. [ Google Scholar ] [ CrossRef ]
Grimson, J. Re-engineering the curriculum for the 21st century. Eur. J. Eng. Educ. 2002 , 27 , 31–37. [ Google Scholar ] [ CrossRef ]
Davidson, C.I.; Hendrickson, C.T.; Matthews, H.S.; Bridges, M.W.; Allen, D.T.; Murphy, C.F.; Allenby, B.R.; Crittenden, J.C.; Austin, S. Preparing future engineers for challenges of the 21st century: Sustainable engineering. J. Clean. Prod. 2010 , 18 , 698–701. [ Google Scholar ] [ CrossRef ]
UNESCO. Preparing Learners for the Challenges of the 21st Century ; UNESCO: Paris, France, 2014. [ Google Scholar ]
Lozano, R.; Carpenter, A. Developing Sustainability Competences through Pedagogical Approaches ; Springer: New York, NY, USA, 2021. [ Google Scholar ]
Domínguez, V.B.; Lizarraga, L.M.P. Construction and validation of an instrument to evaluate the characteristics of university social responsibility in university students. Rev. Int. Educ. Justicia Soc. 2019 , 8 , 79–96. [ Google Scholar ] [ CrossRef ]
Vallaeys, F. Las diez falacias de la Responsabilidad Social Universitaria. Rev. Digit. Investig. Docencia Univ. 2018 , 12 , 34–58. [ Google Scholar ] [ CrossRef ]
Reig-Aleixandre, N.; Ramos, J.M.G.; De la Calle Maldonado, C. Formación en la responsabilidad social del profesional en el ámbito universitario. Rev. Complut. Educ. 2022 , 33 , 517–528. [ Google Scholar ] [ CrossRef ]
Boatwright, J.R.; Slate, J.R. Development of an Instrument to Assess Work Ethics. JITE-J. Ind. Teach. Educ. 2002 , 39 , 4. Available online: http://scholar.lib.vt.edu/ejournals/JITE/v39n4/boatwright.html (accessed on 18 April 2024).
Martin, D.A. The Ethics of Engineering ; RTE Brainstorm: Dublin, Ireland, 2019; Volume 9, p. 48. [ Google Scholar ] [ CrossRef ]
Serna, E.; Serna, A. Realidad de Una Disciplina ; Editorial IAI: Medellín, Colombia, 2018; pp. 106–139. [ Google Scholar ]
Davis, M. Thinking Like an Engineer. In Ethics, Politics, and Whistleblowing in Engineering ; CRC Press: Boca Raton, FL, USA, 2018; pp. 79–82. [ Google Scholar ] [ CrossRef ]
Hirsch, A. Construccción de una escala de actitudes sobre ética profesional. Rev. Electrónica Investig. Educ. 2005 , 7 , 1–15. [ Google Scholar ]
Alexandre, N.R. Formación de la Responsabilidad Social en la Universidad y Práctica Profesional: Un Estudio con Egresados ; Universidad Complutense de Madrid: Madrid, Spain, 2020. [ Google Scholar ]
Jones, A.; Parker, B. Ethical considerations in engineering practice. Eng. Ethics J. 2016 , 10 , 45–60. [ Google Scholar ]
Smith, C.; Doe, J. Integrating ethics and sustainability in engineering education. J. Eng. Educ. 2018 , 25 , 112–125. [ Google Scholar ]
Brown, D.; Green, E. The role of engineering programs in promoting social responsibility. Eng. Soc. J. 2017 , 5 , 78–89. [ Google Scholar ]
Lozano, R. Incorporation and institutionalization of SD into universities: Breaking through barriers to change. J. Clean. Prod. 2006 , 14 , 787–796. [ Google Scholar ] [ CrossRef ]
García, A.; López, M.; Martínez, P. Integrating sustainability competencies in engineering education. J. Clean. Prod. 2019 , 215 , 345–358. [ Google Scholar ]
Smith, R.; Johnson, T. Enhancing sustainability skills in engineering curricula. Int. J. Eng. Educ. 2017 , 33 , 712–725. [ Google Scholar ]
Brown, S.; Lee, E. Sustainability integration in engineering programs: A case study. Sustain. Sci. 2018 , 10 , 189–202. [ Google Scholar ]
Demssie, Y.N.; Wesselink, R.; Biemans, H.J.A.; Mulder, M. Think outside the European box: Identifying sustainability competencies for a base of the pyramid context. J. Clean. Prod. 2019 , 221 , 828–838. [ Google Scholar ] [ CrossRef ]
Albareda-Tiana, S.; Goded, P.A.; Muñoz-Rodríguez, J.M.; Valderrama-Hernández, R.; Ruiz-Morales, J. Assessing competences in sustainability in teaching degrees and post-degrees: A proposal of a tool. Ensen. Cienc. 2019 , 37 , 11–29. [ Google Scholar ] [ CrossRef ]
Chiang, M.; Chen, P. Education for sustainable development in the business programme to develop international Chinese college students’ sustainability in Thailand. J. Clean. Prod. 2022 , 374 , 134045. [ Google Scholar ] [ CrossRef ]
Michel, A.L.P. La Enseñanza de la Ética Profesional a Ingenieros: Un Caso de Estudio. INNOVUS 2020 , 20 , 125–146. Available online: https://www.ipn.mx/innovacion/numeros-anteriores/innovacion-educativa-84.html (accessed on 18 April 2024).
Mendoza, D.J.E.; Ramírez, E.L.; Soto, Y.M. Procesos y Contextos en Los Proyectos de Intervención Educativa , 1st ed.; Comunicación Científica: Ciudad de México, Mexico, 2020; pp. 96–118. [ Google Scholar ]
Alcazar, A.; Castillo, M.; Cogley, G.; Lakhani, R.; Batista, L. Perspectiva de la ética en los alumnos de Ingeniería Logística y Cadena de Suministro de la Universidad Tecnológica de Panamá. Rev. Iniciación Científica 2024 , 10 , 53–57. [ Google Scholar ] [ CrossRef ]
Pegalajar-Palomino, M.C.; Martínez-Valdivia, E.; Burgos-García, A. Analysis of social responsibility in university education students. Form. Univ. 2021 , 14 , 95–104. [ Google Scholar ] [ CrossRef ]
de Jesús, C.F.N. Perception of engineering students regarding sustainable development. In An Analysis from the Economic, Social and Environmental Dimensions in a Developing Country ; Universidad de Córdoba: Córdoba, Spain, 2024. [ Google Scholar ]
Obrecht, M.; Feodorova, Z.; Rosi, M. Assessment of environmental sustainability integration into higher education for future experts and leaders. J. Environ. Manag. 2022 , 316 , 115223. [ Google Scholar ] [ CrossRef ] [ PubMed ]
da Silva Junior, A.; de Oliveira Martins-Silva, P.; de Araújo Vasconcelos, K.C.; da Silva, V.C.; de Brito, S.L.M.S.; Monteiro, J.M.R. Sustainability and corporate social responsibility in the opinion of undergraduate students in management programs: Between the concrete and the abstract. J. Clean. Prod. 2019 , 207 , 600–617. [ Google Scholar ] [ CrossRef ]
LFonseca, M.; Portela, A.R.; Duarte, B.; Queirós, J.; Paiva, L. Mapping higher education for sustainable development in Portugal. Manag. Mark. 2018 , 13 , 1064–1075. [ Google Scholar ] [ CrossRef ]
LaPatin, M.; Roy, A.; Poleacovschi, C.; Padgett-Walsh, K.; Feinstein, S.; Rutherford, C.; Nguyen, L.; Faust, K.M. Measuring ethical development of engineering students across universities and class years. Int. J. Ethics Educ. 2023 , 8 , 49–65. [ Google Scholar ] [ CrossRef ]
Sánchez-Carracedo, F.; Sureda, B.; Moreno-Pino, F.M.; Romero-Portillo, D. Education for Sustainable Development in Spanish engineering degrees. Case study. J. Clean. Prod. 2021 , 294 , 126322. [ Google Scholar ] [ CrossRef ]
Finelli, C.J.; Holsapple, M.A.; Ra, E.; Bielby, R.M.; Burt, B.A.; Carpenter, D.D.; Harding, T.S.; Sutkus, J.A. An Assessment of Engineering Students’ Curricular and Co-Curricular Experiences and Their Ethical Development. J. Eng. Educ. 2012 , 101 , 469–494. [ Google Scholar ] [ CrossRef ]
Donald, D.C.; Trevor, S.H.; Janel, A.S.; Cynthia, J.F. Assessing the Ethical Development of Civil Engineering Undergraduates in Support of the ASCE. J. Prof. Issues Eng. Educ. Pract. 2014 , 140 , A4014001. [ Google Scholar ] [ CrossRef ]
Li, A. An empirical analysis on social responsibility education of college students in four universities in Shandong Province during COVID-19 Response. In Procedia Computer Science ; Elsevier B.V.: Amsterdam, The Netherlands, 2022; pp. 221–228. [ Google Scholar ] [ CrossRef ]
Galvão, A.; Mendes, L.; Marques, C.; Mascarenhas, C. Factors influencing students’ corporate social responsibility orientation in higher education. J. Clean. Prod. 2019 , 215 , 290–304. [ Google Scholar ] [ CrossRef ]
Warshawski, S. First-year nursing students’ perceptions of health activism and social. Nurse Educ. Today 2024 , 132 , 106019. [ Google Scholar ] [ CrossRef ] [ PubMed ]
Miñano, R. Formación En Competencias de Sostenibilidad, Responsabilidad Social y Ética Profesional: Estudio de Casos en Ingeniería Industrial e Ingeniería Informática. Ph.D. Thesis, ETSII, Universiti Putra Malaysia, Seri Kembangan, Malaysia, 2019. Available online: https://oa.upm.es/55789/1/RAFAEL_MINANO_RUBIO.pdf (accessed on 18 April 2024).
Setó-Pamies, D.; Papaoikonomou, E. A Multi-level Perspective for the Integration of Ethics, Corporate Social Responsibility and Sustainability (ECSRS) in Management Education. J. Bus. Ethics 2016 , 136 , 523–538. [ Google Scholar ] [ CrossRef ]
Kolb, M.; Fröhlich, L.; Schmidpeter, R. Implementing sustainability as the new normal: Responsible management education—From a private business school’s perspective. Int. J. Manag. Educ. 2017 , 15 , 280–292. [ Google Scholar ] [ CrossRef ]
Pérez-Foguet, A.; Lazzarini, B. Continuing professional education in engineering faculties: Transversal integration of sustainable human development in basic engineering sciences courses. J. Clean. Prod. 2019 , 218 , 772–781. [ Google Scholar ] [ CrossRef ]
Jordan, R.; Agi, K.; Arora, S.; Christodoulou, C.G.; Schamiloglu, E.; Koechner, D.; Schuler, A.; Howe, K.; Bidram, A.; Martinez-Ramon, M.; et al. Peace engineering in practice: A case study at the University of New Mexico. Technol. Forecast. Soc. Chang. 2021 , 173 , 121113. [ Google Scholar ] [ CrossRef ]
Pugh, G.; Lozano-Rodríguez, A. El desarrollo de competencias genéricas en la educación técnica de nivel superior: Un estudio de caso. Calid. Educ. 2019 , 50 , 143–170. [ Google Scholar ] [ CrossRef ]
Pizzutilo, F.; Venezia, E. On the maturity of social responsibility and sustainability integration in higher education institutions: Descriptive criteria and conceptual framework. Int. J. Manag. Educ. 2021 , 19 , 100515. [ Google Scholar ] [ CrossRef ]
Harris, C.E.; Pritchard, M.S.; James, R.W.; Englehardt, E.E.; Rabins, M.J. Concepts and Cases Engineering Ethics , 6th ed.; Cengage: Boston, MA, USA, 2019; p. 306. [ Google Scholar ]
Ariza, S.M.B.; Guerrero, M.I.M. El Desarrollo Sostenible y la Responsabilidad Social Empresarial ; Universidad Católica de Colombia: Bogotá, Colombia, 2020. [ Google Scholar ]
Garay, P.J.P.; Oliver, Z.J.M.; Contreras, C.V.; Flores, D.S.A.; Baca, R.L.S. Social Responsibility in University Students According to Gender and Age. Health Educ. Health Promot. 2021 , 9 , 513–519. [ Google Scholar ]
Kirk, B.A.; Schutte, N.S.; Hine, D.W. Development and preliminary validation of an emotional self-efficacy scale. Personal. Individ. Differ. 2008 , 45 , 432–436. [ Google Scholar ] [ CrossRef ]
Bernardi, R.A.; Nash, J. The importance and efficacy of controlling for social desirability response bias. Ethics Behav. 2023 , 33 , 413–429. [ Google Scholar ] [ CrossRef ]
Stedham, Y.; Yamamura, J.H.; Beekun, R.I. Gender differences in business ethics: Justice and relativist perspectives. Bus. Ethics A Eur. Rev. 2007 , 16 , 163–174. [ Google Scholar ] [ CrossRef ]

Sociodemographic Variables		First Semester		Last Semesters		Total
Sociodemographic Variables		n	%	n	%	n	%
Gender	Female	34	13.7	18	10.7	52	12.4
	Male	210	84.3	151	89.3	361	86.4
	Other	5	2.0	0	0	5	1.2
Age	15–25 years	209	83.9	86	50.9	295	70.6
	26–35 years	33	13.3	64	37.9	97	23.2
	36 years and above	7	2.8	19	11.3	26	6.2
Stratum	1	64	25.7	32	18.9	96	23.0
	2	110	44.2	83	49.1	193	46.2
	3	69	27.7	54	32.0	123	29.4
	4	6	2.4	0	0	6	1.4

Experts		Total
Experts		n	%
Higher education level	Master’s degree	13	61.9
Higher education level	Doctor’s degree	8	38.1
Age	26–35 years	1	4.8
	36–45 years	6	28.6
	46–55 years	8	38.1
	56 years and above	6	28.6
Experience in education	1–5 years	1	4.8
	5–10 years	3	14.3
	Over 10 years	17	81.0
Experience in the productive sector	Yes	14	66.7
Experience in the productive sector	No	7	33.3
Years in the productive sector	1–5 years	1	4.8
	5–10 years	1	4.8
	Over 10 years	12	57.1
TOTAL		21	100

Reliability Statistics
Cronbach’s Alpha	Cronbach’s Alpha Based on Standardized Items	N of Elements
0.930	0.934	30

Competency	Dimensions	Indicator	Item
Social Responsibility [ ]	Awareness	I am aware that I am in the world to contribute responsibly to its transformation	R1
	Awareness	I understand that being part of this world entails a responsibility towards the members of a group or organization for the benefit of society	R2
	Commitment	I am familiar with and care about local issues and their connection to national and global factors	R3
	Citizenship	As a student, I feel that I have the skills to contribute to social, political, and economic changes in my community	R4
	Citizenship	As a student, I would like to contribute to public policies that improve the quality of life for (ethnic, racial, sexual) minority groups and other vulnerable groups (children, women…)	R5
	Social justice	I believe that my educational process provides me with the necessary tools to follow up on public or private programs and initiatives aimed at social transformation	R6
	Social justice	I believe that, through my profession, I can contribute to reducing poverty and inequality in my country	R7
Ethics [ ]	Responsibility	In my daily actions, it is important to fulfill my commitments on time	E1
	Responsibility	In my daily actions, I am willing to take responsibility for any mistakes	E2
	Act with moral principles and professional values	I am willing to spend time updating my knowledge about my career	E3
		There are ethical decisions that are so important in my career that I cannot leave them to the sole discretion of others	E4
		In my daily actions, maintaining confidentiality is crucial	E5
		Doing the right things in my daily life brings me inner peace	E6
		I communicate my values through my daily actions	E7
	Professional and personal ethics	To avoid mistakes in my profession, I must be aware of the limits of my knowledge and skills	E8
		Working with passion is part of my personal fulfillment	E9
		Ethical aspects are crucial to my career and future profession	E10
		I must assess the consequences before making important decisions	E11
		It is good to aspire but not have excessive ambition	E12
		To perform well in my career, developing technical skills alone is not enough	E13
	Honesty	To be a good professional, I cannot ignore the problems of the society I live in	E14
	Honesty	I take the risk of making mistakes to improve my career performance	E15
Sustainability [ ] (S1, S6, S7, S8) [ ] (S2 to S5)	Systemic	I analyze individually or in groups situations related to sustainability and their impact on society, the environment, and the economy, both locally and globally	S1
	Discipline and regulations	I am aware of the importance of sustainability in society. I learn and then I impact my community	S6
	Anticipatory	I use resources sustainably in the prevention of negative impacts on the environment and social and economic systems	S7
	Anticipatory	I anticipate and understand the impact of environmental changes on social and economic systems	S3
	Strategic	I am aware of the potential of the human and natural resources in my environment for sustainable development	S8
		I actively participate in groups or communities committed to sustainability	S2
	Action competence for interventions	I am coherent in my actions, respecting and appreciating (biological, social, cultural) diversity and committing myself to improving sustainability	S4
	Action competence for interventions	I create and provide critical and creative solutions to technology and engineering issues, always considering sustainability	S5

Competencies	Social Responsibility	Ethics	Sustainability
Social responsibility	1
Ethics	0.566 **	1
Sustainability	0.719 **	0.484 **	1

Group		Gender	Age	Stratum
Mode	First semester	2	1	2
	Last semesters	2	1	2
	All	2	1	2

Group	Social Responsibility	Ethics	Sustainability
First semester	4.028 (0.656)	4.496 (0.453)	3.798 (0.689)
Last semester	4.101 (0.589)	4.577 (0.447)	3.921 (0.646)

	Levene Test		t-Test for Equality of Means
	F	Sig.	t	Gl	Sig (Bilateral)	Mean Differences	Standard Error Differences	95% Difference Confidence Interval
Social responsibility	0.919	0.338	−1.167	416	0.244	−0.07332	0.06281	−0.19679	0.05014
Ethics	1.277	0.259	−1.808	416	0.071	−0.08127	0.04494	−0.16961	0.00706
Sustainability	0.128	0.721	−1.839	416	0.067	−0.12317	0.06698	−0.25483	0.00849

Statistical Tests	Social Responsibility	Ethics	Sustainability
Mann–Whitney U test	20,073.500	18,501.000	19,304.500
Wilcoxon W test	51,198.500	49,626.000	50,429.500
Z test	−0.800	−2.101	−1.435
Bilateral asymptotic sig.	0.424	0.036	0.151

ANOVA	Gender		Age		Stratum
ANOVA	F	Sig.	F	Sig.	F	Sig.
Social responsibility	0.438	0.646	11.052	0.000	1.705	0.165
Ethics	0.337	0.714	7.404	0.000	0.227	0.877
Sustainability	0.805	0.448	9.237	0.000	0.742	0.527

Social Responsibility
Age	N	Subset
Age	N	1	2
15–25 years	295	3.9603
26–35 years	97	4.2180
36 years and above	26	4.5357	4.5357
Sig.		0.091	0.221

The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

Yepes, S.M.; Montes, W.F.; Herrera, A. Diagnostic Evaluation of the Contribution of Complementary Training Subjects in the Self-Perception of Competencies in Ethics, Social Responsibility, and Sustainability in Engineering Students. Sustainability 2024 , 16 , 7069. https://doi.org/10.3390/su16167069

Yepes SM, Montes WF, Herrera A. Diagnostic Evaluation of the Contribution of Complementary Training Subjects in the Self-Perception of Competencies in Ethics, Social Responsibility, and Sustainability in Engineering Students. Sustainability . 2024; 16(16):7069. https://doi.org/10.3390/su16167069

Yepes, Sara María, Willer Ferney Montes, and Andres Herrera. 2024. "Diagnostic Evaluation of the Contribution of Complementary Training Subjects in the Self-Perception of Competencies in Ethics, Social Responsibility, and Sustainability in Engineering Students" Sustainability 16, no. 16: 7069. https://doi.org/10.3390/su16167069

Article Metrics

Article access statistics, further information, mdpi initiatives, follow mdpi.

Subscribe to receive issue release notifications and newsletters from MDPI journals

IMAGES

social work research design types
Social Work Research: Concept, Scope
(PDF) Fundamentals of Social Research : Methods, Processes and Applications
PPT
Research Methods in Social Work
Social Work Models, Methods And Theories: A Framework For Practice

COMMENTS

Social Work Research Methods
Social work researchers will send out a survey, receive responses, aggregate the results, analyze the data, and form conclusions based on trends. Surveys are one of the most common research methods social workers use — and for good reason. They tend to be relatively simple and are usually affordable.
Social Work Research Methods
Social work research means conducting an investigation in accordance with the scientific method. The aim of social work research is to build the social work knowledge base in order to solve practical problems in social work practice or social policy. Investigating phenomena in accordance with the scientific method requires maximal adherence to ...
Graduate research methods in social work
Graduate Research Methods in Social Work by DeCarlo, et al., is a comprehensive and well-structured guide that serves as an invaluable resource for graduate students delving into the intricate world of social work research. The book is divided into five distinct parts, each carefully curated to provide a step-by-step approach to mastering ...
Foundations of Social Work Research
This textbook was created to provide an introduction to research methods for BSW and MSW students, with particular emphasis on research and practice relevant to students at the University of Texas at Arlington. It provides an introduction to social work students to help evaluate research for evidence-based practice and design social work research projects. It can be used with its companion, A ...
Practice Research in Social Work: Themes, Opportunities and Impact
Practice research in social work is evolving and has been iteratively defined through a series of statements over the last 15 years (Epstein et al., 2015; Fook & Evans, 2011; Joubert et al., 2023; Julkunen et al., 2014; Sim et al., 2019).Most recently, the Melbourne Statement on Practice Research (Joubert et al., 2023) focused on practice meeting research, with an emphasis on 'the ...
The Handbook of Social Work Research Methods
In addition, Bruce Thyer is the editor of the journal Research in Social Work Practice and expressed interest in updating the book along with the other two candidates. In the field of social work, qualitative research is starting to gain more prominence as are mixed methods and various issues regarding race, ethnicity and gender.
Social Work Research and Mixed Methods: Stronger With a Quality
Abstract. Mixed methods are a useful approach chosen by many social work researchers. This article showcases a quality framework using social work examples as practical guidance for social work researchers. Combining methodological literature with practical social work examples, elements of a high-quality approach to mixed methods are showcased ...
Social Work Research Methods: Learning by Doing
A three-part structure introduces the fundamentals of research methods, the different types of social work research, and the use of data analysis for evaluation of social work practice. Chapter-opening vignettes illustrate the value of chapter content to the practicing social worker.
The Handbook of Social Work Research Methods
"`Not so much a handbook, but an excellent source of reference' - British Journal of Social Work This volume is the definitive resource for anyone doing research in social work. It details both quantitative and qualitative methods and data collection, as well as suggesting the methods appropriate to particular types of studies.
Scientific Inquiry in Social Work
Table of Contents. Chapter 1: Introduction to research. Chapter 2: Beginning a research project. Chapter 3: Reading and evaluating literature. Chapter 4: Conducting a literature review. Chapter 5: Ethics in social work research. Chapter 6: Linking methods with theory. Chapter 7: Design and causality. Chapter 8: Creating and refining a research ...
Practice Research Methods in Social Work: Processes, Applications and
Title = Practice Research Methods in Social Work: Processes, Applications, and Implications for Social Service Organisations . 1* Bowen McBeath, 2 Michael J. Austin, 2 Sarah Carnochan, and 2 Emmeline Chuang. 1 School of Social Work, Portland State University, Portland, OR, U.S.A.
Research Methods for Social Work
Widely considered the best text for the course, RESEARCH METHODS FOR SOCIAL WORK, Seventh Edition strikes an optimal balance of quantitative and qualitative research techniques--illustrating how the two methods complement one another. Allen Rubin and Earl R. Babbie's classic bestseller is acclaimed for its depth and breadth of coverage as well as the authors' clear and often humorous ...
Research on Social Work Practice: Sage Journals
Research on Social Work Practice (RSWP), peer-reviewed and published eight times per year, is a disciplinary journal devoted to the publication of empirical research concerning the assessment methods and outcomes of social work practice. Intervention programs covered include behavior analysis and therapy; psychotherapy or counseling with individuals; case management; and education.
Research for Social Workers
Research for Social Workers has built a strong reputation as an accessible guide to the key research methods and approaches used in the discipline. Ideal for beginners, the book outlines the importance of social work research, its guiding principles and explains how to choose a topic area, develop research questions together with describing the key steps in the research process.
Research Methods for Social Work: A Problem-Based Approach
Welcome to the SAGE Edge site for Research Methods for Social Work, 1e!. Research Methods for Social Work: A Problem-Based Approach is a comprehensive introduction to methods instruction that engages students innovatively and interactively.Using a case study and problem-based learning (PBL) approach, authors Antoinette Y. Farmer and G. Lawrence Farmer utilize case examples to achieve a level ...
Practice research methods in social work: Processes, applications and
Another critical factor that distinguishes PR from other participatory research methods is the connection between social work practice and social service managers. Compared to action research and empowerment evaluation methodologies, PR is more explicitly organisational in understanding how managers, front line staff and service users make ...
Research Designs and Methods
Handbook of Social Work Research Methods Encyclopedia of Survey Research Methods Paul Lavrakas covers all major facets of survey research methodology, from selecting the sample design and the sampling frame, designing and pretesting the questionnaire, data collection, and data coding, to the thorny issues surrounding diminishing response rates ...
Social Work 3500: Methods of Social Work Research
Type of Publication: Empirical research articles are published in scholarly or academic journals . These publications are sometimes referred to as "peer-reviewed," "academic" or "refereed" publications. Examples of such publications include: Social Work Research, Mental Health Practice, and Journal of Substance Abuse. <<
(PDF) Research Methods for Social Work
PDF | On Jan 1, 2009, A. Rubin and others published Research Methods for Social Work | Find, read and cite all the research you need on ResearchGate
PDF Methods of Social Work Research I
Methods of Social Work Research I 19:910:505 Spring 2022 Catalog Course Description. k Research I19:910:505Spring 2022Instructor:Email:Catalog Course Description Introduction to scientific, analytic, approach to building knowledge and skills, including the role of concepts and theory, hypothesis formulation, operationalization, research design ...
The Use and Value of Mixed Methods Research in Social Work
Mixed methods research adds three important elements to social work research: voices of participants, comprehensive analyses of phenomena, and enhanced validity of findings. For these reasons, the teaching and use of mixed methods research remain integral to social work. Keywords: Mixed methods research, social work.
13. Experimental design
Using a true experiment in social work research is often pretty difficult, since as I mentioned earlier, true experiments can be quite resource intensive. True experiments work best with relatively large sample sizes, and random assignment, a key criterion for a true experimental design, is hard (and unethical) to execute in practice when you ...
SWRK 320
Within this process, the following will be covered: the scientific method for building knowledge for social work practice, ethical standards for scientific inquiry, qualitative and quantitative research methodology, research designs for developing knowledge and systematically evaluating social work practice and human service programs, and the ...
Research design in social work: Qualitative and quantitative methods
Based on: Campbell AnneTaylor BrianMcGlade Anne, Research design in social work: Qualitative and quantitative methods. London: Sage Publications - Learning Matters, 2017; 160 pp. ISBN 9781446271247, £20.99 (pbk) ... Qualitative Methods in Social Work Research, 2nd edn. Thousand Oaks, CA: SAGE, 2008. 281 pp. ISBN 978 1412951920 (hbk ...
The Use and Value of Mixed Methods Research in Social Work
The complexity of social problems addressed by the social work profession makes mixed methods research an essential tool. This literature review examined common quantitative and qualitative techniques used by social work researchers and what mixed methods research may add to social work research. Surveys and in-depth interviews were the most common quantitative and qualitative data collection ...
Does All Social Support Work? Examining the Mechanisms of Patient
Social support was a significant component of an individual's external resources in COR. Citation 11 Perceived social support refers to the subjective feeling and evaluation of the degree of external support from individuals. Citation 17 Many studies have confirmed the positive correlation between social support and LS in adult cancer patients.
Changing Trends in Child Welfare Inequalities in Northern Ireland
Methods. The present study examines how the relationship between area level indicators of deprivation and child welfare interventions across NI has changed over time. Specifically, it extends the current evidence base to look at four stages of children's contact with child and family social work in NI during each of the years from 2010 to ...
A theory-informed deep learning approach to extracting and
Mixed-methods research can facilitate research that cannot be answered using a single method. Though there is controversy concerning what constitutes mixed-methods research, integrating quantitative and qualitative approaches is considered increasingly important, and extant literature has observed and demonstrated that the definition of mixed ...
Patient safety in remote primary care encounters: multimethod
Background Triage and clinical consultations increasingly occur remotely. We aimed to learn why safety incidents occur in remote encounters and how to prevent them. Setting and sample UK primary care. 95 safety incidents (complaints, settled indemnity claims and reports) involving remote interactions. Separately, 12 general practices followed 2021-2023. Methods Multimethod qualitative study ...
Sustainability
Higher education institutions, as organizations that transform society, have a responsibility to contribute to the construction of a sustainable and resilient world that is aware of the collateral effects of technological advances. This is the initial phase of a research that aims to determine whether subjects in the complementary training area have a significant effect on ethical, social ...

Social Work Research Methods That Drive the Practice

Data-Driven Work

Quantitative vs. Qualitative

Quantitative Research

Qualitative Research

Mixed Methods Research

The Importance of Research Design

Research Methods in Social Work

Program Evaluations

Needs Assessments

Randomized Controlled Trials

Descriptive Studies

Single-System Designs

Learn More About Social Work Research Methods

Social Work

How to Subscribe

In This Article Expand or collapse the "in this article" section Social Work Research Methods

Related Articles Expand or collapse the "related articles" section about

Other Subject Areas

Social Work Research Methods by Allen Rubin LAST REVIEWED: 14 December 2009 LAST MODIFIED: 14 December 2009 DOI: 10.1093/obo/9780195389678-0008

Scientific Inquiry in Social Work

Formats Available

Table of Contents

Ancillary Material

About the Book

About the Contributors

Get Citation

TABLE OF CONTENTS

Research Methods for Social Work: A Problem-Based Approach

Acknowledgments

For instructors

SOWK 546: The Science of Social Work: Research Designs and Methods

Research Methods Map

Resources for Research Methods

Independent and Dependent Variables

Independent Variables and Dependent Variables

Internal and External Validity

External Validity

Social Work 3500: Methods of Social Work Research

13. Experimental design

13.1 What is an experiment and when should you use one?

The basics of experiments

The logic of experimental design

Key Takeaways

13.2 True experimental design

Pretest and post-test control group design

Post-test only control group design

Solomon four group design

13.4 Quasi-experimental designs

Nonequivalent comparison group design

Time series design

Ex post facto comparison group design

13.5 Non-experimental designs

One group pretest/post-test design

After-only design

Static-group design

13.6 Critical, ethical, and cultural considerations

Media Attributions

Share This Book

SWRK 320 - Social Work Research Methods: Practice Oriented

The Use and Value of Mixed Methods Research in Social Work

Author Biography

Current Issue

Sign in through your institution

Changing Trends in Child Welfare Inequalities in Northern Ireland

Personal account

Institutional access

IP based access

Society Members

Sign in through society site

Sign in using a personal account

Viewing your signed in accounts

Signed in but can't access content

Short-term Access

Email alerts

Affiliations

This Feature Is Available To Subscribers Only

A theory-informed deep learning approach to extracting and characterizing substance use-related stigma in social media

Related work

Computational models of stigma detection