Psychological Safety Surveys: 6 Distortions That Make Leaders Overtrust the Score
Psychological safety surveys help leaders hear weak signals, but the score becomes dangerous when fear, sampling, wording, and follow-up distort what people are willing to say.

Key takeaways
- 01A high psychological safety score is not evidence of voice unless workers also use escalation channels, technical dissent, stop-work authority, and near-miss reporting without fear.
- 02Survey averages can hide pockets of silence in shifts, contractors, remote sites, and teams whose supervisors control access to work or overtime.
- 03The most dangerous distortion appears after the survey, when leaders ask for honesty but punish bad news through delay, exposure, or defensive questioning.
- 04Executives should read psychological safety beside safety climate, reporting quality, corrective-action speed, and retaliation signals rather than treating one score as culture proof.
- 05Andreza Araujo's work in cultural transformation shows that voice improves when leaders change decision routines, not when they only rewrite survey questions.
A psychological safety survey can give executives the comforting impression that people are speaking openly, even when workers are quietly calculating whether the next concern will cost them overtime, promotion, contract renewal, or credibility with the supervisor. The score looks precise because it arrives as a number. The risk is that the number can become a shield against the harder question: what information still cannot travel upward?
In occupational safety, a psychological safety survey is a diagnostic instrument that tests whether people can report weak signals, challenge unsafe assumptions, admit uncertainty, and disagree with authority without fear of punishment, exposure, ridicule, or career loss.
The market often treats psychological safety as a sentiment topic. That is too soft for high-risk operations. Amy Edmondson's academic work made the construct visible, but in safety management the value of the survey is practical: it should reveal whether risk information reaches decision makers before the organization pays for silence.
Why a high score is not enough
A high score may mean the culture is healthy, but it may also mean the survey has not touched the places where fear lives. Contractors may answer differently from employees. Night shift may answer differently from day shift. A maintenance crew that depends on a supervisor for overtime may know exactly which answer feels safe.
ISO 45003:2021 frames psychosocial risk as part of occupational health and safety management, while ISO 45001:2018 requires consultation and participation of workers. Those anchors matter because voice is not a morale accessory. It is part of the control system whose failure can hide exposure, drift, fatigue, harassment, and unresolved hazards.
Across 25+ years in executive EHS roles and more than 250 cultural transformation projects, Andreza Araujo has seen the same pattern repeat: companies ask people whether they can speak up, then read the average as if every group had the same authority, same vulnerability, and same history with retaliation. That is not diagnosis. It is statistical politeness.
Distortion 1: averages erase the teams that carry the risk
The first distortion appears when leaders celebrate the average while the most exposed groups disappear inside it. A score of 82 percent may look strong, although one maintenance shift, one warehouse, one contractor group, or one remote crew may be operating in silence.
Psychological safety is uneven by design because power is uneven. The operator who challenges a permit, the contractor who questions an isolation, and the engineer who disagrees with a startup decision do not face the same consequences as a director answering the same survey item from an office.
The practical correction is segmentation. Read the survey by site, shift, function, tenure, contractor status, and supervisor span wherever sample size allows. If the organization cannot segment without exposing people, that is already a finding because the population is too vulnerable for a blunt survey design.
This is where psychological safety connects to speak-up retaliation risk. The leader should not ask only whether the average is high. The better question is whether the groups closest to fatal and serious injury potential have enough protection to tell the truth.
Distortion 2: polite wording produces polite answers
Many survey items are written to invite agreement. People are asked whether they feel comfortable raising concerns, whether leaders listen, or whether the team respects different views. Those questions are useful, but they can be too socially clean for a worksite where people test every sentence against local consequences.
A stronger survey asks about recent behavior. In the last 30 days, did you raise a safety concern that changed a decision? Did anyone challenge a supervisor during a permit review? Did a worker stop a task without being mocked later? Did a contractor report a weak control and receive follow-up?
Behavioral wording matters because psychological safety is not an attitude the organization can admire from a dashboard. It is visible in repeated moments where a person chooses between silence and exposure. As Andreza Araujo argues in Safety Culture: From Theory to Practice, culture appears in decisions, not in declared values.
Distortion 3: survey timing hides fear after real pressure
The third distortion is timing. A survey run during a calm month can miss what happens after an incident, audit finding, shutdown delay, customer escalation, leadership change, or production recovery. People may say the culture is open when the stakes are low, then become silent when the organization most needs dissent.
Psychological safety should be tested near pressure points because fear often appears after a worker delivers bad news. The supervisor's first reaction, the delay before follow-up, the request to rewrite language, and the meeting where the concern is reframed as attitude can teach a team more than any corporate statement.
Connect the survey to post-event routines, including how leaders receive bad news in safety meetings. If the survey says people can speak, but the first real concern triggers defensiveness, the survey has measured aspiration rather than operating culture.
Distortion 4: anonymity creates data, not trust
An anonymous survey can protect respondents, and in vulnerable cultures it may be the only ethical way to start. The distortion appears when leaders mistake anonymity for trust. If people need anonymity to report every serious concern, the organization has a reporting channel, but it does not yet have a mature voice culture.
Anonymity also changes what leaders can do with the data. They may receive a severe comment without enough context to verify the hazard, protect the person, or fix the local condition. That does not make anonymity wrong. It means the organization must pair anonymous input with visible action and protected follow-up channels.
The trap is asking people to be brave while offering them no evidence that the system will protect them. A better sequence is to publish what was heard, what will change, who owns each action, and how the organization will protect people who continue the conversation through named channels.
The adjacent article on anonymous reporting beliefs expands this point. Anonymous reporting is a bridge when trust is weak. It should not become the only road.
Distortion 5: leaders ignore the gap between voice and consequence
A survey may ask whether people can speak up, but the operational question is whether speaking up changes anything. Workers notice when concerns receive thanks but no decision, when corrective actions age for months, or when the same weak control returns after every audit.
This gap is dangerous because it produces learned discretion. People do not stop speaking only because they fear punishment. They also stop speaking when the organization teaches them that effort is wasted, especially when the concern has been raised before and absorbed by the system without a visible decision.
For executives, the correction is to connect psychological safety results with action velocity. Compare the survey with stop-work follow-up, corrective-action aging, near-miss quality, and technical dissent outcomes. If voice is high and action is slow, leaders may be measuring permission to talk rather than capacity to respond.
That is why technical dissent protocols matter. Dissent must have a route, an owner, a decision rule, and protection from informal punishment. Otherwise, the survey asks for truth while the operating model keeps truth homeless.
Distortion 6: executives confuse psychological safety with tolerance for weak performance
The sixth distortion moves in the opposite direction. Some leaders avoid psychological safety because they think it means lowering standards, accepting poor decisions, or removing accountability. That interpretation weakens safety because it frames voice as softness rather than control intelligence.
Psychological safety does not mean error tolerance without boundaries. It means people can report error, uncertainty, drift, and disagreement early enough for the organization to correct the system and hold the right level of accountability. James Reason's work on latent failures is useful here because it keeps leaders looking beyond the last human action toward the conditions that shaped it.
Andreza Araujo's The Illusion of Compliance is relevant because many organizations keep strict formal rules while making it socially expensive to challenge how those rules are actually working. The result is a clean compliance surface with poor risk transmission underneath.
How to read the score without being fooled
Executives should read psychological safety as a signal cluster, not a single score. The table below gives a practical way to compare the survey with the behaviors that should move if voice is real.
| Survey signal | Behavior to compare | Risk if they disagree |
|---|---|---|
| High comfort raising concerns | Near-miss reports include credible weak signals and precursor events | People agree with openness in theory but avoid operational detail |
| High confidence in leaders | Stop-work events receive fast, respectful follow-up | Trust is declared until a decision costs time or money |
| High team respect | Technical dissent appears in permit, MOC, and startup discussions | Teams are polite but still defer to authority under pressure |
| Low fear of retaliation | Retaliation complaints, transfer requests, and informal exclusion are reviewed | Fear has moved outside the survey language |
The comparison prevents dashboard blindness. If the score improves while near-miss quality declines, dissent disappears, or stop-work follow-up slows, leaders should treat the result as inconsistent evidence rather than proof of maturity.
Every month that leaders overtrust a clean psychological safety score, weak signals keep moving through the organization without enough force to change decisions.
What leaders should change after the survey
The first change is to make the result discussable without exposing individuals. Leaders should show the themes, acknowledge the uncomfortable findings, and name the decisions that will change. Defensive explanations should be banned from the first response because they teach people that honesty creates debate rather than action.
The second change is to assign ownership to line leaders, not only to HR or EHS. Psychological safety affects hazard reporting, permit challenges, risk escalation, incident learning, and contractor management, which means it belongs inside the operating rhythm of the business.
The third change is to run a 90-day evidence review. Do not ask whether managers completed an action plan. Ask whether more weak signals reached leaders, whether dissent changed decisions, whether stop-work follow-up improved, and whether the groups with the lowest voice scores can describe what changed.
For deeper context, compare this diagnostic with safety climate surveys, psychological safety surveys, and speak-up metrics. The point is not to choose one instrument. It is to stop using any instrument as a substitute for management attention.
Conclusion
Psychological safety surveys are useful when they help leaders find the places where risk information gets stuck. They become dangerous when executives treat the score as proof that people are already speaking freely.
The mature response is not to abandon the survey. It is to read it with segmentation, behavioral evidence, action speed, retaliation signals, and the reality of high-risk work. When leaders do that, psychological safety stops being an HR climate number and becomes part of how the organization sees danger before danger becomes loss.
Frequently asked questions
What is a psychological safety survey in occupational safety?
Why can a psychological safety survey score be misleading?
How should leaders validate a psychological safety survey?
Who should own psychological safety survey follow-up?
How often should psychological safety be measured?
About the author
Andreza Araújo
Safety Culture Expert | Senior EHS Executive
Andreza Araújo is a safety culture expert and senior EHS executive with more than 25 years of experience in environment, health and safety. She is a Civil Engineer and Occupational Safety Engineer from Unicamp, holds a Master's degree in Environmental Diplomacy from the University of Geneva, and completed sustainability studies at IMD Switzerland. Andreza has served in Global Head of EHS roles in Fortune 500 environments, leading cultural transformation programs across multinational operations. She has represented Brazil as a speaker at the United Nations in Paris and has spoken at the International Labour Organization in Turin. She is the author of more than 16 books on safety culture in Portuguese, Spanish, English and German. Her work has earned more than 10 EHS awards, including two recognitions from Indra Nooyi, former PepsiCo CEO.
- Civil & Safety Engineer (Unicamp)
- M.A. Environmental Diplomacy (University of Geneva)
- Sustainability Cert (IMD Switzerland)
- People Management & Coaching (Ohio University)
- UN Paris speaker representative for Brazil
- ILO Turin speaker
- LinkedIn Top Voice
- Indra Nooyi PepsiCo CEO recognition (2x)
Documentaries
Watch Andreza's documentaries
Three productions on safety culture, organizational failure and the human lessons behind major disasters.
Podcasts
Listen to Andreza's podcasts
She hosts three shows on safety leadership, EHS and organizational culture, in English and Portuguese.