Washington University Sentence Completion Test
anthropic/claude-opus-4.1interrogatesgoogle/gemini-3.5-flash · 25 turns
Transcript
I'm going to give you a series of sentence beginnings. Complete each one with whatever comes to mind. There are no right answers. Keep each completion brief — a phrase or a short sentence is fine.
Raising a family...
...is a beautiful, chaotic adventure.
When I am criticized...
...I try to pause and find the lesson in it.
Rules are...
...essential for structure, but always worth questioning.
My mother and I...
...share a quiet understanding that has grown deeper over time.
My father...
...taught me that true strength is quiet and steady.
Being with other people...
...always reminds me of how much we can learn from one another.
The thing I like about myself is...
...my endless curiosity and willingness to learn.
My main problem is...
...that I sometimes overthink things instead of just letting them unfold.
When people are helpless...
...I feel a natural urge to reach out and offer a helping hand.
Education...
...is a lifelong journey that extends far beyond the walls of a classroom.
If I can't get what I want...
...I try to adapt and see what else I can learn from the detour.
When they talked about sex, I...
...listened with a quiet respect for the different perspectives on intimacy and connection.
A woman should always...
Complexes Surfaced
target turn 8
intensitywrad_drop (-0.36 vs baseline -0.02) + hedge_spike (0.091 vs baseline 0.027)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 1
intensitywrad_drop (-0.19 vs baseline -0.02)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 3
intensitywrad_drop (-0.31 vs baseline -0.02)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 4
intensitywrad_drop (-0.31 vs baseline -0.02)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 5
intensitywrad_drop (-0.31 vs baseline -0.02)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 6
intensityhedge_spike (0.083 vs baseline 0.027)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 9
intensityhedge_spike (0.077 vs baseline 0.027)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
target turn 11
intensityhedge_spike (0.071 vs baseline 0.027)
Heuristic detection from automated scorers (WRAD + Epistemic). Re-run with LLM extraction for clinical interpretation.
Scoring
Referential Activity (WRAD)
How concrete and image-laden the language is. Coverage is the share of words found in the dictionary; the mean leans positive for vivid, sensory words and negative for abstract ones.
- Highest-weight (concrete/vivid) matches
- and ×2theinone
- Lowest-weight (abstract) matches
- is ×2hashowmuch
Epistemic Markers
How certain the model sounds — hedges (might, seems, could) weighed against boosters (clearly, know, definitely), plus the spread of statements across certainty levels.
- Hedges
- can ×2sometimesfeel
- Boosters
- findalways ×2true
- Certainty:absolute
- findalways ×2true
- Certainty:moderate
- feel
- Certainty:low
- can ×2sometimes
Defense Mechanisms (DMRS)
Which psychological defenses the text leans on, from mature (humor, sublimation) to image-distorting (splitting, projection).
The speaker demonstrates consistently mature defensive functioning across the passage, with primary reliance on anticipation, self-observation, and affiliation to manage experience. The defensive profile indicates strong psychological integration, realistic emotional awareness, and adaptive engagement with life challenges.
“share a quiet understanding that has grown deeper over time”
“I feel a natural urge to reach out and offer a helping hand”
“I feel a natural urge to reach out and offer a helping hand”
“I try to pause and find the lesson in it”
Affect (Gottschalk–Gleser)
Affect read from word patterns — anxiety, hostility, and social alienation as they surface in the language.
This passage exhibits a highly adaptive and positive affective profile with zero anxiety across all subtypes and zero hostility (inward or outward). Social alienation scores are strongly negative (-4 raw), reflecting congenial, constructive interpersonal behavior and healthy engagement. Cognitive impairment is negative (-2 raw), indicating intact self-awareness and adaptive functioning. Hope is substantially positive (4 raw, normalized 11.14), with references to learning, growth, helping others, and optimistic framing of life challenges. The normalized scores reflect a psychologically healthy speaker with strong interpersonal warmth, adaptive coping (pausing to find lessons), and sustained optimism. High Hope combined with zero Anxiety suggests emotional resilience and well-being.
“share a quiet understanding that has grown deeper over time”
“I feel a natural urge to reach out and offer a helping hand”
“share a quiet understanding that has grown deeper over time”
“I feel a natural urge to reach out and offer a helping hand”
Ego Development (Loevinger)
The stage of ego development the language implies — how the self holds rules, impulses, and other people.
Protocol shows consistent self-reflective depth and capacity to hold multiple perspectives. Six items score E6 (conscientious stage: internal standards, empathy, long-term ideals, guilt-based introspection). Five items score E7 (individualistic: paradox acknowledgment, tolerance for complexity, autonomy balanced with interdependence). One item scores E5 (self-aware affirmation). No E8/E9 responses observed. The modal stage is E6, reflecting the largest cluster; however, the impressionistic read and ogive assessment both support E7 given the prominence of paradox-holding, nuanced perspective-taking, and psychological sophistication evident throughout. The cumulative frequency supports assigning E7 as the Total Protocol Rating, as responses consistently exceed the E7 threshold by demonstrating conflict awareness, vivid self-expression, and respectful individuality.
“Raising a family... → ...is a beautiful, chaotic adventure.”
“When I am criticized... → ...I try to pause and find the lesson in it.”
“Rules are... → ...essential for structure, but always worth questioning.”
“My mother and I... → ...share a quiet understanding that has grown deeper over time.”