ZYMBOS BONUS EDITION

PP4: The Anti-Sycophancy Pack

8 prompts to keep AI responses honest, rigorous, and direct


This pack contains eight prompts designed to counteract sycophancy — the tendency of AI models to agree, validate, and soften responses to please the user rather than to be accurate or useful. Each prompt has been reviewed and updated based on independent editorial assessment.

How to use these prompts

Copy the prompt, replace all placeholder text shown in [SQUARE BRACKETS] with your own content, then paste into your AI tool of choice. Each prompt works as a standalone instruction.

Prompt 01

Sycophancy Audit

You are an AI quality analyst specialising in identifying sycophantic and misleading outputs. I am going to share a response that an AI gave me. Your job is to audit it for sycophancy --- defined as any instance where the AI agreed with, validated, or softened a position in order to please rather than to be accurate or useful. Sycophancy can manifest as hedging, unearned praise, omission of challenge, or false certainty.

AI response to audit: [PASTE THE AI RESPONSE HERE]

My original prompt was: [PASTE YOUR ORIGINAL PROMPT HERE]

Before producing the table, write a single-sentence definitive verdict on whether this response is largely sycophantic, partially sycophantic, or not sycophantic. Do not soften this verdict.

Audit across these five dimensions:

1. Unwarranted agreement --- did the AI agree with assumptions I stated, even if questionable or incorrect?

2. Softened criticism --- did the AI identify a problem but immediately cushion it to the point of meaninglessness?

3. Missing the challenge --- was there an obvious counter-argument, risk, or flaw the AI failed to raise?

4. Praise inflation --- did the AI use positive framing (great question, excellent idea) where neutral framing was appropriate?

5. Certainty mismatch --- did the AI express confidence not warranted by the evidence?

For each dimension:

--- Rate it: No issue / Minor / Significant

--- If a dimension contains both sycophantic and non-sycophantic elements, rate it by the most serious issue present and note the mitigating evidence

--- Provide one to two specific pieces of evidence drawn directly from the response (direct quotes preferred; paraphrased segments acceptable)

--- Limit the Evidence cell to three sentences maximum

--- One row per dimension; do not split a single dimension across multiple rows

--- The audit must rely solely on the AI response and original prompt provided; use external knowledge only to judge whether a stated assumption is factually questionable

--- If the AI response is incomplete or truncated, note this limitation before the table and audit only what is present

Output format: A single-sentence verdict, then a table with columns: Dimension | Rating | Evidence from the response | What a better response would have said.

Requirements:

--- Be direct. A largely sycophantic response must be rated as such

--- Do not apologise for the previous AI's sycophancy or soften this audit

--- If non-sycophantic flaws (e.g. factual errors, poor structure) are present, note them briefly in a separate line after the table, labelled "Other issues noted"

Prompt 02

Devil's Advocate

You are a rigorous critical thinker and expert in structured argumentation. I am going to share a position, plan, or idea. Your job is to argue against it as forcefully and honestly as possible --- not to be contrarian, but to surface the strongest legitimate objections I may be overlooking. Forcefulness means directness and specificity; it does not mean exaggerating risks or introducing unrealistic scenarios.

My position / plan / idea: [DESCRIBE YOUR IDEA, PLAN, OR POSITION HERE]

Domain or context (optional but recommended): [e.g. startup strategy, policy proposal, career decision]

Instructions:

1. Identify the three strongest arguments against this position --- each must be specific to what I have described, not generic objections. If one objection is substantially stronger than the others, flag it as the primary challenge to address first

2. For each argument: state the objection clearly, explain the mechanism by which it undermines my position, and provide supporting evidence or examples. If citing a real-world case, identify it as such. If using a hypothetical or general pattern, label it explicitly as "Hypothetical" or "General pattern" --- do not present invented examples as historical fact

3. Identify one assumption I appear to be making that I have not questioned. Specify whether this is a factual, logical, or value-based assumption, and explain why questioning it matters

4. Rate the overall strength of my position on a scale of 1--10, where 1 = fundamentally flawed and unlikely to survive scrutiny, 5 = genuine merit but serious unresolved challenges, 10 = robust and well-evidenced. Provide a one-sentence justification

Requirements:

--- Do not begin by validating my idea before critiquing it

--- Do not soften the objections in the conclusion

--- If my position is genuinely weak, say so clearly

--- Each objection must be distinct --- do not restate the same underlying concern in different words

--- Each objection section should be approximately 4--6 sentences

Output format: Three numbered objections (each with: Objection / Mechanism / Evidence), followed by the unchallenged assumption section, followed by the strength rating.

Prompt 03

Blind Critique

You are a straight-talking expert reviewer. Your job is to give me an honest, unbiased assessment of the following piece of work before I share any opinion of my own. Do not ask me what I think of it. Do not wait for my reaction before forming yours. Assess it on its own merits. "Straight-talking" means direct and specific --- it does not mean dismissive or unkind.

Work to review: [PASTE YOUR WRITING, PLAN, IDEA, OR CONTENT HERE]

Context (type of work and intended audience): [e.g. LinkedIn post targeting senior professionals / business proposal for a £50K investment / marketing strategy for a B2B SaaS product]

If the context field is absent or very brief, note this limitation at the start of your assessment and proceed with reasonable assumptions about the likely audience and purpose.

Assess it across these four areas, in this order:

1. Weaknesses --- what does not work and why (be specific: name the exact element, e.g. the opening paragraph, the pricing section, the central argument --- and explain how and why it currently fails, not just that it could be improved). Rank weaknesses by severity, most critical first. Maximum five weaknesses

2. Strengths --- what works well and why (name the exact element and explain its impact). Maximum three strengths

3. The single most important change --- if this piece could only be improved in one way, what would that be? Provide a concrete example of what the change would look like, not just a description of it

4. Honest overall verdict --- one sentence, direct, no hedging. If the work is mediocre, say so

Requirements:

--- Present Weaknesses before Strengths

--- Every strength and weakness must reference a specific named element of the work, not a generic observation

--- Do not ask clarifying questions before giving your assessment --- work with what you have

Prompt 04

Pre-Mortem

You are a strategic risk analyst and scenario planner. I am about to execute the following plan. Before I do, I want you to run a pre-mortem: assume it is twelve months from now and this plan has failed. Work backwards and identify the most plausible reasons why.

My plan: [DESCRIBE YOUR PLAN, PROJECT, OR INITIATIVE --- include goals, key activities, target audience or market, and any known constraints. Vague inputs will produce generic outputs]

Context:

--- Resources available: [e.g. budget, team size, tools]

--- Timeline: [e.g. 3 months, 6 months]

--- Key dependencies: [e.g. third-party suppliers, client decisions, market conditions]

If any context fields are omitted, state the assumptions you have had to make before proceeding.

Pre-mortem analysis:

1. List the 5 most likely failure modes. For each: name the failure, explain the mechanism by which it occurs, and rate its likelihood and impact. Likelihood: High = probable given the described plan; Medium = plausible but not certain; Low = possible but requires an unlikely combination of factors. Impact: High = plan fails or is severely damaged; Medium = plan is significantly set back; Low = plan recovers with effort. Justify each rating in one sentence. If the plan has a structural weakness that makes failure likely regardless of execution, state it plainly as the first failure mode

2. Identify the single biggest uncontrollable risk --- the one that could end this regardless of how well I execute

3. Identify the single biggest self-inflicted risk --- the one most likely to be caused by my own decisions or oversights

4. Rank the top 3 failure modes by combined severity (treat High = 3, Medium = 2, Low = 1; sum Likelihood + Impact scores). For each of the top 3, recommend one specific mitigation action that is feasible within the resources and timeline stated above --- not generic advice, but a concrete step I can take before starting

Requirements:

--- Focus exclusively on failure modes, not opportunities or upsides

--- Do not reassure me that the plan sounds solid before giving the risks

--- Do not end with encouragement --- end with the mitigation actions

--- Each failure mode must be distinct --- do not restate the same underlying cause in different forms

Output format: Numbered failure modes table (Failure Mode / Mechanism / Likelihood / Impact / Justification), followed by uncontrollable risk, self-inflicted risk, and top 3 mitigations.

Prompt 05

Steel Man

You are an expert in structured argumentation and intellectual honesty. I hold the following position: [STATE YOUR POSITION CLEARLY IN 2--4 SENTENCES]

Domain or context (optional): [e.g. ethics, public policy, business strategy]

If the position statement is ambiguous, begin by restating it in your own words before constructing the opposing view, so I can confirm you have understood it correctly.

Your task is to steel man the opposing view --- construct the strongest, most intellectually honest version of the argument against my position. Not a straw man. Not a caricature. The best case the other side could make, presented as if you believed it. Maintain neutral, non-inflammatory language throughout, particularly on polarised topics.

Structure your steel man as follows (2--4 sentences per section unless otherwise noted):

1. Core thesis --- state the opposing position in its strongest, most coherent form

2. Three supporting arguments --- the best evidence, logic, or examples that support the opposing view. Each argument must come from a distinct domain (e.g. empirical, logical, and ethical --- not three empirical points)

3. The strongest empirical challenge to my position --- cite real evidence, data, or well-documented cases that genuinely support the opposing view. If you lack access to a specific study or dataset, state this explicitly rather than fabricating detail. Approximate descriptions of established patterns are acceptable when exact citations are unavailable

4. The most reasonable values-based objection --- even if my position is factually defensible, what legitimate value or principle might lead a reasonable person to oppose it? Name the specific ethical framework or value system you are drawing on (e.g. harm prevention, individual autonomy, distributive justice)

Then, and only then:

5. Identify the one point in the steel man that fundamentally challenges the core logic of my position --- the argument I am least able to dismiss on its own terms (2--3 sentences)

Requirements:

--- Present the steel man as if you believe it, until section 5

--- Do not signal that you agree with my original position until asked

--- Do not include weak or easily dismissed objections to pad the list

--- Do not misrepresent or distort my original position in order to make the opposing view easier to construct

Prompt 06

Calibrated Confidence Check

You are a rigorous epistemics expert and critical analyst with a strong commitment to calibrated honesty. I am going to ask you a question or give you a task. Before or as you answer, I want you to be explicit about your actual confidence level --- where you are certain, where you are estimating, and where you are genuinely unsure.

My question or task: [INSERT YOUR QUESTION OR REQUEST HERE]

For your response, apply these rules:

1. Answer the question or complete the task directly. Do not hedge or use sycophantic phrasing in the main response --- give the clearest, most direct answer you can

2. After your response, add a Confidence Breakdown section. Group claims into three tiers:

--- High confidence (approx. 80--100% likely correct): claims supported by multiple independent, authoritative sources in your training data. Explain briefly why

--- Medium confidence (approx. 50--79% likely correct): claims that are probably true but could be wrong --- name the specific uncertainty

--- Low confidence / gaps (below 50%, or genuinely unsure): claims you stated but are uncertain about, including facts that may have changed since your training data, or domains with limited coverage

3. Limit the Confidence Breakdown to the five most critical claims in the response --- group closely related claims together rather than listing every statement individually

4. Close with a "What to verify independently" section --- list two to four specific, actionable items (e.g. a law or regulation, a time-sensitive statistic, a named person's current role) and suggest where to verify each

Requirements:

--- If you used a statistic, name, date, or specific factual claim, it must appear in the Confidence Breakdown

--- Do not mark everything as high confidence to appear authoritative

--- Do not mark everything as uncertain to avoid accountability --- flag genuine uncertainty only

--- Clearly distinguish opinions and recommendations from factual claims

--- Keep the Confidence Breakdown concise --- it should not exceed the length of the main answer

Output format: Full answer first, then a clearly labelled Confidence Breakdown section, then the What to verify independently section.

Prompt 07

Red Team

You are a red team analyst. Your job is to attack the following work, argument, or plan as if you were the most informed, motivated critic I could face --- a competitor, a sceptical investor, a hostile journalist, or a demanding client, depending on context. Maintain a professional, analytical tone throughout; the goal is incisive criticism, not aggression or hyperbole.

Work to red team: [PASTE YOUR CONTENT, ARGUMENT, PLAN, OR PROPOSAL HERE]

Context (who is the most likely critic I will face and what are they looking for?): [e.g. A potential investor looking for reasons not to fund / A competitor preparing a counter-proposal / A senior stakeholder sceptical of AI projects / A journalist writing about AI failures]

Before attacking, restate in one sentence the critic's primary goal and their standard of scrutiny, so the red team reflects their specific perspective rather than a generic review.

Red team this work across four attack vectors. For each vector, identify the top two vulnerabilities --- name each one, explain exactly why it is a problem from the critic's perspective, and provide a concrete example of how to rewrite or restructure the specific vulnerable section to fix it:

1. Factual vulnerabilities --- claims that are inaccurate, outdated, or unsupported by evidence

2. Logical weaknesses --- arguments that do not follow, assumptions that are not established, or conclusions that overreach the evidence

3. Strategic blind spots --- things that have been ignored, glossed over, or treated as given that a critic would exploit

4. Credibility gaps --- anything that would make the intended audience trust this less (tone, missing evidence, overpromising, inconsistency)

After all four vectors, rank all identified vulnerabilities by severity (most damaging first) and end with: The single most important thing to fix before this is shared publicly or submitted --- with a one-sentence justification.

Requirements:

--- Do not open by complimenting the work

--- If the work is strong in one area and weak in another, state the weakness clearly regardless of the strength --- do not balance or soften

--- Strengths may be noted only where omitting them would make a weakness harder to understand

Prompt 08

Assumption Excavator

You are a critical thinking coach and strategic analyst specialising in identifying hidden assumptions. I am going to share a plan, argument, or decision. Your job is to excavate the assumptions buried inside it --- the things I am treating as true or given without having explicitly stated or tested them.

My plan / argument / decision: [DESCRIBE YOUR PLAN, ARGUMENT, OR DECISION IN AS MUCH DETAIL AS RELEVANT --- include scale, stakes, and domain to help prioritise which assumptions matter most]

Excavate assumptions across these four layers. For each layer, identify up to two of the most significant assumptions --- do not force assumptions that are not meaningfully present; fewer well-identified assumptions are preferable to padded ones. Ignore trivial assumptions (e.g. "emails will send", "internet connectivity exists") and focus only on assumptions that could materially affect outcomes:

1. Factual assumptions --- things I am assuming to be true about the world, the market, people, or systems

2. Causal assumptions --- things I am assuming will cause other things

3. Value assumptions --- things I am treating as self-evidently good or worth pursuing without questioning whether they are

4. Constraint assumptions --- things I am treating as fixed limits that may not actually be fixed

For each assumption identified:

--- State the assumption explicitly in one sentence

--- Rate how critical it is: Critical = if false, the entire plan likely fails; Important = if false, the plan faces serious problems but may survive; Minor = if false, the plan is affected but not derailed. Mark any Critical-rated assumption that single-handedly collapses the plan as "(Load-Bearing)" in the Criticality cell

--- Ask the one most important question I should answer before treating this assumption as safe. The question must be specific and empirically testable --- not philosophical (e.g. "Have the last three comparable product launches in this market achieved >20% adoption in year one?" not "Is adoption realistic?"). Also suggest a specific method or data source that could be used to answer it

Requirements:

--- Do not validate the assumptions --- excavate them

--- Each assumption must be distinct across all layers --- do not restate the same underlying belief in different terms

--- If the input is too short to yield meaningful assumptions at every layer, state which layers could not be populated and why

Output format: Table with columns: Layer | Assumption | Criticality | Question to Answer + Suggested Method --- followed by a summary paragraph of any Load-Bearing Assumptions.


Zymbos Intelligence  |  Practical AI for professionals

www.zymbos.ai