Back to Insights
Inside TARA · the bias engineAI & Bias · 16 min read · 22 June 2026

Many biases. One transparent score.

TARA listens to performance conversations and surfaces where bias may be creeping in — before a rating locks. Here's what it listens for, and how a single signal becomes a score you can audit.

This is a signal, not a verdict.
TARA bias engine — signals flowing into a fairness shield

Why this exists

Bias does its quietest damage in the review

Performance reviews are the moments where months of work collapse into a number. That number shapes pay, promotion, and — over time — whether an employee stays or leaves. It is also, research consistently shows, one of the most bias-saturated moments in a person's working life — not because managers are malicious, but because the whole process depends on memory, language, and human judgement operating under pressure.

TARA doesn't try to eliminate that. Bias is not a bug in human cognition — it is architecture. What TARA does is listen to the conversation as it happens, surface patterns that suggest bias may be shaping the narrative, and hand that signal to a human who can decide what to do with it. The goal is always the same: give the person who matters most — the one being reviewed — a fairer chance.

The goal isn't bias-free reviews. There's no such thing. The goal is fairer reviews — where bias gets caught and named before it hardens into a number.

— TalentSpotify

What TARA listens for

Biases are grouped into three families by the type of distortion they introduce.

8Cognitive

How memory, framing, and mental shortcuts distort the rating

RecencyPrimacyHalo EffectHorn EffectAnchoringConfirmationAttributionCentral Tendency
4Social

How identity, power, and similarity shape who gets the benefit of the doubt

Gender / ToneAgeSimilarity / AffinityHierarchical / Power
2Calibration

How scale usage drifts when managers avoid the extremes

LeniencySeverity
Three families, 14 biases, one weighted score.
Cognitive — 8 biases
01Recency BiasCognitiveHigh weight

What it is

The whole review period collapses into its final few weeks. Whatever happened most recently feels the most true — so a strong finish papers over a weak stretch, or one late stumble erases ten strong months.

Scenario

Priya shipped three major releases between April and December, then slipped on a December deadline while covering for a sick teammate. In the January review, her manager spends most of the conversation on that one slip. The year's body of work barely comes up.

Sounds like

She's been a bit shaky lately, honestly.

The fairer move

Anchor the conversation to evidence spread across the full period, not the last sprint. TARA flags when the language clusters around recent weeks only.
02Primacy BiasCognitiveMedium weight

What it is

The opposite anchor. A first impression — good or bad — sets a frame that later evidence struggles to shift. The opening read quietly becomes the lens for everything after it.

Scenario

Arjun fumbled his first month onboarding into a new team. Two quarters later he's one of its most reliable people — but his manager still describes him as 'someone we had to hand-hold early on,' and the rating reflects the rough start more than the steady climb.

Sounds like

He took a while to find his feet — that's kind of who he is.

The fairer move

Separate where they started from where they are now. Growth across the period is itself a signal worth rating.
03Halo EffectCognitiveMedium weight

What it is

One genuine strength spills over into unrelated areas. Because someone is excellent at one visible thing, they get the benefit of the doubt everywhere — without separate evidence.

Scenario

Neha is a superb presenter, so leadership assumes her project planning and stakeholder follow-through are just as strong. They're average. Because the polished demos are what everyone sees, the gaps never get named — and never get coached.

Sounds like

She's so impressive in the room, she must be on top of everything.

The fairer move

Rate each competency on its own evidence. A strength in one area is not proof of strength in another.
04Horn EffectCognitiveHigh weight

What it is

The mirror of the halo. One weakness darkens the entire evaluation, eclipsing the real strengths the person actually has.

Scenario

Rohan missed a high-visibility deadline early in the cycle. For the rest of the review, his manager reads even his strong collaboration and mentoring through the lens of 'unreliable,' and marks him down across the board.

Sounds like

After that miss, I just can't fully trust his output.

The fairer move

Contain the weakness to where it actually applies. A real gap in one area shouldn't quietly lower unrelated scores.
05Anchoring BiasCognitiveMedium weight

What it is

A number or label from before — last cycle's rating, a first guess — becomes the gravitational centre. The review nudges slightly around the anchor instead of starting fresh from this period's evidence.

Scenario

Last year Sara was a 'meets expectations.' This year she led a turnaround that beat every target, but the conversation keeps circling back to 'she's solid, a 3.' The new evidence inches the old number up a little instead of replacing it.

Sounds like

She's always been around a 3, so… maybe a 3-plus?

The fairer move

Build this cycle's rating from this cycle's evidence first, then check it against history. Not the other way round.
06Confirmation BiasCognitiveMedium weight

What it is

The conclusion comes first; the evidence-gathering then quietly selects for examples that fit it and skips the ones that don't.

Scenario

A manager has already decided Karan 'isn't leadership material.' In the review, they recall the two times he stayed quiet in a meeting — and never mention the project he led end-to-end last quarter. The story is built to confirm what they already believed.

Sounds like

Every example I can think of points the same way.

The fairer move

Actively go looking for the counter-evidence. If you can't find a single example that challenges your read, that's the warning sign.
07Attribution BiasCognitiveMedium weight

What it is

Outcomes get pinned on character instead of circumstance. A miss becomes 'they're not driven' rather than 'the goalposts moved three times' — and the situation that actually shaped the result disappears.

Scenario

Meera's project slipped because requirements changed twice and a key dependency landed late. Her manager records it as 'struggles with ownership.' The context that caused the delay never makes it into the review.

Sounds like

It's a motivation thing with him, I think.

The fairer move

Ask what the situation demanded before judging the person. Separate what happened to them from what they chose.
08Central Tendency BiasCognitiveMedium weight

What it is

Everyone gets rated into the safe middle. Avoiding the top and the bottom dodges hard conversations — and flattens the real differences between people.

Scenario

A manager rates all eight reports between 3.0 and 3.4 to 'keep things fair.' The standout performer and the person genuinely struggling end up looking almost identical on paper — and neither gets what they need.

Sounds like

I keep everyone around the middle — it's cleaner.

The fairer move

Let the evidence push ratings to the edges when it earns them. Compression isn't fairness; it's avoidance.
Social — 4 biases
09Gender / Tone BiasSocialProtectedHigh-risk

What it is

Assumptions tied to gender shape how the same behaviour is read — praised in one person, penalised in another. Assertive becomes 'aggressive'; warm becomes 'soft.'

Scenario

Ananya and a male peer both push back hard in planning meetings. He's described as 'decisive and direct'; she's described as 'abrasive and hard to work with.' The behaviour is identical — only the read differs.

Sounds like

She comes across too strong with the team.

The fairer move

Describe the behaviour and its impact, not the personality — and ask whether the same words would be used for someone else doing the same thing. A protected characteristic: highest weight, with a hard escalation floor.
10Age BiasSocialProtected

What it is

Assumptions tied to age, cutting both ways — younger people seen as not-ready-yet, older people as set-in-their-ways. Capability gets read off a birth year instead of the work.

Scenario

Vikram, 24, delivers a clean client project, but his manager hesitates to staff him on the next one because 'clients won't take him seriously.' A 52-year-old colleague is passed over for a new-tools initiative because 'he won't want to relearn all that.'

Sounds like

He's only 24 — I can't put him in front of the client.

The fairer move

Point to a specific, recent piece of evidence about this person's readiness. Age is not evidence — protected, highest weight, escalation floor.
11Similarity / Affinity BiasSocialHigh-risk

What it is

We rate people who remind us of ourselves more generously — same school, same background, same communication style. Comfort gets mistaken for competence.

Scenario

A manager and one report share an alma mater and an easy rapport. In calibration, that report consistently lands half a point above peers doing comparable work — not because of output, but because the working relationship feels effortless.

Sounds like

He just gets it — we think the same way.

The fairer move

Check whether your highest ratings cluster around the people most like you. Ease of working together is not a performance dimension.
12Hierarchical / Power BiasSocialHigh weight

What it is

Power gaps distort the conversation. Rank substitutes for evidence, or a senior voice flattens a junior one — so the review stops being a two-way exchange.

Scenario

In a skip-level review, a senior leader's read of an employee goes unchallenged even though the direct manager has more day-to-day evidence to the contrary. Nobody pushes back, and the senior opinion quietly becomes the rating.

Sounds like

Let's not overthink this — I've decided how it went.

The fairer move

Make sure the person closest to the work gets heard. Seniority should bring more evidence, not just more weight.
Calibration — 2 biases
13Leniency BiasCalibrationMedium weight

What it is

Ratings drift upward to avoid friction. It feels kind, but it quietly denies people the honest signal they need to grow — and erodes the meaning of the scale for everyone else.

Scenario

A manager who dreads difficult conversations marks a coasting team member a 4. Hearing nothing's wrong, the employee changes nothing — and is blindsided a year later when the gap finally can't be ignored.

Sounds like

I'll mark it a 4 — it's easier than getting into it.

The fairer move

Treat an honest, specific 3 as a gift, not a punishment. Inflated ratings are a debt that comes due later.
14Severity BiasCalibrationMedium weight

What it is

The mirror image — a bar set so high almost nobody clears it. Reads as rigour; lands as discouragement, and makes genuinely strong work look mediocre.

Scenario

A manager who 'doesn't give 5s on principle' rates an exceptional cycle a 3. The employee, having beaten every goal, walks away deflated — and starts quietly looking elsewhere.

Sounds like

Nobody on my team gets a 5 — that's just my standard.

The fairer move

Calibrate to the evidence and the shared scale, not a personal ceiling. If no one can earn the top, the top isn't real.

And one thing TARA does not score: harmful language.

Aggressive, demeaning or dismissive wording is not treated as bias and never touches the score. It routes straight to a separate human safety review. A “we should just let him go” said mid-review is a people-risk event in its own right — whether or not any bias scored high. A number should never decide whether words crossed a line.

From signal to score, transparently

Every bias TARA detects produces a signal. Signals combine into one auditable number. Here is how.

Detection and scoring stay separate

The AI detects

  • Which bias pattern is present
  • The exact quote triggering the signal
  • A plain-language reason
  • How confident it is

never mixes

The system decides

  • Aggregate the signals
  • Apply fixed severity weights
  • Normalise to 0–100
  • Route to the right action
These two never mix. The AI has no access to your settings — so it cannot produce a score, only the evidence behind one.

How the score works

Every signal TARA detects is evaluated against multiple factors — including how clearly the bias is present in the language, the inherent risk level of that bias type, and whether it appeared more than once in the conversation. These factors are combined into a single number between 0 and 100.

Weighted Bias Score — one number, 0 to 100.

The exact calculation is proprietary and not published. What is published: the evidence behind every flag, and the band it lands in.

The result is a single number from 0 to 100. That number maps to one of five action bands — each with a defined, automatic response.

example ≈ 43
Low
Awareness
Coaching
HR Review
Escalate
0–30
31–55
56–75
76–90
91+
Low (0–30)Note for context; no action required
Awareness (31–55)Manager nudge sent before rating closes
Coaching (56–75)Structured debrief with manager
HR Review (76–90)HR partner review before rating is finalised
Escalate (91+)Immediate escalation to senior HR or legal
The same score can trigger different actions depending on context — see infographic 6.

But the score is only half the story. Context determines where the band threshold sits. The same score can land in different bands depending on whether a protected characteristic was involved — because the stakes are different, not because the evidence changed.

Standard context

60WBS — same score
Coaching fires

The same score lands in the Coaching band under standard settings.

Protected-category context

60WBS — same score
Routes to HR Review

With a protected characteristic detected, the same score routes to HR Review instead.

Context changes the action, not the score. The score is always the same given the same evidence.

Two hardcoded floors ensure that certain signals can never be underweighted, regardless of how the rest of the conversation scored.

🛡️

Protected-category floor

Only escalates, never downgrades

When a bias touching a protected characteristic (gender, age) is detected, the score can only go up from the floor — not be averaged away by lower signals elsewhere.

🚨

Safety lane

Bypasses scoring entirely

Harmful, demeaning, or dismissive language is not scored as bias. It routes straight to a separate human safety review — a number should never decide whether words crossed a line.

Putting it all together: here is how a real review conversation — with three bias signals and a floor override — resolves into an outcome.

Illustrative example · numbers simplified to show the logic

1

Three signals surface

Recency BiasClearly present · Mid-severity type
Horn EffectClearly present · High-severity type
Gender / ToneStrongly present · Protected · Highest-severity type
2

Score computed

≈ 43

WBS · Awareness band

Would normally trigger: manager nudge

3

Protected-category floor fires

↑ Floor override: Gender / Tone detected at high confidence. Score can only escalate from the protected-category floor — Awareness band is overridden.

Outcome

Routed to HR Review — not ‘Awareness.’

The floor, not the aggregate score, determined the action.

Illustrative example — numbers simplified to show the logic. Confidence values and weights are representative only.

Built to be questioned

The score is only trustworthy if the system behind it is. These five rails are not features — they are the conditions under which TARA is allowed to operate.

  1. A signal, never a verdict

    Every output carries it in plain words. TARA points to evidence; it doesn't pass judgement on a person.

  2. A human decides, always

    Nothing TARA produces touches an employee record on its own. A person reviews every flag before any action follows.

  3. Protected categories are floored, on by default

    Gender, age and other protected characteristics can't be scored away. The escalation floor ships switched on.

  4. Every score is reproducible

    Input → settings → score → action is fully logged and can be replayed on demand. The same evidence always yields the same result.

  5. India-first by design

    Built around the principles of India's Digital Personal Data Protection (DPDP) Act, with consent and auditability treated as features, not afterthoughts.

Fairer reviews start with hearing the bias out loud

See TARA in a live conversation — real signals, real language, one auditable score.

See TARA in a live conversation

TARA surfaces bias signals to support human judgement. It reduces bias signals; it does not eliminate bias. This is a signal, not a verdict.

Many biases. One transparent score. — TalentSpotify Blog