TalentTuner Research

Our methodology

How we score resumes, how our ATS platform simulators are built, what data we rely on, and the things we deliberately do not claim. We would rather you trust the numbers because you understand them than because we asked you to.

Maintained by the TalentTuner Research team. Last reviewed: May 12, 2026.

The short version

TalentTuner compares the resume you upload against the specific job description you paste in. It produces a match score, the missing and weakly-represented keywords, content-quality feedback, and a set of concrete edits, then it can rewrite the resume for that job and hand back a clean ATS-safe DOCX or PDF. The ATS-platform pages (Workday, Taleo, Greenhouse, Lever) layer on a model of how that particular system filters and ranks candidates.

What we learn from

Our scoring and recommendations are informed by:

  • 50,000+ resume analyses run through the platform: which keywords actually move match scores, where resumes most often lose points, and which fixes correlate with higher scores.
  • The published documentation of the major applicant tracking systems: how Workday handles knockout questions, how Oracle Taleo computes requisition rank, how Greenhouse structures scorecards, how Lever's boolean search and talent pools work.
  • Direct testing: we run controlled resumes through real ATS-powered application flows and observe how formatting, section headings, file types, and keyword placement change what gets parsed and surfaced.
  • Peer-reviewed and industry research on resume screening, recruiter behavior, and information retrieval, cited where we use it on our research hub and in the ATS whitepaper.

How the match score is calculated

The score blends three things:

  1. Keyword match (the largest component). We extract the meaningful terms from the job description using TF-IDF weighting, normalize variants (for example "JavaScript" / "JS", "Project Management" / "PMP"), and check which appear in your resume, how prominently, and in what context. Skills and requirements the job emphasizes count for more than incidental words.
  2. Content quality. A GPT-4-based pass evaluates whether bullets are achievement-oriented and quantified, whether the summary is targeted, and whether the experience reads as relevant to the role rather than a generic job description.
  3. ATS-safety of the format. Tables, text boxes, headers/footers, images, unusual fonts, and non-standard section headings can break or distort parsing. We flag what is likely to be misread and, in the optimizer, output formats engineered for maximum parser compatibility.

The output is a percentage plus an itemized list. The percentage is a guide, not a guarantee: a real ATS does not publish its exact algorithm, and a recruiter's judgment sits on top of whatever the system surfaces.

How the ATS-platform simulators work

Each ATS page models the part of that system that most affects whether a candidate is seen:

  • Workday — knockout/screening questions and how disqualifying answers gate an application before a human ever looks; plus how Workday parses uploaded resumes into its candidate profile fields.
  • Oracle Taleo — requisition rank: how Taleo scores and orders candidates against a requisition's required and preferred criteria, and what raises that rank.
  • Greenhouse — structured scorecards: how interviewers' attribute ratings aggregate, and what a strong, scorecard-aligned application looks like.
  • Lever — boolean search and talent pools: how recruiters surface candidates with keyword and field queries, and how to be findable in those searches.

These are estimators built from public information and our own testing. They are not connected to the vendors' systems and produce approximations to help you prepare, not predictions of any specific outcome.

What we do not claim

  • We are not affiliated with, endorsed by, or partnered with Workday, Oracle, Greenhouse, Lever, or any other ATS vendor. Product names are used for identification only.
  • A high TalentTuner score does not guarantee an interview. ATS systems vary by employer configuration, and humans make the final call.
  • Competitor pricing and feature details we mention (for example on the comparisons pages) are approximate and change; we link out so you can verify current details.
  • We do not sell or share the resumes you upload. See the privacy policy.

Corrections

If you find something on our site that is out of date or wrong, tell us through the contact form. We revise these pages when ATS platforms change behavior or when better data comes in, and we date every revision.

Go deeper

The five measurement layers and where each one can be wrong

The scoring model used by TalentTuner operates across five distinct layers. Each layer has a defined scope, a measurement approach, and a set of conditions under which it produces less reliable output. Knowing those boundaries is part of understanding what the score means.

Layer What we measure Primary tool Known validity limits
1. Keyword match Term overlap between resume and job description, weighted by TF-IDF importance scikit-learn TF-IDF + spaCy lemmatisation Thin job descriptions (under ~150 words) produce noisy weights; highly technical niche roles may lack enough vocabulary surface for reliable matching
2. Content quality Achievement orientation, quantification, targeting, and relevance of experience bullets GPT-4 analysis pass Very short resumes (one-page academic CVs, early-career) or highly creative roles where conventional achievement framing is uncommon may score artificially low
3. Format safety Parser-hostile elements: tables, text boxes, headers/footers, images, unusual encoding PyMuPDF structural extraction PDFs exported from Canva, Adobe InDesign, or Apple Pages have non-standard internal structure; PyMuPDF can extract them, but the internal element classification may differ from what a given ATS parser encounters
4. Intent fit Whether the resume's overall career narrative aligns with the role's seniority, function, and industry GPT-4 contextual pass Career changers and internal-promotion candidates may score lower on intent fit even when their skills genuinely transfer; the model reads signal, not potential
5. Recency Whether the most recent experience and skills align with current requirements for the role Date-weighted TF-IDF pass Candidates with legitimate career pauses (caregiving, illness, education) will receive lower recency signals; the model has no mechanism to evaluate the quality of a gap explanation

The five-layer structure is documented in full on the algorithm page and in the ATS whitepaper match model section.

The match score is a relative ranking tool, not an absolute threshold

A TalentTuner score of 72 does not mean "72 percent of the requirements are met." It means "this resume, against this job description, is positioned better than a resume scoring 61 and worse than one scoring 84." The score is most useful when comparing multiple resume versions against the same job description, or when tracking improvement after specific edits.

What the score cannot account for: employer-specific ATS configurations (a Workday instance configured with a 60-point knockout threshold will behave differently from one with no automated filtering), the weight the hiring manager places on any one requirement, and factors that never appear in a job description (internal referrals, team composition, budget changes).

The detailed mechanics of the scoring pipeline, including the weighting formula for each layer, are documented in the whitepaper's ATS match model section. If you want to challenge or replicate any part of the methodology, that is the right starting point.

How we handle edge cases in the scoring model

After more than 50,000 resume analyses through the TalentTuner platform, a predictable set of edge cases has surfaced. The table below documents the most common, what the model does, and what the output should be understood to mean.

Edge case What the model does How to interpret the result
Resume is in a language other than English PyMuPDF extracts text; TF-IDF and GPT-4 process it; spaCy lemmatisation degrades for non-English languages Keyword match scores will be less accurate; content quality pass will still function if GPT-4 can parse the language, but results are less reliable than for English-language documents
Job description is a one-sentence listing ("Senior Engineer – great opportunity") TF-IDF runs on the sparse vocabulary; score will be low-confidence Use the keyword suggestions as a floor, not a ceiling; paste in a fuller job description when available
Resume contains only a skills list (no experience section) Keyword match may score well if skills overlap; content quality and intent fit scores will be low because there are no achievement bullets to evaluate Overall score will understate readiness; focus on the keyword gap list rather than the composite score
Candidate has a PhD or highly academic background applying to industry roles GPT-4 will identify publication-language patterns as different from industry achievement framing; content quality score will often flag this The flags are usually valid; converting academic framing to industry-outcome language genuinely improves ATS performance
Resume contains a keyword density far above normal (suspected stuffing) TF-IDF will score the match high; GPT-4 pass will flag the quality as low and may note unnatural keyword density TalentTuner does not impose a keyword-stuffing penalty on the match score, but the content quality feedback will flag the issue; real ATS platforms do penalise extreme density
File is a scanned image PDF (no selectable text) PyMuPDF returns empty or near-empty text; analysis will fail or produce a near-zero score Convert to DOCX or use a text-layer PDF before uploading; most real ATS systems also cannot parse image-only PDFs

A note on career changers and industry-pivot candidates. The intent-fit layer is the one most likely to penalise a genuine but unconventional candidate. If you are pivoting industries and your resume scores low on intent fit, the score is not wrong — it is reflecting that your current resume does not yet signal fluency in the target industry's vocabulary. The fix is to mirror that vocabulary explicitly, which is what the match model documentation and the algorithm walk-through are designed to help you do.

What TalentTuner does not measure, and why that matters

Factor Why we do not measure it What you can do about it
Employer-specific ATS configuration Workday, Oracle Taleo, Greenhouse, and Lever are each configured differently by each employer; we model the platform defaults, not any individual instance Use TalentTuner for the keyword and content work; use the platform-specific pages to understand that system's general mechanics
Recruiter's subjective impression Human judgment depends on context we cannot access (team needs, cultural priorities, other candidates in the pool) A strong TalentTuner score helps you pass the algorithmic screen; what happens next is between you and the recruiter
Truthfulness verification We assess fit, not accuracy; we have no way to verify the claims on a resume Only submit information you can substantiate in a reference or interview
Compensation fit Most job descriptions do not include salary ranges; we do not factor compensation matching into the score Use the salary calculator on TalentTuner for market benchmarking separate from the match analysis
Interview performance Beyond scope; we optimise for the application screening stage, not the full hiring funnel Getting to interview is the goal of the tool; preparation for the interview is a different problem

The honest version of what TalentTuner optimises for: the algorithmic screening stage of a competitive application process. If your resume does not pass automated keyword filtering, everything else is moot. We address that specific problem well. We do not address everything that follows.

How the ATS platform model data is structured and updated

The Workday, Oracle Taleo, Greenhouse, and Lever simulators on TalentTuner are built from three types of input: the vendors' own published documentation (recruiter and administrator guides, API references where publicly available), observed behavior from controlled test applications through live employer ATS-powered portals, and documented changes tracked through vendor release notes and community reporting from HR practitioners.

ATS platform variance our simulators model — and what they cannot model
Platform Primary filtering mechanism our model covers What the model cannot account for
Workday Screening question gate logic; candidate profile field mapping from resume upload Employer-custom knockout question sets; pipeline automation rules each employer configures
Oracle Taleo Requisition rank calculation using required vs. preferred criteria weighting Employer-defined requisition configurations; custom fields added by enterprise customers
Greenhouse Structured scorecard attribute alignment; what a strong scorecard-aligned application looks like from the candidate side Individual interviewer scoring behaviors; pipeline stages that vary by employer
Lever Boolean search query patterns; talent pool discoverability for sourcing workflows Employer-specific tag taxonomies; nurture sequence configurations

The platform pages are updated when vendors publish major feature changes (typically quarterly) and when our controlled testing reveals behavior that differs from documented defaults. Each platform page carries a "last verified" date. If you notice outdated information, use the contact form to flag it.

Verdict: Our ATS platform simulators are accurate for the default behavior of each system, unreliable for any employer-specific configuration sitting on top of that default. Use them to understand the mechanics of the platform, not to predict what any specific employer's instance will do.

How TalentTuner's methodology compares to other resume scoring tools

Dimension TalentTuner Typical rule-based checker Academic NLP research
Keyword extraction method TF-IDF with spaCy lemmatisation against the specific job description Static keyword lists by industry or role Varies; often BERT-based contextual embeddings in controlled datasets
Content quality assessment GPT-4 per-bullet evaluation Pattern matching (action verb presence, length rules) Typically out of scope for ATS-specific research
Methodology disclosed Yes — this page, the algorithm page, and the whitepaper Rarely; most commercial tools treat scoring as proprietary Yes — peer-reviewed papers include full methodology
Validated against real outcomes Partially — based on 50,000+ analyses and observed correlations, not a controlled RCT Typically not published Yes, but on research datasets that may not reflect current hiring markets
Claims about the output Relative ranking and gap identification; no interview guarantee Often claim direct ATS pass/fail prediction Careful to scope claims to the dataset studied

For specific audiences: what to read and what to trust

If you are a journalist verifying our claims before citing us:

The factual claims we make in our content — the 50,000+ analysis figure, the ATS platform mechanics, the scoring pipeline — are documented on this page, the algorithm page, and the whitepaper. The 50,000+ figure refers to cumulative resume analyses run through the TalentTuner platform since launch and is based on internal usage data. We do not publish user-level data; that figure is verifiable to us but not independently auditable by a third party. The ATS platform claims are based on publicly available vendor documentation and our own controlled testing. We link to vendor documentation where we cite it. If a specific claim looks wrong, use the contact form and we will respond with sources or correct the content.

If you are a researcher considering our dataset for an academic study:

The 50,000+ analyses are not available as a public dataset due to privacy constraints (resumes contain identifying information). We have published aggregate findings and documented our methodology in the research hub and whitepaper. We are open to discussing research collaboration that does not require sharing individual resume data. Our keyword matching approach uses standard scikit-learn TF-IDF with spaCy 3.x; our content quality pass uses GPT-4 with the prompt documented in the whitepaper. Our ATS-platform testing methodology is described in the research hub. One important limitation to flag in any citation: our dataset overrepresents English-language, North American job applications. Findings should not be generalised to other labor markets without further study.

If you are a hiring manager wondering how our scoring would treat your candidates:

TalentTuner scores a resume against a specific job description — not against your company's requirements unless you paste your job description in. A candidate who scores 85 against your job description has a resume that uses the vocabulary of your JD well and presents achievement-oriented content. What TalentTuner cannot see: whether the candidate's claimed experience is verifiable, whether their compensation expectations fit your range, whether their work style matches the team, or whether they would pass your company's specific ATS configuration. The score is a shortlisting aid, not a hiring recommendation. We are explicit about this to candidates as well.

If you are a candidate trying to decide whether to trust our score:

Trust the gap analysis more than the number. The itemised list of missing keywords and under-represented terms is more actionable than the percentage. A score of 65 does not tell you much by itself; "missing: cloud architecture, Terraform, Kubernetes — three terms that appeared four or more times in the job description" tells you exactly what to fix. The score will rise when you address those gaps, but more importantly, your resume will be meaningfully closer to what the role requires. We recommend running the analysis, working through the specific gap list, and re-running to verify the changes landed. The comparison view makes that iteration easy.

Verdict: The gap list is the product. The score is a summary of the gap list. If you only look at the number, you are using the tool at half its value.

How TalentTuner Research handles opt-in outcome data

We do not require candidates to report interview outcomes, and we do not track application outcomes by default. When users voluntarily indicate that they received an interview or offer after using TalentTuner, we record that as opt-in feedback. This creates selection bias: users who have a positive outcome are more likely to report it. We do not use this data to make causal claims about TalentTuner's impact on hiring outcomes. We use it to identify patterns in what high-scoring, interview-successful resumes look like compared to resumes that were optimised but did not advance — and that comparison informs the recommendations the model generates.

Scenario How we use the data What it cannot tell us
User scores 80+ and reports an interview Logged as positive feedback; contributes to pattern analysis of high-performing resume features Causation (would they have gotten the interview without TalentTuner?); whether TalentTuner was the deciding factor
User scores 80+ and does not report an outcome No outcome data collected; majority of analyses fall into this category Whether they applied, what happened, whether the score was accurate
User scores low and asks why Qualitative feedback used to improve gap list clarity and edge case handling documentation Whether the low score correctly predicted a rejection

The full methodology disclosure — including the limits of our outcome data — is available in the ATS whitepaper. We believe tools that make claims about hiring outcomes have an obligation to disclose how those claims are measured.

Verdict: This methodology page is a binding commitment to what TalentTuner measures and does not measure. When the tool or the underlying model changes in a material way, this page is updated. The date at the top of this page reflects the most recent review.

See it on your own resume

Paste a job description, upload your resume, and get the match score and the fixes in about a minute. Free to start.

Analyze my resume →