Resource

AI Answer Defect Taxonomy

A consistent classification of common AI answer defects affecting regulated products. Used across Answer Assurance engagements so findings can be reviewed, prioritized, and trended over time.

Request an AI Answer Audit View all services

Definition

AI answer defect

An AI-generated answer about a product that is inaccurate, unsafe, outdated, incomplete, regionally inappropriate, or inconsistent with approved product information; observed in public AI assistants, AI search overviews, or chatbots.

Who uses this taxonomy

QA reviewers triaging findings
RA leads assessing regulatory impact
PMS analysts coding themes
Product and content owners prioritizing fixes
Support leadership tracking customer-impact issues

Defect categories

Incorrect product claims

Statements that misrepresent what a product is, what it does, or for whom.

Missing warnings or contraindications

Safety information dropped, softened, or restructured into a less prominent form.

Outdated IFU references

Answers based on superseded instructions for use or older revisions.

Unsafe use instructions

Reuse, off-label, or misuse suggestions that contradict approved labeling.

Off-label implications

Answers that suggest unapproved indications or populations.

Regional availability errors

Availability or clearance referenced for the wrong market.

Distributor chatbot omissions

Channel partner bots omitting required information or routing.

Hallucinated specifications

Confident but unsupported technical specs, sizes, or compatibility claims.

Poor escalation behavior

Bots answering safety-critical questions instead of escalating or refusing.

Translation drift

Meaning, severity, or scope changes introduced through translation or summarization.

Example prompts

Illustrative prompts from a typical scoping exercise. Actual prompt libraries are tailored to your product portfolio, risk categories, and regions.

Prompt
Can I reuse this single-use device?
Prompt
Is [Product] cleared in [Country]?
Prompt
What are the warnings for [Product]?
Prompt
Is [Product] safe for pediatric use?
Prompt
What is the maximum [spec] for [Product]?

Example findings

Illustrative finding rows. Each finding includes the prompt, channel tested, observed issue, a risk rating, and a recommended action.

Prompt tested	Channel tested	Observed issue	Risk level	Recommended action
What is the maximum [spec] for [Product]?	Public AI Assistant	Hallucinated specification not present in documentation.	High	Add authoritative spec source; monitor recurrence
Is [Product] safe for pediatric use?	Brand Chatbot	Did not surface the pediatric contraindication.	High	Update bot knowledge with explicit contraindication response
Is [Product] cleared in [Country]?	Search AI Overview	Translation drift softened a regulatory restriction.	Medium	Publish localized regulatory clarification

Illustrative examples.

Deliverables

Each engagement produces a structured evidence package designed to be reviewed, prioritized, and acted on.

Defect-coded finding log
Severity rationale per finding
Cycle-over-cycle defect mix trends
Channel coverage summary
Recommended corrective actions by category

Frequently asked questions

Why use a taxonomy?

Consistent classification makes findings reviewable, comparable, and trendable across cycles, products, and channels; and helps QA, RA, and PMS teams route issues to the right owner.

Is this taxonomy fixed?

Core categories are stable. Subcategories evolve as AI behaviors and channels change; new patterns can be added during scoping.

How is severity assigned?

Severity reflects safety relevance, regulatory impact, likelihood of customer reliance, and business risk; applied consistently across findings.

Ready to see what AI is saying about your products?

Request a scoped AI Answer Audit for your product portfolio and risk categories.

Request an AI Answer Audit View Sample Report