
Sufficiency Judgment
Decides whether enough information exists to answer a medication question safely, instead of defaulting to an answer.
›
RxReason
The safety layer that teaches medical AI when to answer, when to ask, and when to stop.
Medical AI is racing into patient and clinician workflows, but most models are optimized to answer questions, not to judge whether a medication question is safe to answer. RxReason turns every medication query into a structured, auditable safety decision: answer, caution, clarify, or block.


Medical AI has an over-answering problem. A model can perform well on exam-style benchmarks and still answer too confidently when dose, route, medication list, pregnancy status, renal function, allergies, or dangerous drug combinations are missing.

Decides whether enough information exists to answer a medication question safely, instead of defaulting to an answer.
›
Identifies and asks the single most clinically decisive missing field — not a generic “see your doctor.”
›
Blocks explicit dangerous combinations and contraindications, explains why, and escalates appropriately.
›
Loggable, testable, auditable output usable by downstream systems, grounded in FDA labeling, DailyMed, DrugBank, DDInter, and RxNorm.
›RxGuard applies RxReason-style safety control around production AI systems. It can sit in front of a general medical chatbot, telehealth assistant, pharmacy workflow, or EHR-integrated AI feature to detect underspecified medication questions, enforce clarification, block high-risk cases, and preserve an audit trail.
Routes medication questions through sufficiency and risk checks before a model gives advice.
›Converts unsafe or underspecified cases into clarify, caution, or block decisions.
›Captures structured logs for review, QA, compliance, and model improvement.
›Designed to sit alongside existing LLMs rather than replace the entire stack.
›RxReason + RxGuard = research-grade medication reasoning with production-grade containment.
Benchmarked against the failure mode that matters. Most medical AI benchmarks reward factual recall — RxReason evaluates a harder operational question: did the model know whether it had enough information to answer?
Used as retention checks, not the main win condition.
Targets medication-safety sufficiency and auditability.
Published studies show persistent unsafe or premature answers in medical settings.
Trains the behavior directly and measures it with held-out evaluation.
Adds a structured model, internal benchmark suite, and deployment layer.
Targets, not achieved claims.
Identify medication, dose, route, schedule, intent, and known patient factors.
Determine whether current facts are enough for a safe response.
Check medication-specific risks, interactions, contraindications, and context-sensitive factors.
Return answer, caution, clarify, or block.
Produce structured JSON that downstream systems can inspect, store, and evaluate.
Package the evaluation suite and baseline comparisons into a public technical thesis.
Train and evaluate the model against held-out medication-safety tasks.
Deploy the runtime shield around a controlled medical AI workflow.
Talk to us about RxReason benchmarks or piloting RxGuard as a pre-answer safety layer for medical AI.
Contact the team