FAQS

About RosettaMD

RosettaMD is a free AI-powered tool that instantly translates complex medical notes — including MyChart entries, lab results, and discharge summaries — into plain English you can understand.
After helping his brother with confusing hospital records, Dr. Chris Wixon saw that most people don’t understand medical language. RosettaMD was created to fix that and to standardize data for population health tools.
Dr Chris Wixon is a physician with deep NLP experience built RosettaMD. It’s part of Archimedes Medical, a health-tech company that puts people first with AI.
No. It’s a web-based translation tool — think of it as Google Translate for doctor notes. You upload your document, and RosettaMD returns a plain-language summary, instantly.
Yes. RosettaMD does not store or share your health information. Nothing is saved. Your data is processed securely and immediately discarded.

try Rosetta MD

How It Works

No accounts. No Logins. No emails. Just clarity.
RosettaMD can translate any medical note — charts, blood tests, scans, discharge or visit summaries, and EHR records.
Yes — RosettaMD is fully mobile-optimized. You can use it from any phone, tablet, or desktop browser.
No. RosettaMD helps you understand your records, but it is not a substitute for medical advice. Always consult your doctor for decisions.
Rosetta MD Chrome Extension :
- Can be added to any desktop Chrome browser.
- Once installed, it can process medical text found on any website or any PDF that you open within Chrome.
- You use it directly in the browser: open a webpage or PDF, then activate the extension to translate clinical language into clear, patient-friendly explanations.
- It works only on desktop Chrome (not on mobile browsers or other browsers).
- Installation and use happen within your Chrome browser; no separate app is required.

🟦 Still have questions? Try RosettaMD and see for yourself.
No account. No ads. Just clarity.

💡 Technical FAQs (for Developers)

This FAQ gives developers, architects, and data scientists detailed answers about our NLP engine—covering model performance, security and compliance, architectural best practices, and commercial terms—to support smooth, confident API integration for clinical narrative structuring.

Architecture and Performance

- To ensure high reliability and predictable performance, our API imposes a maximum input size limit of 100,000 characters per single request.
- For extremely long narratives that exceed this limit, we recommend a sharding or chunking architecture.
Latency scales linearly with input length: ~0.6–1.0 ms per input token. Processing may vary with entity count (NER resolution to codes). We optimize for low P90 latency to meet real‑time needs. Latency includes network RTT plus model inference.
Note Types and Targets
Short Note — 450 tokens / 1,800 chars / ~8 entities (e.g., Radiology Impression, Progress Note)
Target P90 latency: 0.2–0.4 s
Medium Note — 1,000 tokens / 4,000 chars / ~30 entities (Operative note, H&P)
Target P90 latency: 0.7–0.9 s
Long Note — 1,600 tokens / 7,500 chars / ~50 entities (Discharge Summary)
Target P90 latency: 1.6–2.0 s
Extreme Note — 24,000 tokens / 80,000 chars / ~1,200 entities (Medical journal review)
Target P90 latency: 25–30 s
Our API is built for fast, modern web use and easy developer work.
Input:
- Plain text: Send the clinical note as a text string in a JSON request — fastest and most efficient.
- Document images: Upload image-based files (PDF, TIFF, JPEG). We run OCR to extract text before NLP.
Output:
- JSON: Primary output is JSON with extracted entities, codes, negation status, and human-readable format.
- FHIR (future) : Convert extracted data into FHIR R4 resources (e.g., Condition, Observation, Medication) for direct import into EHRs, HIEs, and population health systems.
For large-scale population health needs, we offer asynchronous batch processing.

Model &Extraction Capability

API documentation

- SNOMED
- ICD-10
- CPT (requires third-party license)
- RxNorm
- NPPES
- Proprietary dictionary of lay descriptions
- Customized terminology capable
Our mission: API payload includes Patient-Friendly Language (PFL) output for each clinical concept (e.g., "Heart attack: blockage of blood flow to the heart muscle.") This PFL is produced by our proprietary model, built for clear medical communication.
Developers can use PFL to make NLP-structured clinical data immediately actionable and easy to understand in patient portals, mobile apps, and other consumer tools.
"All" means the NER model extracts every supported clinical entity type and resolves them across all terminologies.
Entity scope — If no types are specified, the API labels each NER into comprehensive classes (e.g., Disorders, Medications, Procedures, Findings, Person, Occupation, Religion — 14 classes modeled).
Terminology resolution — Selecting "all" maps each entity to ICD-10-CM, SNOMED CT, CPT, RxNorm, UMLS, and NPPES.
Note: To reduce latency and compute, specify needed terminologies instead of using "all."
No — we don’t use general-purpose LLMs. We use specialized, encoder-based Small Language Models built for high‑precision, low‑latency clinical classification and entity extraction. That design gives production advantages for healthcare:
- Faster, low-latency performance for real-time clinical workflows.
- Deterministic, stable outputs for reliable, auditable data and consistent code selection.
- Lower compute costs, enabling scalable, competitive pricing.
- Improved PHI security: private models keep data in our HIPAA‑compliant environment, avoiding public or multi‑tenant LLM exposure.
For classification and structured extraction, these domain-specific encoders outperform general LLMs on accuracy, cost, latency, and reliability—aligning with our physician‑led focus on clarity, safety, and real-world usability.
Ensuring fairness is crucial for population-health solutions because algorithmic bias can worsen healthcare disparities. Our bias-mitigation spans the model lifecycle:
- Data-focused: Train on large, diverse de-identified clinical notes. Use oversampling and augmentation for underrepresented cases and demographics to avoid spurious correlations.
- Transparency & oversight: Development and audit teams review models to detect biases and reduce “black box” risks.
We commit to equitable outcomes and continually update mitigation strategies to follow best practices in clinical AI fairness.
Not yet, but we're working on it. After NER and entity resolution, the next step is Relationship Extraction (RE) to map connections—essential for knowledge graphs and clinical reasoning. RE is a high-priority roadmap item.
We aim for the API to return structured triples so developers can build knowledge graphs from clinical notes.
Developers interested in piloting can contact our Partnerships team.

Model Accuracy and Robustness

NER: 92% (F1)
Entity Resolver
- SNOMED
- ICD10:
- CPT
- RxNorm
Our engine, trained on large, diverse clinical texts, uses deep learning to capture nuanced context around extracted entities.
Negation (Assertion Status): Assigns Assertion Status to concepts so population-health models don’t mislabel diagnoses from phrases that rule them out. An Assertion Detection module finds negation cues (e.g., “denies,” “no evidence of,” “without”) and their scope.
Temporality (Time and Sequence): Recognizes and timestamps clinical events, distinguishing historical vs. current concepts (e.g., “past history of,” “status post appendectomy”).
The model leverages the surrounding context to identify ambiguous medical documentation with a high degree of precision.
The engine is trained on "noisy" real-world clinical data, making it inherently resilient to common typographical errors.
We employ semantic matching layer which intelligently maps misspelled or non-standard terms to the closest standardized clinical concept, preventing extraction failure due to minor errors. For example, 'diabites' will be correctly mapped to Diabetes.
We maintain and regularly update a proprietary lexicon of common clinical abbreviations and their possible long-form expansions.
The Entity Resolver confidence score measures semantic similarity between the extracted phrase and the final standardized code (e.g., SNOMED CT, ICD-10).
Meaning: It’s the probability the assigned code is the best contextual/semantic match. Scores near 1.0 indicate near-perfect matches.
Scale: Float from 0 to 1.0.
Pre-filtering (minimum): Our API suppresses any entity with confidence < 0.95 to reduce false positives and improve data reliability. We still return the exact score for transparency so developers can set audit thresholds and monitor score distributions in logs.

Deployment and Versioning (MLOps)

The stability and predictability of our engine are core requirements for production environments, and we manage this through strict versioning and a predictable update schedule.
- Version Pinning: Developers are required to specify the API version in their request header (e.g., API-Version: v2.1). This pins their integration to a specific, immutable model and processing pipeline, guaranteeing stability even as newer versions are released.
- The underlying model is not static, as it must evolve to keep pace with new medical knowledge, emerging terminology changes (e.g., new ICD codes), and shifts in clinical language.
- Update Cadence: We anticipate model updates on an annual basis. These updates incorporate the latest terminology, fix edge cases, and enhance accuracy across specific domains.
- Minimal Impact: Our rigorous testing process ensures that new versions are highly compatible. We project that annual model updates will impact the ranking or selection of codes for less than 1% of outputted data. This low delta allows for high confidence in migration.
- Version Sunset: New major versions are announced with a minimum 90-day deprecation notice for the previous version. This provides ample time for developers to test and migrate their systems to the new, enhanced version.
For any given version of the API (e.g., v2.1), the model behavior is highly stable and fully deterministic.
Identical Input, Identical Output: Repeated calls with the same clinical text will yield identical entity extractions, entity rankings, and code selections. This ensures predictable downstream processing and accurate data comparison across runs.

Compliance and data security

NO
For any given version of the API (e.g., v2.1), the model behavior is highly stable and fully deterministic.
Identical Input, Identical Output: Repeated calls with the same clinical text will yield identical entity extractions, entity rankings, and code selections. This ensures predictable downstream processing and accurate data comparison across runs.

Commercial and Support

Licensing & pricing:
Usage-based (default)
- Best for variable or spiky volume; pay for actual use.
- API called per document; billing = total input token count.
- Tiered discounts lower per-token price as monthly volume grows.
Subscription (monthly/annual)
- Best for predictable, high-volume production.
- Fixed fee includes a large pre-allocated token/document allowance.
- Locked low rate for included volume; predictable overage pricing.
- Often includes enterprise features: support SLAs, VPC deployment options.
Custom enterprise
- For very high volume or on-prem/VPC needs; custom license with annual volume commitments, dedicated infra, and specialized support.
Key terms
- Standard terminology: license covers royalty-free, commonly available sets (e.g., SNOMED CT for US, ICD-10-CM).
- Third-party terminology: you must obtain any required external licenses (e.g., proprietary CPT sets).
- Data rights: per our BAA, you retain ownership of your data and extracted outputs.
Yes — two ways to start:
- Free Developer Key: permanent, limited access — up to 100 API calls/day for testing and proofs of concept.
- 14‑Day Full Trial: unlimited API access for 14 days to evaluate performance, latency, and full extraction across your clinical notes.
Item description
Item description
Item description

FAQS

About RosettaMD

What is RosettaMD?

Why was RosettaMD created?

Who is behind RosettaMD?

Is RosettaMD an App ?

Is RosettaMD safe ?

How It Works

Do I need an account to use RosettaMD ?

What type of medical note can I upload ?

Does it work on mobile devices ?

Does RosettaMD provide medical advice ?

What is the Rosetta MD Chrome Extension ?

💡 Technical FAQs (for Developers)

What are the input size limits per request, and how should we architect around long clinical notes in production?

How does latency scale with input size, and what type of performance can we expect ?

What type(s) of documents can be processed by the API, and what are the output formats ?

How do we handle high-volume, concurrent requests ?

Which clinical terminologies does the engine map extracted entities to ?

Can the model also return human-readable formats ?

What exactly does "ALL" include during NER extraction ?

Do you use foundational LLMs for the core NLP extraction pipeline ?

How do you ensure against model bias ?

Does the NLP engine identify [:RELATIONSHIPS] between extracted entities ?

What is the reported model accuracy ?

How does the engine handle common challenges such as NEGATION (patient denies headache) , or TEMPORALITY (history of MI 5 years ago)

How does the model handle ambiguity (abbreviations , slang, misspellings, typographical errors, and acronyms ) ?

The Entity Resolver reports a confidence score (similarity score) for each identified NER, how should developers interpret / use this score ?

Model Updates (versioning): Is the model static ? How frequently is the underlying NLP model updated ?

How stable are entity ranking and code selection across repeated calls with same input ?

Is any data persisted or used beyond scope of request ?

How stable are entity ranking and code selection across repeated calls with same input ?

What are the licensing terms and pricing structure for API usage ?

Do you offer free, limited access to the API for developer to explore the model’s capability ?

How is the API metered and is there an Audit Log of API calls available ?

What levels of technical support are offered ?

What is the complete list of API error codes, and what are the suggested developer actions for each?