IMPORTANT: Risk scores displayed here are statistical anomaly indicators, not evidence of fraud. High scores mean a provider's billing patterns deviate from peers — this requires official investigation to determine if actual fraud occurred.

This tool uses a 500-provider sample from the 227M-row DOGE HHS dataset. T-MSIS source data has known quality issues. See the Known Limitations section below for full details.

ANALYZING THE LARGEST MEDICAID DATASET IN HISTORY

Tracking $90.9B in Suspected Improper Payments

Analyzing the DOGE HHS dataset — 227,000,000 rows of Medicaid provider spending from 2018–2024 — cross-referenced against the NPPES provider registry.

TOTAL SPENDING
$0B

2.1M providers nationwide

IMPROPER PAYMENTS
$0B

5.09% CMS improper payment rate

HIGH-RISK DETECTED
0

$5.6B in flagged spending

TOTAL SPENDING

$1785.0B

2018–2024 cumulative

PROVIDERS ANALYZED

2.1M

500 in sample

FLAGGED PROVIDERS

3247

$5.6B flagged

EST. IMPROPER PAYMENTS

$90.9B

5.09% CMS improper rate

GEOGRAPHIC RISK DISTRIBUTION

Avg risk 13.2 across 49 states

Low
Medium
Elevated
High
Critical

NATIONAL MONTHLY SPENDING

JAN 2018 — DEC 2024
PEAK

$25.9B

AVG

$20.4B

LOW

$14.1B

2018-012019-012020-012021-012022-012023-012024-01$0B$7B$13B$20B$26B

RISK BY SPECIALTY

Audiologist60
Anesthesiology, Pain Medicine30
Physician Assistant30
Family Medicine, Adult Medicine30
Pathology, Clinical Laboratory Director, Non-physician27
Internal Medicine, Sleep Medicine27
Obstetrics & Gynecology26
Psychiatry & Neurology, Child & Adolescent Psychiatry25
Nurse Anesthetist, Certified Registered25
Internal Medicine, Hepatology25

AVG RISK SCORE BY PROVIDER TAXONOMY

TOP PROCEDURES BY SPENDING

HCPCS — 2018–2024
T1019
Personal Care Services, per 15 minutes
$187.0B
99213
Office/Outpatient Visit, Established Patient, Low-Moderate Complexity
$98.5B
99214
Office/Outpatient Visit, Established Patient, Moderate-High Complexity
$112.0B
90834
Psychotherapy, 45 Minutes
$42.0B
T2003
Non-Emergency Transportation, Encounter/Trip
$38.5B
97110
Therapeutic Exercises, Each 15 Minutes
$35.2B
96372
Therapeutic/Prophylactic/Diagnostic Injection, Subcutaneous/IM
$28.7B
90837
Psychotherapy, 60 Minutes
$31.0B
S5125
Attendant Care Services, per 15 Minutes
$26.4B
H0015
Alcohol and/or Drug Services, Intensive Outpatient
$18.9B

HIGH-RISK PROVIDERS

46 RECORDS
RISK NPIPROVIDER STATE TOTAL PAID
100
1003460890BAY SHORE BRIGHTWATERS RESCUE AMBULANCE INC.NY$107.5M
100
1750442844ADVANCED REHABILITATION OF METAIRIELA$52.5M
100
1013951623SAMUEL FINEBERGAL$269.2M
100
18719308261 AFFINITY HEALTHCARE SERVICES INCTX$204.5M
100
1841727021RICHARD FERNANDES ALMEIDAWI$86.4M
85
1417704883NASER UDDINMD$137.2M
80
1215029269VICKI BAKERWV$213.6M
80
1184204711JAMES ABERCROMBIECO$217.1M
80
1699719336MARIA GOMESNC$278.8M
80
1669490348GORDON AHLERSVT$85.3M
80
1093715435JAMES EVANSONME$55.7M
80
1821321647BILLINGS CLINICMT$66.1M
75
1861400095ALEXANDER MALAYEV MD PCME$69.3M
75
1780382721ABBY PERSONAL CARE INC.WI$223.4M
70
1841794211LEILA ABAAB DRIRAMI$191.6M
70
1073170494AIDMEN MEDICAL EQUIPMENT LLCKY$253.1M
70
1700167830ALLISON ALLENAK$117.0M
70
1710416870NARSIS AMINIANDE$140.6M
70
1124738463MONOGRAM HEALTH PROFESSIONAL SERVICES OF SOUTH DAKOTA, PLLCSD$141.0M
70
1134573777AMANDA ADDINGTONTN$200.5M
70
1750502688ADVANCE PHYSCIAL THERAPY AND REHABILITATIONLA$279.8M
70
1851404396SUSAN ALGOODTN$222.2M
70
1851395610JACK BROCKNM$161.8M
70
1548823669KIRSTIN ACUSOH$52.5M
70
1437978236ACUTE BEHAVIORAL HEALTH OF VIRGINIA, P.C.TN$212.4M
1–25 of 46

METHODOLOGY & DATA SOURCES

DATA SOURCES

DOGE HHS Medicaid Provider Spending

PRIMARY

227M rows of provider-level spending data from T-MSIS (2018–2024). Released by the Department of Government Efficiency.

opendata.hhs.gov/datasets/medicaid-provider-spending/

NPPES NPI Registry

LIVE API

Provider identity verification — name, address, taxonomy, enumeration date. Live API lookups for 8M+ providers.

npiregistry.cms.hhs.gov/

OIG LEIE Exclusion List

CROSS-REF

Individuals/entities excluded from federal healthcare programs for fraud, abuse, or other offenses.

oig.hhs.gov/exclusions/

CMS Open Payments

CROSS-REF

Financial relationships between healthcare providers and pharmaceutical/device manufacturers.

openpaymentsdata.cms.gov/

COMPOSITE RISK SCORING

Each provider receives a composite risk score (0–100) based on multiple cross-referenced signals. A score of 60+ triggers a HIGH risk classification. These are heuristic rules, not ML-trained models.

Billing Pattern Analysis

Detects spike or bust-out patterns in monthly spending timelines

25pt

New Entity + High Billing

Providers registered after 2021 with above-average billing volume

25pt

OIG Exclusion Match

Provider or authorized official appears on the LEIE exclusion list

30pt

Multi-NPI Official

Same authorized official listed across multiple provider NPIs

15pt

Spending Percentile

Billing above the 95th percentile for taxonomy group and state

30pt

NOTE: The maximum theoretical score is 125 (all factors triggered), normalized to 0–100. Scores are relative rankings, not probabilities of fraud. A score of 80 does not mean 80% chance of fraud — it means the provider deviates significantly from statistical norms across multiple dimensions.

APPROACH CONSTRAINTS

SAMPLE SIZE

500 / 2.1M

Only 500 providers analyzed from the full 2.1M in the dataset

DATA VINTAGE

2018–2024

Historical snapshot only — no real-time monitoring capability

SCORING MODEL

HEURISTIC

Rule-based thresholds, not validated against confirmed fraud outcomes

FRAUD TYPES

BILLING ONLY

Cannot detect medical necessity, upcoding, phantom billing, or kickbacks

KNOWN LIMITATIONS & UNSOLVED GAPS

Transparency is essential. The following are significant limitations of this analysis tool that users must understand before drawing any conclusions from the data presented above.

T-MSIS Data Quality Issues

HIGH

TMSIS-DQ

The underlying T-MSIS (Transformed Medicaid Statistical Information System) data has well-documented quality problems that vary significantly by state.

Some states submit incomplete or delayed data — CMS has flagged data quality concerns for multiple states

Spending figures may include legitimate adjustments, recoupments, or retroactive corrections that appear as anomalies

Provider taxonomy codes are self-reported and may be inaccurate or outdated

The 2018–2024 time range includes COVID-era emergency waivers that temporarily changed billing rules

Algorithmic False Positives

HIGH

ALG-FP

Risk scores are based on statistical deviation from peer groups, not on verified fraudulent activity. Many flagged providers may be entirely legitimate.

High-volume specialists (e.g., oncologists, transplant surgeons) will naturally score higher due to expensive treatments

Rural providers serving large geographic areas may show unusual billing patterns that are operationally justified

New providers may appear as "spike" patterns simply because they are ramping up a legitimate practice

The composite scoring weights are heuristic-based, not validated against confirmed fraud outcomes

Limited Fraud Detection Scope

MEDIUM

FRD-SCOPE

This tool only detects a narrow subset of potential fraud types. Many common healthcare fraud schemes are invisible to spending-pattern analysis alone.

Cannot detect medical necessity fraud (unnecessary procedures billed correctly)

Cannot detect upcoding (billing for more expensive services than provided)

Cannot detect phantom billing (billing for services never rendered to real patients)

Cannot detect kickback schemes (financial arrangements between providers and referral sources)

Cannot detect identity theft or credential misuse without beneficiary-level data

Historical Data Only — No Real-Time Detection

MEDIUM

HIST-ONLY

All analysis is based on historical spending data from 2018–2024. This tool cannot detect fraud in progress or provide real-time alerts.

The DOGE HHS dataset was released as a static snapshot — it is not continuously updated

Providers flagged here may have already been investigated, sanctioned, or cleared

New fraud schemes emerging after the dataset cutoff date are completely invisible

Time lag between fraud occurrence and data availability can be 6–18 months

Broader Vulnerabilities Not Addressed

MEDIUM

VULN-GAP

Major categories of Medicaid waste and abuse fall entirely outside the scope of provider spending analysis.

Managed care organization (MCO) fraud — capitation payment manipulation, network adequacy issues

Eligibility fraud — individuals receiving benefits they don't qualify for

Pharmacy benefit fraud — drug diversion, counterfeit medications, prescription mills

Foreign-organized fraud rings that use synthetic identities and shell companies

State-level administrative waste and misallocation of federal matching funds

No Machine Learning or Advanced Analytics

LOW

NO-ML

The current risk scoring uses simple heuristic rules and statistical thresholds. No machine learning models, network analysis, or predictive analytics are employed.

A production fraud detection system would use supervised ML trained on confirmed fraud cases

Social network analysis could identify collusion rings — this tool analyzes providers in isolation

Natural language processing on clinical notes could detect documentation fraud — not available here

Beneficiary-level claims data would enable much more precise anomaly detection

This tool is intended for research and transparency purposes only. It should not be used as the sole basis for any accusation, investigation, or administrative action against healthcare providers. Any findings should be verified through official CMS, OIG, or state Medicaid fraud control unit channels.