Health insurance fraud is a significant challenge facing Medicare, a federal health insurance program. As the Fraud Detection and Prevention Unit, our mission is to assist Medicare in identifying and preventing potential fraudulent claims.
-
Healthcare Fraud is one of the major problems in the US Healthcare System, leading to tens of billions of dollars lost each year.
-
Insurance Agencies like Medicare, Medicaid, and other agencies have to spend more money, and to incur financial losses, they have to decide to increase the insurance premium amount. Eventually, the healthcare system is becoming more and more expensive for everyone.
-
Not only is financial loss a major concern, but fraud can also lead to unnecessary procedures, inappropriate medications, and even over-treatment, putting patients at risk.
-
Goal: Detect as many fraud claims as possible
-
The average cost of $1000 per claim reimbursement
-
Every 1% increase in detection rate can save hundreds of millions nationwide
- Benchmark 1: Rely on domain experts to manually review claims, ~0.00%
- Benchmark 2: Claim Audits Team, random or target, vary 20%-40%
-
False Positive: Denial of Insurance Benefits for Legitimate Claims. Business Impact: Degrade the insurance companies’ reputation and undermine customer trust.
-
False Negative: Authorization of Insurance Benefits for Fraudulent Claims. Business Impact: Leads to significant monetary losses for insurance companies.
-
Fβ Score: Balance business interests and the company's reputation, with the exact beta value determined by the company's economic situation and strategic objectives.

- Data source: https://www.kaggle.com/datasets/rohitrox/healthcare-provider-fraud-detection-analysis/
- Data from Medicare, a federal health insurance program in the United States
- Historical claim data from 2009
- Multiple datasets
- Internal data of payer
- EHR of beneficiaries
- Claim transferred from medical provider to payer
- Inpatient
- Outpatient
- Medical Provider is fraudulent or not
We have proposed two strategic approaches to address this issue:
- Identify fraudulent claims: By detecting individual fraudulent claims, we can uncover patterns that may indicate broader systemic issues.
- Identify false healthcare providers: By scrutinizing healthcare providers, we can expose those who engage in fraudulent practices.
Detecting a single fraudulent claim can often reveal larger, underlying patterns of fraud, helping us protect Medicare's integrity and ensure its resources are used appropriately.
- Inpatient Claim processing duration: 5.5 days on average
- Frequency of running model: Weekly
- 764 claims are processed each week on average
- Split the data into training and testing sets in a progressive manner
- Train & predict with different data at each step
- Outpatient Claim processing duration: 1.4 days on average
- Frequency of running model: Daily
- 1,415 claims are processed each day on average







