Estimates suggest that billing errors may cost patients and the healthcare system anywhere from $10 – $20 billion annually. Students on the IHA team will integrate AI techniques to develop a tool that assists auditors in identifying improperly billed patient services, assigning proper billing codes, and predicting such errors to prevent recurrence.
Abstract:
Accurately determining medical billing codes and billing correctly is very challenging due to the complexity and specificity required in coding medical procedures and diagnoses. The coding system, Current Procedural Terminology (CPT), involves a vast array of codes with nuanced distinctions, demanding precision to reflect the exact nature of the medical services provided. Errors can arise from ambiguous documentation, variations in clinical terminology, and human transcription errors. Overbilling errors in the system result in patients and their insurers being charged incorrectly and require significant time and effort for all involved to correct after the fact and can lead to expensive audits for the medical system. Underbilling costs the medical provider in lost income for services that they have provided, impacting the bottom line.
AI has the potential to significantly improve the accuracy and efficiency of clinical billing systems by automating the coding process and reducing human error. Machine learning algorithms can analyze vast amounts of clinical data and documentation to identify the correct codes with high precision, adapting to changes in coding standards and payer requirements in real-time. AI can also assist in flagging discrepancies or inconsistencies in the documentation, thereby enhancing the accuracy of the bills generated. By streamlining the coding process, AI can reduce the administrative burden on healthcare professionals, allowing them to focus more on patient care while potentially lowering the incidence of billing errors and accelerating claim approvals.
Given access to large data sets of claim records, audit reports, and Electronic Medical Records (EMR), several AI techniques are well-suited to identify, correct, and prevent billing errors. Students on the IHA team will identify macro-trends, investigate patterns and anomalies, and generate insights from the data that are actionable by IHA full time employees (FTEs). The goal is to start small with a single CPT code and particular diagnosis that has high probability of misbilling. The team will look at historical data related to this CPT code and apply AI methodologies for this single use case and build out a proof-of-concept deliverable before expanding to additional CPT codes. A successful project will result in time-savings for the IHA audit team, capturing of missed revenue, and prediction of error-prone cases.
The team will investigate techniques and come up with a strategic approach to address the use case(s). Together, these techniques should be able to enhance billing precision, compliance, and efficiency, ultimately streamlining the healthcare billing process. The team will likely incorporate more than one of the following techniques:
Natural Language Processing (NLP): NLP can analyze and interpret unstructured clinical documentation that exists in EMR data, extracting relevant information such as but not limited to diagnoses, symptoms, and treatment plans to determine applicable billing codes. Various NLP techniques can be explored to extract relevant data from the clinical notes and provide structed data for use in the machine learning model. Labeled data helps train models to understand and extract relevant information from text accurately.
Machine Learning (ML) Classification Model: ML classification models can be trained on historical claims data and auditor reports to predict the appropriate codes for new cases. IHA is interested in exploring supervised learning algorithms such as Random Forest, Gradient Boosting Machines (GBM), Support Vector Machines (SVM) or other appropriate algorithms for this context. Labeled data will be available to train the models.
Rule-Based Systems: IHA is interested in exploring combining the ML classification model with a rule-based system to ensure that coding adheres to established guidelines and payer requirements. This system will validate codes against predefined rules and highlight potential discrepancies for auditors to review. Expert auditors will be available to provide the established guidelines and requirements.
Impact:
Estimates from the American Medical Association suggest that up to 80% of medical bills may contain some form of error, ranging from simple clerical mistakes to more complex billing issues. With tens of thousands of patient interactions per week, it is impossible for auditors to review every single case manually to ensure complete billing accuracy. The tool will assist human auditors with reviewing medical bills and analyze large volumes of data to identify likely situations of overbilling, underbilling, or incorrect coding. Continuous learning capabilities could ensure it becomes more accurate over time, further enhancing the overall efficiency and effectiveness of the auditing process.
Scope:
Minimum Viable Product Deliverable (Minimum level of success)
- Develop an understanding of the current business process, technical challenges, and gain foundational understanding of the relevant use cases (CPT codes) provided by the sponsor.
- Demonstrate functional competence in select components of the Microsoft Fabric Analytics Platform by completing a “mini project” to implement basic model concepts on a simplified database. Note – it is unlikely that any student would be fully competent in the entire tech stack before the project. individual training effort is expected.
- Develop a strategic approach for identified priority use cases and selected modeling techniques.
- Design and assemble necessary database(s) to support MVP development.
- Literature review of all relevant techniques and existing documentation (both internal to IHA and public domain)
- Deliver initial model results on first use case and present to stakeholders for critique and review.
Expected Final Deliverable (Expected level of success)
- Expand to at least 1 additional CPT code and/or modeling techniques, validating the results against baseline and historical results.
- Benchmark the value of implementing these techniques and providing a report of the work, including recommended next steps and prepare documentation for IHA FTEs.
Stretch Goal Opportunities: (High level of success)
- Application of additional AI techniques to refine model accuracy or improve functionality (i.e. predictive analytics)
- Demonstrate unsupervised learning techniques.
- Complete models for additional CPT codes and diagnoses.
- Delivered models and tools can be adapted for similar datasets to tackle tangential business problems.
AI Experience (3-4 Students)
Specific Skills: Natural Language Processing, Machine Learning Classification
EECS 281 (or equivalent) is required. Additionally, any of the following would be a plus: EECS 445, 487 or 492.
Likely Majors: CS, ROB, DATA, ECE
Data Analytics (2 Students)
Specific Skills: Business interpretation/application of data results. Data analytics skills, logical evaluation of data results, knowledge of analytics algorithms, good software engineering practice to build your own analysis tools,
EECS 280 (or equivalent) is required.
Likely Majors: CS, DATA, HMI, IOE, SI
Finance and Business Process Quality (1 Students)
Specific Skills: Basic finance, business process mapping, auditing, quality assurance techniques.
EECS 280 (or equivalent) required.
This position will participate in technical development of the project. The student should be motivated to build AI skills.
Likely Majors: BBA, CS, IOE
Additional Desired Skills/Knowledge/Experience
Strong candidates will have familiarity or experience with some of the following items, and a positive attitude to learn what is necessary as the project gets underway.
- Successful team-based project experience
- Excellent interpersonal skills and the willingness to work hard.
- Interest/knowledge in healthcare sector business practices.
- Interest/knowledge in business operations and quality control.
- Project management experience utilizing hybrid Agile.
- Practical Experience with any/most of our tech stack. Note – it is unlikely that any student would be fully competent in the entire tech stack before the project. Individual training effort is expected.
- Microsoft Fabric Platform
- Power BI
- Copilot
- OneLake
- Data Factory
- Azure DevOps
- Machine Learning
- Python / Python Machine Learning packages
Practical project experience with any of the following techniques:
- NLP text evaluation.
- Supervised learning algorithms such as Random Forest, Gradient Boosting Machines (GBM), Support Vector Machines (SVM).
- Rule based systems.
Sponsor Mentor
Victor Wong
Victor is a Project Manager on IHA Medical Group’s Business Intelligence team, where he enjoys contributing to the organization’s efforts to embrace technology to improve patient care. Prior to IHA, Victor spent several years in the corporate IT Services industry serving customers in Healthcare, Life Sciences, and Pharmaceuticals. He is a proud graduate of the UM College of Literature, Science, and the Arts (BS) and Ross School of Business (MBA).
Executive Mentor
Amber MacKenzie
Amber is the Director of Business Intelligence at IHA Medical Group. The department is responsible for delivering data solutions that support patient care and help to optimize the operations. Amber sets the strategic vision for advanced analytics at IHA and sponsors innovation projects using data science. With a background in higher education and healthcare data, she is passionate about data insights that tell a meaningful story and enable leaders to make strategic data-informed decisions.
Faculty Mentor
Professor Jeff Ringenberg
Jeff is a professor in the EECS department. His research interests are mobile learning software development, tactile programming, methods for bringing technology into the classroom, and studying the effects of social networking and collaboration on learning. Jeff is a long time MDP mentor.
Weekly Meetings: The IHA team will meet on Fridays from 1:30 – 3:30 PM. Some in-person collaboration with project stakeholders at the IHA corporate office in Ann Arbor is anticipated.
Work Location: The work will take place on campus, with visits to the IHA corporate office in Ann Arbor. MDP will provide transportation (as needed).
Course Substitutions: CE MDE, ChE Elective, CS Capstone/MDE, Data Science MDE/Capstone, DS Graduate Practicum, EE MDE, CoE Honors, SI Elective/Cognate
Citizenship Requirements: This project is open to all students. Note: International students on an F-1 visa will be required to declare part time CPT during Winter 2025 and Fall 2025 terms.
IP/NDA: Students will sign IP/NDA documents that are unique to IHA.
Summer Project Activities: Students will be guaranteed an interview for a 2025 internship. The interviews will take place before the end of March of 2025. To participate in the internship students must have the permission to work in the United States without sponsorship and have a GPA > 3.0.
Learn more about the expectations for this type of MDP project
If you have any of these characteristics, highlight them on your Experience and Interest Form and talk about them in your (optional) one way video interview.