Automatic construction of rule-based ICD-9-CM coding systems

Published 2008 in BMC Bioinformatics

ABSTRACT

BackgroundIn this paper we focus on the problem of automatically constructing ICD-9-CM coding systems for radiology reports. ICD-9-CM codes are used for billing purposes by health institutes and are assigned to clinical records manually following clinical treatment. Since this labeling task requires expert knowledge in the field of medicine, the process itself is costly and is prone to errors as human annotators have to consider thousands of possible codes when assigning the right ICD-9-CM labels to a document. In this study we use the datasets made available for training and testing automated ICD-9-CM coding systems by the organisers of an International Challenge on Classifying Clinical Free Text Using Natural Language Processing in spring 2007. The challenge itself was dominated by entirely or partly rule-based systems that solve the coding task using a set of hand crafted expert rules. Since the feasibility of the construction of such systems for thousands of ICD codes is indeed questionable, we decided to examine the problem of automatically constructing similar rule sets that turned out to achieve a remarkable accuracy in the shared task challenge.ResultsOur results are very promising in the sense that we managed to achieve comparable results with purely hand-crafted ICD-9-CM classifiers. Our best model got a 90.26% F measure on the training dataset and an 88.93% F measure on the challenge test dataset, using the micro-averaged Fβ=1 measure, the official evaluation metric of the International Challenge on Classifying Clinical Free Text Using Natural Language Processing. This result would have placed second in the challenge, with a hand-crafted system achieving slightly better results.ConclusionsOur results demonstrate that hand-crafted systems – which proved to be successful in ICD-9-CM coding – can be reproduced by replacing several laborious steps in their construction with machine learning models. These hybrid systems preserve the favourable aspects of rule-based classifiers like good performance, and their development can be achieved rapidly and requires less human effort. Hence the construction of such hybrid systems can be feasible for a set of labels one magnitude bigger, and with more labeled data.

PUBLICATION RECORD

Publication year
2008
Venue
BMC Bioinformatics
Publication date
2008-04-11
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1186/1471-2105-9-S3-S10 PMID 18426545 PMCID 2352868
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Automatic Code Assignment to Medical Text
2007cited by this paper
From indexing the biomedical literature to coding clinical text: experience with MTI and machine learning approaches
2007cited by this paper
Developing Feature Types for Classifying Clinical Notes
2007cited by this paper
A shared task involving multi-label classification of clinical free text
2007cited by this paper
Dragon Toolkit: Incorporating Auto-Learned Semantic Knowledge into Large-Scale Text Retrieval and Mining
2007cited by this paper
Biological, translational, and clinical language processing
2007cited by this paper
Three Approaches to Automatic Assignment of ICD-9-CM Codes to Radiology Reports
2007cited by this paper
Data mining - practical machine learning tools and techniques, Second Edition
2005cited by this paper
A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries
2001cited by this paper
Automating ICD-9-CM Encoding Using Medical Language Processing: A Feasibility Study
2000cited by this paper
A hierarchical approach to the automatic categorization of medical documents
1998cited by this paper
A Maximum Entropy Approach to Natural Language Processing
1996cited by this paper
Automatic Assignment of ICD9 Codes To Discharge Summaries
1995cited by this paper

CITED BY

Integrating Deep Learning Nodes into an Augmented Decision Tree for Automated Medical Coding
2026cites this paper
Artificial intelligence and natural language processing for automated coding of cervical and lumbar spine surgery.
2025cites this paper
Reinforcement Learning for Clinical Reasoning: Aligning LLMs with ACR Imaging Appropriateness Criteria
2025cites this paper
Hybrid-Code: A Privacy-Preserving, Redundant Multi-Agent Framework for Reliable Local Clinical Coding
2025cites this paper
MedDCR: Learning to Design Agentic Workflows for Medical Coding
2025cites this paper
Deep learning for automatic ICD coding: Review, opportunities and challenges
2025cites this paper
Multitask gated interactive network for automatic international classification of diseases coding with dual denoising mechanism
2025cites this paper
Improving Rare and Common ICD Coding via a Multi-Agent LLM-Based Approach
2025cites this paper
Exploring the Accuracy and Performance: A Comparative Analysis of Deep Learning Techniques for ICD Prediction from Clinical Notes
2025cites this paper
Enhancing medical coding efficiency through domain-specific fine-tuned large language models
2025cites this paper
Explainable ICD Code Assignment Using Knowledge-Based Sentence Extraction and Deep Learning
2025cites this paper
Automated Medical Coding Using a Hybrid Decision Tree with Deep Learning Nodes
2025cites this paper
Evaluating Large Language Models for Automated Clinical Abstraction in Pulmonary Embolism Registries: Performance Across Model Sizes, Versions, and Parameters
2025cites this paper
A Comparative Study on Automatic Coding of Medical Letters with Explainability
2024cites this paper
Enhancing Automated Medical Coding: Evaluating Embedding Models for ICD-10-CM Code Mapping
2024cites this paper
Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification
2024cites this paper
EXAMINATION OF SUMMARIZED MEDICAL RECORDS FOR ICD CODE CLASSIFICATION VIA BERT
2024cites this paper
Czech medical coding assistant based on transformer networks
2024cites this paper
Exploring LLM Multi-Agents for ICD Coding
2024cites this paper
Using clinical text to refine unspecific condition codes in Dutch general practitioner EHR data
2024cites this paper
Using clinical text to refine unspecific condition codes in Dutch general practitioner EHR data
2024cites this paper
Creating a computer assisted ICD coding system: performance metric choice and use of the ICD hierarchy
2024cites this paper
A Clustering-Based Optimization Approach for Hospital Miscoding Correction
2024cites this paper
Modelling long medical documents and code associations for explainable automatic ICD coding
2024cites this paper
A Hierarchical Fine-Grained Deep Learning Model for Automated Medical Coding
2024cites this paper
Medical Diagnosis Coding Automation: Similarity Search vs. Generative AI
2024cites this paper
INSIGHTBUDDY-AI: Medication Extraction and Entity Linking using Large Language Models and Ensemble Learning
2024cites this paper
MedCodER: A Generative AI Assistant for Medical Coding
2024cites this paper
OLR-Net: Object Label Retrieval Network for principal diagnosis extraction
2024cites this paper
ICDXML: enhancing ICD coding with probabilistic label trees and dynamic semantic representations
2024cites this paper
Large language models are good medical coders, if provided with tools
2024cites this paper
Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel Classification
2023cites this paper
A Review on Deep Neural Networks for ICD Coding
2023cites this paper
Automating the overburdened clinical coding system: challenges and next steps
2023cites this paper
DILM-ICD: A Deep Iterative Learning Model for Automatic ICD Coding
2023cites this paper
Using a Large Open Clinical Corpus for Improved ICD-10 Diagnosis Coding.
2023cites this paper
Combining unsupervised, supervised and rule-based learning: the case of detecting patient allergies in electronic health records
2023cites this paper
Natural language processing in radiology: Clinical applications and future directions.
2023cites this paper
Automatic Coding at Scale: Design and Deployment of a Nationwide System for Normalizing Referrals in the Chilean Public Healthcare System
2023cites this paper
Development and external validation of automated ICD-10 coding from discharge summaries using deep learning approaches
2023cites this paper
A Two-Stage Decoder for Efficient ICD Coding
2023cites this paper
Automatic assignment of diagnosis codes to free-form text medical note
2023cites this paper
ICDBigBird: A Contextual Embedding Model for ICD Code Classification
2022cites this paper
Revisiting Transformer-based Models for Long Document Classification
2022cites this paper
Can Natural Language Processing and Artificial Intelligence Automate The Generation of Billing Codes From Operative Note Dictations?
2022cites this paper
A Survey of Automated ICD Coding: Development, Challenges, and Applications
2022cites this paper
Automated Diagnosis Code Assignment of Thai Free-text Clinical Notes
2022cites this paper
Implementation of specialised attention mechanisms: ICD-10 classification of Gastrointestinal discharge summaries in English, Spanish and Swedish
2022cites this paper
AnEMIC: A Framework for Benchmarking ICD Coding Models
2022cites this paper
AI-based ICD coding and classification approaches using discharge summaries: A systematic literature review
2022cites this paper
Classification of user queries according to a hierarchical medical procedure encoding system using an ensemble classifier
2022cites this paper
An Automatic ICD Coding Network Using Partition-Based Label Attention
2022cites this paper
GrabQC: Graph Based Query Contextualization for Automated ICD Coding
2022cites this paper
Entity Anchored ICD Coding
2022cites this paper
Fine-Grained ICD Code Assignment Using Ontology-Based Classification
2022cites this paper
Deep-ADCA: Development and Validation of Deep Learning Model for Automated Diagnosis Code Assignment Using Clinical Notes in Electronic Medical Records
2022cites this paper
Automated clinical coding: what, why, and where we are?
2022influential citation
A Unified Review of Deep Learning for Automated Medical Coding
2022cites this paper
JLAN: medical code prediction via joint learning attention networks and denoising mechanism
2021cites this paper
Multi-channel, convolutional attention based neural model for automated diagnostic coding of unstructured patient discharge summaries
2021cites this paper
Does the Magic of BERT Apply to Medical Code Assignment? A Quantitative Study
2021cites this paper
Automated Machine Learning for Healthcare and Clinical Notes Analysis
2021cites this paper
Automatic Classification of Electronic Nursing Narrative Records Based on Japanese Standard Terminology for Nursing
2021cites this paper
Leveraging electronic health record data to inform hospital resource management
2021cites this paper
A Deep Learning Framework for Automated ICD-10 Coding
2021cites this paper
A Pseudo Label-Wise Attention Network for Automatic ICD Coding
2021cites this paper
Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review
2021cites this paper
A Systematic Literature Review of Automated ICD Coding and Classification Systems using Discharge Summaries
2021cites this paper
Analyzing Code Embeddings for Coding Clinical Narratives
2021cites this paper
Fusion: Towards Automated ICD Coding via Feature Compression
2021cites this paper
Medical code prediction via capsule networks and ICD knowledge
2021cites this paper
Multi-label Diagnosis Classification of Swedish Discharge Summaries – ICD-10 Code Assignment Using KB-BERT
2021influential citation
Automatic RadLex coding of Chinese structured radiology reports based on text similarity ensemble
2021cites this paper
Unsupervised learning approach for understanding critical infectious disease progression in ICU patients
2021cites this paper
Query-Focused EHR Summarization to Aid Imaging Diagnosis
2020cites this paper
Fraunhofer AICOS at CLEF eHealth 2020 Task 1: Clinical Code Extraction From Textual Data Using Fine-Tuned BERT Models
2020cites this paper
Medical Code Assignment with Gated Convolution and Note-Code Interaction
2020cites this paper
Mapping Clinical Narrative Texts of Patient Discharge Summaries to UMLS Concepts
2020cites this paper
Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
2020cites this paper
Leveraging Semantics in WordNet to Facilitate the Computer-Assisted Coding of ICD-11
2020cites this paper
Can structured EHR data support clinical coding? A data mining approach
2020influential citation
Using Deep Learning for Automatic Icd-10 Classification from Free-Text Data
2020cites this paper
Dilated Convolutional Attention Network for Medical Code Assignment from Clinical Text
2020cites this paper
KAICD: A knowledge attention-based deep learning framework for automatic ICD coding
2020cites this paper
Classification of Syncope Cases in Norwegian Medical Records
2020cites this paper
A Comparison of Deep Learning Methods for ICD Coding of Clinical Records
2020cites this paper
Experimental Evaluation and Development of a Silver-Standard for the MIMIC-III Clinical Coding Dataset
2020cites this paper
Automated ICD coding via unsupervised knowledge integration (UNITE)
2020cites this paper
Analysing Effectiveness of Multi-Label Classification in Clinical Coding
2019cites this paper
Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN
2019cites this paper
Negation and Speculation Detection
2019cites this paper
CASCADENET: An LSTM Based Deep Learning Model for Automated ICD-10 Coding
2019cites this paper
Improving Medical Code Prediction from Clinical Text via Incorporating Online Knowledge Sources
2019cites this paper
Interpretable deep learning to map diagnostic texts to ICD-10 codes
2019cites this paper
Distributed Knowledge Based Clinical Auto-Coding System
2019cites this paper
Multi-label clinical document classification: Impact of label-density
2019cites this paper
Ontological attention ensembles for capturing semantic concepts in ICD code prediction from clinical text
2019cites this paper
Assigning Medical Codes at the Encounter Level by Paying Attention to Documents
2019cites this paper
Hybrid Text Feature Modeling for Disease Group Prediction Using Unstructured Physician Notes
2019cites this paper
Towards the Development of a Web Support System for Improving Accuracy in Coding Discharge Diagnosis
2019cites this paper