The Efficacy of Utilizing BPJS Health Claim Big Data on the Accuracy of Diagnosis Coding in Type B Hospital Medical Records

Authors

  • Fauzia Laili Universitas Kadiri
  • Siti Aminah Universitas Kadiri
  • Siswi Wulandari Universitas Kadiri
  • Khofifah Rafika Universitas Kadiri
  • Nadia Vivi K Universitas Kadiri

DOI:

https://doi.org/10.69855/rekammedis.v1i2.308

Keywords:

BPJS Health, Diagnosis Coding Accuracy, Big Data, Machine Learning, Type B Hospitals, JKN

Abstract

The pervasive problem of diagnostic coding inaccuracies significantly impacts the financial integrity and efficiency of Indonesia's National Health Insurance (JKN) system in Type B hospitals. This study aims to assess the efficacy of utilizing large-scale BPJS Health claims data to improve coding accuracy and identify its key determinants. A quantitative, retrospective secondary data analysis was conducted on 150,000 claim records spanning 2020–2024. Big Data analytics employing Random Forest (RF) and Classification and Regression Tree (CART) models successfully detected coding discrepancies, achieving an overall accuracy of 87.2% for primary diagnoses. Statistical analysis indicated that the maturity of the Electronic Medical Record (EMR) system (p<0.01) and staff ICD-10 training (p<0.05) are highly significant determinants. Crucially, the application of this predictive analysis resulted in a 12% reduction in coding errors compared to historical methods. In conclusion, the utilization of BPJS claim Big Data substantially enhances coding accuracy and reliability, confirming the necessity of integrating data-driven technology with simultaneous investments in digital infrastructure and continuous human capacity building for the sustainable quality improvement of the Indonesian health system.

References

Berman, S. (2024). Meeting International Health Coding Standards through Big Data in Indonesia. Journal of International Health Policy, 12(2), 200-215.

BPJS Kesehatan. (2024). Statistical Report on the Distribution and Competence of Indonesian Health Workers. Ministry of Health of the Republic of Indonesia.

Hassan, R., et al. (2021). Big Data Analytics for Improving Healthcare Diagnosis Accuracy: A Comprehensive Review. International Journal of Medical Informatics, 150, 104443. https://doi.org/10.1016/j.ijmedinf.2021.104443

Jensen, P., & Schön, T. (2020). Machine Learning Applications in Healthcare Claims Data. Health Informatics Journal, 26(1), 45–58.

Kumar, A., & Rajendran, R. (2023). Automated Detection of Diagnostic Inconsistencies Using Big Data Algorithms. Journal of Healthcare Engineering, 2023, 6798352. https://doi.org/10.1155/2023/6798352

Kusuma, R., & Tseng, C. (2020). Data Sanitation Methods in Healthcare Big Data Analytics. Journal of Health Data Science, 4(2), 112-124.

Kusuma, R., et al. (2021). Implementing Big Data Solutions for Healthcare Claims Management. International Journal of Health Management, 14(3), 190-202.

Morris, L., et al. (2023). Enhancing Diagnosis Coding Accuracy with Machine Learning: Evidence from Health Claims Data. Journal of Biomedical Informatics, 140, 104234. https://doi.org/10.1016/j.jbi.2023.104234

Pratama, A., & Sari, R. (2024). Analysis of accuracy in determining diagnosis coding and its effect on BPJS claim. Journal of Hospital Management and Services. 6 (1), 6-11 https://thejhms.org/index.php/JHMS/article/view/60

Putri, R., et al. (2020). Random Forest Application for Health Data Classification: A Case Study in Indonesian Hospitals. Medical Informatics and Decision Making, 20(1), 56. https://doi.org/10.1186/s12911-020-01152-3

Setiawan, D. (2023). Legal and Ethical Framework for Health Data Usage in Indonesia. Indonesian Journal of Health Law, 5(1), 10-25.

Silva, P., & Rashid, M. (2022). Patient Data Privacy and Security in the Era of Big Data. Health Policy and Technology, 11(3), 100589.

Suryanto, A., & Fadli, M. (2022). Diagnostic Coding Errors and Their Impact on Health Insurance Claims in Indonesian Hospitals. Journal of Health Administration, 15(3), 120–135.

Thompson, J., et al. (2024). Ethical and Governance Challenges in Big Data Healthcare Analytics. Journal of Medical Ethics, 50(2), 98-105.

Wang, H., et al. (2024). Leveraging Big Data for Diagnostic Coding Accuracy in National Health Insurance Systems. Healthcare Analytics, 7(1), 15-28.

Downloads

Published

2025-12-12

How to Cite

Fauzia Laili, Siti Aminah, Siswi Wulandari, Khofifah Rafika, & Nadia Vivi K. (2025). The Efficacy of Utilizing BPJS Health Claim Big Data on the Accuracy of Diagnosis Coding in Type B Hospital Medical Records. Research and Evidence on Knowledge in Administration and Management — Medical Electronic Data and Information Systems, 1(2), 34–44. https://doi.org/10.69855/rekammedis.v1i2.308