The Phantom Borrower: Machine Learning Detection of Synthetic Identities and the Econometrics of Precision-Recall Trade-Offs in Digital Credit Markets

Authors

  • Narvind Kumar Institute of Business Administration, University of Sindh, Jamshoro, Pakistan.
  • RajKumar Lohano the Institute of Business Administration, University of Sindh, Jamshoro, Pakistan.
  • Dr. Ain Bemisal Alavi Bath Spa University Ras Al Khaima Campus, UAE.

DOI:

https://doi.org/10.62019/t7hjsa65

Keywords:

Synthetic Identity Fraud, machine Learning, Digital Credit, Precision-recall trade-off, Fraud Detection, Financial Econometrics.

Abstract

Digital lending platforms face escalating synthetic identity fraud—fabricated identities combining real and fictitious data—causing an estimated $20 billion in U.S. losses in 2020. Traditional verification systems fail to detect these "phantom borrowers," creating severe information asymmetries in credit markets that undermine portfolio quality and financial stability.This study compares machine learning algorithms to detect synthetic identities and explore the econometrics impact of precision-recall trade-offs in fraud detection systems, and includes evidence-based recommendations for selecting optimal threshold. This is secondary research, which is a synthesis of empirical evidence from recent literature.  Asymmetric information theory and cost-sensitive learning approaches (2019-2024). Ensemble methods (XGBoost: AUPRC 0.847) and graph neural networks (AUPRC 0.891) outperform logistic regression (AUPRC 0.724). Feature importance analysis reveals identity consistency (28.4%), network/relational patterns (24.1%), and behavioral biometrics (22.7%) as primary detection signals. Optimal operating points vary by context: high-value loans require recall of 0.88-0.92, while small-dollar products prioritize precision at 0.82-0.90. Contextual factors significantly influence threshold selection, with F-beta scores ranging from 0.71 to 0.79 across scenarios. Machine learning is an effective method for detecting synthetic identities but thresholds should be optimized according to the costs of the product. To prevent fraud effectively, there needs to be a balance between the accuracy of fraud detection and the financial inclusion objectives, which means there is not a universal threshold for fraud. Regulatory framework should support a range of operating points to ensure financial stability and equitable access to credit.

Downloads

Published

2026-03-30

How to Cite

The Phantom Borrower: Machine Learning Detection of Synthetic Identities and the Econometrics of Precision-Recall Trade-Offs in Digital Credit Markets. (2026). The Asian Bulletin of Big Data Management , 6(1), 482-494. https://doi.org/10.62019/t7hjsa65

Similar Articles

11-20 of 229

You may also start an advanced similarity search for this article.