A Data-Driven Framework For Real-Time Fraud Detection In Financial Transactions Using Machine Learning And Big Data Analytics

Main Article Content

Murali Malempati

Abstract

The rapid growth of electronic commerce and the gradual increase in customer confidence in the security of electronic payments have led to a persistent increase in the number of on-line transactions in the past years. Credit and debit cards account for the majority of the on-line payments. Consequently, the credit card financial ecosystem growth has been accompanied by a similar growth of illicit actions, by which con-men try to benefit from this huge financial exchange. Fraud detection is critical for credit institutions, merchants, and national services to minimize money losses. In recent years, several initiatives to enhance systems aimed at detecting fraudulent credit card transactions have been taken. Detecting fraud is very challenging since machine learning approaches rely on training sets limited to the observations that were available at the moment of the training. The newly triggered events, which were never observed in the training phase, can lead to severe issues such as alarm fatigue, in which frauds are detected only after a large amount of incurred losses.


Detecting fraud requires applying scalable learning techniques able to analyze the huge amount of streaming data generated by the transactions and able to mitigate the two main situations complicating the problem: the class imbalance and the concept drift since the world evolves and consequently the fraud patterns change. The need to detect frauds in real-time gives rise to several challenges. Recent advances in analytics, and the availability of open source solutions for storage, processing, and analytics of Big Data, have opened new perspectives for the real-time detection of frauds in massive amounts of transactions. In this paper, the SCAlable Real-time Fraud Finder (SCARFF) framework is presented. SCARFF is a distributed (anomaly) detection machine learning approach for the fraud detection that integrates Big Data tools both for the massive storage and processing of the transactions and for the predictive analysis.


SCARFF makes a contribution to the literature in four directions. First, the integration of the Hadoop and Spark ecosystems and of the sophisticated learning approach, addressing the inherent problems of imbalance, nonstationarity, and feedback latency, is a unique contribution. Second, the capability of handling a never-seen-before massive dataset of real credit card transactions is a unique achievement. Third, the formal description of the methods implemented to tackle data imbalance in real-time is presented. Fourth, the implementation in real-time of an ensemble learning engine capable of detecting credit card frauds at the rate of records−1, with large computational savings compared to batch implementations, is a further unique achievement.

Article Details

How to Cite
Murali Malempati. (2023). A Data-Driven Framework For Real-Time Fraud Detection In Financial Transactions Using Machine Learning And Big Data Analytics. Journal for ReAttach Therapy and Developmental Diversities, 6(10s(2), 1954–1963. https://doi.org/10.53555/jrtdd.v6i10s(2).3563
Section
Articles
Author Biography

Murali Malempati

Senior Software Engineer, Mastercard International INC

References

JPMorgan Chase. (2024). AI-Driven Fraud Prevention and Risk Management. Yallo+1 Leading Tech Trends+1

Mastercard. (2024). AI-Powered Decision Intelligence Solution. Yallo

PayPal. (2024). AI-Driven Fraud Detection Algorithms. SoftStream.ai

FBD Insurance. (2024). AI-Based Security Software for Real-Time Threat Analysis. Financial Times

MPowered Mortgages. (2023). AI in Lending Processes.