178
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      scite_
       
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Fraud Detection using Machine Learning

      Preprint
      In review
      research-article
      Bookmark

            Abstract

            The winding path of this research into fraud detection using traditional machine learning (ML) led us through landscapes of both impressive strides and thought-provoking hurdles. As we retrace our steps, revisiting key findings and peering into the technology's potential, a nuanced understanding of its effectiveness comes into focus. Our primary contribution lies in demonstrating the surprising efficacy of traditional ML algorithms in combating fraudulent activities. By meticulously dissecting the Credit Card Fraud dataset and employing a rigorous methodological approach, we established that these algorithms can achieve commendable accuracy in identifying suspicious transactions. This not only reaffirms the viability of ML in fraud detection but also offers a valuable roadmap for future research and practical implementation. Our journey unveiled a fascinating tapestry of insights. We observed that different ML algorithms excel in specific domains. While Random Forest and Gradient Boosting proved to be overall champions, Logistic Regression [19] displayed exceptional talent in pinpointing certain types of fraudulent behavior. This underscores the importance of wielding a diverse arsenal of algorithms to achieve comprehensive fraud detection. Furthermore, we discovered that feature engineering is not merely a parlor trick, but a crucial stage in the dance with fraud. By crafting tailored features from the raw data, we were able to significantly enhance the accuracy of our models. This highlights the critical role of domain expertise in understanding the subtle nuances of fraudulent transactions and translating them into actionable features for ML algorithms. However, the ever-shifting sands of fraud necessitate constant vigilance and adaptation. Our research emphasizes the paramount importance of real-time monitoring and model updates to stay ahead of evolving fraudster tactics. This agility will be the key to unlocking the full potential of ML in this dynamic arena. Yet, our exploration would be remiss if it failed to acknowledge the limitations that linger within traditional ML. The "black box" nature of some algorithms can hinder interpretability and raise concerns about bias. Additionally, their reliance on historical data can render them vulnerable to novel fraud schemes that haven't yet graced the training dataset. However, these challenges should not overshadow the immense potential of traditional ML. Its ability to handle large datasets, learn from experience, and adapt to changing patterns remains invaluable in the fight against fraud. By combining traditional ML with other approaches, such as explainable AI and deep learning, we can unlock even greater capabilities and build robust, interpretable fraud detection systems that are not only effective, but also transparent and accountable. This research journey into traditional ML for fraud detection has illuminated both its strengths and weaknesses, paving the way for future advancements. By embracing its potential and addressing its limitations, we can leverage this powerful technology to create a safer and more secure digital landscape for everyone. The path forward lies in harnessing the diverse power of ML, while remaining mindful of its limitations, to build a future where fraudsters find themselves perpetually outwitted and outmaneuvered.

            Content

            Author and article information

            Journal
            ScienceOpen Preprints
            ScienceOpen
            15 January 2024
            Affiliations
            [1 ] Indiana University of Pennsylvania;
            [2 ] Austin Peay State University;
            Author notes
            Author information
            https://orcid.org/0009-0007-3428-4442
            https://orcid.org/0009-0000-9832-879X
            Article
            10.14293/PR2199.000647.v1
            055d16f2-8a40-4bba-a071-ecece6916d1d

            This work has been published open access under Creative Commons Attribution License CC BY 4.0 , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Conditions, terms of use and publishing policy can be found at www.scienceopen.com .

            History
            : 15 January 2024
            Categories

            The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
            Computer science,Mathematics
            Fraud detection, Machine learning, Feature engineering, Random Forest, Gradient Boosting, Logistic Regression, Deep learning.

            Comments

            Comment on this article