Case Background
Problem: Verne Global wanted well-researched, long-form blog content to inform their customers how HPC power was shaping industry and could benefit their businesses.
Solution: We created a sustained schedule of blog content to explore a range of HPC and machine learning use cases. These blogging efforts help spur rapid growth at Verne Global .
HPC and Big Data Are Powering
a New Era of Fraud Detection
The convergence of two existing disciplines can be an explosively creative force. Take for example the hybrid of computer science, biology, and mathematics that has given rise to bioinformatics, which although a new field now plays a crucial role in the drug discovery process and is expected to grow at an impressive 25% per year until 2025.
​
Another great example convergence leading to greater productivity is HPC technology merging with big data and machine learning. Though in many ways this convergence is still in its early stages, the merging of these technologies is already starting to deliver concrete, real-world benefit in the fraud detection field, helping saving financial firms hundreds of millions of dollars.
​
Unsurprisingly, PayPal is one of the companies at the forefront of this convergence. As online transaction processor that was conceived on the Internet, Paypal has grown up exposed to virtually every cybersecurity threat and fraud imaginable. Because of this, the company has been aggressively pursuing a security strategy that used both HPC and big data technologies since as early as 2001. Although PayPal keeps details of its fraud protection systems a secret, it has been very open about leveraging the flexibility of the open-source H20 machine learning framework in conjunction with its big data infrastructure, which gathers more than 20 terabytes of log data every day.
​
To gain insight from this massive Hadoop dataset, Paypal, which handles over 13 million online monetary transactions per day, combines the three types of machine learning — linear, nonlinear, and deep learning — to help identify and stop fraudsters. The company estimates that just in the first few years of deploying their fraud detection systems, they’ve saved over $700 million dollars in fraudulent transactions that they otherwise wouldn’t have noticed.
​
Though PayPal may have been one of the first to recognize the value of converging HPC and big data technologies, today virtually all the major financial services firms are seeking novel ways to combine these technologies in order to protect themselves. Another high-profile example is MasterCard, which has a staggering 2.2 billion cards in use in 330 countries, and handles roughly 160 million transactions per hour, or 52 billion transactions a year.
​
MasterCard, much like PayPal, employs a hybrid machine learning approach that uses supervised and unsupervised learning, in addition to traditional big data technologies like Hadoop and Spark, to examines the location, spending habits, and travel patterns of the customer before each purchase is made. According to Vice President of Global Big Data Consulting at MasterCard, Nick Curcuru, the company’s infrastructure applies 1.9 million distinct rules to examine each transaction, and processes each of these transactions in just milliseconds.
Other major institutions have developed their own mix of HPC and big data technologies to protect themselves against fraud. CitiBank, for example, has recently made investments in machine learning companies like Feedzai, Cylance and Ayasdi in order to bolster its fraud detection capability, and the company just recently announced that it would open a branch of its Global Innovation Lab at a London WeWork, specializing in the development of big data and high performance computing technologies.
​
The quest for better fraud protection is a never-ending one, however, as credit card fraud continues to increase in severity. Credit card fraud in Europe caused 1.8 billion Euro in damages in 2016 (with the UK and France accounting for 73% of that), while in the US credit card fraud has been a persistent problem, and even debit cards, which until now have been relatively safe, are now starting to see a rise in fraud. The problems of controlling fraud are compounded by demands from consumers, who want faster, easier payments, and greater flexibility. This pressure was almost certainly a factor when the Payment Card Industry (PCI) Security Standards Council, the regulatory body that oversees transactions from credit and debit card, decide to start allowing PIN numbers to be entered on mobile phones. The decision, which may benefit the user experience, will also introduces a new level of vulnerability into the transaction process, and therefore has elicited some skepticism.
​
Delivering an improved user experience while also maintaining maximum security will necessitate improved fraud detection systems. This points to a need for stronger, more deeply converged systems that can gather deeper insight from customer and transaction data. Realizing the better union of HPC, big data, and machine learning will require a deeper harmony between once heterogeneous computational infrastructures, and perhaps also a shift in perception. Challenges aside, major players in the HPC industry are already making serious commitments to this convergence, and envision a not-to-distant future where the three workloads can be seamlessly processed on one system. Verne Global, which helps its clients process HPC, big data, and machine learning workloads with the low-cost, fully-sustainable power resources, is in a key position to help facilitate this transformation, a role we’re eager to fulfill.