Credit Risk

Our latest paper is live! This is the work of our Ph.D. researcher Sahab Zandi, co-supervised by Cristián and María, in collaboration with Christophe Mues, J. C. Moreno-Paredes, at Santander Bank at the time and also Cristián’s Ph.D. classmate, and Kamesh Korangi. This paper is the conclusion of two years of work on our collaboration with Santander Spain, who provided direct access to their database. Our work is, to the best of our knowledge, the first one that measures calculates how risk propagates on small businesses on an individual basis, using advanced deep learning paired with financial behaviour and transactional information.

The Problem We Set Out to Solve

SME lending is challenging to judge from spreadsheets alone. Many small firms don’t have long financial histories, and traditional ratios can miss how trouble moves through supply chains. If a key customer stops paying, cash dries up and loan payments follow – that risk lives in relationships, not just in financial information.

The Idea

We added a “map of relationships” to the usual credit data. One layer captures who pay whom (financial transactions); another captures who share ownership. Together, those multilayer links give a fuller picture of where stress might spread.

How We Built It

We trained models that look at both worlds at once: the tabular records lenders already keep and graph representations of the two networks. A cross-attention fusion step lets each side inform the other before making a call – so the model isn’t blind to either the firm’s own profile or its neighbourhood.

What Changed

Adding the networks made predictions sharper than using standard data alone, with our best cross-attention model on the double-layer graph topping the baselines. We were able to catch more future defaulters, more reliably. We also saw modest gains when we kept the real-world direction and size of money flows, which help the model tell strong ties from weak ones.

What the Links Actually Tell Us

The transaction layer tends to be more informative than common ownership because it reflects live dependencies: who owes you money and how concentrated those receipts are. Firms exposed to defaulted customers (money flowing in from a troubled partner) look riskier than those exposed to defaulted suppliers, an effect that’s stronger when the ties are big and recurring.

Why It Matters

For lenders, network-aware scores flag clusters of correlated risk earlier and support fairer pricing. For SMEs, diversified, healthy counterparties become a credit asset in their own right. More broadly, better risk signals can widen access to finance—especially where traditional histories are thin—while still keeping decisions explainable and auditable.

Read the preprint in this ArXiV link. The paper is still a preprint, so please take it with a grain of salt as it will evolve through the peer-review process. We welcome any feedback!

Our latest paper, from our lab member Mahsa Tavakoli, is out. This was in collaboration with Prof. Rohitash Chandra at UNSW in Australia

When it comes to understanding how credit ratings are determined, most studies have focused only on numbers—like financial ratios and balance sheets. But in the real world, a lot of important information is found in written documents, such as company reports or news articles. In our study, we looked at how combining both numbers and text using deep learning models could improve the prediction of credit ratings. We tested different types of models and ways to blend the data, and we found that a model based on CNN (a type of deep learning model) that mixes information early and in the middle of the process gave the best results. Surprisingly, simpler models worked better than more complex ones, and the text information—like what’s written about a company—was even more useful than numbers in predicting credit ratings.

We also checked how reliable our model is when things change, such as during big events like the COVID-19 pandemic. The results showed our model stayed stable and still made good predictions, especially when using text data. Another interesting finding was that credit ratings given by Moody’s were more accurate over time than those from other rating agencies. This could help financial institutions trust those ratings more when making decisions. Overall, our study shows that using both text and numbers together can lead to smarter and more reliable credit rating predictions. It also opens the door for better tools that can help people make confident financial decisions—even during uncertain times.

The paper was published a bit ago at Applied Soft Computing. You can find the paper Open Access here, and its ArXiV version here.

HouseNetwork-e1707236743536.jpg?fit=200%2C200&ssl=1

Our latest paper is live! In this work, we study how to model financial contagion over dynamic networks. When people apply for loans, banks have a pretty important decision to make—can the borrower pay it back? Traditionally, banks use credit scores to assess risk, but our new research extends our previous research by delving deeper into the relationships borrowers have with others to better understand their chances of default.

Why Networks Matter in Credit Risk

Imagine you’re considering lending money to someone, but that person is part of a group where others have also borrowed money. This idea is at the heart of our study. Rather than seeing borrowers as isolated, we treat them as part of a bigger network. Their connections to other borrowers—like being in the same neighbourhood or using the same mortgage provider—might influence their financial behaviour.

Predicting Loan Defaults Using Dynamic Networks

Borrowers can be connected in various ways and these relationships evolve over time, making them dynamic. To better capture these connections, we developed a model that combines two powerful tools: Graph Neural Networks (GNNs) and Recurrent Neural Networks (RNNs). The GNNs help us map out these borrower networks, while the RNNs allow us to track how these relationships change over time. But that’s not all—we added an attention mechanism that prioritizes certain time points over others, based on their relevance to the borrower’s default risk.

What Did We Find?

We tested our model using real-world data from Freddie Mac, a major U.S. mortgage financier. The results were exciting—our model did a better job of predicting which borrowers were likely to default compared to traditional methods. It wasn’t just more accurate; it also provided a more profound understanding of why certain borrowers might struggle to repay loans.

Why This Matters

For banks and lenders, this research could change how they think about credit risk. By considering the connections between borrowers, lenders can make more informed decisions. This could even lead to more people getting approved for loans, especially those who might not have had a chance based on traditional credit scores alone. For borrowers, this kind of model could mean more opportunities. If banks can better understand the factors affecting risk, they might be more willing to take a chance on people who were previously overlooked.

What’s Next?

Our research shows that networks play a significant role in financial decisions, and there’s much more to explore. We’re excited to keep building on this work to better understand financial risk. The more we learn, the more we can help lenders and borrowers alike make informed financial decisions.

The paper is available, open to all and with CC-BY license, here. The code to replicate the paper can be found here.

Also, Juan Cristóbal Constain from Quipu created a podcast using NotebookLM from this post and the paper. Give it a listen below if you prefer!

We had a great time attending the 2023 INFORMS Annual Meeting that took place between October 15^th to October 18^th in Phoenix, AZ. This is one of the largest conferences in the field of OR, with 6,000+ attendees and 1,400+ sessions.

The BAL had a strong presence at the conference with four presentations:

On October 15^th, we had two presentations:
- Cristián Bravo presented the work with Kamesh Korangi and Christophe Mues on “Large-Scale Portfolio Optimization using Graph Attention Networks”.

Daniel Abib presented the work with Cristián Bravo, Raffaella Calabrese, and María Óskarsdóttir on “Optimal Feature Split in Classification Models with Dependency”.

On October 17^th, Sahab Zandi presented the work with Kamesh Korangi, María Óskarsdóttir, Christophe Mues, and Cristián Bravo on “Leveraging Dynamic Multilayer Networks for Modelling Credit Risk Contagion in SMEs”.

On October 18^th, Yuhao (Jet) Zhou presented the work with Collins Ntim, María Óskarsdóttir, Matthew Davison, and Cristián Bravo on “Uncovering the Network Power Gap: A Deep Learning Approach to Investigating Gender Disparities in the Boardrooms of Canadian Public Firms”.

We also had fun trying a few good restaurants in Scottsdale, AZ!

This was a great chance to showcase the preprints that will come out in the next few months. Stay tuned for them!

We had a great time attending the Credit Scoring and Credit Control Conference XVIII that took place between August 30^th to September 1^st in Edinburgh, UK. This conference bridges the academic/practitioner divide and is the world’s premier conference for credit scoring and credit risk related topics.

The BAL had a strong presence at the conference with six presentations:

On August 30^th, Cristián presented the work with our PhD student Mahsa Tavakoli, cosupervised by Rohitash Chandra from UNSW, on “Multi-Modal Deep Learning for Midcap Credit Rating Prediction Using Text and Numerical Data”.
On August 31^st we had two presentations:
- Our collaborator Prof. María Óskarsdóttir from Reykjavík University, Iceland, presented the work by our PhD students Sahab Zandi and Kamesh Korangi, cosupervised by Prof. Christophe Mues from Southampton University and Cristián, titled “Credit Scoring with Dynamic Multilayer Graph Neural Networks”.
- Cristián presented the work led by our PhD student Sherly Alfonso Sánchez, cosupervised by Prof. Kristina Sendova here at Western, called “Causal Learning for Credit Limit Adjustment in Revolving Lending Under Adversarial Goals”.
On September 1^st, we had three:
- Daniel Abib, who joined earlier this year as a postdoc at the Lab, presented the work coauthored with Prof. Raffaella Calabrese for Edinburgh University, Prof. María Óskarsdóttir, and Cristián. The work was called “Optimal Feature Split in Credit Risk Models with Dependency”.
- Our PhD student Kamesh Korangi presented the work from his PhD, coauthored with Christophe Mues and Cristián, on “Deep Temporal Graph Networks for Behavioural Scoring Prediction in Revolving Credit Lines”.
- Our PhD student Sahab Zandi presented the work with coautored with Kamesh, and cosupervised by Prof. María Óskarsdóttir, Prof. Christophe Mues, and Cristián. These last two works are part of the collaboration with one of the largest consumer banks in the world. Sahab’s presentation is titled “Modelling Credit Risk Contagion for SMEs over Supply Chains using Dynamic Multilayer Networks”.

The conference provided a great opportunity to meet and network with people in the field of credit risk from both academia and industry. We were honestly surprised and happy with the reception that we had from the conference attendants. We had many interesting talks and we look forward to what will come out of these chats!

We also had a blast having a reunion with some friends and colleagues after a while in Edinburgh!

We would like to thank the organizers, Professor Galina Andreeva and Professor Jonathan Crook from the Credit Research Centre at the University of Edinburgh, plus of course our collaborator Prof. Christophe Mues for hosting this wonderful conference. We look forward to attending the next one in 2025!

Now that the summer is over I was invited once again to the Weekend Business panel on CBC News. You can watch it below!

The TL;DW version is:

Latest inflation numbers: Not very good news as inflation seems to be supply-side, so it is much harder to control. Gas prices will also negatively affect the price of food even more for the next quarter at least. This means that interest rates will remain high for a while, possibly even into 2025. Also, deflation is not a bad thing if it is transitory and aimed at first necessity goods, as opposed to affecting consumption in the long run.
The UAW strike: Not really my topic, but my comment here was that the strike was expanded significantly and that can impact car prices in the future as it will now target in-demand cars. Also, some factories in Canada may be facing temporary work stoppages.
Equifax report on the increase in lending application fraud: while this is a relatively minor issue, it mixes two different things. First, mortgage fraud is on the rise. Most of this type of fraud is misrepresentation of income, which may be considered a white lie by some borrowers (16% according to a relatively old survey), but it actually is fraud and can have serious consequences for borrowers. The second is auto and credit card fraud. This one is mostly done by criminals that steal identities. The recommendation here is clear: monitor your credit at least monthly and if you see anything that you don’t recognize, immediately contact your financial institution.

I’m on next on October 14 and November 4.

Another interest rate hike, another hit to Canadians to keep inflation in check, another time journalists reach out to the BAL for insights. I was on CTV national speaking about it. You can see the interview in this link. What’s cool about this link (active for 30 days) is that it also shows how many people viewed the interview. 3,520,000 persons. Wow, I’m amazed about the reach of these activities and humbled I get the chance to speak directly to so many Canadians. Thank you to everyone that tuned in and I hope I helped explain what’s going on!

The second coverage was at CTV London. This one did have a shareable link, and a piece of written news. The written news is in this link, and I’ve also embedded the interview below.

I had a bit of a slip that made the segment: what I wanted to say was that one of the factors within core inflation is service inflation, and that one hasn’t come down. Also, this round we had a surprisingly strong demand for goods. According to the BoC this is both due to savings from the pandemic that households are spending, and also because of very strong demand from the US for our goods.

The BoC is much more pessimistic about when they will control inflation, targeting now the second semester of 2025. This would come, however, with no recession. This is very uncertain though, as they themselves acknowledge. We’ll have to see.

In a more personal opinion, I believe the BoC is ok with a moderate recession as long as inflation comes back down, so they rather overdo it. Inflation expectations are really high both in consumers and businesses. These decisions are aimed at convincing everyone that they will keep hiking rates as long as necessary. I, for one, believe them.

Our research focuses on using reinforcement learning (RL) to address the credit limit modification problem for companies offering credit card products. This involves two main challenges: defining the RL problem for this specific task and training the RL agent without conducting online experiments with customers.

To define the RL problem, we consider the financial history of credit card holders and the expected losses due to defaults when deciding whether to increase or maintain their credit limits. The actions available are increasing the limit or keeping it the same. We calculate the reward function based on the expected profit, considering the revolving aspect of credit card usage. This differs from previous studies that overlooked this aspect in profit calculations.

To train the RL agent offline, we use a two-stage model to simulate the balance after taking an action. This involves selecting the balance type and predicting the balance amount using a regressor model. Through our experiments, we found that our trained Double-Q learning agent outperformed other strategies, including the one used by Rappi, a Latin American fintech company known for its delivery and commerce services that has also ventured into banking with its RappiCard credit card, and that was our collaborator in this research.

Our research contributes by providing a conceptual framework for applying RL to credit limit adjustments and emphasizes data-driven decision-making rather than relying solely on expert judgments. Furthermore, we discovered that incorporating additional predictors did not improve the performance of our simulator. This implies that fintech companies do not necessarily have an advantage over traditional banking institutions in this specific task. Figure 1 provides an overview of the proposed methodology’s general workflow.

Figure 1: Methodology’s general workflow.

Link to the working paper: https://arxiv.org/abs/2306.15585

By Mahsa Tavakoli @Bal:

Our research study was undertaken with the aim of enhancing the accuracy of predicting company credit ratings, a critical factor in evaluating their financial stability. Unlike previous studies that solely focused on structured data, such as numbers and figures, we recognized the significance of incorporating other, non-structured information. Thus, our primary objective was to evaluate the effectiveness of employing advanced deep learning models to merge both structured and unstructured data, particularly textual information. Through this approach, we sought to provide a more comprehensive analysis and improve the overall predictive capabilities of the models. In our quest for the optimal approach, we conducted thorough testing of various fusion strategies and deep learning models, including CNN, LSTM, GRU, and BERT. To our surprise, we discovered that a CNN-based model (Figure 2), which effectively amalgamated data from diverse sources, outperformed more intricate models. Leveraging this model enabled us to achieve highly precise credit rating predictions.

Furthermore, we delved into the contribution of different data types to these predictions. Textual data, such as insights shared by company managers, played a pivotal role, particularly during challenging periods like the COVID-19 pandemic. This underscored the significance of contextual information and managerial perspectives in accurately predicting credit ratings.

Additionally, our research encompassed a comparative analysis of ratings provided by various agencies. Moody’s credit ratings emerged as the frontrunner, surpassing those of other agencies like Standard & Poor’s and Fitch Ratings, especially in medium-term predictions.

Collectively, our research provides a comprehensive framework that empowers rating agencies and financial institutions to make well-informed decisions when assigning credit ratings. By incorporating a combination of structured and unstructured data and leveraging the most effective deep learning models, they can significantly enhance the precision of credit rating predictions, thereby augmenting their overall decision-making process.

Fig1: Blending Textual Managers’ Insights and Companies Numerical Data for Precise Credit Rating Predictions

Fig2: Architecture of the CNN ensemble for the best model, showing
the convolution and dropout layers with two streams of data that includes
text and numerical data (N1, N2, N3, N4).

Link to the working paper: https://arxiv.org/abs/2304.10740

Credit Risk

A Multimodal Approach to SME Credit Scoring Integrating Transaction and Ownership Networks – New Preprint Available!

Our Latest Paper: Multi-modal deep learning for credit rating prediction using text and numerical data streams

New Paper Published! Attention-based Dynamic Multilayer Graph Neural Networks for Loan Default Prediction

The BAL at INFORMS 2023

The BAL at the Credit Scoring Conference 2023

CBC Weekend Business Panel September 23, 2023

The July 2023 Rate Hike

Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning

Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams

The Banking Analytics Lab