Beyond Structured Data - Deep Learning Approaches for Multimodal Risk Assessment and Bias Detection
Abstract
This thesis investigates the integration of multimodal deep learning techniques for enhanced decision-making in financial and public sector domains, with a focus on fairness, transparency, and performance. Although traditional models rely predominantly on structured data, this research explores the synergistic potential of combining structured and unstructured sources, such as text, images, and numerical data, through advanced data fusion strategies. The first line of inquiry focuses on corporate credit rating predictions by evaluating various fusion levels and techniques using convolutional neural networks, recurrent neural networks, and transformer-based language models. The results show that hybrid fusion strategies significantly outperform simpler or more complex architectures and that textual data plays a more influential role than numerical counterparts. The second strand addresses the detection of bias in multilingual customer service feedback within public tax administrations. A novel framework is proposed, integrating quantized large language models with human-in-the-loop validation to enhance bias detection and ensure equitable service across demographic groups. This approach demonstrated greater alignment with expert evaluations and adaptability to specific organizational contexts. The final study focuses on mortgage default prediction using multimodal inputs, such as news articles and spatial imagery. To address this, we introduce a novel fusion architecture, CapsFusion, which not only captures modality-specific features but also incorporates trainable weights that dynamically adjust the contribution of each modality. Together, these contributions demonstrate the viability and necessity of multimodal, interpretable AI systems for responsible decision-making in high-stakes environments. The findings underscore the importance of fusing diverse data types, embedding fairness principles, and improving accessibility for greater stakeholder participation.