A Deep Learning Framework for Automated Railway Inspection of Track Environments
Abstract
Automated inspection of railway track defects is essential for improving safety and reducing maintenance costs in large-scale rail networks. This thesis proposes a practical two-stage vision-based inspection framework designed for real-world railway environments. In the first stage, object detection models are systematically evaluated for Fastener localization and region-of-interest (ROI) generation. Comparative experiments demonstrate that YOLO-based detectors outperform CNN-FPN and transformer-based approaches in both detection accuracy and computational efficiency. Consequently, YOLO v11m is selected to provide reliable and real-time ROI extraction. In the second stage, a multi-task learning network operates on the extracted ROIs to jointly predict slot-wise Fastener presence and tightness. Conditional supervision is introduced to align tightness prediction with physical Fastener existence. Experimental results show stable and consistent performance across slot-level metrics, validating the effectiveness of decoupling coarse localization from fine-grained structural inspection. The proposed framework offers a robust and deployable solution for automated railway Fastener inspection.