Real-Time Semantic Segmentation for Autonomous Driving Based on Swin Transformer

Authors

  • Sophia Koch Faculty of Computer Science, Goethe University Frankfurt, 60323 Frankfurt, Germany
  • Jakob Bauer Faculty of Computer Science, Goethe University Frankfurt, 60323 Frankfurt, German
  • Jonas Fischer Faculty of Computer Science, Goethe University Frankfurt, 60323 Frankfurt, Germany

DOI:

https://doi.org/10.64972/jaat.2026v4.107

Keywords:

Deep Learning, Real-Time Semantic Segmentation, Swin Transformer, Autonomous Driving, Model Compression, Edge Computing

Abstract

Real-time semantic segmentation helps safely construct detailed maps of dynamic cities and improves the robustness of autonomous driving perception. To meet the high precision and high efficiency requirements of automotive hardware, a universal Swin Transformer framework has been developed for autonomous driving. This new method is optimized at runtime by using small decoder modules, hierarchical window self-attention mechanisms, quantization-aware training, channel pruning, and adaptive normalization. By using these additions, global and local contextual data can be obtained at a relatively low computational cost. A large number of experiments were conducted on the built-in city driving dataset, aiming to improve its generalization ability under various lighting and weather conditions, and to enhance its capability to recognize fine-grained object boundaries and rare semantic categories. The system will run stably at a frame rate of over 30 FPS and can operate on both GPUs and embedded devices. Comprehensive performance evaluations indicate that it outperforms traditional CNNs and transformer-based baselines, with significant improvements in mean Intersection over Union (IoU) and boundary delineation. As shown above, scalable attention, controllable model compression, and system-level acceleration constitute the practical application of this framework in safety-critical environments. According to the above research, advanced vision transformers can be used for efficient and high-performance semantic segmentation in intelligent vehicles.

Downloads

Published

2026-01-05

How to Cite

Koch, S., Bauer, J., & Fischer, J. (2026). Real-Time Semantic Segmentation for Autonomous Driving Based on Swin Transformer. Journal of Applied Automation Technologies, 4, 14–27. https://doi.org/10.64972/jaat.2026v4.107

Issue

Section

Articles

Similar Articles

<< < 1 2 

You may also start an advanced similarity search for this article.