Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Hybrid Deep Learning approach for Lane Detection: Combining convolutional and transformer networks with a post-processing temporal information mechanism, for efficient road lane detection on a road image scene
Jönköping University, School of Engineering, JTH, Department of Computing, Jönköping AI Lab (JAIL).
Jönköping University, School of Engineering, JTH, Department of Computing, Jönköping AI Lab (JAIL).
2023 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Lane detection is a crucial task in the field of autonomous driving and advanced driver assistance systems. In recent years, convolutional neural networks (CNNs) have been the primary approach for solving this problem. However, interesting findings from recent research works regarding the use of Transformer models and attention-based mechanisms have shown to be beneficial in the task of semantic segmentation of the road lane markings. In this work, we investigate the effectiveness of incorporating a Vision Transformer (ViT) to process feature maps extracted by a CNN network for lane detection. We compare the performance of a baseline CNN-based lane detection model with that of a hybrid CNN-ViT pipeline and test the model over a well known dataset. Furthermore, we explore the impact of incorporating temporal information from a road scene on a lane detection model’s predictive performance. We propose a post-processing technique that utilizes information from previous frames to improve the accuracy of the lane detection model. Our results show that incorporating temporal information noticeably improves the model’s performance, and manages to make effective corrections over the originally predicted lane masks. Our SegNet backbone, exploiting the proposed post-processing mechanism, reached an F1 scoreof 0.52 and Intersection-over-Union (IoU) of 0.36 over the TuSimple test set. However, the findings from the testing of our CNN-ViT pipeline and a relevant ablation study, do indicate that this hybrid approach might not be a good fit for lane detection. More specifically, the ViT module fails to exploit the feature sextracted by our CNN backbone and therefore, our hybrid pipeline results in less accurate lane marking spredictions.

Place, publisher, year, edition, pages
2023. , p. 41
Keywords [en]
Lane Detection, CNN, Vision Transformer, Deep Learning, Semantic Segmentation, Computer Vision
National Category
Other Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
URN: urn:nbn:se:hj:diva-61469ISRN: JU-JTH-DTT-2-20230007OAI: oai:DiVA.org:hj-61469DiVA, id: diva2:1772730
Supervisors
Examiners
Available from: 2023-08-23 Created: 2023-06-21 Last updated: 2023-08-23Bibliographically approved

Open Access in DiVA

Thesis_Graded(6020 kB)159 downloads
File information
File name FULLTEXT01.pdfFile size 6020 kBChecksum SHA-512
c5c88c24e0f45f8164b55f9ce780d00c9972fff7b872f730a37f2089527ddae3be1c553e14d575616326c4e06d49859ba01471a5e1ed63d5b71076754f892723
Type fulltextMimetype application/pdf

By organisation
Jönköping AI Lab (JAIL)
Other Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 159 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 650 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf