Volume 41 Issue 1
Feb.  2023
Turn off MathJax
Article Contents
MA Haowei, ZHANG Di, LI Yuli, FAN Liang. A Ship Detection Algorithm for Infrared Images under Hazy Environment based on an Improved YOLOv5 Algorithm[J]. Journal of Transport Information and Safety, 2023, 41(1): 95-104. doi: 10.3963/j.jssn.1674-4861.2023.01.010
Citation: MA Haowei, ZHANG Di, LI Yuli, FAN Liang. A Ship Detection Algorithm for Infrared Images under Hazy Environment based on an Improved YOLOv5 Algorithm[J]. Journal of Transport Information and Safety, 2023, 41(1): 95-104. doi: 10.3963/j.jssn.1674-4861.2023.01.010

A Ship Detection Algorithm for Infrared Images under Hazy Environment based on an Improved YOLOv5 Algorithm

doi: 10.3963/j.jssn.1674-4861.2023.01.010
  • Received Date: 2022-09-26
    Available Online: 2023-05-13
  • Accurately detecting ships from surveillance images is crucial for intelligent ship traffic surveillance around port waters. To address the issues of low accuracy and capability of small target feature extraction from traditional YOLOv5 object detection algorithms from the infrared images under hazy weather, an improved YOLOv5 algorithm based on Swin Transformer is proposed. To expand the diversity of the original dataset, the improved algorithm considers the characteristics of ship infrared images with strong resistance to cloud and fog interference but blurred image contour features and low contrast, and enhances the dataset based on an atmospheric scattering model. To enhance the algorithm's attention to global features during feature extraction, the backbone network of the improved algorithm uses Swin Transformer to extract ship infrared image features and expands the window view range using a multi-head self-attention mechanism controlled by a sliding window. To enhance the capability of extracting spatial features of dense small targets, a multi-scale feature fusion Path Aggregation Network (PANet) is improved by adding a bottom-up feature sampling module and a coordinate attention (CA) mechanism, in order to capture the position, direction, and cross-channel information of small target ships. To reduce false negatives and false positives, a complete intersection over union loss function (CIoU) is used to calculate the coordinate prediction loss of the original bounding box and combined with the non-maximum suppression algorithm (NMS) to judge and filter candidate boxes in a multi-loop structure to improve the reliability of object detection. Study results show that under certain concentrations of haze, the average recognition accuracy, recall rate, and detection rate of the improved algorithm is 93.73%, 98.10%, and 38.6 frames per second, respectively. Compared with the following algorithms: RetinaNet, Faster R-CNN, YOLOv3 SPP, YOLOv4, YOLOv5, and YOLOv6-N, the average recognition accuracy of the proposed algorithm is improved by 13.90%, 11.53%, 8.41%, 7.21%, 6.20%, and 3.44% respectively; and the average recall rate is improved by 11.81%, 9.67%, 6.29%, 5.53%, 4.87%, and 2.39%, respectively. The proposed Swin-YOLOv5s algorithm has a strong generalization ability for ship target recognition of different sizes and has a high detection accuracy, which helps to improve the surveillance capability of ships around port waters.

     

  • loading
  • [1]
    王岩, 孙寿保, 徐峰, 等. 提升尹公洲段航道通过能力的探讨[J]. 水运工程, 2020(12): 161-165, 190.

    WANG Y, SUN S B, XU F, et al. Discussion on improving passage capacity of Yingongzhou channel[J]. Port & Waterway Engineering, 2020(12): 161-165, 190. (in Chinese)
    [2]
    郝姝馨, 郝增周, 黄海清, 等. 基于Himawari-8数据的夜间海雾识别[J]. 海洋学报, 2021, 43(11): 166-180.

    HAO S X, HAO Z Z, HUANG H Q, et al. Nighttime sea fog recognition based on Himawari-8 data[J]. Acta OceanologicaSinca, 2021, 43(11): 166-180. (in Chinese)
    [3]
    李云红, 刘宇栋, 苏雪平, 等. 红外与可见光图像配准技术研究综述[J]. 红外技术, 2022, 44(7): 641-651. https://www.cnki.com.cn/Article/CJFDTOTAL-HWJS202207001.htm

    LI Y H, LIU Y D, SU X P, et al. Review of infrared and visible image registration[J]. Infrared Technology, 2022, 44(7): 641-651. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-HWJS202207001.htm
    [4]
    SHU Q, WU C, ZHONG Q, et al. Alternating minimization algorithm for hybrid regularized variational image dehazing[J]. Optik, 2019(185): 943-956.
    [5]
    ZHANG J, FENG F, SONG W. A compensation textures dehazing method for water alike area[J]. The Journal of Supercomputing, 2021, 77(4): 3555-3570. doi: 10.1007/s11227-020-03406-8
    [6]
    MA Z, WEN J, ZHANG C, et al. An effective fusion defogging approach for single sea fog image[J]. Neurocomputing, 2016(173): 1257-1267.
    [7]
    GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]. 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbia, USA: IEEE, 2014.
    [8]
    车凯, 向郑涛, 陈宇峰, 等. 基于改进Fast R-CNN的红外图像行人检测研究[J]. 红外技术, 2018, 40(6): 578-584. https://www.cnki.com.cn/Article/CJFDTOTAL-HWJS201806010.htm

    CHE K, XIANG Z T, CHEN Y F, et al. Research on infrared image pedestrian detection based on improved Fast R-CNN[J]. Infrared Technology, 2018, 40(6): 578-584. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-HWJS201806010.htm
    [9]
    顾燕, 李臻, 杨锋, 等. 基于改进Faster R-CNN的复杂背景红外车辆检测算法[J]. 激光与红外, 2022, 52(4): 614-619. https://www.cnki.com.cn/Article/CJFDTOTAL-JGHW202204022.htm

    GU Y, LI Z, YANG F, et al. Infrared vehicle detection algorithm with complex background based on improved Fast R-CNN[J]. Laser & Infrared, 2022, 52(4): 614-619. (in Chinese) https://www.cnki.com.cn/Article/CJFDTOTAL-JGHW202204022.htm
    [10]
    CAI Z, VASCONCELOS N. Cascade R-CNN: delving into high quality object detection[C]. 2018 IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA: IEEE, 2018.
    [11]
    ZHANG C, XIONG B, KUANG G. Ship detection and recognition in optical remote sensing images based on scale enhancement rotating Cascade R-CNN networks[C]. 2021 IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium: IEEE, 2021.
    [12]
    LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]. 2016 IEEE European Conference on Computer Vision, Amsterdam: IEEE, 2016.
    [13]
    REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]. 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA: IEEE, 2016.
    [14]
    ZOU Y, ZHAO L, QIN S, et al. Ship target detection and identification based on SSD_MobilenetV2[C]. 2020 IEEE Information Technology and Mechatronics Engineering Conference, Changsha, China: IEEE, 2020.
    [15]
    CHANG Y L, ANAGAW A, CHANG L, et al. Ship detection based on YOLOv2 for SAR imagery[J]. Remote Sensing, 2019, 11(7): 786-800.
    [16]
    陈信强, 郑金彪, 凌峻, 等. 基于异步交互聚合网络的港船作业区域人员异常行为识别[J]. 交通信息与安全, 2022, 40 (2): 22-29. doi: 10.3963/j.jssn.1674-4861.2022.02.003

    CHEN X Q, ZHENG J B, LING J, et al. Detecting abnormal behaviors of workers at ship working fields via asynchronous interaction aggregation network[J]. Journal of Transport Information and Safety, 2022, 40(2): 22-29. (in Chinese) doi: 10.3963/j.jssn.1674-4861.2022.02.003
    [17]
    LIU R W, YUAN W, CHEN X, et al. An enhanced CNN-enabled learning method for promoting ship detection in maritime surveillance system[J]. Ocean Engineering, 2021, (235): 109435.
    [18]
    LIU W, REN G, YU R, et al. Image-adaptive YOLO for object detection in adverse weather conditions[C]. The AAAI Conference onArtificial Intelligence, Beijing, China: AAAI, 2022.
    [19]
    LIU Z, LIN Y, CAO Y, et al. Swin transformer: hierarchical vision transformer using shifted windows[C]. 2021 IEEE International Conference on Computer Vision, Montreal, Canada: IEEE, 2021.
    [20]
    HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]. 2021 IEEE Conference on Computer Vision and Pattern Recognition, Nashville, USA: IEEE, 2021.
    [21]
    HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]. 2018 IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA: IEEE, 2018.
    [22]
    WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module[C]. 2018 IEEE European Conference on Computer Vision, Munich, Germany: IEEE, 2018.
    [23]
    PARK J, WOO S, LEE J Y, et al. BAM: bottleneck attention module[C]. 2018 British Machine Vision Conference, Newcastle, UK: IAPR, 2018.
    [24]
    Infiray. Infiray infrared open source offshore vessel dataset[R/OL]. (2021-12)[2022-10-30]. http://iray.iraytek.com:7813/apply/E_Sea_shipping.html/
    [25]
    CHEN X, LING J, WANG S, et al. Ship detection from coastal surveillance videos via an ensemble Canny-Gaussian-morphology framework[J]. The Journal of Navigation, 2021, 74 (6): 1252-1266.
    [26]
    CHEN Z, CHEN D, ZHANG Y, et al. Deep learning for autonomous ship-oriented small ship detection[J]. Safety Science, 2020, (130): 104812.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(9)  / Tables(3)

    Article Metrics

    Article views (985) PDF downloads(72) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return