Research on terminal identification detection and recognition algorithm based on optimized DBNet-CRNN
Author:
Affiliation:

1.School of Electrical an Electronic Engineering, Shijiazhuang Tiedao University;2.Shijiazhuang Power Supply Section of China Railway Beijing Bureau Group Co.Ltd.

Fund Project:

The National Natural Science Foundation of Chi-na,The Scientific research project of Guoneng Shuohuang Railway Development Co., Ltd

  • Article
  • | |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    In order to improve the efficiency of substation inspection, a detection and recognition method of distorted terminal logo text in substation based on deep learning was proposed. The terminal identification detection model uses DBNet as the basic framework, replaces the original ResNet backbone network with ConvNeXt V2, and uses its modern architecture design to significantly improve the global information modeling ability of the model and the feature extraction ability of the terminal identification. In order to further improve the detection accuracy of distorted terminal signs, an efficient multi-scale attention module EMA and a deformable convolutional network DCNv4 are introduced to effectively improve the ability to capture global context information and enhance the robustness to irregular text shapes. After optimization, the detection accuracy of the terminal sign detection model reaches 97.4%, which is 18.9 percentage points higher than the original model. And the amount of calculation is only increased by 2.9%. The terminal identification model takes CRNN as the basic framework, and introduces spatial and channel convolution SCConv to optimize the feature extraction process, which significantly reduces redundant features and reduces the computational burden. In the sequence modeling part, the selective state space network Mamba is used to replace the original LSTM, its state space model and selective mechanism can dynamically model the sequence data, adaptively focus on the important part of the sequence, and significantly enhance the ability to capture long sequence dependencies. The recognition accuracy of the optimized terminal identification model reaches 98.2%. It is 2.7 percentage points higher than the original model. Experimental results show that the proposed method has excellent detection ability and recognition accuracy for negative factors such as distortion, weak light and blur in substation terminal identification.

    Reference
    Related
    Cited by
Get Citation
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:February 21,2025
  • Revised:March 20,2025
  • Adopted:March 25,2025
Article QR Code

Address:No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode:210044

Phone:025-58731025