Corpus construction and named entity recognition for landslide geological hazards
DOI:
Author:
Affiliation:

1.School of Remote Sensing and Geomatics Engineering,Nanjing University of Information Science and Technology;2.Beijing Harzard of Geological Disaster Prevention;3.School of Geographical Sciences,Nanjing University of Information Science and Technology,Nanjing,China;4.Anhui University Of Science and Technology;5.School of Geography,Nanjing Normal University,Nanjing

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Extracting valuable landslide geohazard entities from massive texts describing landslide geological hazards is the basis for constructing a landslide geohazard knowledge graph. Based on relevant unstructured text data such as landslide geological hazard exploration report documents, this paper analyzes the text language description characteristics of landslide geological hazards based on the mechanism of landslide geological hazards, formulates an annotation system and annotation specifications for semantic information of landslide geological hazards, and constructs a corpus for the field of landslide geological hazards. Meanwhile, the entity recognition experiments based on the corpus show that the accuracy, recall and precision of the named entity recognition model reach more than 90%, which verifies the applicability of the corpus and provides strong data support for the subsequent research work of landslide geology knowledge mapping.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:April 29,2024
  • Revised:May 21,2024
  • Adopted:May 27,2024
  • Online:
  • Published:

Address:No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode:210044

Phone:025-58731025