Back to Search Start Over

Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation

Authors :
Higashiyama, Shohei
Ouchi, Hiroki
Teranishi, Hiroki
Otomo, Hiroyuki
Ide, Yusuke
Yamamoto, Aitaro
Shindo, Hiroyuki
Matsuda, Yuki
Wakamiya, Shoko
Inoue, Naoya
Yamada, Ikuya
Watanabe, Taro
Publication Year :
2023

Abstract

Geoparsing is a fundamental technique for analyzing geo-entity information in text. We focus on document-level geoparsing, which considers geographic relatedness among geo-entity mentions, and presents a Japanese travelogue dataset designed for evaluating document-level geoparsing systems. Our dataset comprises 200 travelogue documents with rich geo-entity information: 12,171 mentions, 6,339 coreference clusters, and 2,551 geo-entities linked to geo-database entries.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2305.13844
Document Type :
Working Paper