Back to Search
Start Over
Chunk Segmentation of Chinese Sentences Using a Combined Statistical and Rule-based Approach (CSRA).
- Source :
-
International Journal of Computer Processing of Oriental Languages . Jun2007, Vol. 20 Issue 2/3, p197-218. 22p. 3 Diagrams, 10 Charts, 7 Graphs. - Publication Year :
- 2007
-
Abstract
- Deep parsing of Chinese sentences is a very challenging task due to their complexity such as ambiguous word boundaries and meanings. An alternative mode of Chinese language processing is to perform shallow parsing of Chinese sentences in which chunk segmentation plays an important role. In this paper, we present a chunk segmentation algorithm using a combined statistical and rule-based approach (CSRA). The decision rules for refining chunk segmentation are generated from incorrectly segmented chunks from a statistical model which is built on a training corpus. Experimental results show that the CSRA works well and produces satisfactory chunk segmentation results for subsequent processes such as chunk tagging and chunk collocation extraction. [ABSTRACT FROM AUTHOR]
- Subjects :
- *SENTENCES (Grammar)
*CHINESE language
*ALGORITHMS
*STATISTICS
Subjects
Details
- Language :
- English
- ISSN :
- 02194279
- Volume :
- 20
- Issue :
- 2/3
- Database :
- Academic Search Index
- Journal :
- International Journal of Computer Processing of Oriental Languages
- Publication Type :
- Academic Journal
- Accession number :
- 30028071