1. A Novel Approach on Automatic Building of Word Correlation Net Based on Statistic
- Author
-
Chang Yang, Xue Guang Zhou, Hao Zhang, and Yang Wang
- Subjects
Thesaurus (information retrieval) ,Computer science ,business.industry ,Generalization ,General Engineering ,WordNet ,computer.software_genre ,Word lists by frequency ,Semantic memory ,Artificial intelligence ,business ,computer ,Word (computer architecture) ,Statistic ,Text retrieval ,Natural language processing - Abstract
Semantic knowledge-base has important meaning for increasing the deepness of NLP. Some comparatively mature Semantic knowledge-base such as WordNet, HowNet and Thesaurus was developed by manpower, and has many difficulties on actual application. In order to capture Chinese word knowledge of relating status moue automatically and demonstrably, this paper presented the concept of word correlation and a calculation method of word correlation based on statistic. Then a correlation net based on Chinese words which have strong domain characteristic was built. In order to resolve the difficulty of processing the huge amount of data, a hard disk storing method of array segmentation was designed. The semantic knowledge gained by the experiment had the advantage of empiricism. It is veracity and generalization is strong so it can play an important role in many fields such as text categorization, text retrieval, text filtering, etc.
- Published
- 2013
- Full Text
- View/download PDF