1. Big data-assisted urban governance: A comprehensive system for business documents classification of the government hotline.
- Author
-
Zhang, Zicheng, Li, Anguo, Wang, Li, Cao, Wei, and Yang, Jianlin
- Subjects
- *
BIG data , *NETWORK governance , *GOVERNMENT publications , *NEW words , *DATA structures - Abstract
The government service platform, exemplified by the government hotline, has to handle extensive volumes of business documents that contain rich and timely public opinion information and citizens' demands. However, manual processing struggles to process large-scale text data, adversely impacting operating costs and the quality of government services. This study proposes a comprehensive system for business document classification of the government hotline (BDCGHS) in China to address these challenges. BDCGHS leverages information entropy fused with term frequency-inverse document frequency (TF-IDF) weight to mine new words from business documents of the government hotline, and store them in a new word repository. These new words optimize Chinese word segmentation and text representation for text classification. We introduce a novel data structure called nested balanced binary tree to expedite new word mining, yielding a computational speed of almost five times than the Trie trees. Comparative experiments on the THUNews and government hotline datasets validate our proposed improvement BDCGHS algorithm's superior performance 3 % over text classification algorithms. Compared to the latest bidirectional encoder representations from the transformers (BERT) model, BDCGHS enhances the accuracy of order dispatch based on business documents by almost 3 %. It has also demonstrated stable operations in two Chinese cities for over a year, yielding favorable results. [Display omitted] • An embedded balanced binary tree structure is proposed for new word discovery. • A new word database is constructed for the government hotline. • The effects of mainstream classification algorithms are compared based on a new word database. • An intelligent text classification system for the government hotline is constructed and the results have promising results. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF