Research on Word Attributes and Word Meaning Difficulty Based on Time Series Analysis
DOI:
https://doi.org/10.54097/0y242z91Keywords:
Time Series Analysis (ARMA), Gradient Boosted Tree (GBDT) Algorithm, Correlation Analysis.Abstract
In recent years, natural language processing has increasingly focused on revealing the relationship between word attributes and word meaning difficulty through data mining and statistical modeling to provide new perspectives for lexical cognition research. In this study, the daily result data from January 7 to December 31, 2022, classify word attributes into 30 categories of related attributes. A time-series model was built to derive the number of reported results on March 1, 2023, to be about 12,295 items. It was found that 29 attributes such as the commonness of words are not related to the difficulty pattern and the meaning of words are related to the difficulty pattern. Then corresponding to the data on the number of attempts to solve different subcategories, a gradient boosting tree (GBDT) [1] regression model was built to derive the correlation percentages under the number of attempts 1, 2, 3, 4, 5, 6, and X for players solving the EERIE word case as 1%, 4%, 18%, 28%, 28%, 16%, and 5%. In this paper, it is found that in solving EERIE words, players have a higher probability of solving the word under 4 and 5 attempts and a smaller probability of solving the word under 1 and 6 attempts.
Downloads
References
[1] Feng Naixing, Wang Huan, Zhu Zixian, et al. Perfectly matched single-layer intelligent algorithm based on limit gradient boosting for efficient absorption of aero transient electromagnetic problems [J]. Physics Letters, 2024, 73 (06): 173 - 81.
[2] Liu Enchen, Yang Xiaolong, Luo Xinwu. The Application of Logical Thinking Training in Professional English Teaching: Starting from the Representation of “Properties” of Things by Words [J]. Overseas English, 2021, (02): 100 - 1.
[3] DU Zhiyong, YANG Fan, YANG Wenjie. Study on daily maximum temperature prediction for long time series based on Conv1D-LSTM hybrid model [J]. Journal of Beijing Institute of Printing, 2024, 32 (09): 52 - 7.
[4] SUN Le Zheng, ZHAO Meng Di, DU Shaobo, et al. Prediction of game difficulty and influencing factors based on multivariate linear regression model [J]. Digital Technology and Application, 2023, 41 (12): 85- 7.
[5] Cai Zhongzhe, Zeng Riwei, Lin Chengcheng, et al. Prediction and Analysis of Wordle Answering [J]. Journal of Taizhou College, 2023, 45 (06): 8 - 13+21.
[6] ZHANG Mingming, ZHANG Mingdong. Application of combined gray Verhulst-time series model in settlement monitoring [J]. Mapping and Spatial Geographic Information, 2024, 47 (09): 151 - 3.
[7] LI Junfeng, YANG Xin. Impact of water level change on water quality in Caizihu Lake [J]. Anhui Chemical Industry, 2024, 50 (02): 124 - 9.
[8] Xutuo Wang, Yuting Wei, Huanhuan Zhang. Credit scoring model selection method based on Kolmogorov-Smirnov (KS) statistic [J]. Mathematical Statistics and Management, 2024, 43 (01): 100 - 16.
[9] Liu Jindong, Wang Tencheng, Chen Yonghong, et al. Improved method based on Anderson-Darling normality test and its application [J]. Practice and Understanding of Mathematics, 2023, 53 (04): 184 - 93.
[10] Li Peng, Luo Xiangchun, Meng Qingwei, et al. Ultra-short-term load forecasting for customer-level integrated energy systems based on Spearman correlation threshold optimization and VMD-LSTM [J]. Global Energy Internet, 2024, 7 (04): 406 - 20.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







