A Comparative Study of Deep Learning’s Performance Methods for News Article using Word Representations

Iman Saladin B. Azhar, Winda Kurnia Sari, Naretha Kawadha Pasemah Gumay

Abstract


In natural language processing (NLP), text classification is a crucial task that involves analyzing textual data, which often has high dimensionality. A good word representation is essential to address this challenge, and the word representation using GloVe is one of the popular methods that provides pre-trained word representations in high-dimensional vectors. This research evaluates the effectiveness of three deep learning techniques Convolutional Neural Network (CNN), Deep Neural Network (DNN), and Long Short-Term Memory (LSTM) for online news classification using 300-dimensional GloVe word representations. The CNN model utilizes convolutional and pooling layers to extract local features, the DNN relies on dense layers to learn abstract representations, while the LSTM excels at capturing long-term dependencies between words. The results show that the LSTM model achieved the best accuracy at 93.45%, followed by CNN at 91.24%, and DNN at 90.67%. The superiority of LSTM is attributed to its ability to effectively capture temporal relationships and context, while CNN offers efficiency with faster training times. Although DNN produced solid performance, it is less optimal in understanding word sequences. These findings indicate that LSTM outperforms the other models in online news text classification tasks.

Keywords


Deep Learning, CNN, DNN, LSTM, News Classification

Full Text:

PDF

References


M. Wankhade, A. C. S. Rao, and C. Kulkarni, “A Survey on Sentiment Analysis Methods, Applications, and Challenges,” Artif. Intell. Rev., vol. 55, no. 7, pp. 5731–5780, 2022.

L. Yue, W. Chen, X. Li, W. Zuo, and M. Yin, “A Survey of Sentiment Analysis un Social Media,” Knowl. Inf. Syst., vol. 60, no. 2, pp. 617–663, 2019.

W. K. Sari, I. S. B. Azhar, Z. Yamani, and Y. Florensia, “Fake News Detection using Optimized Convolutional Neural Network and Bidirectional Long Short-Term Memory.” Computer Engineering and Applications, 2024.

J. Guo et al., “A Deep Look Into Neural Ranking Models for Information Retrieval,” Inf. Process. Manag., vol. 57, no. 6, p. 102067, 2020.

A. Kumar Sharma, B. Bajpai, R. Adhvaryu, S. Dhruvi Pankajkumar, P. Parthkumar Gordhanbhai, and A. Kumar, “An Efficient Approach of Product Recommendation System using NLP Technique,” Mater. Today Proc., vol. 80, pp. 3730–3743, 2023.

B. T. Jijo and A. M. Abdulazeez, “Classification based on Decision Tree Algorithm for Machine Learning.” Journal Of Applied Science And Technology Trends, 2021.

S. Ruan, B. Chen, K. Song, and H. Li, “Weighted Naïve Bayes Text Classification Algorithm based on Improved Distance Correlation Coefficient,” Neural Comput. Appl., vol. 34, no. 4, pp. 2729–2738, 2022.

D. E. Cahyani and I. Patasik, “Performance Comparison of TF-IDF and Word2Vec Models for Emotion Text Classification,” Bull. Electr. Eng. Informatics; Vol 10, No 5 Oct. 2021DO - 10.11591/eei.v10i5.3157 , Oct. 2021.

Z. Ren, Q. Shen, X. Diao, and H. Xu, “A Sentiment-Aware Deep Learning Approach for Personality Detection from Text,” Inf. Process. Manag., vol. 58, no. 3, p. 102532, 2021.

S. Hakak, M. Alazab, S. Khan, T. R. Gadekallu, P. K. R. Maddikunta, and W. Z. Khan, “An Ensemble Machine Learning Approach Through Effective Feature Extraction to Classify Fake News,” Futur. Gener. Comput. Syst., vol. 117, pp. 47–58, 2021.

W. K. Sari, D. P. Rini, R. F. Malik, and I. S. B. Azhar, “Multilabel Text Classification in News Articles using Long-Term Memory with Word2Vec.” Jurnal Resti, pp. 276–285, 2020.

E. M. Dharma, F. L. Gaol, H. L. H. S. Warnars, and B. Soewito, “The Accuracy Comparison among Word2vec, Glove, and Fasttext Towards Convolution Neural Network (CNN) Text Classification.” Journal of Theoretical and Applied Information Technology, 2022.

R. A. Stein, P. A. Jaques, and J. F. Valiati, “An Analysis of Hierarchical Text Classification using Word Embeddings,” Inf. SCI. (Ny)., vol. 471, pp. 216–232, 2019.

J. Pennington and R. Socher, “Glove: Global Vectors for Word Representation,” in AES: Journal of the Audio Engineering Society, 1971, vol. 19, no. 5, pp. 417–425.

F. Gasparetti, “News Aggregator.” UCI Machine Learning Repository, 2017.

M. Lichman, “UCI Machine Learning Repository [http://archive.ics.uci.edu/ml].” Irvine, CA: University of California, School of Information and Computer Science, 2013.

A. K. Sharma, S. Chaurasia, and D. K. Srivastava, “Sentimental Short Sentences Classification by using CNN Deep Learning Model with Fine Tuned Word2Vec,” Procedia Comput. SCI., vol. 167, pp. 1139–1147, 2020.

A. Kulkarni and A. Shivananda, “Deep Learning for NLP BT - Natural Language Processing Recipes: Unlocking Text Data with Machine Learning and Deep Learning using Python,” A. Kulkarni and A. Shivananda, Eds. Berkeley, CA: Apress, 2021, pp. 213–262.

B. Alshemali and J. Kalita, “Improving the Reliability of Deep Neural Networks in NLP: A Review,” Knowledge-Based Syst., vol. 191, p. 105210, 2020.

S.-H. Noh, “Analysis of Gradient Vanishing of RNNs and Performance Comparison,” Information, vol. 12, no. 11. 2021.

M. Roodschild, J. Gotay Sardiñas, and A. Will, “A New Approach for the Vanishing Gradient Problem on Sigmoid Activation,” Prog. Artif. Intell., vol. 9, no. 4, pp. 351–360, 2020.

A. Yadav, C. K. Jha, and A. Sharan, “Optimizing LSTM for Time Series Prediction in Indian Stock Market,” Procedia Comput. Sci., vol. 167, pp. 2091–2100, 2020.

B. Ren, “The use of Machine Translation Algorithm based on Residual and LSTM Neural Network in Translation Teaching.” Plos one, 15(11), e0240663, 2020.

D. G. S. N. Murthy, S. R. Allu, B. Andhavarapu, M. Bagadi, and M. Belusonti, “Text based Sentiment Analysis using LSTM.” International Journal of Engineering Research & Technology (IJERT), 2020.




DOI: https://doi.org/10.32520/stmsi.v14i2.5090

Article Metrics

Abstract view : 124 times
PDF - 18 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.