An Efficient Cucconi Based Feature Extraction With Random Decision Forest Classification for Improved Sentiment Analysis

Authors

DOI:

https://doi.org/10.2298/YJOR240315034A

Keywords:

Sentiment analysis, review statements, weak learner, random decision forest classifier, ID3 decision tree, cucconi projective feature extraction

Abstract

Sentiment analysis is a form of opinion mining technique that identifies the polarity of extracted opinions. Nowadays, opinion mining has become an important research area in recent decades to identify the polarity of the statements. Various research works have been carried out on sentiment analysis. However, the existing sentimental analysis techniques, such as time and space complexity, still have considerable limitations. To deal with these issues, this paper proposed the Cucconi Feature Extracted Random Decision Forest Classification (CFDFC) Approach. The main objective of the CFDFC approach is to provide effective sentiment analysis with improved accuracy and reduced time complexity. The proposed CFDFC framework comprisespre-processing, feature extraction, and classification. The pre-processing step eliminates stop words and stem words from user reviews. After the pre-processing step, the feature extraction process is carried out to minimize the dimensionality and time consumption for opinion classification. Cucconi's projective feature extraction process is used in this work to reduce dimensionality. Finally, the classification process is formulated using a random decision forest classifier. The random decision forest classifier uses the ID3 DT (decision tree) as a weak learner to classify the review statements. The performance evaluation of the proposed approach is carried out using performance metrics such as accuracy, error rates, recall values, and time and space complexities concerning the number of review statements gathered from the dataset. The results show that the proposed CFDFC model achieves remarkable accuracy, recall, and minimal time complexity compared to existing methods.

References

T. Li, H. Xu, Z. Liu, Z. Dong, Q. Liu, J. Li, S. Fan, and X. Sun, “A spatiotemporal multi-feature extraction framework for opinion mining”, Neurocomputing, vol. 490, pp.337-346. 2022.

A.F. Pathan, and C. Prakash, “Attention-based position-aware framework for aspect-based opinion mining using bidirectional long short-term memory”, Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 10, pp.8716-8726, 2022.

A.F. Pathan, and C. Prakash, “Unsupervised aspect extraction algorithm for opinion mining using topic modeling”, Global Transitions Proceedings, vol. 2, no. 2, pp.492-499, 2021.

K. Gangadharan, G.R.N. Kumari, D. Dhanasekaran, and K. Malathi, “Detection and classification of various pest attacks and infection on plants using RBPN with GA based PSO algorithm”, Indonesian Journal of Electrical Engineering and Computer Science (IJEECS), vol. 20, no. 3, pp.1278-1288, 2020.

S. Kumar, and M. Zymbler, “A machine learning approach to analyze customer satisfaction from airline tweets”, Journal of Big Data, vol. 6, no. 1, pp.1-16, 2019.

H. Alzahrani, P. Duverger, and N.P. Nguyen, “Contextual Polarity and Influence Mining in Online Social Networks”, In 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech) (pp. 1054-1061), IEEE, 2017.

S. Gottipati, V. Shankararaman, and J.R. Lin, “Text analytics approach to extract course improvement suggestions from students’ feedback”, Research and Practice in Technology Enhanced Learning, vol. 13, pp.1-19, 2018.

S. Zervoudakis, E. Marakakis, H. Kondylakis, and S. Goumas, “OpinionMine: A Bayesian-based framework for opinion mining using Twitter Data”, Machine Learning with Applications, vol. 3, p.100018, 2021.

B. Yoon, Y. Jeong, K. Lee, and S. Lee, “A systematic approach to prioritizing R&D projects based on customer-perceived value using opinion mining”, Technovation, vol. 98, p. 102164, 2020.

M. Arun, “Experimental investigation on energy and exergy analysis of solar water heating system using zinc oxide-based Nanofluid”, Arabian Journal for Science and Engineering, vol. 48, no. 3, pp.3977-3988, 2023.

Y. Zuo, J. Wu, H. Zhang, D. Wang, and K. Xu, “Complementary aspect-based opinion mining”, IEEE Transactions on Knowledge and Data Engineering, vol. 30, no. 2, pp.249-262, 2017.

C. Prajitha, K.P. Sridhar, and S. Baskar, “Noise Removal-based Thresholding framework for Arrhythmia classification”, Journal of Intelligent & Fuzzy Systems, vol. 45, no. 2, pp.2657-2668, 2023.

B. Zhang, Y. Bai, Q. Zhang, J. Lian, and M. Li, “An opinion-leader mining method in social networks with a phased-clustering perspective”, IEEE Access, vol. 8, pp.31539-31550, 2020.

S. Al-Azani, and E.S.M. El-Alfy, “ Early and late fusion of emojis and text to enhance opinion mining”, IEEE Access, vol. 9, pp.121031-121045, 2021.

A. Mullick, P. Goyal, N. Ganguly, and M. Gupta, “Harnessing twitter for answering opinion list queries”, IEEE Transactions on Computational Social Systems, vol. 5, no. 4, pp.1083-1095, 2018.

A. Mohammadinejad, R. Farahbakhsh, and N. Crespi, “Consensus opinion model in online social networks based on influential users”, IEEE Access, vol. 7, pp.28436-28451, 2019.

A.D. Vo, Q.P. Nguyen, and C.Y. Ock, “Opinion-aspect relations in cognizing customer feelings via reviews”, IEEE Access, vol. 6, pp.5415-5426, 2018.

S. Siddiqui, M.A. Rehman, S.M. Doudpota, and A. Waqas, “Ontology driven feature engineering for opinion mining”, IEEE Access, vol. 7, pp.67392-67401, 2019.

J.S. He, M. Han, S. Ji, T. Du, and Z. Li, “Spreading social influence with both positive and negative opinions in online networks”, Big Data Mining and Analytics, vol. 2, no. 2, pp.100-117, 2019.

F. Riquelme, P. Gonzalez-Cantergiani, D. Hans, R. Villarroel, and R. Munoz, “Identifying opinion leaders on social networks through milestones definition”, IEEE Access, vol. 7, pp.75670-75677, 2019.

T. Nguyen Thi Ngoc, H. Nguyen Thi Thu, and V.A. Nguyen, “Mining aspects of customer’s review on the social network”, Journal of Big Data, vol. 6, pp.1-21, 2019.

S. Poria, E. Cambria, and A. Gelbukh, “Aspect extraction for opinion mining with a deep convolutional neural network”, Knowledge-Based Systems, vol. 108, pp.42-49, 2016.

S.N. Almuayqil, M. Humayun, N.Z. Jhanjhi, M.F. Almufareh, and D. Javed, “Framework for improved sentiment analysis via random minority oversampling for user tweet review classification”, Electronics, vol. 11, no. 19, p.3058, 2022.

M.M.R. Mamun, O. Sharif, and M.M. Hoque, “Classification of textual sentiment using ensemble technique”, SN Computer Science, vol. 3, no. 1, p.49, 2022.

R.S. Patil, and S.R. Kolhe, “Supervised classifiers with TF-IDF features for sentiment analysis of Marathi tweets”, Social Network Analysis and Mining, vol. 12, no. 1, p.51, 2022.

A. Samih, A. Ghadi, &A. Fennan, “Enhanced sentiment analysis based on improved word embeddings and XGboost”, International Journal of Electrical & Computer Engineering (2088-8708), vol. 13, no. 2, 2023.

T.A. Khan, R. Sadiq, Z. Shahid, M.M. Alam, & M. B. M. Su'ud, “Sentiment Analysis using Support Vector Machine and Random Forest”, Journal of Informatics and Web Engineering, vol. 3, no. 1, 67-75, 2024.

A. Go, R. Bhayani, and L. Huang, “Twitter sentiment classification using distant supervision.”, CS224N Project Report, Stanford, vol. 1, p. 12, 2009.

Downloads

Published

2024-10-21

How to Cite

Anuradha, K., Mallik, B., & Krishna, V. M. (2024). An Efficient Cucconi Based Feature Extraction With Random Decision Forest Classification for Improved Sentiment Analysis. Yugoslav Journal of Operations Research, 34(4), 765–783. https://doi.org/10.2298/YJOR240315034A

Issue

Section

Special Issue