Sentiment Analysis using Latent Dirichlet Allocation for Aspect Term Extraction
Keywords:Sentiment analysis, Latent Dirichlet Allocation (LDA), Product design, User-generated text data, Opinion mining
This work proposes a sentiment analysis approach for decision-making in product design, analysis, and market share. The approach incorporates user-generated text data in the form of consumer reviews to extract product features using topic-based modeling methods. Latent Dirichlet Allocation (LDA) is employed to extract aspect categories from the data and identify the sentiment of each review using the VADER sentiment analyzer. The performance of the proposed method is evaluated in terms of accuracy, with an achieved result of 80%. The extracted topics are also summarized to provide leads for product design and quality assurance. The approach can be used by manufacturers, retailers, and suppliers to understand customers' opinions about their products better and make better decisions. LDA is a powerful unsupervised method that can extract latent topics from a collection of documents; this method has been widely used in text mining, information retrieval, and natural language processing. The accuracy can be improved by using more sophisticated models or more data.
Z. Li, Y. Fan, B. Jiang, T. Lei, and W. Liu, A survey on sentiment analysis and opinion mining for social multimedia, “Multimedia Tools and Applications,” 78 (6), pp. 6939–6967, 2019, doi: https://doi.org/10.1007/s11042-018-6445-z.
C. Zong, R. Xia, and J. Zhang, Sentiment analysis and opinion mining, in Text Data Mining, Singapore: Springer Singapore, 2021, pp. 163–199. doi: https://doi.org/10.1007/978-981-16-0100-2_8.
R. Nimesh, P. Veera Raghava, S. Prince Mary, and B. Bharathi, A survey on opinion mining and sentiment analysis, “IOP Conference Series: Materials Science and Engineering,” 590 (1), pp. 14–46, 2019, doi: https://doi.org/10.1088/1757-899X/590/1/012003.
M. S. Hossain and M. F. Rahman, Customer sentiment analysis and prediction of insurance products’ Reviews Using Machine Learning Approaches, “FIIB Business Review,” p. 231971452211157, 2022, doi: https://doi.org/10.1177/23197145221115793.
D.-F. Ciocodeică, R.-G. (Popa) Chivu, I.-C. Popa, H. Mihălcescu, G. Orzan, and A.-M. (Dumitrache) Băjan, The degree of adoption of business intelligence in romanian companies—the case of sentiment analysis as a marketing analytical tool, “Sustainability,” 14 (12), p. 7518, 2022, doi: https://doi.org/10.3390/su14127518.
M. Birjali, M. Kasri, and A. Beni-Hssane, A comprehensive survey on sentiment analysis: Approaches, challenges and trends, “Knowledge-Based Systems,” 226, p. 107134, 2021, doi: https://doi.org/10.1016/j.knosys.2021.107134.
H. Jelodar, Y. Wang, C. Yuan, X. Feng, X. Jiang, Y. Li, and L. Zhao, Latent dirichlet allocation (LDA) and topic modeling: models, applications, a survey, “Multimedia Tools and Applications,” 78 (11), pp. 15169–15211, 2019, doi: https://doi.org/10.1007/s11042-018-6894-4.
R. Priyantina and R. Sarno, Sentiment analysis of hotel reviews using latent dirichlet allocation, semantic similarity and LSTM, “International Journal of Intelligent Engineering and Systems,” 12 (4), pp. 142–155, 2019, doi: https://doi.org/10.22266/ijies2019.0831.14.
M. F. A. Bashri and R. Kusumaningrum, Sentiment analysis using latent dirichlet allocation and topic polarity wordcloud visualization, in 2017 5th International Conference on Information and Communication Technology (ICoIC7), May 2017, pp. 1–5. doi: https://doi.org/10.1109/ICoICT.2017.8074651.
A. Goyal and I. Kashyap, Latent dirichlet allocation - an approach for topic discovery, in 2022 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COM-IT-CON), May 2022, pp. 97–102. doi: https://doi.org/10.1109/COM-IT-CON54601.2022.9850912.
T. A. Rana and Y.-N. Cheah, Aspect extraction in sentiment analysis: comparative analysis and survey, “Artificial Intelligence Review,” 46 (4), pp. 459–483, 2016, doi: https://doi.org/10.1007/s10462-016-9472-z.
Y.-C. Chang, C.-H. Ku, and C.-H. Chen, Social media analytics: Extracting and visualizing Hilton hotel ratings and reviews from TripAdvisor, “International Journal of Information Management,” 48, pp. 263–279, 2019, doi: https://doi.org/10.1016/j.ijinfomgt.2017.11.001.
O. Alqaryouti, N. Siyam, A. Abdel Monem, and K. Shaalan, Aspect-based sentiment analysis using smart government review data, “Applied Computing and Informatics,” 2020, doi: https://doi.org/10.1016/j.aci.2019.11.003.
S. De, S. Dey, S. Bhatia, and S. Bhattacharyya, An introduction to data mining in social networks, in Advanced Data Mining Tools and Methods for Social Computing, Elsevier, 2022, pp. 1–25. doi: https://doi.org/10.1016/B978-0-32-385708-6.00008-4.
W. Zhang, X. Li, Y. Deng, L. Bing, and W. Lam, Towards generative aspect-based sentiment analysis, in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), 2021, 2, pp. 504–510. doi: https://doi.org/10.18653/v1/2021.acl-short.64.
J. Wang, B. Xu, and Y. Zu, Deep learning for aspect-based sentiment analysis, in 2021 International Conference on Machine Learning and Intelligent Systems Engineering (MLISE), Jul. 2021, pp. 267–271. doi: https://doi.org/10.1109/MLISE54096.2021.00056.
B. Verma and R. S. Thakur, Sentiment analysis using lexicon and machine learning-based approaches: a survey, in Lecture Notes in Networks and Systems, 34, 2018, pp. 441–447. doi: https://doi.org/10.1007/978-981-10-8198-9_46.
K. Crowston, X. Liu, and E. E. Allen, Machine learning and rule-based automated coding of qualitative data, “Proceedings of the American Society for Information Science and Technology,” 47 (1), pp. 1–2, 2010, doi: https://doi.org/10.1002/meet.14504701328.
Z. Fachrina and D. H. Widyantoro, Aspect-sentiment classification in opinion mining using the combination of rule-based and machine learning, in 2017 International Conference on Data and Software Engineering (ICoDSE), Nov. 2017, 2018-Janua, pp. 1–6. doi: https://doi.org/10.1109/ICODSE.2017.8285850.
T. A. Rana and Y.-N. Cheah, Sequential patterns rule-based approach for opinion target extraction from customer reviews, “Journal of Information Science,” 45 (5), pp. 643–655, 2019, doi: https://doi.org/10.1177/0165551518808195.
V. Bonta, N. Kumaresh, and N. Janardhan, A comprehensive study on lexicon based approaches for sentiment analysis, “Asian Journal of Computer Science and Technology,” 8 (S2), pp. 1–6, 2019, doi: https://doi.org/10.51983/ajcst-2019.8.S2.2037.
S. Sohangir, N. Petty, and Di. Wang, Financial sentiment lexicon analysis, in 2018 IEEE 12th International Conference on Semantic Computing (ICSC), Jan. 2018, 2018-Janua, pp. 286–289. doi: https://doi.org/10.1109/ICSC.2018.00052.
A. Veluchamy, H. Nguyen, M. L. Diop, and R. Iqbal, Comparative study of sentiment analysis with product reviews using machine learning and lexicon-based approaches, “SMU Data Science Review,” 1 (4), pp. 1–22, 2018.
Z. Madhoushi, A. R. Hamdan, and S. Zainudin, Sentiment analysis techniques in recent works, in 2015 Science and Information Conference (SAI), Jul. 2015, pp. 288–291. doi: https://doi.org/10.1109/SAI.2015.7237157.
S. Momtazi and F. Naumann, Topic modeling for expert finding using latent Dirichlet allocation, “Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery,” 3 (5), pp. 346–353, 2013, doi: https://doi.org/10.1002/widm.1102.
F. Gurcan and N. E. Cagiltay, Big data software engineering: analysis of knowledge domains and skill sets using LDA-based topic modeling, “IEEE Access,” 7, pp. 82541–82552, 2019, doi: https://doi.org/10.1109/ACCESS.2019.2924075.
M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth, The author-topic model for authors and documents, 2012.
How to Cite
Copyright (c) 2022 Journal of Computers, Mechanical and Management
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The Journal of Computers, Mechanical and Management applies the Creative Commons Attribution- Non-Commercial 4.0 license to published articles. Under this license, authors retain ownership of the copyright for their content, but they allow anyone to download, reuse, reprint, modify, distribute and/or copy the content as long as the original authors and source are cited. Appropriate attribution can be provided by simply citing the original article.