SENTIMENT ANALYSIS OF ELECTRIC CAR PRODUCT TRENDS IN INDONESIA USING BM25 AND K-NEAREST NEIGHBOR

Ariel - Alfaro, Iffatul - Mardhiyah

Abstract


The global and Indonesian shift towards electric vehicles (EVs) is driven by efforts to reduce emissions and promote sustainable energy. Social media, especially Twitter, functions as an important measuring tool regarding public sentiment towards electric vehicles in Indonesia, so that it can influence policy making. This research uses the BM25 and K-Nearest Neighbor (KNN) methods to analyze sentiment, which aims to improve EV adoption strategies. Conducted in 2023, this research applies data mining, specifically Knowledge Discovery and Data Mining (KDD), analyzing primary and secondary data descriptively and quantitatively starting with data collection from Twitter, followed by data crawling and initial text processing. Next, labeling, term frequency (TF) and inverse document frequency (IDF) calculations were carried out using the BM25 and KNN methods, with an Evaluation and Validation Diagram that visualized the process. The findings show that negative sentiment dominates at 48% (4800 data), followed by 34% (3400 data) neutral sentiment and 18% (1800 data) positive sentiment. The balanced distribution of sentiment highlights the diverse perceptions of society. BM25 and KNN pre-processing methods effectively reduce overfitting and underfitting, especially in negative and neutral sentiments. Accuracy testing without BM25 resulted in 58.6% to 60.25%, while integrating BM25 with KNN increased accuracy by 12.5% to 71% to 72.75%. Understanding sentiment provides a basis for decision making and policy development, as well as providing insight into public perceptions of electric vehicles in Indonesia. Implications include leveraging positive sentiment for marketing, adjusting strategies, refining pricing, addressing infrastructure and reliability issues, and collaborating with governments to increase adoption of electric vehicles in society.

Keywords


trend sentiment analysis. electric car, Indonesia, BM25 and K-Nearest Neighbor

References


Anwar, TM, Riandhita and d., Permana, A. 2023. Analysis of Indonesian public sentiment towards electric vehicle products using VADER. Journal of Informatics Engineering and Information Systems, 10(1), 783-792

Ayudhitama, Annisa Putri, and Utomo Pujianto. 2020. "Analysis of 4 Algorithms in liver classification using rapidminer." Polinema Informatics Journal 6(2): 1–9.

Binawan and D. Hendra. 2019. Classification of thesis abstract documents based on research focus in the field of intelligent computing using Bm25 and K- Nearest Neighbor. Bachelor's thesis, Brawijaya University. Skipsi p.1-69, November 2022, SKR/FILKOM/2019/121/051902291, (internet in aplot, 15 December 2023) available http://repository.ub.ac.id/id/eprint/169247/

Berry, W. and Kogan, J. (2013) 'Michael W. Berry and Jacob Kogan (eds.): Text mining: applications and theory', Information Retrieval, 14(2), pp. 208– 211. doi: 10.1007/s10791-010-9153-5.

Binawan, D. Hendra, Indriati, I., and Adikara, PP (2019). Classification of Thesis Abstract Documents Based on Research Focus in the Field of Intelligent Computing Using BM25 and K-Nearest Neighbor. Journal of Information Technology and Computer Science Development, 3(3), 2640– 2645.

Han, et al., 2022. Data mining: concepts and techniques Florin, G., 2011. Data Mining Concepts, Models and Techniques. Volume 12 ed. [online] Poland: Springer. Available at: .

Jena, R., 2019. An empirical case study on Indian consumers' sentiment towards electric vehicles: A big data analytics approach. Industrial Marketing Management, 90(2), 605-616

Johar T, Asahar. Yanosma, D. Anggriani, K. (2016). Implementation of the K-Nearest Neighbor (KNN) and Simple Additive Weighting (SAW) Methods in Making Selection Decisions for Accepting Paskibraka Members. Bengkulu: Informatics Engineering Study Program, Faculty of Engineering, Bengkulu University. ISSN: 2355-5920. Jena, R., 2019. An empirical case study on Indian consumers' sentiment towards electric vehicles: A big data analytics approach. Industrial Marketing Management, 90(2), 605-616.

Ministry of Environment and Forestry. (2019). 2018 Statistics of the Directorate General of Climate Change Control. Directorate General of Change Control (internet on plot date, 15 December 2023) available Climate. http://ditjenppi.menlhk.go.id

Ministry of Environment and Forestry. (2020). Strategic Plan for 2020-2024. Directorate General of Pollution and Environmental Damage Control. (internet on plot, 15 December 2023) available https://www.menlhk.go.id/. JHan, et al., 2022. Data mining: concepts and techniques Florin, G., 2011. Data Mining Concept, Models and Techniques. Volume 12 ed. [online] Poland: Springer. Available at: .

Indah., Rumiasih., Carlos., Firmansyah. (2019). Analysis of Determining Battery Capacity and Charging in Electric Cars. Elektra Journal,4(2),29-37

(IEA) International Energy Agency. (2021). Global EV Outlook 2021. "Accelerating ambitions despite the pandemic". Published April 2021. (internet in plot date, 11 December 2023) from https://www.iea.org/.J

(IEA) International Energy Agency. (2019). Global EV Outlook 2019. “China's CATL staInternational Energy Agency. (2019). Global EV Outlook 2019. “China's CATL begins mass production of high-nickel batteries: chairman”. (internet in plot date, 15 December 2023) available https://www.iea.org/.

Isman, A., & Dagdeviren, E. 2018. "Diffusion of Twitter in Turkey". The Turkish Online Journal of Educational Technology, 17(4), 1-7

Laurensz, B., and E. Sediyono. 2021. Analysis of Public Sentiment on Vaccination in Efforts to Overcome the Covid-19 Pandemic. National Journal of Electrical Engineering and Information Technology, 10(2), 118-123.

Lubis, AR, Lubis, M., & Khowarizmi, A.-. (2020). Optimization of distance formula K-Nearest Neighbor method. Bulletin of Electrical Engineering and Informatics, 9(1), 326–338. https://doi.org/10.11591/eei.v9i1.1464

Nisa Lily Choirun and Anita Susanti. 2018. Strategy for implementing electric cars in Surabaya as smart mobility. Journal; applied transportation publication media 1(2):213-225 https://journal.unesa.ac.id/index.php/mitran/article/view/26193

Mentari, ND, Fauzi, MA and Muflikhah, L. (2018) 'Sentiment Analysis of the 2013 Curriculum on Twitter Social Media Using the k-Nearest Neighbor Method and Feature Selection Query Expansion Ranking', Development of Information Technology and Computer Science, 2(8), pp. 2739– 2743.

Pardede, J., Husada, MG and Riansyah, R., 2018. Implementation and Comparison of the Okapi BM25 and PLSA Methods in Information Retrieval Applications.

Pratama, Y., Murdiansyah, T., D., Lhaksmana, M., K. 2023. Analysis of electric vehicle sentiment on Twitter social media using the logistic regression algorithm and principal component analysis. Budidarma Media Informatics Journal, 7(1), 529-535.

Purnamawati, A., Winnarto, M., N., Mailasari., P., Y. 2023. Sentiment analysis of the TikTok application using the BM25 Method and Improved K-NN Chi-Square Features. Journal of Computational and Informatics, 7(1), 97-105.

Sakariana, MID, Indriati, I., & Dewi, C. (2020). Sentiment Analysis of the Move of the Indonesian Capital City Using BM25 Term Weighting and K-Nearest Neighbor Weighted Neighbor Classification. Journal of Information Technology and Computer Science Development, 4(3), 748–755.

Santoso, A., Nugroho, A., and Sunge, AS 2022. Sentiment Analysis About Electric Cars Using Support Vector Machine Methods and Feature Selection Particle Swarm Optimization. Journal of Practical Computer Science, 2(1), 24-31. https://doi.org/10.37366/jpcs.v2i1.1084.

Sudjoko Cakrawati. 2020. Strategy for using sustainable electric vehicles as a solution to reduce carbon emissions Paradigm Journal: Multidisciplinary Journal of Indonesian Postgraduate Students, 2(2):54-68 https://doi.org/10.22146/jpmmpi.v2i2.70354

Suharno, CF, et al., 2017. Classification of Indonesian Text in Online Samsat Complaint Documents Using K-Nearest Neighbors and Chi-Square Methods. Journal of Information Technology and Computer Science Development 1(10): 1000-1007.

Zakia, AS and Indriati, M., 2020. Gender Classification of Twitter Users using the BM25 and K-Nearest Neighbor (KNN) Methods. Journal of Information Technology and Computer Science Development e-4(10): 3331-3337.


Full Text: PDF

DOI: 10.33751/komputasi.v21i2.9382 Abstract views : 243 views : 116

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.