Implementation of the c4.5 decision tree learning algorithm for sentiment analysis in e-commerce application reviews on google play store
DOI:
https://doi.org/10.31315/cip.v1i1.6128Abstract
Objective: Forknowing the level of accuracy of the C4.5 Decision Tree Learning Algorithm in sentiment analysis of reviews of e-commerce applications on the google play store.
Design/method/approach: Using C4.5 AlgorithmDecision Tree Learning.
Results: This study uses a confusion matrix test with a comparison of 80% for training data and 20% for test data, where 750 is used for training data and 190 is used for test data. This test obtained an average accuracy of 92.63%, precision 69.58%, and recall 69.99%.
Authenticity/state of the art: In this study using the C4.5 Algorithm Decision Tree Learningto conduct sentiment analysis of e-commerce reviews, which use the gain value to perform feature selection. There are four categories, namely display, service, access, and product. The data in this study were obtained from the google play store.
References
Y. Jahja, “Pengaruh service performance value, emotional value, monetary value, social value terhadap customer loyalty melalui perceived value dan customer satisfaction pada konsumen zalora di Surabaya,” pp. 1–13, 2018.
Ilmawan, “Aplikasi Mobile untuk Analisis Sentimen pada Google Play,” IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 9, no. 1, pp. 53–64, 2015.
A. Kumar and Y. K. Kim, “The store-as-a-brand strategy: The effect of store environment on customer responses,” J. Retail. Consum. Serv., vol. 21, no. 5, pp. 685–695, 2014.
Murphy, Rosie. 2020. “Local Consumer Review Survey: How Customer Reviews Affect Behavior", https://www.brightlocal.com/Research/Local-Consumer-Review-Survey/ .
F. Gunawan, M. A. Fauzi, and P. P. Adikara, “Analisis Sentimen Pada Ulasan Aplikasi Mobile Menggunakan Naive Bayes dan Normalisasi Kata Berbasis Levenshtein Distance (Studi Kasus Aplikasi BCA Mobile),” Syst. Inf. Syst. Informatics J., vol. 3, no. 2, pp. 1–6, 2017.
H. Zulfa, “ANALISIS PENGARUH PERSEPSI RISIKO, KUALITAS SITUS WEB, DAN KEPERCAYAAN KONSUMEN TERHADAP KEPUTUSAN PEMBELIAN KONSUMEN E-COMMERCE SHOPEE DI KOTA SEMARANG,” JSHP J. Sos. Hum. dan Pendidik., vol. 4, no. 2, pp. 1–11, 2020.
A. Rakhman and M. R. Tsani, “Analisis Sentimen Review Media Massa,” Smart Comp, vol. 8, no. 2, pp. 78–82, 2019.
Z. A. Gumilang, “IMPLEMENTASI NAÏVE BAYES CLASSIFIER DAN ASOSIASI UNTUK ANALISIS SENTIMEN DATA ULASAN APLIKASI E-COMMERCE SHOPEE PADA SITUS GOOGLE PLAY TUGAS,” 經濟研究, 2018.
M. Lailiyah, “Sentiment Analysis Menggunakan Rule Based Method Pada Data Pengaduan Publik Berbasis Lexical Resources,” 2017.
And M. A. F. P. Antinasari, R. S. Perdana, “‘Analisis Sentimen Tentang Opini Film Pada Dokumen Twitter Berbahasa Indonesia Menggunakan Naive Bayes Dengan Perbaikan Kata Tidak Baku,’” מים והשקייה, vol. 549, no. 12, pp. 40–42, 2017.
Y. Kustiyahningsih and E. Rahmanita, “Aplikasi Sistem Pendukung Keputusan Menggunakan Algoritma C4.5. untuk Penjurusan SMA,” J. Semantec, vol. 5, no. 2, pp. 101–108, 2016.
Mulholland, “Application of the C4 . 5 classifier to building an expert system for ionchromatography, 27, 95–104,” עלון הנוטע, vol. 66, no. September, pp. 37–39, 1995.
And P. V. C. P. N. Patil, P. R. Lathi, “‘Customer Card Classification Based on C5 . 0 & CART Algorithms,’” עלון הנוטע, vol. 66, no. 3, pp. 37–39, 2012.
J. Han, J., Kamber, M., & Pei, “Data Mining Concepts and Techniques Third Edition. Waltham: Elsevier Inc.,” p. 2011, 2011.
Downloads
Published
Issue
Section
License
License and Copyright Agreement
In submitting the manuscript to the journal, the authors certify that:
- They are authorized by their co-authors to enter into these arrangements.
- The work described has not been formally published before, except in the form of an abstract or as part of a published lecture, review, thesis, or overlay journal. Please also carefully read SITech's Posting Your Article Policy.
- That it is not under consideration for publication elsewhere,
- That its publication has been approved by all the author(s) and by the responsible authorities – tacitly or explicitly – of the institutes where the work has been carried out.
- They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
- They agree to the following license and copyright agreement.
Copyright
Authors who publish with Computing and Information Processing Letters agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
Licensing for Data Publication
Science in Information Technology Letters use a variety of waivers and licenses, that are specifically designed for and appropriate for the treatment of data:
Open Data Commons Attribution License, http://www.opendatacommons.org/licenses/by/1.0/ (default)
Creative Commons CC-Zero Waiver, http://creativecommons.org/publicdomain/zero/1.0/
Open Data Commons Public Domain Dedication and Licence, http://www.opendatacommons.org/licenses/pddl/1-0/
Other data publishing licenses may be allowed as exceptions (subject to approval by the editor on a case-by-case basis) and should be justified with a written statement from the author, which will be published with the article.
Open Data and Software Publishing and Sharing
The journal strives to maximize the replicability of the research published in it. Authors are thus required to share all data, code, or protocols underlying the research reported in their articles. Exceptions are permitted but have to be justified in a written public statement accompanying the article.
Datasets and software should be deposited and permanently archived in appropriate, trusted, general, or domain-specific repositories (please consult http://service.re3data.org and/or software repositories such as GitHub, GitLab, Bioinformatics.org, or equivalent). The associated persistent identifiers (e.g., DOI or others) of the dataset(s) must be included in the data or software resources section of the article. Reference(s) to datasets and software should also be included in the article's reference list with DOIs (where available). Where no domain-specific data repository exists, authors should deposit their datasets in a general repository such as ZENODO, Dryad, Dataverse, or others.
Small data may also be published as data files or packages supplementary to a research article; however, the authors should prefer a deposition in data repositories in all cases.