TY - JOUR
T1 - Exploring and mining rationale information for low-rating software applications
AU - Ullah, Tahir
AU - Khan, Javed Ali
AU - Khan, Nek Dil
AU - Yasin, Affan
AU - Arshad, Hasna
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2023
Y1 - 2023
N2 - Rationale refers to making human judgments, sets of reasons, or intentions to explain a particular decision. Nowadays, crowd-users argue and justify their decisions on social media platforms about market-driven software applications, thus generating a software rationale. Such rationale information can be of pivotal importance for the software and requirements engineers to enhance the performance of existing software applications by revealing end-users tactic knowledge to improve software designing and development decision-making. For this purpose, we proposed an automated approach to capture and analyze end-user reviews containing rationale information, focusing on low-rating applications in the amazon store using Natural Language Processing (NLP) and supervised machine learning (ML) classification methods. In the literature, high-rating applications have been emphasized while ignoring low-rating software application that causes potential biasness. Therefore, we examined 59 comparatively low-ranked market-based software applications from the Amazon app store covering various software categories to capture and identify crowd-users justifications. Next, using a developed grounded theory and content analysis approach, we studied and recorded how crowd-users analyze and explain their rationale based on issues encountered, attacking or supporting arguments registered, and updating or uninstalling software applications. Also, to achieve the best results, an experimental study is conducted by comparing various ML and deep learning (DL) algorithms, i.e., MNB, LR, RF, MLP, KNN, AdaBoost, Voting, LSTM, and BILSTM classifiers on the end-users rationale data set by preprocessing the input data, applying feature engineering, balancing the data set, and then training and testing the ML algorithms with a standard cross-validation approach. We obtained satisfactory results with MLP, voting, LSTM, and RF Classifiers, having 93%, 93%, 91%, and 90% average accuracy, respectively. Also, we plot the ROC curves for the high-performing ML and DL classifiers to identify and capture classifiers yielding the best performance with an under-sampling or oversampling balancing approach. The proposed research approach outerperforms the existing rationale approaches with better Precision, Recall, and F-measure values. Additionally, the paper discusses various aspects of the proposed approach by extending it in multiple directions to be utilized by software developers and vendors to help improve the performance of existing software applications.
AB - Rationale refers to making human judgments, sets of reasons, or intentions to explain a particular decision. Nowadays, crowd-users argue and justify their decisions on social media platforms about market-driven software applications, thus generating a software rationale. Such rationale information can be of pivotal importance for the software and requirements engineers to enhance the performance of existing software applications by revealing end-users tactic knowledge to improve software designing and development decision-making. For this purpose, we proposed an automated approach to capture and analyze end-user reviews containing rationale information, focusing on low-rating applications in the amazon store using Natural Language Processing (NLP) and supervised machine learning (ML) classification methods. In the literature, high-rating applications have been emphasized while ignoring low-rating software application that causes potential biasness. Therefore, we examined 59 comparatively low-ranked market-based software applications from the Amazon app store covering various software categories to capture and identify crowd-users justifications. Next, using a developed grounded theory and content analysis approach, we studied and recorded how crowd-users analyze and explain their rationale based on issues encountered, attacking or supporting arguments registered, and updating or uninstalling software applications. Also, to achieve the best results, an experimental study is conducted by comparing various ML and deep learning (DL) algorithms, i.e., MNB, LR, RF, MLP, KNN, AdaBoost, Voting, LSTM, and BILSTM classifiers on the end-users rationale data set by preprocessing the input data, applying feature engineering, balancing the data set, and then training and testing the ML algorithms with a standard cross-validation approach. We obtained satisfactory results with MLP, voting, LSTM, and RF Classifiers, having 93%, 93%, 91%, and 90% average accuracy, respectively. Also, we plot the ROC curves for the high-performing ML and DL classifiers to identify and capture classifiers yielding the best performance with an under-sampling or oversampling balancing approach. The proposed research approach outerperforms the existing rationale approaches with better Precision, Recall, and F-measure values. Additionally, the paper discusses various aspects of the proposed approach by extending it in multiple directions to be utilized by software developers and vendors to help improve the performance of existing software applications.
KW - Amazon store
KW - Deep learning
KW - Grounded theory
KW - Machine learning
KW - NLP
KW - Software rationale
KW - User feedback
UR - http://www.scopus.com/inward/record.url?scp=85166902679&partnerID=8YFLogxK
U2 - 10.1007/s00500-023-09054-3
DO - 10.1007/s00500-023-09054-3
M3 - Article
AN - SCOPUS:85166902679
SN - 1432-7643
JO - Soft Computing
JF - Soft Computing
ER -