Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Study on ML-Based Software Defect Detection for Security Traceability in Smart Healthcare Applications
Jönköping University, School of Engineering. Department of Computer Science, Kristianstad University, Kristianstad, SE-29188, Sweden.
Department of Computer Science, Kristianstad University, Kristianstad, SE-29188, Sweden.
2023 (English)In: Sensors, E-ISSN 1424-8220, Vol. 23, no 7, article id 3470Article in journal (Refereed) Published
Abstract [en]

Software Defect Prediction (SDP) is an integral aspect of the Software Development Life-Cycle (SDLC). As the prevalence of software systems increases and becomes more integrated into our daily lives, so the complexity of these systems increases the risks of widespread defects. With reliance on these systems increasing, the ability to accurately identify a defective model using Machine Learning (ML) has been overlooked and less addressed. Thus, this article contributes an investigation of various ML techniques for SDP. An investigation, comparative analysis and recommendation of appropriate Feature Extraction (FE) techniques, Principal Component Analysis (PCA), Partial Least Squares Regression (PLS), Feature Selection (FS) techniques, Fisher score, Recursive Feature Elimination (RFE), and Elastic Net are presented. Validation of the following techniques, both separately and in combination with ML algorithms, is performed: Support Vector Machine (SVM), Logistic Regression (LR), Naïve Bayes (NB), K-Nearest Neighbour (KNN), Multilayer Perceptron (MLP), Decision Tree (DT), and ensemble learning methods Bootstrap Aggregation (Bagging), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Random Forest(RF), and Generalized Stacking (Stacking). Extensive experimental setup was built and the results of the experiments revealed that FE and FS can both positively and negatively affect performance over the base model or Baseline. PLS, both separately and in combination with FS techniques, provides impressive, and the most consistent, improvements, while PCA, in combination with Elastic-Net, shows acceptable improvement.

Place, publisher, year, edition, pages
MDPI , 2023. Vol. 23, no 7, article id 3470
Keywords [en]
Algorithms, Bayes Theorem, Machine Learning, Neural Networks, Computer, Software, Support Vector Machine, Adaptive boosting, Application programs, Decision trees, Defects, Extraction, Learning systems, Least squares approximations, Life cycle, Nearest neighbor search, Principal component analysis, Software design, Support vector machines, Elastic net, Ensemble learning, Features extraction, Features selection, Machine-learning, Partial least square regression, Principal-component analysis, Selection techniques, Software defect prediction, Software development life-cycle, algorithm, Feature extraction, feature selection, software defects prediction
National Category
Software Engineering
Identifiers
URN: urn:nbn:se:hj:diva-60226DOI: 10.3390/s23073470ISI: 000970228700001PubMedID: 37050529Scopus ID: 2-s2.0-85152322265Local ID: GOA;;876657OAI: oai:DiVA.org:hj-60226DiVA, id: diva2:1752585
Available from: 2023-04-24 Created: 2023-04-24 Last updated: 2023-05-15Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus
By organisation
School of Engineering
In the same journal
Sensors
Software Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 110 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf