Enhancing Software Requirement Classification via Dataset Fusion and Machine Learning
| dc.authorwosid | FWU-2100-2022 | |
| dc.authorwosid | NQT-6529-2025 | |
| dc.authorwosid | AAY-5207-2020 | |
| dc.authorwosid | OLC-0756-2025 | |
| dc.authorwosid | ABW-9013-2022 | |
| dc.contributor.author | Raza, Muhammad Owais | |
| dc.contributor.author | Mir, Vajeeha | |
| dc.contributor.author | Rasheed, Jawad | |
| dc.contributor.author | Yeşiltepe, Mirşat | |
| dc.contributor.author | Alsubai, Shtwai | |
| dc.contributor.author | Rasheed, Jawad | |
| dc.contributor.department-temp | ||
| dc.date.accessioned | 2026-05-06T12:16:22Z | |
| dc.date.issued | 2025 | |
| dc.department | Mühendislik ve Doğa Bilimleri Fakültesi | |
| dc.description.abstract | Software engineering involves numerous steps; a successful software product follows these guidelinesto the core. One such step is gathering requirements for a software product. This step is quite expensivein terms of time and money; a potential solution is to automate the requirement collection process.Automating the process of gathering software requirements requires separating requirements into types.An approach to predict the type of requirement is using text classification and machine learning; however,the problem with this approach is that it requires a large amount of data, which is not available forthis use case. In this study, we perform dataset fusion to create a large dataset. We applied verticalfusion, which increased the number of instances in the dataset. Once a fusion-based dataset is created,machine learning algorithms are applied, and based on empirical results, the performance of the machinelearning model after fusion drastically improved to 87.78% f1score with support vector machine (SVM).This improvement shows the efficacy of data fusion in improving the performance of a text classifierand demonstrates that it can overcome the limitations of small datasets by combining data from diversesources. Our study demonstrated the robustness of our approach in software requirement classificationby surpassing the highest recall scores from the previous four years, achieving 94.20% with fusion-basedSVC and outperforming previous models even in non-fusion settings. | |
| dc.identifier.citation | Raza, M., Mir, V., Rasheed, J., Yeşiltepe, M., & Alsubai, S.. (2025). Enhancing Software Requirement Classification via Dataset Fusion and Machine Learning. Acta Infologica, 9(1), 275–292. https://doi.org/10.26650/acin.1634472 | |
| dc.identifier.doi | 10.26650/acin.1634472 | |
| dc.identifier.endpage | 292 | |
| dc.identifier.issn | 2602-3563 | |
| dc.identifier.issue | 1 | |
| dc.identifier.orcid | 0000-0003-3761-1641 | |
| dc.identifier.startpage | 275 | |
| dc.identifier.uri | https://doi.org/10.26650/acin.1634472 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12436/9498 | |
| dc.identifier.volume | 9 | |
| dc.identifier.wos | 001511513000001 | |
| dc.identifier.wosquality | Q4 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.language.iso | en | |
| dc.publisher | İstanbul University | |
| dc.relation.ispartof | Acta Infologica | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Öğrenci | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.subject | Software Engineering | |
| dc.subject | Machine Learning | |
| dc.subject | Software Requirement Engineering | |
| dc.subject | Hybrid Models | |
| dc.subject | Ensemble Modeling | |
| dc.title | Enhancing Software Requirement Classification via Dataset Fusion and Machine Learning | |
| dc.type | Article | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | f9b9b46c-d923-42d3-b413-dd851c2e913a | |
| relation.isAuthorOfPublication.latestForDiscovery | f9b9b46c-d923-42d3-b413-dd851c2e913a |









