MACHINE LEARNING

MACHINE LEARNING-BASED AI FRAMEWORK FOR SQL INJECTION IN WEB APPICATION FIREWALL

Year of Publication
Publication Type
Abstract
This research develops an AI-powered Web Application Firewall (WAF) to detect SQL injection( SQLi) attacks, addressing the limitations of traditional signature-based systems. Using the Kaggle SQLi dataset (30,905 queries), the study applied TF-IDF character-level n-grams and three machine learning models: XGBoost, Random Forest, and SVM, with hyperparameter tuning using grid search and cross-validation. The SVM model performed best, achieving 99.48% accuracy, 99.59% F1-score, 99.90% AUC- ROC, very low false positives and false negatives, and real-time detection with 1.52 ms latency and throughput of 658 queries/second per CPU core. Character n-grams successfully captured common SQLi patterns such as UNION SELECT, OR operators, comments, and tautologies. A Flask-based web application and REST API demonstrated that the system is production-ready, highly scalable, and far cheaper than commercial WAFs. The research confirms that traditional machine learning with good feature engineering can match deep learning performance while remaining simpler and more efficient. Limitations include reliance on one dataset, binary classification, and reduced effectiveness against highly obfuscated or second-order attacks. Future work should involve multi-dataset
testing, adversarial robustness, attack subtype classification, and exploring contextual embeddings. Overall, the study shows that ensemble machine learning provides an accurate, fast, and cost- effective alternative for real-time SQL injection detection.
Supervisor(s)
co-supervisor

A COMPARATIVE ANALYSIS ON PREDICTING FOOTBALL MATCHES USING MACHINE LEARNING. (A CASE STUDY OF SPANISH LEAGUE)

Year of Publication
upload
Publication Type
Abstract
Football appears to be the most popular sports the world over, making it a game of betting for money making among other thing. This business of betting, over the years has gown making it a difficult and complex task in predicting correctly the outcome of football matches. This is as a result of the numerous number of factors that are considered but cannot be quantitatively valued or modeled. The aim of the project is to develop a machine learning algorithms for the prediction of football matches. The classification algorithms adopted in this project includes: K-Nearest Neighbor (KNN), support vector machines (SVM), Gaussian naïve Bayes (GNB), decision tree (DT) and Logistic Regression (LR) techniques. The dataset used was gathered from football- data-co.uk. The models was built using python programming language environment. The comparative analysis carried out in this project support that machine learning algorithms perform well and shows room for future improvement.
Supervisor(s)
co-supervisor