DATA SCIENCE WORKSHOP: CERVICAL CANCER CLASSIFICATION AND PREDICTION USING MACHINE LEARNING AND DEEP LEARNING WITH PYTHON GUI
Vivian Siahaan, Rismon Hasiholan SianiparSubsequently, the project delves into an illuminating visualization of categorized feature distributions. Through the ingenious use of pie charts, bar plots, and heatmaps, the project unveils the distribution patterns of key attributes such as 'Hormonal Contraceptives,' 'Smokes,' 'IUD,' and others. These visualizations illuminate potential relationships between these features and the target variable 'Biopsy,' which signifies the presence or absence of cervical cancer. Such exploratory analyses serve as a vital foundation for identifying influential trends within the dataset.
Transitioning into the core phase of predictive modeling, the workshop orchestrates a meticulous ensemble of machine learning models to forecast cervical cancer outcomes. The repertoire includes Logistic Regression, Decision Trees, Random Forests, Support Vector Machines (SVM), K-Nearest Neighbors (KNN), Gradient Boosting, Naïve Bayes, and the power of ensemble methods like AdaBoost and XGBoost. The models undergo rigorous hyperparameter tuning facilitated by Grid Search and Random Search to optimize predictive accuracy and precision.
As the workshop progresses, the spotlight shifts to the realm of deep learning, introducing advanced neural network architectures. An Artificial Neural Network (ANN) featuring multiple hidden layers is
…