Performance Analysis of Classification and Regression Tree (CART) Algorithm in Classifying Male Fertility Levels with Mobile-Based


  • Arif Rahman Hakim Medika Suherman University
  • Dewi Marini Umi Atmaja Medika Suherman University
  • Amat Basri Buddhi Dharma University
  • Andri Ariyanto Jenderal Achmad Yani University



Android, CART, Classification, Fertility, K-Fold Cross Validation


Fertility is the ability to produce offspring in a man or the ability of the reproductive organs to work optimally in fertilization. Fertility rates have declined drastically in the last fifty years. Machine Learning is a field devoted to understanding and building learning methods. This study will use machine learning algorithms to classify male fertility levels, namely the Classification and Regression Tree (CART) algorithm and the K-Fold Cross Validation validation method. The fertility dataset used in this study was obtained from the UCI Machine Learning website, with a total of 100 data and the variables used are Age, Childish diseases, Accident or serious trauma, Surgical intervention, High fevers in the last year, Frequency of alcohol consumption, Smoking habit, Number of hours spent sitting per day and Diagnosis. K-Fold Cross Validation can be used together with CART to measure the performance of the CART model on different data, so as to avoid overfitting or underfitting the CART model. Based on the calculation of the CART algorithm and the K-Fold Cross Validation validation method (K = 1 to K = 9), the average accuracy value for training data is 98.70% and the average accuracy value for testing data is 81.16%. The results of this study have proven that the CART algorithm can be used to classify the level of fertility in men well. In addition, the classification model formed can be implemented into a mobile application (android) so that it is easy to use and understand.


Download data is not yet available.