PCOS Detection Using Machine Learning

Abstract

Reproductive-age women worldwide suffer from metabolic problems, hormonal imbalance, and irregular menstruation due to PCOS. Complex symptoms of PCOS often lead to misdiagnosis or underdiagnosis, causing suffering and increasing the risk of obesity, diabetes, and cardiovascular disease. Treating these symptoms requires early, correct diagnosis. The project tests machine learning techniques such as Linear Regression, Ridge Regression, Lasso Regression, Random Forest Regressor, K-Nearest Neighbors, Decision Tree Regressor, and Support Vector Machines employing Mean Squared Error to address diagnostic issues. Since it handled noisy and non-linear data better, the Random Forest Regressor was best. Django-based internet applications and predictive algorithm help clinicians identify PCOS risk. The approach instantly assesses risk using BMI, age, blood pressure, and lifestyle factors. This simple method lets clinicians identify high-risk patients for rapid intervention and personalized treatment. Accuracy, scalability, and usability tests validated the system's clinical value. Finally, our machine learning-based solution will improve early PCOS identification, clinical resource use, and global women's health.

Country : India

1 G Dinesh Reddy2 Dr. M. Sireesh Kumar

  1. MCA student, Department of Computer Applications, Mohan Babu University, Tirupati, Andhra Pradesh, India
  2. Associate Professor, Department of Computer Applications, Mohan Babu University, Tirupati, Andhra Pradesh, India

IRJIET, Volume 9, Special Issue of INSPIRE’25 April 2025 pp. 364-370

doi.org/10.47001/IRJIET/2025.INSPIRE59

References

  1. S. B. M. R. Sokwala and R. Dodia, “Global approach to polycystic ovary syndrome in Africa,” in Elsevier eBooks, 2023, pp. 220–228. doi: 10.1016/b978-0-323-87932-3.00038-4. Available: https://doi.org/10.1016/b978-0-323-87932-3.00038-4.
  2. M. Thoma, J. Fledderjohann, C. Cox, and R. K. Adageba, “Biological and Social Aspects of Human Infertility: a Global perspective,” Oxford Research Encyclopedia of Global Public Health, Apr. 2019, doi: 10.1093/acrefore/9780190632366.013.184. Available: https://doi.org/10.1093/acrefore/9780190632366.013.184.
  3. I.H. Sarker, “Machine learning: algorithms, Real-World applications and research directions,” SN Computer Science, vol. 2, no. 3, Mar. 2021, doi: 10.1007/s42979-021-00592-x. Available: https://doi.org/10.1007/s42979-021-00592-x.
  4. M. Bansal, A. Raj, and A. Raj, “Comparative analysis of ML models for electricity price Forecasting,” in Lecture notes in networks and systems, 2024, pp. 551–578. doi: 10.1007/978-981-97-7710-5_42. Available: https://doi.org/10.1007/978-981-97-7710-5_42.
  5. M. Almutairi, A. A. Almutairi, and A. M. Alodhialah, “The Influence of Lifestyle Modifications on Cardiovascular Outcomes in Older Adults: Findings from a Cross-Sectional Study,” Life, vol. 15, no. 1, p. 87, Jan. 2025, doi: 10.3390/life15010087. Available: https://doi.org/10.3390/life15010087.
  6. D. Vine, M. Ghosh, T. Wang, and J. Bakal, “Increased prevalence of adverse health outcomes across the lifespan in those affected by polycystic ovary syndrome: a Canadian population cohort,” CJC Open, vol. 6, no. 2, pp. 314–326, Dec. 2023, doi: 10.1016/j.cjco.2023.12.010. Available: https://doi.org/10.1016/j.cjco.2023.12.010.
  7. C. Arslanian-Engoren, R. Gary, C. Irwin, and W. Zhang, “Chronic and other conditions that increase CVD risk,” in Springer eBooks, 2024, pp. 181–227. doi: 10.1007/978-3-031-53705-9_7. Available: https://doi.org/10.1007/978-3-031-53705-9_7.
  8. V. Gupta and P. V. Suresh, “A comprehensive review of predicting Lifestyle-Based Disease specifically PCOS among women using data mining and machine learning approaches,” Lecture Notes in Networks and Systems, pp. 419–433, Jan. 2024, doi: 10.1007/978-981-97-2089-7_37. Available: https://doi.org/10.1007/978-981-97-2089-7_37.
  9. T. Kavzoglu and A. Teke, “Predictive performances of ensemble machine learning algorithms in landslide susceptibility mapping using random forest, extreme gradient boosting (XGBOOST) and natural gradient boosting (NGBOOST),” Arabian Journal for Science and Engineering, vol. 47, no. 6, pp. 7367–7385, Jan. 2022, doi: 10.1007/s13369-022-06560-8. Available: https://doi.org/10.1007/s13369-022-06560-8.
  10. A.Alamoudi et al., “A deep learning fusion approach to diagnosis the polycystic ovary syndrome (PCOS),” Applied Computational Intelligence and Soft Computing, vol. 2023, pp. 1–15, Feb. 2023, doi: 10.1155/2023/9686697. Available: https://doi.org/10.1155/2023/9686697.
  11. K. Hussein and M. Karami, “Association between insulin resistance and abnormal menstrual cycle in Saudi females with polycystic ovary syndrome,” Saudi Pharmaceutical Journal, vol. 31, no. 6, pp. 1104–1108, Apr. 2023, doi: 10.1016/j.jsps.2023.03.021. Available: https://doi.org/10.1016/j.jsps.2023.03.021.
  12. J. H. Joloudari, A. Marefat, M. A. Nematollahi, S. S. Oyelere, and S. Hussain, “Effective Class-Imbalance learning based on SMOTE and convolutional neural networks,” Applied Sciences, vol. 13, no. 6, p. 4006, Mar. 2023, doi: 10.3390/app13064006. Available: https://doi.org/10.3390/app13064006.
  13. C. C. Yang, “Explainable artificial intelligence for predictive modeling in healthcare,” Journal of Healthcare Informatics Research, vol. 6, no. 2, pp. 228–239, Feb. 2022, doi: 10.1007/s41666-022-00114-1. Available: https://doi.org/10.1007/s41666-022-00114-1.