Smart Speaker: Enhancing Any Persons' Ability to Deliver English Speeches Independently With a Web Application

Abstract

This paper introduces a state-of-the-art online application called "SMART SPEAKER" to improve English speaking abilities, especially in public speaking. The system uses Machine Learning, Deep Learning, and Natural Language Processing Techniques to evaluate the user's speech based on various aspects such as Content analysis, Flow Completeness analysis, Grammar analysis, and Facial Expressions analysis. The tool is designed to be user-friendly and simple, providing an easy and efficient solution for those looking to improve their English-speaking skills, gain confidence, and deliver well-articulated speeches. This system meets the growing demand for a practical and effective tool that can support English speakers around the world. Through "SMART SPEAKER", users can practice and improve their public speaking skills.

Country : Sri Lanka

1 Malshan E.G.C2 Buddhini B.A.D3 Isurandi I.G4 Habalakkawa W.V.K.I5 Suranjini Silva6 Thamali Kelegama

  1. Department of Information Technology, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  2. Department of Information Technology, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  3. Department of Information Technology, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  4. Department of Information Technology, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  5. Department of Computer System Engineering, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka
  6. Department of Information Technology, Sri Lanka Institute of Information Technology, Malabe, Sri Lanka

IRJIET, Volume 7, Issue 10, October 2023 pp. 66-73

doi.org/10.47001/IRJIET/2023.710009

References

  1. O. Abdel-Hamid, A. Mohamed, H. Jiang; L. Deng, G. Penn, D. Yu, Convolutional Neural Networks for Speech Recognition, Vol. 22. ACM Transactions on Audio, Speech, and Language Processing: IEEE, 2014, 1533 – 1545.
  2. A. Deshpande, R. Pandharkar, S. Deolekar, Speech Coach: A framework to evaluate and improve speech delivery. International Conference on Computer, Communication and Signal Processing (ICCCSP): IEEE, 2020.
  3. Team Ummo, Ummo, Ummoapp.herokuapp.com Ummo | Speech Coaching App, Webrate.org, 2016.
  4. A. M. Cronin, P. Lam, LikeSo App, 2017.
  5. F. Chiarello, P. Belingheri, G. Fantoni, Data science for engineering design: State of the art and future directions. Vol.129. Model from NLTK. Journal of Information Science, 2021.
  6. A. A. Pise, M. A. Alqahtani, P. Verma, Purushothama K, D. A. Karras, Prathibha S, A. Halifacorresponding, Methods for Facial Expression Recognition with Applications in Challenging Situations, Vol.2022. Comput Intell Neurosci, 2022.
  7. M. Bargaje, Emotion recognition and emotion-based classification of audio using genetic algorithm - an optimized approach. International Conference on Industrial Instrumentation and Control: IEEE, 2015.
  8. D. V. Sang, N. V. Dat, D. P. Thua, Facial expression recognition using deep convolutional neural networks. 9th International Conference on Knowledge and Systems Engineering (KSE), 2019.
  9. H. H. Al-Baity, H. J. Alshahrani, M. K. Nour, A. Yafoz, O. Alghushairy, R. Alsini, M. Othman, Computational Linguistics Based Emotion Detection and Classification Model on Social Networking Data, Vol.12. 2022.
  10. B. Tucker, M. Lehew, Exploring Public Speaking: The Free Dalton State College Public Speaking Textbook: Chapter 8: Introductions and Conclusions, 4th Edition. University System of Georgia, 2019.
  11. Public Speaking: Practice And Ethics: The Importance of an Introduction. 2012.
  12. Mueller, M. Orcholski, J. Miller, K. Woodward, M. L. Mcdonough, and L. Roth, The Basis For Strong Public Speaking: Effective Introductions and Powerful Conclusions.
  13. “O/L English Paper Marking Schema”. 2020.
  14. V. Pashek, R. H. Brookshire, Effects of Rate of Speech and Linguistic Stress on Auditory Paragraph Comprehension of Aphasic Individuals, Vol.25. Journal of Speech and Hearing Research, 1982, 377-383.
  15. Stouten, J. P. Martens, A feature-based filled pause detection system for Dutch. Workshop on Automatic Speech Recognition and Understanding: IEEE, 2003.
  16. Afroz, S. G. Koolagudi, Recognition and Classification of Pauses in Stuttered Speech Using Acoustic Features. International Conference on Signal Processing and Integrated Network: IEEE, 2019.
  17. A. Chatziagapi, D. Sgouropoulos, C. Karouzos, T. Melistas, T. Giannakopoulos, A. Katsamanis, S. Narayanan, Audio and ASR-based Filled Pause Detection. International Conference on Affective Computing and Intelligent Interaction: IEEE, 2022.
  18. M. Goto, K. Itou, S. Hayamizu, A Real-Time Filled Pause Detection System For Spontaneous Speech Recognition. 1999.
  19. M. Belz, J. Trouvain, Are ‘Silent’ Pauses Always Silent, Physics. 2019.
  20. S. P. Singh, A. Kumar, L. Singh, M. Bhargava, K. Goyal, B. Sharma, Frequency-based spell checking and rule-based grammar checking. International Conference on Electrical, Electronics, and Optimization Techniques: IEEE, 2016
  21. Dizon, Gilbert, Gayed, John M, Examining the Impact of Grammarly on the Quality of Mobile L2 Writing, Vol.17. JALT CALL Journal, 2021, 74 – 92.
  22. Murakami, N. C. Ellis, Effects of Availability, Contingency, and Formulaicity on the Accuracy of English Grammatical Morphemes in Second Language Writing, Vol.72. Empirical Study, 2022, 899-940.
  23. S. Shaumiwaty, I. Munandar, The Analysis of Grammatical and Lexical Errors on The Students' Theses of Iain Takengon, Vol.6. Journal As-Salam, 2022, 269–285.
  24. R. A. Andania, The Grammatical Accuracy, Cohesion, and Coherence of Thai Students’ English Writing at Darawithaya School, Narathiwat – Thailand, Vol.6. 2017, 10-25.
  25. C. Mellish and G. Ritchie, “The Grammatical Analysis of Sentences”, University of Aberdeen.