This paper presents the design, implementation, and pedagogical evaluation of the ESP32 AI Tutor – a low cost, voice controlled embedded system that serves as an interface to cloud based large language models for personalised education. Using an ESP32 S3 microcontroller with I2S digital microphone and amplifier, the system captures audio, streams it to AssemblyAI for speech recognition, sends the transcribed text to Groq LLM, and synthesises the response via Google TTS. Four design principles for educational embedded systems are formulated: hardware autonomy, sensory integration, network resilience, and low latency turn taking. The open source implementation is validated through two realistic scenarios – individual adaptive tutoring and group work in a STEM laboratory – demonstrating that the device provides Socratic questioning, adaptive difficulty adjustment, and real time visual feedback without requiring a PC or smartphone. Experimental results show a total response latency of 0.5–1 s, satisfying natural conversation requirements. The reproducible blueprint and complete code are publicly available on GitHub.
Keywords
Embedded SystemsAI TutorVoice InterfacePersonalised LearningESP32Large Language Models (Llms)Edge ComputingSpeech RecognitionText to SpeechSTEM Education.
References
B. S. Bloom, "The 2 Sigma Problem: The Search for Methods of Group Instruction as Effective as One-to-One Tutoring," Educational Researcher, vol. 13, no. 6, pp. 4-16, 1984, [Online]. Available: https://doi.org/10.3102/0013189X013006004.
T. B. Brown et al., "Language Models are Few-Shot Learners," in Advances in Neural Information Processing Systems (NeurIPS 2020), vol. 33, pp. 1877-1901, 2020, [Online]. Available: https://doi.org/10.48550/arXiv.2005.14165.
J. Sweller, "Cognitive Load During Problem Solving: Effects on Learning," Cognitive Science, vol. 12, no. 2, pp. 257-285, 1988, [Online]. Available: https://doi.org/10.1016/0364-0213(88)90023-7.
M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, I. Stoica, and M. Zaharia, "A view of cloud computing," Communications of the ACM, vol. 53, no. 4, pp. 50-58, 2010, [Online]. Available: https://doi.org/10.1145/1721654.1721672.
M. Satyanarayanan, "The emergence of edge computing," Computer, vol. 50, no. 1, pp. 30-39, 2017, doi: 10.1109/MC.2017.9.
W. Shi, J. Cao, Q. Zhang, Y. Li, and L. Xu, "Edge computing: Vision and challenges," IEEE Internet of Things Journal, vol. 3, no. 5, pp. 637-646, 2016, [Online]. Available: https://doi.org/10.1109/JIOT.2016.2579198.
S. Papert, Mindstorms: Children, Computers, and Powerful Ideas. New York: Basic Books, 1980.
W. Wolf, Computers as Components: Principles of Embedded Computing System Design, 3rd ed. Morgan Kaufmann, 2012.
S. Petrovych and S. D. Horvath, "ESP32 AI companion," GitHub repository, 2026, [Online]. Available: https://github.com/HorvathStefanDaniel/esp32AICompanion.
B. J. Zimmerman, "Becoming a self-regulated learner: An overview," Theory into Practice, vol. 41, no. 2, pp. 64-70, 2002, doi: 10.1207/s15430421tip4102_2.
O. Soia, M. Kovtoniuk, O. Kosovets, and S. Petrovych, "Project-Based Learning as an Integration of Critical Thinking and Teamwork Skills of Future Teachers of Mathematics and Computer Science," in ITEST 2024, Springer, 2024, [Online]. Available: https://doi.org/10.1007/978-3-031-71804-5_26.
R. Luckin, W. Holmes, M. Griffiths, and L. B. Forcier, Intelligence Unleashed: An Argument for AI in Education. Pearson Education, 2016.