Home Mathematics Chapter 10 Drishti: a generative AI-based application for gesture recognition and execution
Chapter
Licensed
Unlicensed Requires Authentication

Chapter 10 Drishti: a generative AI-based application for gesture recognition and execution

  • Tathagata Bhattacharya , Harshavardhan Meka , Srikanth Ponaganti , Peddi Adithya Vardhan and Irshad Ali Mohammad
Become an author with De Gruyter Brill
Imaging Science
This chapter is in the book Imaging Science

Abstract

This study explores the evolution of the inclusive educational tool, now named “Drishti,” a new release of the preceding “Dishari” project. Drishti integrates current technologies, specifically hand gesture detection, and generative AI, to cater to individuals with hearing and speech impairments. Traditional engines like Google frequently overlook the unique accessibility desires of these users, developing barriers to digital engagement. Drishti bridges this gap by using machine learning algorithms and computer vision to interpret hand gestures captured via web cameras, translating them into both sign language and keyword inputs for search engines like Google and Yahoo. The updated version extends the functionality of Dishari by incorporating not only alphabet inputs but also numerical inputs (0-9), delete button gesture, and space button gesture. Generative AI further complements the quest procedure, permitting seamless query inputs through both textual content and gestures. Through an in-depth literature analysis, we list the advancements in gesture recognition and the role of generative AI in improving accessibility, marking Drishti as an enormous step toward empowering people with hearing and speech impairments, to engage with digital platforms more efficiently.

Abstract

This study explores the evolution of the inclusive educational tool, now named “Drishti,” a new release of the preceding “Dishari” project. Drishti integrates current technologies, specifically hand gesture detection, and generative AI, to cater to individuals with hearing and speech impairments. Traditional engines like Google frequently overlook the unique accessibility desires of these users, developing barriers to digital engagement. Drishti bridges this gap by using machine learning algorithms and computer vision to interpret hand gestures captured via web cameras, translating them into both sign language and keyword inputs for search engines like Google and Yahoo. The updated version extends the functionality of Dishari by incorporating not only alphabet inputs but also numerical inputs (0-9), delete button gesture, and space button gesture. Generative AI further complements the quest procedure, permitting seamless query inputs through both textual content and gestures. Through an in-depth literature analysis, we list the advancements in gesture recognition and the role of generative AI in improving accessibility, marking Drishti as an enormous step toward empowering people with hearing and speech impairments, to engage with digital platforms more efficiently.

Downloaded on 20.11.2025 from https://www.degruyterbrill.com/document/doi/10.1515/9783111436425-010/html
Scroll to top button