Enhancing Engagement in Online Courses: Using AI-Powered Gaze Tracking Technology for Improved Eye Contact
Wednesday October 25, 2023 - 10:45 AM to 11:00 AM

Brief Abstract

Online instructors often struggle to maintain direct eye contact with the camera while recording pre-recorded videos, leading to reduced engagement with learners. To solve this issue, this presentation proposes using Nvidia Gaze, an AI-powered software developer kit that tracks a presenter's gaze and redirects it towards the camera in real-time. This method is non-intrusive, flexible, and scalable, making it ideal for both instructors and content producers. During the presentation, before-and-after results will be demonstrated, along with the steps required to use the SDK, limitations, and necessary workarounds.

Presenters

Lead Presenter: Gerry Pedraza, Texas A&M University-College Station

Extended Abstract

Imagine taking an online course where most content is delivered via pre-recorded videos. What would a participant think if the instructor on screen is looking elsewhere instead of at the camera, and their gaze shifts from one place to another seeming distracted or anxious? This scenario often occurs when online instructors prepare their content. Most of them do not have the training or experience to always look at the camera recording them, which is entirely understandable. Most instructors are used to the in-person classroom experience. A standard engagement activity is to look all around the classroom, trying to address the whole class and not directly at any point during their lecture. Therefore when they record in a studio setting, being on camera may be challenging, and the resulting video product often lacks that element of direct eye contact that would increase the feeling of being directly looked at in a conversation. This situation often results in the instructor wanting to avoid appearing on screen to mitigate this issue or appearing on screen and looking distracted. Both conditions produce a video with reduced engagement capabilities.

There are methods to mitigate this. However, the most common one is using a teleprompter, which is rather challenging. The problem with following this path is coming up with the script ahead of time, as this is a heavy pre-production burden often placed on the instructors, who are usually used to presenting on the fly. The method discussed in this presentation is non-intrusive, flexible, and scalable.

Using a software developer kit (SDK) called Nvidia Gaze, video can be analyzed by an artificial intelligence (AI) engine that uses the power of Nvidia's RTX line of multicore processors to track the presenter's body posture, face, and direction of their eye direction. It is non-intrusive as it can, in real-time, redirect the instructor's gaze directly at the camera at all times, within certain limitations. The result is incredibly realistic and not perceivable by the audience, only seeing the processed video with enhanced results. Presenters highly appreciate this method as they do not have the burden of a conscious effort to look straight at the camera and can entirely focus on their delivery.

Instructional designers and content producers appreciate the flexibility and scalability of this method, as existing videos can also be processed. The SDK can be instructed to process videos as a post-production tool to enhance pre-recorded materials. Moreover, depending on the project's needs, multiple cards can process existing materials in batches.

During the presentation, we will demonstrate the before and after results of using this method in a one-minute video and go over the steps we followed to use this SDK. We will close by explaining the limitations and walkarounds necessary to use this technology still in beta stages.

Registered Attendees should access this session and session resources via the digital conference portal.

National Sponsors

Platinum

Titanium

Platinum

Diamond

Titanium

Platinum

Diamond

PlayPosit by WeVideo

WeVideo offers a comprehensive interactive video learning and communication solution, combining powerful multimedia creation tools with interactive elements like multiple choice, fill-in-the blank, polls, free response, and more.

Video is how ideas are expressed and shared in our world today, and video creation is a fundamental element of literacy. Today’s learners need to practice communicating ideas through video with clarity, precision, and creativity. What’s more, studies show that learners retain 3x as much information from interactive video versus standard. Over 5,000 US school districts use WeVideo – maximize your learners’ digital literacy, critical thinking, and creative communication skills today.

Contact:

Twitter

LinkedIn

Instagram

Session Information

Session Start: 10:45 AM

Session Duration: 15min

Onsite

Intended audience: Administrators, Design Thinkers, Faculty, Instructional Support, Students, Training Professionals, Technologists, All Attendees, Researchers

Track: Technology and Future Trends

Audience Level: All

Session Type: Lightning Session

Hashtag: @gerrypedraz #instructionaldesign #videoengagement #edtech #nvidiamaxine

Location: Chesapeake D/E/F

Enhancing Engagement in Online Courses: Using AI-Powered Gaze Tracking Technology for Improved Eye Contact Wednesday October 25, 2023 - 10:45 AM to 11:00 AM

Brief Abstract

Presenters

Lead Presenter: Gerry Pedraza, Texas A&M University-College Station

Extended Abstract

Add To Cart

Message

Enhancing Engagement in Online Courses: Using AI-Powered Gaze Tracking Technology for Improved Eye Contact
Wednesday October 25, 2023 - 10:45 AM to 11:00 AM