Computer Vision / Video Analytics

Share Your Science: Training a Machine to Answer Questions About Images

May 13, 2016

By Brad Nemire

Discuss (0)

AI-Generated Summary

Dislike

Aishwarya Agrawal and her team are using NVIDIA GPUs and deep learning to develop a system that can automatically answer questions about any image.
The system has the potential to assist the visually impaired in navigating real-world environments, such as determining when it is safe to cross the street.
The team's research is documented in their paper VQA: Visual Question Answering, and an online demo is available for further exploration.

AI-generated content may summarize information incompletely. Verify important information. Learn more

Aishwarya Agrawal, PhD student at Virginia Tech shares how her team is using NVIDIA GPUs and deep learning to automatically answer a wide range of questions about arbitrary images.
According to Agrawal and her collaborators, the system may one day be used by the visually impaired to help navigate real-world environments, such as informing the user when it is safe to cross the street.

To learn more, try the online demo or read their research paper “VQA: Visual Question Answering”.
Share your GPU-accelerated science with us at http://nvda.ly/Vpjxr and with the world on #ShareYourScience.
Watch more scientists and researchers share how accelerated computing is benefiting their work at http://nvda.ly/X7WpH

Discuss (0)

About the Authors

About Brad Nemire
Brad Nemire leads the Developer Communications team at NVIDIA. Prior to NVIDIA, he worked at Arm on the Developer Relations team. Brad graduated from San Diego State University and currently resides in Silicon Valley.

View all posts by Brad Nemire

Share Your Science: Training a Machine to Answer Questions About Images

Tags

About the Authors

Comments