Aishwarya Agrawal, PhD student at Virginia Tech shares how her team is using NVIDIA GPUs and deep learning to automatically answer a wide range of questions about arbitrary images.
According to Agrawal and her collaborators, the system may one day be used by the visually impaired to help navigate real-world environments, such as informing the user when it is safe to cross the street.
To learn more, try the online demo or read their research paper “VQA: Visual Question Answering”.
Share your GPU-accelerated science with us at http://nvda.ly/Vpjxr and with the world on #ShareYourScience.
Watch more scientists and researchers share how accelerated computing is benefiting their work at http://nvda.ly/X7WpH
Share Your Science: Training a Machine to Answer Questions About Images
May 13, 2016
Discuss (0)
AI-Generated Summary
- Aishwarya Agrawal and her team are using NVIDIA GPUs and deep learning to develop a system that can automatically answer questions about any image.
- The system has the potential to assist the visually impaired in navigating real-world environments, such as determining when it is safe to cross the street.
- The team's research is documented in their paper VQA: Visual Question Answering, and an online demo is available for further exploration.
AI-generated content may summarize information incompletely. Verify important information. Learn more