Aishwarya Agrawal, PhD student at Virginia Tech shares how her team is using NVIDIA GPUs and deep learning to automatically answer a wide range of questions about arbitrary images.
According to Agrawal and her collaborators, the system may one day be used by the visually impaired to help navigate real-world environments, such as informing the user when it is safe to cross the street.
To learn more, try the online demo or read their research paper “VQA: Visual Question Answering”.
Share your GPU-accelerated science with us at http://nvda.ly/Vpjxr and with the world on #ShareYourScience.
Watch more scientists and researchers share how accelerated computing is benefiting their work at http://nvda.ly/X7WpH
Share Your Science: Training a Machine to Answer Questions About Images
May 13, 2016
Discuss (0)

Related resources
- GTC session: Exploring Uncertainty Quantification in Deep Learning for Medical Imaging* (Spring 2023)
- GTC session: Introduction to "Learning Deep Learning"* (Spring 2023)
- GTC session: Detecting Skin Diseases using AI (Spring 2023)
- Webinar: Isaac Developer Meetup #2 - Build AI-Powered Robots with NVIDIA Isaac Replicator and NVIDIA TAO
- Webinar: Accelerate Information Discovery in Energy with AI-Powered NLP
- Webinar: Inception Workshop 101 - Getting Started with Vision AI