PV-RNN visual predictions in response to language instructions
Visual representations of the different modules of the PV-RNN model as it predicts the visual output to a language instruction. The first clips show accurate predictions in response to “put green on blue”, whereas the latter show erroneous predictions to “put blue on yellow”.
Date:
19 December 2024
Creator:
adrian-skov
Credit:
Vijayaraghavan et al., 2025
Copyright OIST (Okinawa Institute of Science and Technology Graduate University, 沖縄科学技術大学院大学). Creative Commons Attribution 4.0 International License (CC BY 4.0).