PV-RNN visual predictions in response to language instructions

Visual representations of the different modules of the PV-RNN model as it predicts the visual output to a language instruction. The first clips show accurate predictions in response to “put green on blue”, whereas the latter show erroneous predictions to “put blue on yellow”.

Date:
19 December 2024
Creator:
adrian-skov
Credit:
Vijayaraghavan et al., 2025
Share on: