Visual Learning and Recognition: Difference between revisions

Line 645: Line 645:
# There are 5 conv blocks, followed by two 4-dir IRNN blocks which extract context features.
# There are 5 conv blocks, followed by two 4-dir IRNN blocks which extract context features.
# The whole image passes through this entire network.
# The whole image passes through this entire network.
IRNN
* This is a set of 4 RNNs which move across the image. Up, down, left, right.
* The outputs of each RNN are concatenated, yielding an image with the same shape.


* For each RoI identified using object proposals:
* For each RoI identified using object proposals: