Visual Learning and Recognition: Difference between revisions

Visual Learning and Recognition (view source)

167 bytes added , 28 October 2020

5,337

edits

@@ Line 645: / Line 645: @@
 # There are 5 conv blocks, followed by two 4-dir IRNN blocks which extract context features.
 # The whole image passes through this entire network.
+IRNN
+* This is a set of 4 RNNs which move across the image. Up, down, left, right.
+* The outputs of each RNN are concatenated, yielding an image with the same shape.
 * For each RoI identified using object proposals: