Visual Learning and Recognition: Difference between revisions

Line 495: Line 495:
{{reflist|refs=
{{reflist|refs=
<ref name="torralba2008tinyimages">Antonio Torralba, Rob Fergus and William T. Freeman (2008). 80 million tiny images: a large dataset for
<ref name="torralba2008tinyimages">Antonio Torralba, Rob Fergus and William T. Freeman (2008). 80 million tiny images: a large dataset for
non-parametric object and scene recognition (PAMI 2008) [https://people.csail.mit.edu/torralba/publications/80millionImages.pdf https://people.csail.mit.edu/torralba/publications/80millionImages.pdf]</ref>
non-parametric object and scene recognition (PAMI 2008) [https://people.csail.mit.edu/torralba/publications/80millionImages.pdf Link]</ref>
<ref name="standing1973learning">Lionel Standing (1973). Learning 10000 pictures. ''Journal
<ref name="standing1973learning">Lionel Standing (1973). Learning 10000 pictures. ''Journal
Quarterly Journal of Experimental Psychology'' [https://www.tandfonline.com/doi/abs/10.1080/14640747308400340 https://www.tandfonline.com/doi/abs/10.1080/14640747308400340]</ref>
Quarterly Journal of Experimental Psychology'' [https://www.tandfonline.com/doi/abs/10.1080/14640747308400340 Link]</ref>
<ref name="brady2008visual">Timothy F. Brady, Talia Konkle, George A. Alvarez, and Aude Oliva (2008). Visual long-term memory has a massive storage capacity for object details. [http://olivalab.mit.edu/MM/pdfs/BradyKonkleAlvarezOliva2008.pdf http://olivalab.mit.edu/MM/pdfs/BradyKonkleAlvarezOliva2008.pdf].</ref>
<ref name="brady2008visual">Timothy F. Brady, Talia Konkle, George A. Alvarez, and Aude Oliva (2008). Visual long-term memory has a massive storage capacity for object details. [http://olivalab.mit.edu/MM/pdfs/BradyKonkleAlvarezOliva2008.pdf Link].</ref>
<ref name="torralba2011unbiased>Antonio Torralba, Alexei A. Efros (2011). Unbiased Look at Dataset Bias (CVPR 2011) [https://people.csail.mit.edu/torralba/publications/datasets_cvpr11.pdf https://people.csail.mit.edu/torralba/publications/datasets_cvpr11.pdf]</ref>
<ref name="torralba2011unbiased>Antonio Torralba, Alexei A. Efros (2011). Unbiased Look at Dataset Bias (CVPR 2011) [https://people.csail.mit.edu/torralba/publications/datasets_cvpr11.pdf Link]</ref>
<ref name="dale2009restoration">Kevin Dale, Micah K. Johnson, Kalyan Sunkavalli, Wojciech Matusik, Hanspeter Pfister (2009) Image Restoration using Online Photo Collections (ICCV 2009) [https://faculty.idc.ac.il/arik/seminar2010/papers/ImageRestoration/restoration_iccv09.pdf https://faculty.idc.ac.il/arik/seminar2010/papers/ImageRestoration/restoration_iccv09.pdf]</ref>
<ref name="dale2009restoration">Kevin Dale, Micah K. Johnson, Kalyan Sunkavalli, Wojciech Matusik, Hanspeter Pfister (2009) Image Restoration using Online Photo Collections (ICCV 2009) [https://faculty.idc.ac.il/arik/seminar2010/papers/ImageRestoration/restoration_iccv09.pdf Link]</ref>
<ref name="heys2007scene">James Hays, Alexei A. Efros (2007). Scene Completion Using Millions of Photographs (SIGGRAPH 2007) [http://graphics.cs.cmu.edu/projects/scene-completion/scene-completion.pdf http://graphics.cs.cmu.edu/projects/scene-completion/scene-completion.pdf]</ref>
<ref name="heys2007scene">James Hays, Alexei A. Efros (2007). Scene Completion Using Millions of Photographs (SIGGRAPH 2007) [http://graphics.cs.cmu.edu/projects/scene-completion/scene-completion.pdf Link]</ref>
<ref name="heys2008gps">James Hays, Alexei A. Efros (2008). IM2GPS: estimating geographic information from a single image. (CVPR 2008) [http://graphics.cs.cmu.edu/projects/im2gps/im2gps.pdf http://graphics.cs.cmu.edu/projects/im2gps/im2gps.pdf]</ref>
<ref name="heys2008gps">James Hays, Alexei A. Efros (2008). IM2GPS: estimating geographic information from a single image. (CVPR 2008) [http://graphics.cs.cmu.edu/projects/im2gps/im2gps.pdf Link]</ref>
<ref name="kaneva2008matching">Biliana Kaneva, Josef Sivic, Antonio Torralba, Shai Avidan, William T. Freeman (2008). Matching and Predicting Street Level Images (ECCV Workshops 2008) [https://people.csail.mit.edu/biliana/papers/eccv2010/eccv_workshop_2010.pdf https://people.csail.mit.edu/biliana/papers/eccv2010/eccv_workshop_2010.pdf]</ref>
<ref name="kaneva2008matching">Biliana Kaneva, Josef Sivic, Antonio Torralba, Shai Avidan, William T. Freeman (2008). Matching and Predicting Street Level Images (ECCV Workshops 2008) [https://people.csail.mit.edu/biliana/papers/eccv2010/eccv_workshop_2010.pdf Link]</ref>
<ref name="xie2018rethinking">Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, and Kevin Murphy (2018). Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification (ECCV 2018) [https://arxiv.org/pdf/1712.04851.pdf https://arxiv.org/pdf/1712.04851.pdf]</ref>
<ref name="xie2018rethinking">Saining Xie, Chen Sun, Jonathan Huang, Zhuowen Tu, and Kevin Murphy (2018). Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification (ECCV 2018) [https://arxiv.org/pdf/1712.04851.pdf Link]</ref>
<ref name="huang2018densenet">Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger (2017). Densely Connected Convolutional Networks (CVPR 2017) [https://arxiv.org/pdf/1608.06993.pdf Link]</ref>
<ref name="huang2018densenet">Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger (2017). Densely Connected Convolutional Networks (CVPR 2017) [https://arxiv.org/pdf/1608.06993.pdf Link]</ref>
<ref name="krizhevsky2012alexnet">Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton (2012) ImageNet Classification with Deep Convolutional
<ref name="krizhevsky2012alexnet">Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton (2012) ImageNet Classification with Deep Convolutional
Line 511: Line 511:
<ref name="liu2016ssd">Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg (2016) SSD: Single Shot MultiBox Detector [https://arxiv.org/abs/1512.02325 Link]</ref>
<ref name="liu2016ssd">Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg (2016) SSD: Single Shot MultiBox Detector [https://arxiv.org/abs/1512.02325 Link]</ref>
<ref name="redmon2016yolo">Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi (2016) You Only Look Once: Unified, Real-Time Object Detection [https://pjreddie.com/media/files/papers/yolo.pdf Link]</ref>
<ref name="redmon2016yolo">Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi (2016) You Only Look Once: Unified, Real-Time Object Detection [https://pjreddie.com/media/files/papers/yolo.pdf Link]</ref>
<ref name="shotton2009texton">Jamie Shotton John Winn Carsten Rother Antonio Criminisi (2009) TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context. [https://www.microsoft.com/en-us/research/publication/textonboost-for-image-understanding-multi-class-object-recognition-and-segmentation-by-jointly-modeling-texture-layout-and-context/ Link]
<ref name="shotton2009texton">Jamie Shotton John Winn Carsten Rother Antonio Criminisi (2009) TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context. [https://www.microsoft.com/en-us/research/publication/textonboost-for-image-understanding-multi-class-object-recognition-and-segmentation-by-jointly-modeling-texture-layout-and-context/ Link]</ref>
}}
}}