ImageNet: ImageNet Large Scale Visual Recognition Challenge, by Olga
Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean
Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein,
Alexander C. Berg, and Li Fei-Fei in International Journal of Computer Vision
December 2015, Volume 115, Issue 3, pp. 211–252.
Pascal: The Pascal Visual Object Classes (VOC) Challenge, by Mark Everingham,
Luc Van Gool, Christopher K. I. Williams, John Winn, and Andrew
Zisserman, International Journal of Computer Vision, June 2010, Volume 88,
Issue 2, pp. 303–338.
VGGNet: Very Deep Convolutional Networks for Large-Scale Image Recognition
by Karen Simonyan and Andrew Zisserman, Proc. Int. Conf. Learned
Representations, 2015. You can find a version of this here
Inception: Going Deeper with Convolutions, by Christian Szegedy, Wei Liu,
Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru
Erhan, Vincent Vanhoucke, and Andrew Rabinovich, Proc Computer Vision
and Pattern Recognition, 2015. You can find a version of this here
ResNets: Deep Residual Learning for Image Recognition by Kaiming He,
Xiangyu Zhang, Shaoqing Ren, and Jian Sun, Proc Computer Vision and
Pattern Recognition, 2015. You can find a version of this here
Object detection
Chapter 18, Applied Machine Learning D.A. Forsyth; get the e-book from the library here.
Selective search: Selective Search for Object Recognition by J. R. R. Uijlings,
K. E. A. van de Sande, T. Gevers, and A. W. M. Smeulders, International
Journal of Computer Vision September 2013, Volume 104, Issue 2, pp.
154–171.
R-CNN: Rich feature hierarchies for accurate object detection and semantic
segmentation, by R. Girshick, J. Donahue, T. Darrell, and J. Malik, IEEE
Conf. on Computer Vision and Pattern Recognition, 2014. You can find a
version of this here
Fast R-CNN: Fast R-CNN, by Ross Girshick, IEEE Int. Conf. on Computer
Vision (ICCV), 2015, pp. 1440–1448. You can find a version of this
here
Faster R-CNN: Faster R-CNN: Towards Real-Time Object Detection with
Region Proposal Networks, by Shaoqing Ren, Kaiming He, Ross Girshick,
and Jian Sun, Advances in Neural Information Processing Systems 28 (NIPS
2015). You can find a version of this here
YOLO: You Only Look Once: Unified, Real-Time Object Detection, by Joseph
Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi, Proc Computer
Vision and Pattern Recognition, 2016. You can find a version of this at
here.
There’s a home page here
Multiple View Geometry in Computer Vision
Second Edition
Richard Hartley and Andrew Zisserman,
Cambridge University Press, March 2004.
Y. Boykov, O. Veksler and R. Zabih (2001), "Fast approximate energy minimisation via graph cuts", IEEE Transactions on Pattern Analysis and Machine Intelligence, 29, 1222–1239.
Li Hong and G. Chen, "Segment-based stereo matching using graph cuts," Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., Washington, DC, USA, 2004, pp. I-I, doi: 10.1109/CVPR.2004.1315016.
R. Szeliski et al., "A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 6, pp. 1068-1080, June 2008, doi: 10.1109/TPAMI.2007.70844.
T. Brox, C. Bregler and J. Malik, "Large displacement optical flow", CVPR 2009
High Accuracy Optical Flow Estimation Based on a
Theory for Warping,
Thomas Brox, Andr´es Bruhn, Nils Papenberg, and Joachim Weickert, ECCV, 2004