A journal of IEEE and CAA , publishes high-quality papers in English on original theoretical/experimental research and development in all areas of automation
Volume 7 Issue 4
Jun.  2020

IEEE/CAA Journal of Automatica Sinica

  • JCR Impact Factor: 11.8, Top 4% (SCI Q1)
    CiteScore: 17.6, Top 3% (Q1)
    Google Scholar h5-index: 77, TOP 5
Turn off MathJax
Article Contents
Jacob H. White and Randal W. Beard, "An Iterative Pose Estimation Algorithm Based on Epipolar Geometry With Application to Multi-Target Tracking," IEEE/CAA J. Autom. Sinica, vol. 7, no. 4, pp. 942-953, July 2020. doi: 10.1109/JAS.2020.1003222
Citation: Jacob H. White and Randal W. Beard, "An Iterative Pose Estimation Algorithm Based on Epipolar Geometry With Application to Multi-Target Tracking," IEEE/CAA J. Autom. Sinica, vol. 7, no. 4, pp. 942-953, July 2020. doi: 10.1109/JAS.2020.1003222

An Iterative Pose Estimation Algorithm Based on Epipolar Geometry With Application to Multi-Target Tracking

doi: 10.1109/JAS.2020.1003222
Funds:  This work was funded by the Center for Unmanned Aircraft Systems (C-UAS), a National Science Foundation Industry/University Cooperative Research Center (I/UCRC) under NSF award Numbers IIP-1161036 and CNS-1650547, along with significant contributions from C-UAS industry members
More Information
  • This paper introduces a new algorithm for estimating the relative pose of a moving camera using consecutive frames of a video sequence. State-of-the-art algorithms for calculating the relative pose between two images use matching features to estimate the essential matrix. The essential matrix is then decomposed into the relative rotation and normalized translation between frames. To be robust to noise and feature match outliers, these methods generate a large number of essential matrix hypotheses from randomly selected minimal subsets of feature pairs, and then score these hypotheses on all feature pairs. Alternatively, the algorithm introduced in this paper calculates relative pose hypotheses by directly optimizing the rotation and normalized translation between frames, rather than calculating the essential matrix and then performing the decomposition. The resulting algorithm improves computation time by an order of magnitude. If an inertial measurement unit (IMU) is available, it is used to seed the optimizer, and in addition, we reuse the best hypothesis at each iteration to seed the optimizer thereby reducing the number of relative pose hypotheses that must be generated and scored. These advantages greatly speed up performance and enable the algorithm to run in real-time on low cost embedded hardware. We show application of our algorithm to visual multi-target tracking (MTT) in the presence of parallax and demonstrate its real-time performance on a 640 × 480 video sequence captured on a UAV. Video results are available at https://youtu.be/HhK-p2hXNnU.

     

  • loading
  • [1]
    M. A. Fischler and R. C. Bolles, “Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography,” Communications of the ACM, vol. 24, no. 6, pp. 381–395, 1981. doi: 10.1145/358669.358692
    [2]
    P. J. Rousseeuw, “Least median of squares regression,” J. American Statistical Association, vol. 79, no. 388, pp. 871–880, 1984. doi: 10.1080/01621459.1984.10477105
    [3]
    D. Nistér, “An efficient solution to the five-point relative pose problem,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 6, pp. 756–770, 2004. doi: 10.1109/TPAMI.2004.17
    [4]
    “OpenCV 3.1: Open source computer vision library,” [Online]. Available: https://github.com/opencv/opencv/releases/tag/3.1.0, 2015.
    [5]
    Y. Ma, J. Košecká, and S. Sastry, “Optimization criteria and geometric algorithms for motion and structure estimation,” Int. J. Computer Vision, vol. 44, no. 3, pp. 219–249, 2001. doi: 10.1023/A:1012276232049
    [6]
    T. Botterill, S. Mills, and R. Green, “Refining essential matrix estimates from RANSAC,” in Proc. Image and Vision Computing New Zealand, 2011, pp. 1–6.
    [7]
    T. Botterill, S. Mills, and R. Green, “Fast RANSAC hypothesis generation for essential matrix estimation,” in Proc. IEEE Int. Conf. Digital Image Computing Techniques and Applications, 2011, pp. 561–566.
    [8]
    U. Helmke, K. Hüper, P. Y. Lee, and J. Moore, “Essential matrix estimation using Gauss-Newton iterations on a manifold,” Int. J. Computer Vision, vol. 74, no. 2, pp. 117–136, 2007. doi: 10.1007/s11263-006-0005-0
    [9]
    M. Sarkis, K. Diepold, and K. Hüper, “A fast and robust solution to the five-point relative pose problem using Gauss-Newton optimization on a manifold,” in Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, 2007.
    [10]
    R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision. Cabridge University Press, 2003.
    [11]
    J. Shi and Tomasi, “Good features to track,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 1994.
    [12]
    S. Baker and I. Matthews, “Lucas-Kanade 20 years on: A unifying framework,” Int. J. Computer Vision, vol. 56, no. 3, pp. 221–255, 2004. doi: 10.1023/B:VISI.0000011205.11775.fd
    [13]
    S. Belongie, “Cse 252b: Computer vision II, lecture 11,” [Online]. Available: https://cseweb.ucsd.edu/classes/sp04/cse252b/notes/lec11/lec11.pdf, 2006.
    [14]
    C. Hertzberg, R. Wagner, U. Frese, and L. Schröder, “Integrating generic sensor fusion algorithms with sound state representations through encapsulation of manifolds,” Information Fusion, vol. 14, no. 1, pp. 57–77, 2013. doi: 10.1016/j.inffus.2011.08.003
    [15]
    C. Forster, L. Carlone, F. Dellaert, and D. Scaramuzza, “On-manifold preintegration for real-time visual-inertial odometry,” IEEE Trans. Robotics, vol. 33, no. 1, pp. 1–21, Fe. 2017. doi: 10.1109/TRO.2016.2597321
    [16]
    P. J. Rousseeuw and A. M. Leroy, Robust Regression and Outlier Detection. John Wiley & Sons, 2005, vol. 589.
    [17]
    V. Santhaseelan and V. K. Asari, “Moving object detection and tracking in wide area motion imagery,” in Wide Area Surveillance. Springer, 2014, pp. 49–70.
    [18]
    Y. Sheikh, O. Javed, and T. Kanade, “Background subtraction for freely moving cameras,” in Proc. IEEE 12th Int. Conf. Computer Vision, 2009, pp. 1219–1225.
    [19]
    A. Elqursh and A. Elgammal, “Online moving camera background subtraction,” in Proc. European Conf. Computer Vision. Springer, 2012, pp. 228–241.
    [20]
    J. Kang, I. Cohen, G. Medioni, and C. Yuan, “Detection and tracking of moving objects from a moving platform in presence of strong parallax,” in Proc. 10th IEEE Int. Conf. Computer Vision, vol. 1, pp. 10–17, 2005.
    [21]
    S. Dey, V. Reilly, I. Saleemi, and M. Shah, “Detection of independently moving objects in non-planar scenes via multi-frame monocular epipolar constraint,” in Proc. 12th European Conf. Computer Vision, vol. 7576, pp. 860–873, 2012.
    [22]
    P. C. Niedfeldt and R. W. Beard, “Multiple target tracking using recursive RANSAC,” in Proc. American Control Conf., Jun. 2014, pp. 3393–3398.
    [23]
    P. C. Niedfeldt, K. Ingersoll, and R. W. Beard, “Comparison and analysis of recursive-RANSAC for multiple target tracking,” IEEE Trans Aerospace and Electronic Systems, vol. 53, no. 1, pp. 461–476, Feb. 2017. doi: 10.1109/TAES.2017.2650818
    [24]
    Y. Bar-Shalom, F. Daum, and J. Huang, “The probabilistic data association filter,” IEEE Control Systems, vol. 29, no. 6, 2009.
    [25]
    “BYU holodeck: A high-fidelity simulator for deep reinforcement learning,” [Online]. Available: https://github.com/byu-pccl/holodeck, 2018.

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(10)  / Tables(4)

    Article Metrics

    Article views (1511) PDF downloads(121) Cited by()

    Highlights

    • This paper introduces a new algorithm for estimating the relative pose of a moving camera.
    • A novel optimization algorithm solves for the relative pose using the epipolar constraint.
    • Applications include multi-target tracking, visual odometry, and 3D scene reconstruction.
    • If IMU information is available, it is used to seed the pose estimation algorithm.
    • Real-time execution of the algorithm is demonstrated on an embedded flight platform.

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return