Abstract
In this paper, we address the problem of long-term visual tracking where the target objects undergo signifificant appearance variation due to deformation, abrupt motion, heavy occlusion and out-of-view. In this setting, we decompose the task of tracking into translation and scale estimation of objects. We show that the correlation between temporal context considerably improves the accuracy and reliability for translation estimation, and it is effective to learn discriminative correlation fifilters from the most confifi- dent frames to estimate the scale change. In addition, we train an online random fern classififier to re-detect objects in case of tracking failure. Extensive experimental results on large-scale benchmark datasets show that the proposed algorithm performs favorably against state-of-the-art methods in terms of effificiency, accuracy, and robustness.