TY - GEN
T1 - High-quality stereo video matching via user interaction and space-time propagation
AU - Zhang, Chenxi
AU - Price, Brian
AU - Cohen, Scott
AU - Yang, Ruigang
PY - 2013
Y1 - 2013
N2 - Even current state-of-the-art automatic stereo matching methods often struggle on natural images and videos, in great part due to fundamental matching ambiguities in low texture regions and a lack of higher level object knowledge. Stereo image matching can benefit greatly from user input to guide the matching process and help disambiguate matches. Applying interactive correction tools from scratch on each frame of a video would not only be throwing away valuable information provided by the user on other frames, but would also likely be too time consuming to be practical for video even if excellent disparity results could be obtained within a few minutes on each frame. In this work, we propose a stereo video matching system that allows user interaction to obtain high quality, dense disparity maps on key frames and then intelligently propagates the user input and key frame disparities to automatically produce high quality disparity maps on intermediate frames. The disparity maps on key frames are obtained using several novel, easy-to-use, and effective interactive tools. Our novel propagation algorithm estimates 3D transformations that map user corrected areas in key frames to intermediate frames. Experiments demonstrate the effectiveness and efficiency of our hybrid interactive/automatic approach.
AB - Even current state-of-the-art automatic stereo matching methods often struggle on natural images and videos, in great part due to fundamental matching ambiguities in low texture regions and a lack of higher level object knowledge. Stereo image matching can benefit greatly from user input to guide the matching process and help disambiguate matches. Applying interactive correction tools from scratch on each frame of a video would not only be throwing away valuable information provided by the user on other frames, but would also likely be too time consuming to be practical for video even if excellent disparity results could be obtained within a few minutes on each frame. In this work, we propose a stereo video matching system that allows user interaction to obtain high quality, dense disparity maps on key frames and then intelligently propagates the user input and key frame disparities to automatically produce high quality disparity maps on intermediate frames. The disparity maps on key frames are obtained using several novel, easy-to-use, and effective interactive tools. Our novel propagation algorithm estimates 3D transformations that map user corrected areas in key frames to intermediate frames. Experiments demonstrate the effectiveness and efficiency of our hybrid interactive/automatic approach.
UR - http://www.scopus.com/inward/record.url?scp=84886073415&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84886073415&partnerID=8YFLogxK
U2 - 10.1109/3DV.2013.18
DO - 10.1109/3DV.2013.18
M3 - Conference contribution
AN - SCOPUS:84886073415
SN - 9780769550671
T3 - Proceedings - 2013 International Conference on 3D Vision, 3DV 2013
SP - 71
EP - 78
BT - Proceedings - 2013 International Conference on 3D Vision, 3DV 2013
T2 - 2013 International Conference on 3D Vision, 3DV 2013
Y2 - 29 June 2013 through 1 July 2013
ER -