copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MS-RAFT+: High Resolution Multi-Scale RAFT

A. Jahedi, M. Luz, M. Rivinius, L. Mehl, and A. Bruhn. International Journal of Computer Vision, (2023)
DOI: 10.1007/s11263-023-01930-7

Abstract

Hierarchical concepts have proven useful in many classical and learning-based optical flow methods regarding both accuracy and robustness. In this paper we show that such concepts are still useful in the context of recent neural networks that follow RAFT’s paradigm refraining from hierarchical strategies by relying on recurrent updates based on a single-scale all-pairs transform. To this end, we introduce MS-RAFT+: a novel recurrent multi-scale architecture based on RAFT that unifies several successful hierarchical concepts. It employs a coarse-to-fine estimation to enable the use of finer resolutions by useful initializations from coarser scales. Moreover, it relies on RAFT’s correlation pyramid that allows to consider non-local cost information during the matching process. Furthermore, it makes use of advanced multi-scale features that incorporate high-level information from coarser scales. And finally, our method is trained subject to a sample-wise robust multi-scale multi-iteration loss that closely supervises each iteration on each scale, while allowing to discard particularly difficult samples. In combination with an appropriate mixed-dataset training strategy, our method performs favorably. It not only yields highly accurate results on the four major benchmarks (KITTI 2015, MPI Sintel, Middlebury and VIPER), it also allows to achieve these results with a single model and a single parameter setting. Our trained model and code are available at https://github.com/cv-stuttgart/MS_RAFT_plus.

Links and resources

BibTeX key: jahedi2023msraft
entry type: article
year: 2023
journal: International Journal of Computer Vision
pages: 1573-1405
publisher: Springer
research-areas: Computer Science
issn: 0920-5691 and 1573-1405
affiliation: Jahedi, A; Luz, M (Corresponding Author), Univ Stuttgart, Inst Visualizat & Interact Syst, Stuttgart, Germany. Luz, M (Corresponding Author), Univ Freiburg, Robot Learning Lab, Freiburg, Germany. Jahedi, Azin; Luz, Maximilian; Mehl, Lukas; Bruhn, Andres, Univ Stuttgart, Inst Visualizat & Interact Syst, Stuttgart, Germany. Luz, Maximilian, Univ Freiburg, Robot Learning Lab, Freiburg, Germany. Rivinius, Marc, Univ Stuttgart, Inst Informat Secur, Stuttgart, Germany.
pubstate: prepublished
unique-id: WOS:001126025000002
DOI: 10.1007/s11263-023-01930-7
url: https://doi.org/10.1007/s11263-023-01930-7

@sfbtrr161's tags highlighted

Cite this publication

search on

Meta data

Last update 19 days ago
Created 19 days ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

PUMA

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MS-RAFT+: High Resolution Multi-Scale RAFT

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

PUMA

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML MS-RAFT+: High Resolution Multi-Scale RAFT

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

MS-RAFT+: High Resolution Multi-Scale RAFT

Comments and Reviews
(0)