TUdatalib : Dense Unsupervised Learning for Video Segmentation

Count of file(s): 3

dense-ulearn-vos-code.zip (Training and inference code (PyTorch)) (7.654MB)

snapshots.tar.gz (Model parameters (snapshots)) (402.4MB)

results.tar.gz (Inference results) (8.377GB)

Date

2021-12

Author

Araslanov, Nikita

Schaub-Meyer, Simone

Roth, Stefan

Type

Software

Description

We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train such a model results in a degenerate solution. We propose to prevent this with a simple regularisation scheme, accommodating the equivariance property of the segmentation task to similarity transformations. Our training objective admits efficient implementation and exhibits fast training convergence. On established VOS benchmarks, our approach exceeds the segmentation accuracy of previous work despite using significantly less training data and compute power.

Related Resources

Is described by: arXiv:2111.06265

Collections

Segmentation [4]

The following license files are associated with this item:

License description

Except where otherwise noted, this item's license is described as Apache License 2.0

Version	Item	Description version	Date	Summary
2	tudatalib/3365.2*		2022-01-04T16:58:21Z	Adding third party funding
1	tudatalib/3365		2021-12-22T11:09:23Z

Dense Unsupervised Learning for Video Segmentation

Count of file(s): 3

Date

Author

Type

Metadata

Export

Description

Subject

DFG subject classification

URI

Related third party funded projects

Related Resources

Collections

Segmentation [4]

Version History