TUdatalib : Single-stage Semantic Segmentation from Image Labels

Am Montag, 7.4.2025 wird TUdatalib wegen geplanten Wartungsarbeiten am Speichersystem von 9:00 bis voraussichtlich 9:30 nur eingeschränkt nutzbar sein (kein Datenupload und Download) | Due to scheduled maintenance on the storage system, using TUdatalib will be limited on Monday, April 7 2025 from 9:00 to approx. 9:30 (no data upload or download)

Zur Kurzanzeige

dc.contributor.advisor
dc.contributor.author	Araslanov, Nikita
dc.contributor.author	Roth, Stefan
dc.date.accessioned	2021-12-22T11:09:34Z
dc.date.available	2021-12-22T11:09:34Z
dc.date.issued	2020-06
dc.identifier.uri	https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3367
dc.description	Recent years have seen a rapid growth in new approaches improving the accuracy of semantic segmentation in a weakly supervised setting, i.e. with only image-level labels available for training. However, this has come at the cost of increased model complexity and sophisticated multi-stage training procedures. This is in contrast to earlier work that used only a single stage − training one segmentation network on image labels − which was abandoned due to inferior segmentation accuracy. In this work, we first define three desirable properties of a weakly supervised method: local consistency, semantic fidelity, and completeness. Using these properties as guidelines, we then develop a segmentation-based network model and a self-supervised training scheme to train for semantic masks from image-level annotations in a single stage. We show that despite its simplicity, our method achieves results that are competitive with significantly more complex pipelines, substantially outperforming earlier single-stage methods.	de_DE
dc.language.iso	en	de_DE
dc.relation	IsDescribedBy;arXiv;2005.08104
dc.rights	Apache License 2.0
dc.rights.uri	https://www.apache.org/licenses/LICENSE-2.0
dc.subject	weak supervision	de_DE
dc.subject	semantic segmentation	de_DE
dc.subject.classification	4.43-04 Künstliche Intelligenz und Maschinelle Lernverfahren	de_DE
dc.subject.classification	4.43-05 Bild- und Sprachverarbeitung, Computergraphik und Visualisierung, Human Computer Interaction, Ubiquitous und Wearable Computing
dc.subject.ddc	004
dc.title	Single-stage Semantic Segmentation from Image Labels	de_DE
dc.type	Software	de_DE
tud.unit	TUDa
tud.history.classification	Version=2016-2020;409-05 Interaktive und intelligente Systeme, Bild- und Sprachverarbeitung, Computergraphik und Visualisierung

Dateien zu dieser Ressource

Name:: 1-stage-wseg-code.zip
Größe:: 6.895MB
Format:: application/zip
Beschreibung:: Training and inference code ...

Anzahl der Dateien

Name:: snapshots.zip
Größe:: 862.3MB
Format:: application/zip
Beschreibung:: Model parameters (snapshots)

Anzahl der Dateien

Name:: results.zip
Größe:: 4.076GB
Format:: application/zip
Beschreibung:: Inference results

Anzahl der Dateien

Name:: license_Apache-License.txt
Größe:: 11.09KB
Format:: Text

Anzahl der Dateien

Der Datensatz erscheint in:

Segmentation [8]

Zur Kurzanzeige

Solange nicht anders angezeigt, wird die Lizenz wie folgt beschrieben: Apache License 2.0