Zur Kurzanzeige

dc.contributor.authorHesse, Robin
dc.contributor.authorSchaub-Meyer, Simone
dc.contributor.authorRoth, Stefan
dc.date.accessioned2025-04-03T14:31:17Z
dc.date.available2025-04-03T14:31:17Z
dc.date.issued2023-06
dc.identifier.urihttps://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4534
dc.descriptionMany convolutional neural networks (CNNs) rely on progressive downsampling of their feature maps to increase the network's receptive field and decrease computational cost. However, this comes at the price of losing granularity in the feature maps, limiting the ability to correctly understand images or recover fine detail in dense prediction tasks. To address this, common practice is to replace the last few downsampling operations in a CNN with dilated convolutions, allowing to retain the feature map resolution without reducing the receptive field, albeit increasing the computational cost. This allows to trade off predictive performance against cost, depending on the output feature resolution. By either regularly downsampling or not downsampling the entire feature map, existing work implicitly treats all regions of the input image and subsequent feature maps as equally important, which generally does not hold. We propose an adaptive downsampling scheme that generalizes the above idea by allowing to process informative regions at a higher resolution than less informative ones. In a variety of experiments, we demonstrate the versatility of our adaptive downsampling strategy and empirically show that it improves the cost-accuracy trade-off of various established CNNs.de_DE
dc.language.isoende_DE
dc.relationIsDescribedBy;arXiv;2305.09504
dc.rightsApache License 2.0
dc.rights.urihttps://www.apache.org/licenses/LICENSE-2.0
dc.subjectdeep learningde_DE
dc.subjectsemantic segmentationde_DE
dc.subjectkeypoint estimationde_DE
dc.subjectefficiencyde_DE
dc.subject.classification4.43-05 Bild- und Sprachverarbeitung, Computergraphik und Visualisierung, Human Computer Interaction, Ubiquitous und Wearable Computingde_DE
dc.subject.ddc004
dc.titleContent-Adaptive Downsampling in Convolutional Neural Networksde_DE
dc.typeSoftwarede_DE
dc.typeModelde_DE
tud.projectEC/H2020 | 866008 | REDde_DE
tud.projectHMWK | 500/10.001-(00012) | TAM - TP Rothde_DE
tud.projectHMWK | 519/03/06.001-(0010) | WhiteBox - TP Rothde_DE
tud.unitTUDa


Dateien zu dieser Ressource

No Thumbnail [100%x60]
No Thumbnail [100%x60]
No Thumbnail [100%x60]
No Thumbnail [100%x60]
No Thumbnail [100%x60]
No Thumbnail [100%x60]
No Thumbnail [100%x60]
No Thumbnail [100%x60]

Der Datensatz erscheint in:

Zur Kurzanzeige

Apache License 2.0
Solange nicht anders angezeigt, wird die Lizenz wie folgt beschrieben: Apache License 2.0