Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 8 von 121
IEEE transactions on pattern analysis and machine intelligence, 2023-11, Vol.45 (11), p.1-11
2023
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Unsupervised Pre-Training for Detection Transformers
Ist Teil von
  • IEEE transactions on pattern analysis and machine intelligence, 2023-11, Vol.45 (11), p.1-11
Ort / Verlag
New York: IEEE
Erscheinungsjahr
2023
Quelle
IEL
Beschreibungen/Notizen
  • DEtection TRansformer (DETR) for object detection reaches competitive performance compared with Faster R-CNN via a transformer encoder-decoder architecture. However, trained with scratch transformers, DETR needs large-scale training data and an extreme long training schedule even on COCO dataset. Inspired by the great success of pre-training transformers in natural language processing, we propose a novel pretext task named random query patch detection in Unsupervised Pre-training DETR (UP-DETR). Specifically, we randomly crop patches from the given image and then feed them as queries to the decoder. The model is pre-trained to detect these query patches from the input image. During the pre-training, we address two critical issues: multi-task learning and multi-query localization. (1) To trade off classification and localization preferences in the pretext task, we find that freezing the CNN backbone is the prerequisite for the success of pre-training transformers. (2) To perform multi-query localization, we develop UP-DETR with multi-query patch detection with attention mask. Besides, UP-DETR also provides a unified perspective for fine-tuning object detection and one-shot detection tasks. In our experiments, UP-DETR significantly boosts the performance of DETR with faster convergence and higher average precision on object detection, one-shot detection and panoptic segmentation. Code and pre-training models: https://github.com/dddzg/up-detr .
Sprache
Englisch
Identifikatoren
ISSN: 0162-8828
eISSN: 1939-3539, 2160-9292
DOI: 10.1109/TPAMI.2022.3216514
Titel-ID: cdi_crossref_primary_10_1109_TPAMI_2022_3216514

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX