Dokumentation (english)

DETR ResNet-50 DC5

DETR with dilated ResNet-50 backbone for improved small object detection

DETR ResNet-50 DC5 (Dilated C5 stage) modifies the standard DETR architecture by using dilated convolutions in the final stage of ResNet-50. This increases the spatial resolution of feature maps, providing better localization and significantly improving small object detection while maintaining the transformer-based detection approach.

When to Use DETR ResNet-50 DC5

Use DETR ResNet-50 DC5 when you have:

  • Many small objects in your dataset (<32x32 pixels)
  • Need better localization accuracy than standard DETR
  • Sufficient GPU memory (higher resolution features require more memory)
  • Scenarios where standard DETR misses small objects

Strengths

  • Better small object detection than standard DETR (3-5% mAP improvement on small objects)
  • Higher resolution features through dilated convolutions
  • Improved localization with finer spatial information
  • Same transformer architecture benefits as standard DETR

Weaknesses

  • Higher memory usage than standard DETR ResNet-50
  • Slower training due to higher resolution processing
  • Slower inference (1.5-2x slower than standard DETR)
  • May overfit on small datasets due to increased capacity

Parameters

Training Configuration

Training Images: Folder with images Annotations: COCO-format JSON Batch Size (Default: 2) - Range: 1-4 (memory-intensive) Epochs (Default: 1) - Range: 1-8 Learning Rate (Default: 5e-5) Eval Steps (Default: 1)

Configuration Tips

  • Use when standard DETR ResNet-50 misses small objects
  • batch_size=2 typical even with 16GB GPU (higher memory usage)
  • Particularly effective for small object classes
  • Monitor memory usage closely

Expected Performance

  • 3-5% better mAP on small objects vs standard DETR ResNet-50
  • Overall mAP typically 1-2% better
  • Trade-off: slower inference for better small object detection

Comparison with Alternatives

vs Standard DETR ResNet-50: Choose DC5 when small objects critical, standard for faster inference

vs Deformable DETR: Deformable generally better overall, but DC5 simpler architecture if you want standard DETR with better small object handling


Command Palette

Search for a command to run...

Schnellzugriffe
STRG + KSuche
STRG + DNachtmodus / Tagmodus
STRG + LSprache ändern

Software-Details
Kompiliert vor 1 Tag
Release: v4.0.0-production
Buildnummer: master@64a3463
Historie: 68 Items