본문 바로가기

Data-science/deep learning

[논문 읽기] EfficientDet: Scalable and Efficient Object Detection

728x90

Challenge 1: efficient multi-scale feature fusion

Challenge 2: model scaling

3.2. Cross-Scale Connections

3.3. Weighted Feature Fusion

Unbounded fusion: However, since the scalar weight is unbounded, it could potentially cause training instability.

Softmax-based fusion: the extra softmax leads to significant slowdown on GPU hardware.

Fast normalized fusion: this fast fusion approach has very similar learning behavior and accuracy as the softmax-based fusion, but runs up to 30% faster on GPUs

4.1. EfficientDet Architecture

4.2. Compound Scaling

Backbone network – we reuse the same width/depth scaling coefficients of EfficientNet-B0 to B6 [39] such that we can easily reuse their ImageNet-pretrained checkpoints

BiFPN network

Box/class prediction network

Input image resolution