AggLoss: Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

October 2020

tl;dr: Encourage different anchors to generate the same prediction, and occlusion-aware RoI pooling.

Overall impression

There are two main contributions of the paper. The first is AggLoss which encourages diff anchors associated with the same GT to output the same output. The second is the occlusion aware part-based RoI pooling.

Both RepLoss and AggLoss proposes additional penalties to produce more compact bounding boxes and become less sensitive to NMS. And also imposes additional penalties to bbox which appear in the middle of the two pedestrians.

Key ideas

AggLoss
- If one GT bbox is associated with more than one anchors, encourages the prediction from all these anchors to be the same. It enforces SL1 loss between the avg prediction of the anchors and the corresponding GT. --> There seems to be something wrong in the paper's formulation. Shouldn't this be taking the avg of the abs (~SL1 loss) of the diff, rather than taking abs of the avg diff?
PORoI (Part occlusion aware RoI pooling)
- A part based model: inductive bias to introduce prior structure information of human body with visible prediction into the network.
- The human body is divided into 5 parts, and each region U is compared with the visible region of bbox (V) to find IoU (intersection over U) to generate a binary visibility score.
- Additional "occlusion" loss is calculated with BCE loss.
- The predicted visibility is used to modulate the pooled features before aggregating them with element-wise sum.

Technical details

A common trick to boost pedestrian detection performance: x1.3 resolution.

Notes

Questions and notes on how to improve/revise the current work

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agg_loss.md

agg_loss.md

AggLoss: Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

Overall impression

Key ideas

Technical details

Notes

Files

agg_loss.md

Latest commit

History

agg_loss.md

File metadata and controls

AggLoss: Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

Overall impression

Key ideas

Technical details

Notes