Skip to content

Commit 06dcd33

Browse files
committed
update res2net_v1b
1 parent 4cc462a commit 06dcd33

File tree

6 files changed

+501
-6
lines changed

6 files changed

+501
-6
lines changed

README.md

Lines changed: 10 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
# Res2Net for Pose Estimation
22

3+
## Update
4+
- [2020.3.13] Res2Net_v1b based Pose Estimation results are released now.
35

46
## Introduction
57
This repo uses [*Simple Baselines*](http://openaccess.thecvf.com/content_ECCV_2018/html/Bin_Xiao_Simple_Baselines_for_ECCV_2018_paper.html) as the baseline method for Pose Estimation.
@@ -13,12 +15,16 @@ More detail can be found on [ "Res2Net: A New Multi-scale Backbone Architecture"
1315
| Arch |Person detector | Input size | AP | Ap .5 | AP .75 | AP (M) | AP (L) |
1416
|---------------------------|----------------|--------------|-------|-------|--------|--------|--------|
1517
| pose_resnet_50 | prdbox | 256x192 | 0.704 | 0.886 | 0.783 | 0.671 | 0.772 |
16-
| **pose_res2net_50** | prdbox | 256x192 | 0.715 | 0.890 | 0.793 | 0.682 | 0.784 |
18+
| pose_res2net_50 | prdbox | 256x192 | 0.715 | 0.890 | 0.793 | 0.682 | 0.784 |
1719
| pose_resnet_50 | GTbox | 256x192 | 0.724 | 0.915 | 0.804 | 0.697 | 0.765 |
18-
| **pose_res2net_50** | GTbox | 256x192 | 0.737 | 0.925 | 0.814 | 0.708 | 0.782 |
20+
| pose_res2net_50 | GTbox | 256x192 | 0.737 | 0.925 | 0.814 | 0.708 | 0.782 |
1921
| pose_resnet_101 | prdbox | 256x192 | 0.714 | 0.893 | 0.793 | 0.681 | 0.781 |
20-
| **pose_res2net_101** | prdbox | 256x192 | 0.722 | 0.894 | 0.798 | 0.689 | 0.792 |
21-
| **pose_res2net_101** | GTbox | 256x192 | 0.744 | 0.926 | 0.826 | 0.720 | 0.785 |
22+
| pose_res2net_101 | prdbox | 256x192 | 0.722 | 0.894 | 0.798 | 0.689 | 0.792 |
23+
| pose_res2net_101 | GTbox | 256x192 | 0.744 | 0.926 | 0.826 | 0.720 | 0.785 |
24+
| **pose_res2net_v1b_50** | prdbox | 256x192 | 0.722 | 0.895 | 0.797 | 0.685 | 0.794 |
25+
| **pose_res2net_v1b_50** | GTbox | 256x192 | 0.743 | 0.926 | 0.816 | 0.713 | 0.792 |
26+
| **pose_res2net_101** | prdbox | 256x192 | 0.730 | 0.895 | 0.803 | 0.695 | 0.800 |
27+
| **pose_res2net_101** | GTbox | 256x192 | 0.753 | 0.926 | 0.825 | 0.722 | 0.801 |
2228

2329

2430
### Note:
Lines changed: 85 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
AUTO_RESUME: true
2+
CUDNN:
3+
BENCHMARK: true
4+
DETERMINISTIC: false
5+
ENABLED: true
6+
DATA_DIR: ''
7+
GPUS: (0,1,2,3)
8+
OUTPUT_DIR: 'output'
9+
LOG_DIR: 'log'
10+
WORKERS: 24
11+
PRINT_FREQ: 100
12+
13+
DATASET:
14+
COLOR_RGB: false
15+
DATASET: 'coco'
16+
ROOT: 'data/coco/'
17+
TEST_SET: 'val2017'
18+
TRAIN_SET: 'train2017'
19+
FLIP: true
20+
ROT_FACTOR: 40
21+
SCALE_FACTOR: 0.3
22+
MODEL:
23+
NAME: 'pose_res2net_v1b'
24+
SCALE: 4
25+
BASEWIDTH: 26
26+
PRETRAINED: '/home/shgao/.torch/models/res2net101_v1b_26w_4s-0812c246.pth'
27+
IMAGE_SIZE:
28+
- 192
29+
- 256
30+
HEATMAP_SIZE:
31+
- 48
32+
- 64
33+
SIGMA: 2
34+
NUM_JOINTS: 17
35+
TARGET_TYPE: 'gaussian'
36+
EXTRA:
37+
FINAL_CONV_KERNEL: 1
38+
DECONV_WITH_BIAS: false
39+
NUM_DECONV_LAYERS: 3
40+
NUM_DECONV_FILTERS:
41+
- 256
42+
- 256
43+
- 256
44+
NUM_DECONV_KERNELS:
45+
- 4
46+
- 4
47+
- 4
48+
NUM_LAYERS: 101
49+
LOSS:
50+
USE_TARGET_WEIGHT: true
51+
TRAIN:
52+
BATCH_SIZE_PER_GPU: 32
53+
SHUFFLE: true
54+
BEGIN_EPOCH: 0
55+
END_EPOCH: 140
56+
OPTIMIZER: 'adam'
57+
LR: 0.001
58+
LR_FACTOR: 0.1
59+
LR_STEP:
60+
- 90
61+
- 120
62+
WD: 0.0001
63+
GAMMA1: 0.99
64+
GAMMA2: 0.0
65+
MOMENTUM: 0.9
66+
NESTEROV: false
67+
TEST:
68+
BATCH_SIZE_PER_GPU: 32
69+
COCO_BBOX_FILE: 'data/coco/person_detection_results/COCO_val2017_detections_AP_H_56_person.json'
70+
BBOX_THRE: 1.0
71+
IMAGE_THRE: 0.0
72+
IN_VIS_THRE: 0.2
73+
MODEL_FILE: ''
74+
NMS_THRE: 1.0
75+
OKS_THRE: 0.9
76+
FLIP_TEST: true
77+
POST_PROCESS: true
78+
SHIFT_HEATMAP: true
79+
USE_GT_BBOX: true
80+
DEBUG:
81+
DEBUG: true
82+
SAVE_BATCH_IMAGES_GT: true
83+
SAVE_BATCH_IMAGES_PRED: true
84+
SAVE_HEATMAPS_GT: true
85+
SAVE_HEATMAPS_PRED: true
Lines changed: 85 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
AUTO_RESUME: true
2+
CUDNN:
3+
BENCHMARK: true
4+
DETERMINISTIC: false
5+
ENABLED: true
6+
DATA_DIR: ''
7+
GPUS: (0,1,2,3)
8+
OUTPUT_DIR: 'output'
9+
LOG_DIR: 'log'
10+
WORKERS: 24
11+
PRINT_FREQ: 100
12+
13+
DATASET:
14+
COLOR_RGB: false
15+
DATASET: 'coco'
16+
ROOT: 'data/coco/'
17+
TEST_SET: 'val2017'
18+
TRAIN_SET: 'train2017'
19+
FLIP: true
20+
ROT_FACTOR: 40
21+
SCALE_FACTOR: 0.3
22+
MODEL:
23+
NAME: 'pose_res2net_v1b'
24+
SCALE: 4
25+
BASEWIDTH: 26
26+
PRETRAINED: '/home/shgao/.torch/models/res2net50_v1b_26w_4s-3cf99910.pth'
27+
IMAGE_SIZE:
28+
- 192
29+
- 256
30+
HEATMAP_SIZE:
31+
- 48
32+
- 64
33+
SIGMA: 2
34+
NUM_JOINTS: 17
35+
TARGET_TYPE: 'gaussian'
36+
EXTRA:
37+
FINAL_CONV_KERNEL: 1
38+
DECONV_WITH_BIAS: false
39+
NUM_DECONV_LAYERS: 3
40+
NUM_DECONV_FILTERS:
41+
- 256
42+
- 256
43+
- 256
44+
NUM_DECONV_KERNELS:
45+
- 4
46+
- 4
47+
- 4
48+
NUM_LAYERS: 50
49+
LOSS:
50+
USE_TARGET_WEIGHT: true
51+
TRAIN:
52+
BATCH_SIZE_PER_GPU: 32
53+
SHUFFLE: true
54+
BEGIN_EPOCH: 0
55+
END_EPOCH: 140
56+
OPTIMIZER: 'adam'
57+
LR: 0.001
58+
LR_FACTOR: 0.1
59+
LR_STEP:
60+
- 90
61+
- 120
62+
WD: 0.0001
63+
GAMMA1: 0.99
64+
GAMMA2: 0.0
65+
MOMENTUM: 0.9
66+
NESTEROV: false
67+
TEST:
68+
BATCH_SIZE_PER_GPU: 32
69+
COCO_BBOX_FILE: 'data/coco/person_detection_results/COCO_val2017_detections_AP_H_56_person.json'
70+
BBOX_THRE: 1.0
71+
IMAGE_THRE: 0.0
72+
IN_VIS_THRE: 0.2
73+
MODEL_FILE: ''
74+
NMS_THRE: 1.0
75+
OKS_THRE: 0.9
76+
FLIP_TEST: true
77+
POST_PROCESS: true
78+
SHIFT_HEATMAP: true
79+
USE_GT_BBOX: true
80+
DEBUG:
81+
DEBUG: true
82+
SAVE_BATCH_IMAGES_GT: true
83+
SAVE_BATCH_IMAGES_PRED: true
84+
SAVE_HEATMAPS_GT: true
85+
SAVE_HEATMAPS_PRED: true

lib/models/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,3 +14,4 @@
1414

1515
import models.pose_resnet
1616
import models.pose_res2net
17+
import models.pose_res2net_v1b

0 commit comments

Comments
 (0)