@@ -7,28 +7,28 @@ We provide a spectrum of pre-trained models on different datasets.
77``` python
88import layoutparser as lp
99model = lp.Detectron2LayoutModel(
10- config = ' lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config' , # In model catalog
11- label_map = {0 : " Text" , 1 : " Title" , 2 : " List" , 3 :" Table" , 4 :" Figure" }, # In model`label_map`
10+ config_file = ' lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config' , # In model catalog
11+ label_map = {0 : " Text" , 1 : " Title" , 2 : " List" , 3 :" Table" , 4 :" Figure" }, # In model`label_map`
1212 extra_config = [" MODEL.ROI_HEADS.SCORE_THRESH_TEST" , 0.8 ] # Optional
1313 )
1414model.detect(image)
1515```
1616
1717## Model Catalog
1818
19- | Dataset | Model | Config Path |
20- | ------------------------------------------------------------ | ------------------------------------------------------------ | --------------------------------------------- |
21- | [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | [ faster_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/j4yseny2u0hn22r/config.yml?dl=1 ) | lp://HJDataset/faster_rcnn_R_50_FPN_3x/config |
22- | [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | [ mask_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/4jmr3xanmxmjcf8/config.yml?dl=1 ) | lp://HJDataset/mask_rcnn_R_50_FPN_3x/config |
23- | [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | [ retinanet_R_50_FPN_3x] ( https://www.dropbox.com/s/z8a8ywozuyc5c2x/config.yml?dl=1 ) | lp://HJDataset/retinanet_R_50_FPN_3x/config |
24- | [ PubLayNet] ( https://github.com/ibm-aur-nlp/PubLayNet ) | [ faster_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/f3b12qc4hc0yh4m/config.yml?dl=1 ) | lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config |
25- | [ PubLayNet] ( https://github.com/ibm-aur-nlp/PubLayNet ) | [ mask_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/u9wbsfwz4y0ziki/config.yml?dl=1 ) | lp://PubLayNet/mask_rcnn_R_50_FPN_3x/config |
26- | [ PrimaLayout] ( https://www.primaresearch.org/dataset/ ) | [ mask_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/thdqhkvdihtr8yb /config.yml ?dl=1 ) | lp://PrimaLayout/mask_rcnn_R_50_FPN_3x/config |
19+ | Dataset | Model | Config Path | Eval Result (mAP) |
20+ | ------------------------------------------------------------ | ------------------------------------------------------------ | --------------------------------------------- | ------------------------------------------------------------ |
21+ | [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | [ faster_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/j4yseny2u0hn22r/config.yml?dl=1 ) | lp://HJDataset/faster_rcnn_R_50_FPN_3x/config | |
22+ | [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | [ mask_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/4jmr3xanmxmjcf8/config.yml?dl=1 ) | lp://HJDataset/mask_rcnn_R_50_FPN_3x/config | |
23+ | [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | [ retinanet_R_50_FPN_3x] ( https://www.dropbox.com/s/z8a8ywozuyc5c2x/config.yml?dl=1 ) | lp://HJDataset/retinanet_R_50_FPN_3x/config | |
24+ | [ PubLayNet] ( https://github.com/ibm-aur-nlp/PubLayNet ) | [ faster_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/f3b12qc4hc0yh4m/config.yml?dl=1 ) | lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config | |
25+ | [ PubLayNet] ( https://github.com/ibm-aur-nlp/PubLayNet ) | [ mask_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/u9wbsfwz4y0ziki/config.yml?dl=1 ) | lp://PubLayNet/mask_rcnn_R_50_FPN_3x/config | |
26+ | [ PrimaLayout] ( https://www.primaresearch.org/dataset/ ) | [ mask_rcnn_R_50_FPN_3x] ( https://www.dropbox.com/s/yc92x97k50abynt /config.yaml ?dl=1 ) | lp://PrimaLayout/mask_rcnn_R_50_FPN_3x/config | 69.35 [ eval.csv ] ( https://www.dropbox.com/s/9uuql57uedvb9mo/eval.csv?dl=0 ) |
2727
2828## Model ` label_map `
2929
3030| Dataset | Label Map |
3131| ------------------------------------------------------------ | ------------------------------------------------------------ |
3232| [ HJDataset] ( https://dell-research-harvard.github.io/HJDataset/ ) | ` {1:"Page Frame", 2:"Row", 3:"Title Region", 4:"Text Region", 5:"Title", 6:"Subtitle", 7:"Other"} ` |
3333| [ PubLayNet] ( https://github.com/ibm-aur-nlp/PubLayNet ) | ` {0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"} ` |
34- | [ PrimaLayout] ( https://www.primaresearch.org/dataset/ ) | ` {0 :"TextRegion", 1 :"ImageRegion", 2 :"TableRegion", 3 :"MathsRegion", 4 :"SeparatorRegion", 5 :"OtherRegion"} ` |
34+ | [ PrimaLayout] ( https://www.primaresearch.org/dataset/ ) | ` {1 :"TextRegion", 2 :"ImageRegion", 3 :"TableRegion", 4 :"MathsRegion", 5 :"SeparatorRegion", 6 :"OtherRegion"} ` |
0 commit comments