Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

Requirements

pytorch 1.1+
torchvision 0.3+
pyclipper
opencv3
gcc 4.9+

Data Preparation

train: prepare a text in the following format, use '\t' as a separator

/path/to/img.jpg path/to/label.txt
...

val: use a folder

img/ store img
gt/ store gt file

Train

config the train_data_path,val_data_pathin config.json
use following script to run

python3 train.py

Test

eval.py is used to test model on test dataset

config model_path, img_path, gt_path, save_path in eval.py
use following script to test

python3 eval.py

Predict

predict.py is used to inference on single image

config model_path, img_path, in predict.py
use following script to predict

python3 predict.py

The project is still under development.

Performance

ICDAR 2015

only train on ICDAR2015 dataset

Method	image size (short size)	learning rate	Precision (%)	Recall (%)	F-measure (%)	FPS
paper(resnet18)	736	x	x	x	80.4	26.1
my (resnet18+FPEM_FFM+pse扩张)	736	1e-3	84.24	74.14	78.87	21.31 (P100)
my (resnet50+FPEM_FFM+pse扩张)	736	1e-3	69.04	66.66	67.83	14.22 (P100)
my (resnet18+FPEM_FFM+pse扩张)	736	1e-4	62.93	62.41	62.61	21.31 (P100)
my (resnet50+FPEM_FFM+pse扩张)	736	1e-4	61.19	69.18	64.94	14.22 (P100)
my (resnet18+FPN+pse扩张)	736	1e-3	76.50	74.70	75.59	14.47 (P100)
my (resnet50+FPN+pse扩张)	736	1e-3	71.82	75.73	73.72	10.67 (P100)
my (resnet18+FPN+pse扩张)	736	1e-4	74.19	72.34	73.25	14.47 (P100)
my (resnet50+FPN+pse扩张)	736	1e-4	78.96	76.27	77.59	10.67 (P100)

examples

TBD

reference

If this repository helps you，please star it. Thanks.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
base		base
config		config
data_loader		data_loader
imgs/paper		imgs/paper
models		models
post_processing		post_processing
trainer		trainer
utils		utils
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.MD		README.MD
__init__.py		__init__.py
config.json		config.json
eval.py		eval.py
predict.py		predict.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

Requirements

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

reference

About

Uh oh!

Releases

Packages

Languages

License

xgmiao/PAN.pytorch

Folders and files

Latest commit

History

Repository files navigation

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

Requirements

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages