A deep learning approach for robust, multi-oriented, and curved text detection
dc.authorid | Erfan Babaee Tirkolaee / 0000-0003-1664-9210 | en_US |
dc.authorscopusid | Erfan Babaee Tirkolaee / 57196032874 | en_US |
dc.authorwosid | Erfan Babaee Tirkolaee / U-3676-2017 | en_US |
dc.contributor.author | Ranjbarzadeh, Ramin | |
dc.contributor.author | Jafarzadeh Ghoushchi, Saeid | |
dc.contributor.author | Anari, Shokofeh | |
dc.contributor.author | Safavi, Sadaf | |
dc.contributor.author | Tataei Sarshar, Nazanin | |
dc.contributor.author | Babaee Tirkolaee, Erfan | |
dc.contributor.author | Bendechache, Malika | |
dc.date.accessioned | 2022-12-02T07:34:06Z | |
dc.date.available | 2022-12-02T07:34:06Z | |
dc.date.issued | 2022 | en_US |
dc.department | İstinye Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Endüstri Mühendisliği Bölümü | en_US |
dc.description.abstract | Automatic text localization and segmentation in a normal environment with vertical or curved texts are core elements of numerous tasks comprising the identification of vehicles and self-driving cars, and preparing significant information from real scenes to visually impaired people. Nevertheless, texts in the real environment can be discovered with a high level of angles, profiles, dimensions, and colors which is an arduous process to detect. In this paper, a new framework based on a convolutional neural network (CNN) is introduced to obtain high efficiency in detecting text even in the presence of a complex background. Due to using a new inception layer and an improved ReLU layer, an excellent result is gained to detect text even in the presence of complex backgrounds. At first, four new m.ReLU layers are employed to explore low-level visual features. The new m.ReLU building block and inception layer are optimized to detect vital information maximally. The effect of stacking up inception layers (kernels with the dimension of 3 x 3 or bigger) is explored and it is demonstrated that this strategy is capable of obtaining mostly varying-sized texts further successfully than a linear chain of convolution layers (Conv layers). The suggested text detection algorithm is conducted in four well-known databases, namely ICDAR 2013, ICDAR 2015, ICDAR 2017, and ICDAR 2019. Text detection results on all mentioned databases with the highest recall of 94.2%, precision of 95.6%, and F-score of 94.8% illustrate that the developed strategy outperforms the state-of-the-art frameworks. | en_US |
dc.identifier.citation | Ranjbarzadeh, R., Jafarzadeh Ghoushchi, S., Anari, S., Safavi, S., Tataei Sarshar, N., Babaee Tirkolaee, E., & Bendechache, M. (2022). A Deep Learning Approach for Robust, Multi-oriented, and Curved Text Detection. Cognitive Computation, 1-13. | en_US |
dc.identifier.doi | 10.1007/s12559-022-10072-w | en_US |
dc.identifier.issn | 1866-9956 | en_US |
dc.identifier.issn | 1866-9964 | en_US |
dc.identifier.scopus | 2-s2.0-85141868073 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.uri | http://dx.doi.org/10.1007/s12559-022-10072-w | |
dc.identifier.uri | https://hdl.handle.net/20.500.12713/3397 | |
dc.identifier.wos | WOS:000882753700002 | en_US |
dc.identifier.wosquality | Q2 | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.institutionauthor | Tirkolaee, Erfan Babaee | |
dc.language.iso | en | en_US |
dc.publisher | SPRINGER | en_US |
dc.relation.ispartof | COGNITIVE COMPUTATION | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Deep Learning | en_US |
dc.subject | Text Detection | en_US |
dc.subject | Curved Texts | en_US |
dc.subject | Convolutional Neural Networks | en_US |
dc.subject | Text Segmentation | en_US |
dc.title | A deep learning approach for robust, multi-oriented, and curved text detection | en_US |
dc.type | Article | en_US |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- Ä°sim:
- s12559-022-10072-w.pdf
- Boyut:
- 2.37 MB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
- Tam Metin / Full Text
Lisans paketi
1 - 1 / 1
Küçük Resim Yok
- Ä°sim:
- license.txt
- Boyut:
- 1.44 KB
- Biçim:
- Item-specific license agreed upon to submission
- Açıklama: