A deep learning approach for robust, multi-oriented, and curved text detection

Ranjbarzadeh, Ramin; Jafarzadeh Ghoushchi, Saeid; Anari, Shokofeh; Safavi, Sadaf; Tataei Sarshar, Nazanin; Babaee Tirkolaee, Erfan; Bendechache, Malika

A deep learning approach for robust, multi-oriented, and curved text detection

dc.authorid	Erfan Babaee Tirkolaee / 0000-0003-1664-9210	en_US
dc.authorscopusid	Erfan Babaee Tirkolaee / 57196032874	en_US
dc.authorwosid	Erfan Babaee Tirkolaee / U-3676-2017	en_US
dc.contributor.author	Ranjbarzadeh, Ramin
dc.contributor.author	Jafarzadeh Ghoushchi, Saeid
dc.contributor.author	Anari, Shokofeh
dc.contributor.author	Safavi, Sadaf
dc.contributor.author	Tataei Sarshar, Nazanin
dc.contributor.author	Babaee Tirkolaee, Erfan
dc.contributor.author	Bendechache, Malika
dc.date.accessioned	2022-12-02T07:34:06Z
dc.date.available	2022-12-02T07:34:06Z
dc.date.issued	2022	en_US
dc.department	İstinye Üniversitesi, Mühendislik ve Doğa Bilimleri Fakültesi, Endüstri Mühendisliği Bölümü	en_US
dc.description.abstract	Automatic text localization and segmentation in a normal environment with vertical or curved texts are core elements of numerous tasks comprising the identification of vehicles and self-driving cars, and preparing significant information from real scenes to visually impaired people. Nevertheless, texts in the real environment can be discovered with a high level of angles, profiles, dimensions, and colors which is an arduous process to detect. In this paper, a new framework based on a convolutional neural network (CNN) is introduced to obtain high efficiency in detecting text even in the presence of a complex background. Due to using a new inception layer and an improved ReLU layer, an excellent result is gained to detect text even in the presence of complex backgrounds. At first, four new m.ReLU layers are employed to explore low-level visual features. The new m.ReLU building block and inception layer are optimized to detect vital information maximally. The effect of stacking up inception layers (kernels with the dimension of 3 x 3 or bigger) is explored and it is demonstrated that this strategy is capable of obtaining mostly varying-sized texts further successfully than a linear chain of convolution layers (Conv layers). The suggested text detection algorithm is conducted in four well-known databases, namely ICDAR 2013, ICDAR 2015, ICDAR 2017, and ICDAR 2019. Text detection results on all mentioned databases with the highest recall of 94.2%, precision of 95.6%, and F-score of 94.8% illustrate that the developed strategy outperforms the state-of-the-art frameworks.	en_US
dc.identifier.citation	Ranjbarzadeh, R., Jafarzadeh Ghoushchi, S., Anari, S., Safavi, S., Tataei Sarshar, N., Babaee Tirkolaee, E., & Bendechache, M. (2022). A Deep Learning Approach for Robust, Multi-oriented, and Curved Text Detection. Cognitive Computation, 1-13.	en_US
dc.identifier.doi	10.1007/s12559-022-10072-w	en_US
dc.identifier.issn	1866-9956	en_US
dc.identifier.issn	1866-9964	en_US
dc.identifier.scopus	2-s2.0-85141868073	en_US
dc.identifier.scopusquality	Q1	en_US
dc.identifier.uri	http://dx.doi.org/10.1007/s12559-022-10072-w
dc.identifier.uri	https://hdl.handle.net/20.500.12713/3397
dc.identifier.wos	WOS:000882753700002	en_US
dc.identifier.wosquality	Q2	en_US
dc.indekslendigikaynak	Web of Science	en_US
dc.indekslendigikaynak	Scopus	en_US
dc.institutionauthor	Tirkolaee, Erfan Babaee
dc.language.iso	en	en_US
dc.publisher	SPRINGER	en_US
dc.relation.ispartof	COGNITIVE COMPUTATION	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Deep Learning	en_US
dc.subject	Text Detection	en_US
dc.subject	Curved Texts	en_US
dc.subject	Convolutional Neural Networks	en_US
dc.subject	Text Segmentation	en_US
dc.title	A deep learning approach for robust, multi-oriented, and curved text detection	en_US
dc.type	Article	en_US

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: s12559-022-10072-w.pdf
Boyut:: 2.37 MB
Biçim:: Adobe Portable Document Format
Açıklama:: Tam Metin / Full Text

İndir

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.44 KB
Biçim:: Item-specific license agreed upon to submission
Açıklama:

İndir

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu