journal of la neural architecture based on fuzzy

14
JOURNAL OF L A T E X CLASS FILES, VOL. , NO. , MAY 2019 1 Neural Architecture based on Fuzzy Perceptual Representation For Online Multilingual Handwriting Recognition Hanen Akouaydi, Student Member, IEEE, Sourour Njah, Member, IEEE, Wael Ouarda, Member, IEEE, Anis Samet, Professional tutor, Thameur Dhieb, Member, IEEE, Mourad Zaied,Senior Member, IEEE, and Adel M. Alimi, Senior Member, IEEE Abstract—Due to the omnipresence of mobile devices, online handwritten scripts have become the most important feeding input to smartphones and tablet devices. To increase online handwriting recognition performance, deeper neural networks have extensively been used. In this context, our paper handles the problem of online handwritten script recognition based on extraction features system and deep approach system for sequences classification. Many solutions have appeared in order to facilitate the recognition of handwriting. Accordingly, we used an existent method and combined with new classifiers in order to get a flexible system. Good results are achieved compared to online characters and words recognition system on Latin and Arabic scripts. The performance of our two proposed systems is assessed by using five databases. Indeed, the recognition rate exceeds 98%. Index Terms—Online handwriting recognition, Fuzzy Perceptual Code, Beta-elliptic model, Fuzzy logic, Deep Fuzzy Neural Network, Convolutional LSTM, Fuzzy Ground Truth Training, CTC, Noise. 1 I NTRODUCTION H ANDWRITING recognition system is the tool whereby a computer system can recognize characters and other symbols written by hand. It recognizes individual characters and an entire word, just as the human eye and mind would. Humans can understand any type of writing in different constraints. Thus, many researches are conducted to analyze handwriting and recognize it automatically. In this context, we develop, in our work, two systems for online handwritten script. In fact, the first system uses the perceptual theory, inspired from the human perceptual system, transforming language from graphical marks into symbolic representation. It proceeds by the detection of common properties, and then it gathers them according to some perceptual laws proximity, similarity...,etc. Finally, it attempts to build a representation of the handwritten form based on the assumption that the perception of form is the identification of basic features that are arranged till we identify an object. Therefore, the representation of hand- writing is a combination of primitive strokes. Handwriting is a sequence of basic codes which are grouped together to define a character or a shape. This perceptual theory segments an handwritten script into elliptic strokes by Beta- elliptic model of handwriting generation. These strokes will be classified into primitives codes then we apply LSTM classifier in order to recognize online handwritten script. We choose multilingual scripts : Latin and Arabic. In the second system, we apply convolutional LSTM on (x, y, H.Akouaydi, S.Njah, T.Dhieb, M.Zaied and A.M.Alimi, REGIM-Lab.: REsearch Groups in Intelligent Machines, University of Sfax, National Engineering School of Sfax (ENIS), BP 1173, Sfax, 3038, Tunisia. Anis Samet, Professional tutor, Sifast. E-mail: [email protected] Manuscript received August 01, 2019; z) collected from Arabic/Latin datasets and also from data acquired from Android application. The structure of this paper is as follows: Section 2 presents an overview of systems for handwriting recognition in literature based on extraction features and deep learning approaches. Section 3 illustrates our two systems for online handwritten script recognition. Section 4 reports the rate recognition performed on online handwriting and section 5 draws concluding remarks of future works. 2 RELATED WORK Handwriting recognition is transforming graphical marks into symbolic representation. Although humans are similar and have a common education, they produce significantly different handwritten styles. The factors of the variability are related to the writer, emotional factors, language of writ- ing, the writing with different size, styles and speed...etc. We found two main approaches for Handwriting Recognition : The first approach is based on segmentation and classifica- tion (or segment-and-decode) and the second is related to time-sequence interpretation. 2.1 Handcrafted Approaches Those approaches segment then decode the script in order to identify it. Traditional classification are based on extraction features that are engineered and extracted from kinetic signals. Google online handwriting system is one of the most pow- erful systems that presently supports 22 scripts and 97 languages. The same architecture of its system is used for mobile devices with more limited computational power by means of changing some of the settings of the system [3]. We arXiv:1908.00634v1 [cs.CV] 1 Aug 2019

Upload: others

Post on 24-Oct-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 1

Neural Architecture based on Fuzzy PerceptualRepresentation For Online Multilingual

Handwriting RecognitionHanen Akouaydi, Student Member, IEEE, Sourour Njah, Member, IEEE, Wael Ouarda, Member, IEEE,Anis Samet, Professional tutor, Thameur Dhieb, Member, IEEE, Mourad Zaied,Senior Member, IEEE,

and Adel M. Alimi, Senior Member, IEEE

Abstract—Due to the omnipresence of mobile devices, online handwritten scripts have become the most important feeding input tosmartphones and tablet devices. To increase online handwriting recognition performance, deeper neural networks have extensivelybeen used. In this context, our paper handles the problem of online handwritten script recognition based on extraction features systemand deep approach system for sequences classification. Many solutions have appeared in order to facilitate the recognition ofhandwriting. Accordingly, we used an existent method and combined with new classifiers in order to get a flexible system. Good resultsare achieved compared to online characters and words recognition system on Latin and Arabic scripts. The performance of our twoproposed systems is assessed by using five databases. Indeed, the recognition rate exceeds 98%.

Index Terms—Online handwriting recognition, Fuzzy Perceptual Code, Beta-elliptic model, Fuzzy logic, Deep Fuzzy Neural Network,Convolutional LSTM, Fuzzy Ground Truth Training, CTC, Noise.

F

1 INTRODUCTION

HANDWRITING recognition system is the tool wherebya computer system can recognize characters and other

symbols written by hand. It recognizes individual charactersand an entire word, just as the human eye and mindwould. Humans can understand any type of writing indifferent constraints. Thus, many researches are conductedto analyze handwriting and recognize it automatically. Inthis context, we develop, in our work, two systems foronline handwritten script. In fact, the first system usesthe perceptual theory, inspired from the human perceptualsystem, transforming language from graphical marks intosymbolic representation. It proceeds by the detection ofcommon properties, and then it gathers them according tosome perceptual laws proximity, similarity...,etc. Finally, itattempts to build a representation of the handwritten formbased on the assumption that the perception of form isthe identification of basic features that are arranged till weidentify an object. Therefore, the representation of hand-writing is a combination of primitive strokes. Handwritingis a sequence of basic codes which are grouped togetherto define a character or a shape. This perceptual theorysegments an handwritten script into elliptic strokes by Beta-elliptic model of handwriting generation. These strokes willbe classified into primitives codes then we apply LSTMclassifier in order to recognize online handwritten script.We choose multilingual scripts : Latin and Arabic.In the second system, we apply convolutional LSTM on (x, y,

• H.Akouaydi, S.Njah, T.Dhieb, M.Zaied and A.M.Alimi, REGIM-Lab.:REsearch Groups in Intelligent Machines, University of Sfax, NationalEngineering School of Sfax (ENIS), BP 1173, Sfax, 3038, Tunisia.

• Anis Samet, Professional tutor, Sifast. E-mail: [email protected]

Manuscript received August 01, 2019;

z) collected from Arabic/Latin datasets and also from dataacquired from Android application.The structure of this paper is as follows: Section 2 presentsan overview of systems for handwriting recognition inliterature based on extraction features and deep learningapproaches. Section 3 illustrates our two systems for onlinehandwritten script recognition. Section 4 reports the raterecognition performed on online handwriting and section5 draws concluding remarks of future works.

2 RELATED WORKHandwriting recognition is transforming graphical marksinto symbolic representation. Although humans are similarand have a common education, they produce significantlydifferent handwritten styles. The factors of the variabilityare related to the writer, emotional factors, language of writ-ing, the writing with different size, styles and speed...etc. Wefound two main approaches for Handwriting Recognition :The first approach is based on segmentation and classifica-tion (or segment-and-decode) and the second is related totime-sequence interpretation.

2.1 Handcrafted Approaches

Those approaches segment then decode the script in order toidentify it. Traditional classification are based on extractionfeatures that are engineered and extracted from kineticsignals.Google online handwriting system is one of the most pow-erful systems that presently supports 22 scripts and 97languages. The same architecture of its system is used formobile devices with more limited computational power bymeans of changing some of the settings of the system [3]. We

arX

iv:1

908.

0063

4v1

[cs

.CV

] 1

Aug

201

9

Page 2: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 2

TABLE 1Summary of systems for handwriting recognition.

Approaches Authors Database Method Accuracy

Handcrafted Approaches [1] LMCA + manually Segmented characters form ADAB DNN 82.12%[2] LMCA + manually Segmented characters form ADAB MLP+HMM system 96.45%[3] UNIPEN+ Google cloud Language Model 99.02%[4] Farsi NN+GA 94.20%[5] ADAB Grapheme segmentation + MLP 93.50%

Deep Learning Approaches [6] Chinese CNN 97.20%[7] Mathematical Expressions GRU 52.43%[8] Chinese CNN+Drop Weight 96.88%[9] Drawing & Recognition Chinese characters RNN 93.98%

also find another approach for online Arabic handwritingrecognition, based on neural networks approach. This worksuggests a method based on TDNN classifier and multi-layer perceptron,it achieved 98.50% for characters, 96.90%for words [4]. There is another work which recognizesmathematical expression by using HMM and the systemwas conducted by covering two things: feature modifi-cation experiment and code words number experiments.The best result is gained with respect to four combinationfeatures [10]. We also notice a system for online Farsicharacters recognition. The proposed system is tested onTMU database and 94.2% obtained as recognition rate forthe recognition of the group and the character, respectively[4]. Hamdi et al. [5] suggest a hybridization of neural net-works and genetic algorithm for online Arabic handwrit-ing recognition. In fact, based on Beta-elliptic model andbaseline detection. The proposed system reached 93.5% asaverage of recognition. Another work deals with OnlineArabic Handwritten Character Recognition based on a DeepNeural Networks DNN in which we attempt to optimizethe training process by a smooth construct of the deeparchitecture. This system uses LMCA database for trainingand testing data. Tagougui et al. gained 96.16% as testingrate [1]. There is a new work which proposes a model basedon Bzier curve interpolation and recurrent neural networksfor online handwriting recognition and that able to support102 languages [11]. Besides, another work describes a hybridmodel based on bi-directional LSTM and ConnectionistTemporal Classifier (CTC) for handwriting recognition onmobile devices. [12]The success of deep neural networks approaches hasbeen achieved based on handwriting recognition. However,recognition is not only a way to understand a language, butalso a challenging task to teach machine how to write au-tomatically. Most research works did not propose universalmethods to recognize special languages as Farsi, Chineseor Mathematical Expressions [13]. The major motivation ofthis paper consists in dealing with online script by usingsome novel techniques in training deep recurrent neuralnetworks. In fact,our proposed models for handwritingrecognition applied various lexicons by using perceptualcodes, LSTM and convolutional LSTM.

2.2 Deep Learning Approaches

Generative approaches based on Convolutional NeuralNetworks (CNN) and LSTM to recognize handwritten

script. The extraction features process requires a deepknowledge of application domain, or human experienceor hardware memory to store script in order to identifyscript from user. Accordingly, automatic and deep methodsare attracting in the field of handwriting Recognition. Byadoption data-driven approaches for signal (x, y, z) scriptclassification, the process of selecting meaningful featuresfrom the data is deferred to the learning model. CNNs arecapable of detecting both spatial and temporal dependen-cies among signals, and can effectively model scale invariantfeatures. It is noteworthy that two main advantages canbe considered, when applying deep neural networks tohandwriting recognition which are as follows:

• Local Dependency:New Deep Neural Networks cap-ture local dependencies of handwriting signals. Inimage recognition tasks, the nearby pixels typicallyhave strong relationship with each other. Similarly,in handwritten script(online (x, y, z)) given an script(signal) the nearby acceleration readings are likely tobe correlated.

• Scale Invariance: Deep neural Networks preservefeature scale invariant. In image recognition, thetraining images could probably different scales. Inhandwritten script , a person may write with differ-ent styles (e.g., with different motion intensity).

Those approaches used are: hidden Markov models(HMMs) [14], time-delay neural networks (TDNN) [15],and recurrent neural networks (RNN) [16], of which long-short-term-memory networks (LSTM) [17] are a specificcase, presently receiving significant attention with regardsto machine learning.

3 PROPOSED SYSTEM

Online handwritten scripts are always stored as form ofsequence points. Divers features can be extracted from themsuch as stroke coordinates, times stamp, writing angle andcurvature, etc. The aim of the turn-out of such system is torecognize the sequence points. In this paper, two modelsare suggested, both of them are based on deep neuralnetworks first corresponds to Online Handwriting Segmen-tation & Recognition using LSTM, OnHSR-LSTM, whichrequires a step of segmentation, and the second, consists inOnline Handwriting Recognition using convLSTM, OnHR-convLSTM. The two models will be described below:

Page 3: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 3

Fig. 1. Architecture OnHS-LSTM.

TABLE 2Basic Perceptual Codes

Number Basic Perceptual Codes Abbreviation Shape

1 Valley V2 Left-oblique-shaft L-O-S3 Shaft S4 Right-oblique-shaft R-O-S5 Right-Half-Occlusion R-H-O6 Left-Half-Occlusion L-H-O7 Up-Half-Occlusion U-H-O8 Down-Half-Occlusion D-H-O9 Occlusion Occ

3.1 Online Handwriting Segmentation & Recognitionusing LSTM, OnHSR-LSTM

Our first model OnHSR-LSTM, is based on the fact thathandwriting is a form created by hand movement and allwe perceive is only a form [18], [19], [20] It proposes thathandwriting is a concatenation of perceptual codes groupedtogether, so as to form a character, a digit or a word. Eachonline script can be composed of elementary basic shapes.With (x, y, z) coordinates of online trace and the beta-ellipticmodel we detect basic shapes forming the script. Then, weclassify them relying on LSTM in order to recognize the

character or the written digit. Figure2 represents the ar-chitecture of LSTM-Character-recognizer, which recognizesOnline Handwritten letters. Our first model is endowedwith crucial phases:the phase of segmentation and the phaseof classification and recognition.

3.1.1 Segmentation StepThe segmentation step is concerned with the regenerationof the online handwritten script by means of strokes or by aset of control points that describe the handwritten trace.

3.1.1.1 Preprocessing:Preprocessing is a key step with respect to recognition in

Page 4: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 4

Fig. 2. Illustrated example of composed forms.

order to obtain better recognition rate. The principles of pre-processing steps are used to eliminate trembles in writing ,to reduce noise and to remove the hardware imperfections.The preprocessing operations used in our system are:

• Interpolation: adding missing points caused by thevariation of writing velocity.

• De-hooking: eliminating trembles in writing due toinaccuracies in rapid pen-down/up detection andtrembles in writing;

• Removing noise: we apply a Chebyshev low passfiler to eliminate the noise generated by spatial andtemporal sampling;

3.1.1.2 Beta-elliptic Features:Handwriting is a message which can be decoded into basicshapes. The main assumption of our proposed model con-sists in the fact that handwriting is a concatenation of visualcodes grouped together so as to get a shape, a character ora digit. The principal basic shapes to write are: −, /, |,\.[21], [22]. The basic shape which can compose any script.Indeed, the basic shapes are illustrated in Table 2. Theanalysis of any language has demonstrated that a script is anarrangement of vertical and diagonal lines as shown Figure1, which demonstrates that a letter is a combination of twoor more basic shapes. In fact we have 9 basic perceptualshapes that can be simple or complex. These basic shapeshave been identified as Global Perceptual Codes with GPC”Ain” which only characterize the Arabic script. [18], [19],[23], [24], [25] Thus, in this paper we use only 9 basicperceptual shapes which can form any script. In Figure 1,we notice that handwriting is an arrangement of perceptualcodes in a special order, and we can from different scriptswith the same group of basic shape such as the two Arabicletter �« hamza Z. In this step, we extract the basic shapethat gather together to identify a script. Hence, we use thebeta-elliptic model as a model of segmentation. Basic shapeare not directly detectable. Therefore, we use in the step ofsegmentation the beta-elliptic model.Beta-elliptic model propose that handwriting can be recon-structed by the sum of impulse signals by the curvilinear ve-locity and can be also approximated by a sequence of ellipticshapes. [25], [26], [27] So, handwriting can be segmentedinto simple movements called strokes, which are the resultof superposition of time overlapped velocity profiles. Con-sequently, each stroke with its curvilinear velocity obeys theBeta-elliptic approach and online script can be segmentedinto different ones by tracting Beta-elliptic parameters. [19],

[20], [24], [28]The Beta function is presented in the equation(1)

β(t, p, q, t0, t1) = (t− t0tc − t1

)p(t1 − tt1 − tc

)q, IF t ∈]t0, t1[

= 0, IFnot(1)

where t0: starting time of Beta function.tc: is the instant when the curvilinear velocity reaches theamplitude of the inflexion point.t1: the ending time of Beta function.With(p, q, t0 < t1) ∈ R, and

tc =p ∗ t1 + q ∗ t0

p+ q, p = q ∗ t− t0

tc − t1

p , q intermediate parameters which have an influence onthe symmetry and the width of Beta shape.As a result, a stroke is characterized by 10 parameters.Thefollowing beta-elliptic parameters a, b, x0,y0 and θ reflect thegeometric properties of the set of muscles and joints used ina particular handwriting movement and describe the staticaspect of handwriting. These parameters are:

• x0,x1:the coordinates of the ellipse center,• a: big axe of the ellipse• b: small axe of the ellipse• θ: the angle of the deviation of the elliptic arc and

the horizontal which is obtained by the followingequation 2:

θ = arctan(Y1 − Y0X1 −X0

) (2)

The Beta elliptic was performed in systems for writeridentification and verification. [29], [30], [31] An onlinehandwriting script can be segmented into segments andeach segment is a group of strokes. In fact, the numberof strokes is identified automatically from the curvilinearvelocity representation. In the trajectory domain, each strokeis limited by both M1 and M3 that correspond to themaximum curvature and minimum curvilinear velocity. M2is defined by the maximum tangential velocity.

3.1.1.3 Elementary Perceptual Codes (EPC) Detec-tion with Beta-points:The main idea of this part is to associate an EPC (ElementaryPerceptual Code) to each stroke. It means that we classifystrokes of the handwritten script generated by the beta-elliptic model into EPCs.This classification is performed byusing the parameter, corresponding to the deviation angleof each ellipse ”θ”, by the horizontal axis. An EPC is limitedby beta control points M1, M3 and goes through H whichthe orthogonal projection of M2. The number of strokes isidentified automatically by beta-elliptic model. This numberis equal to the number of EPCs. The EPCs are similarto Freeman’s code, which are frequently used for onlinehandwriting recognition, being able to express stroke direc-tions. Based on this technique, features are extracted fromeach online trajectory segment. By means of analogy withFreemans codes, an EPC is a whole region incorporating agroup of directions as depicted in Figure 2.(a) showing the 8directions of Freeman’s codes and 2.(b) illustrating all EPCsin their corresponding parts. [19]Different factors(style, speed, position, size, orientation )

Page 5: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 5

Fig. 3. Segmentation Step of Arabic Letter ”yaa”.

Fig. 4. Segmentation and Detection of Beta-points of Arabic word ”Kalaa” .

Fig. 5. Directions: (a): Freeman′s chain code, (b): the proposed regions, (c):Deviation angles regions and EPCs on the trigonometric cir-cle,(d):Elementary Perceptual Codes

Page 6: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 6

Fig. 6. Example of perceptual problem decision using fuzzy logic.

can affect the handwritten script, thus making its treatmentand analysis difficult. Figure 6 presents some regions oftrigonometric circle, we consider an overlapped intervalvalue noted cst equal to /16 where there is a problem of in-decision pertaining to the corresponding EPC. This indicatesan example of the perceptual problem decision related tothe belongings of elliptic stroke. To prevent some perceptionproblem, we associate an EPC to each stroke with a certainmembership degree. Thus, we opt for employing fuzzy logicrepresentation to resolve the problems of fuzzy perceptionand the uncertainty of the assessed EPC [18], [19], [23].Correspondingly, for each stroke a certain belongingnessdegree will be allowed.

3.1.2 LSTM FOR CLASSIFICATION

After extracting the Basic Perceptual Codes (BPC) of onlinescript, a classifier is required to identify it. We opt to useLSTM, Long Short-Term Memory as classifier. LSTM isspecially constructed RNNs, Reccurent Neural Networks,node to preserve long lasting dependencies by avoidingthe vanishing gradient problem. It is explicitly conceivedto prevent the long-term dependency problem, a tool ofsequence classification in order to identify time-sequences.In fact, their default behavior of LSTM cells lies mainlyin remembering information forlong periods of time. Itprovides a memory of the previous network internal state.Long Short-Term memory (LSTM) layer: The LSTM networknodes posses a specific architecture, referred to as memoryblock. Each memory block incorporates a memory cell, andits interaction with the rest of the network is controlled bythree gates, i.e: an input gate, an output gate and a forgetgate. This permits the memory cell to preserve its statefor a long period. The following illustrates the architectureof LSTM To identify the online script or to identify basicperceptual codes, we use LSTM. The handwritten script issegmented into strokes limited by beta-points. We applyfuzzy logic in order to obtain the membership of one stroke

Fig. 7. Architecture of LSTM cells.

to the 4 EPCs. A segment is a group of strokes, and strokeis composed of 4 EPCs with degree of membership. Eachstroke is classified into 4 EPCs, and a segment is a group of nstrokes. A BPC is a group of n segments or a script(character,digit ,word) is a group of n EPCs. Handwriting is a group ofperceptual codes which are the bases pattern to read, write,identify, segment and recognize a script. In this section,we detail how from these perceptual codes, we can forma character, a shape or a digit. Handwriting has an equationas our perceptual theory which is:

handwriting = {Segment1i, Segment2i, Segmentni} (3)

n:the total number of segments composing handwriting.

handwriting = {BPC1i, BPC2i, ..., BCPni} (4)

i:1,9 correspond to BPCs. (see Table II)

BPC = {EPC1j, EPC2j, ..., EPCpj} (5)

BPC = {Stroke1j, Stroke2j, ..., Strokepj} (6)

p:Number of strokes identified in the trace.j:1,2,3,4 correspond to EPCs. (see Table I) With an adequatearrangement of elementary perceptual codes (EPCs),

Stroke/EPC = beta−point1+beta−point2+beta−point3(7)

The beta-points are three ,anchorage perceptual points [18],[19], [25],forming a stroke. The stroke goes from M1 to M2,which are the maximum curvilinear velocity and the mini-mum curvilinear velocity, respectively. It passes through Hwhich is the orthogonal projection of M2, standing for themaximum tangential velocity or the inflexion point. TheseBasic perceptual Codes (BPCs)are divided into two classes:

• A simple class contains: Valley, Left oblique shaft,Shaft, Right oblique shaft.

• A complex class is composed of: Right half occlusion,Left half occlusion, Up half occlusion, Down halfocclusion, Occlusion.

As already mentioned, handwriting is a sequence of basicperceptual codes and by the combination of simple ones,we obtain global ones. Numerous possible combination ofEPCs can form a BPC. In fact, there is a large numberof combinations that forms a BPC. Therefore, a large setof BPCs features is assumed to be available in order torecognize the corresponding BPCs of the initial script. Asmentioned above, the Basic Perceptual Codes are a set of

Page 7: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 7

sequence of Elementary perceptual Codes. The same BPCcan be obtained by different sets of EPCs. The BPC ”Valley”can be obtained by a minimum set of three successiveEPCs, corresponding to Valley and at a maximum set of sixValley EPCs. Those difficulties which are the sequences ofEPCs, can vary in length and be comprised of a very largecombination of one BPC, and may require a model to learnthe long-term context or dependencies between BPCs in theinput sequence of initial script. Thus, the problem of thesuitable choice of corresponding BPC can be consequentlyregarded as related to sequence classification. Sequence clas-sification is a predictive modeling problem, there is somesequence of inputs over space or time and the task consistin predicting a category for the sequence. To improve the

Fig. 8. Some samples of BPC’s combination.

performance of our network, we use the CTC as output layerafter softmax layer in system 1 which pertains to PerceptualCodes-LSTM for Online handwriting recognition. In fact,it is a layer of labelling. The CTC algorithm employs amany-to-one sequence-to-sequence mapping function thatconverts the sequence of labeling with timing informationinto the shorter sequence of labels, by means of removingtiming and alignment information [32]. Indeed, the mainpoint is to introduce the additional CTC blank label forthe sequence that has timing information, then remove theblank labels and merge repeating labels do as to obtainthe unique corresponding sequence. Based on our OnHSR-LSTM system,the output of the final LSTM layer is passedthrough a softmax layer that yields the predictable charac-ters for each time step. The output of the LSTM layers ateach time step is located into a softmax layer in order toobtain a probability distribution words, digits or characters,as well as a blank label added by CTC.

3.2 Convolutional LSTM for Handwriting RecognitionOur second proposed system OnHR-convLSTM, is pre-sented in the Figure 9 with the architecture of convLSTMcell. It is a generative model that has as input the onlinesignal (x, y, z) of the script and predicts characters as wellas words from trained model. This generative system excelswith sequence learning tasks and can align unsegmenteddata with their corresponding ground truth. LSTM networkmodels are a type of recurrent neural network capable oflearning and remembering over long sequences of inputdata. In fact, they may be convenient for such problem.The model can support multiple parallel sequences of inputdata, such as each axis of the (x, y, z) data. The model learnsto extract features from sequences of observations and how

to map the internal features to various types of activity.The merit of using LSTMs for sequence classification liesin the fact that they can learn from the raw time series data,directly, and in turn, do not necessitate domain expertise tomanually engineer input features. The model can learn aninternal representation of the time series data, thus ideallyachieving comparable performance to models fit on a ver-sion of the data-set with engineered features. Convolutional-LSTM model is a different approach, wherein the imagepasses through the convolution layers and the result consistsin a set flattened to a 1D array with the obtained features.Repeating this process to all images in the time set, give riseto a set of features over time, which is, in fact, the LSTMlayer input. The ConvLSTM layer output is a combinationof a Convolution and LSTM output. The other ConvLSTMparameters are derived from the Convolution and the LSTMlayer. The The Convolutional LSTM architecture includesusing Convolutional Neural Network (CNN) layers forfeature extraction on input data combined with LSTMs, inorder to support sequence prediction. It was developed forvisual time series prediction problems and the applicationof generating textual descriptions from sequences of images(e.g. videos). A Convolutional LSTM can be defined byadding CNN layers on the front end followed by LSTMlayers with a Dense layer on the output. This architectureis considered helpful by defining two sub-models: the CNNModel for feature extraction and the LSTM Model for in-terpreting the features across time steps. Unlike an LSTMthat reads the data directly in order to calculate internalstate and state transitions, and unlike the CNN LSTM that isinterpreting the output from CNN models, the ConvLSTMuses convolutions directly as part of reading input intothe LSTM units themselves. Both Convolutional NeuralNetworks (CNNs) and Long Short-Term Memory (LSTM)have shown improvements over Deep Neural Networks(DNNs) across a wide range of speech recognition tasks.CNNs, LSTMs and DNNs are complementary with respectto their modeling capabilities. Indeed, CNNs are efficient atreducing frequency variations, LSTMs are good at temporalmodeling, and CNNs are appropriate for mapping featuresto a more separable space. Therefore, we combine them intoone unified architecture.

4 REGULARIZATION AND DATA AUGMENTATION

To improve the performance of our system, we used sometechniques, as illustrated above.

4.1 Dropout

The main goal when using dropout is to regularize theneural network that, we train. The technique involves ofdropping neurons randomly with some probability p. Thoserandom modifications of the networks structure are believedto avoid co-adaptation of neurons by making it impossiblefor two subsequent neurons to rely solely on each other.In fact, dropout is implicitly bagging, at test time, a largenumber of neural networks that share parameters. Besides,dropout is used it to prevent the over-fitting of our networkfrom and to upgrade the latter’s performance, dropout isapplied. This technique consists in temporarily removing a

Page 8: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 8

Fig. 9. (a)Architecture of OnHR-convLSTM, (b) convLSTM cell.

unit from the network. The selection of such removed unitis randomly occurring only during the training stage.

4.2 Data AugmentationThe use of deformation technology, in the context of deepneural networks, aims chiefly providing shape variationand generating numerous online training data. Deep LSTMextends the data-set by applying affine transformations in-cluding scaling, rotations, and translations. Besides, strokejiggling is also used to generate local distortions to enrichour data with local diversity. Data augmentation is added inorder to prevent over-fitting, and to make the learning moreuniform. The neural network training is at the risk of over-fitting and hurting the classification robustness. One wayto deal with this problem is data augmentation where thetraining set is artificially augmented by adding replicas ofthe training samples under certain types of transformations

Fig. 10. Data Augmentation of Arabic letter ”haa” h .

that preserve the class labels. Data generated under suchlabel-preserving transformations will ameliorate the predic-tion invariance and generalization capacity pertaining to the

Page 9: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 9

neural networks. Data augmentation using label-preservingtransformations proved to be effective for neural networktraining in terms of making invariant predictions. The vari-ation in handwriting style represents a potential problemwith regards to online character. To resolve such problem,the concept of character distortion is introduced in orderto generate numerous training samples. In fact, characterdistortion is produced via applying an affine transformationand its variants to the training data. The distortion is appliedby performing flipping and rotation (changing the angleof orientation). Data augmentation maintained by differentscripts emanating from different writers who do not thesame style of writing.

Fig. 11. Distortion of digit 7.

4.3 Training NetworkTo improve the performance of two our proposed systems,various techniques are used that we will illustrate below.

4.3.1 Framewise/Crisper TrainingWe opt this technique of training with two proposed sys-tems OnHRS-LSTM and OnHR-convLSTM which each seg-mented trace or not segmented trace are belong only toone class(one character,one word). Each data belongs to one

Algorithm 1 Crisper Training Algorithmrepeat

for alldoonline script data (x, y, z) not segmented or

segmented into basic perceptual codes: each data belongsto one class

Use LSTM for traininguntil data=0Use the trained network to predict the labels

label. It means that Framewise method computes the forcedalignments of the input sequence with target output of thecorrect word/char sequence. We train the network to clas-sify each sequence individually, based on cross-entropy costfunction. [33] Algorithm 1 describes our method of training,which consists in the fact that one sequence correspondsonly to one predicted class.

4.3.2 Training with Fuzzy Ground TruthTo perform the network performance of our first proposedmodel, OnHRS-LSTM, each segmented data belongs to allclasses with certain membership. To define the value of

membership, we calculate the segmented data which has thelongest series of elliptic-strokes {li}. Then, we calculate theEuclidian distance between the length of class of referenceand the actual data segmented {lr} and {li}.

Algorithm 2 Training with Fuzzy Ground Truth Algorithmrepeat

for alldoonline script data (x, y, z) segmented into BE-

parameters, define {lr}Euclidian distance between {lr} and {li}Use LSTM for training

until data=0Use the trained network to predict the labels

TABLE 3Common Characters Confusions in Latin/Arabic Script.

Latin ScriptP m a n p i ep n c u b l c

Arabic Script� ¨ X �

¬ p P

���

¨

X

�� h ð

5 SINGLE CHARACTER EXPERIMENTS

The pipeline of our system 1 , OnHSR-LSTM, consists inobtaining the online script, segmenting it into strokes, andclassifying them into EPCs. After acquiring EPCs, we applyLSTM in order to obtain BPCs, and then eventually weapply LSTM in order to identify the character or shape orword corresponding of the initial script. The evaluation ofthe performance pertaining to our two proposed systems,OnHSR-LSTM & OnHR-convLSTM, relies to 5 multilingualdatabases.

5.1 Arabic DatabaseFactors do not generalize very well to a one , thus makingthe task of recognition difficult. Although Arabic is rankedthe fourth most spoken language in the world after Chinese,English and Spanish, to the best of our knowledge, there isworks in the field of Arabic text recognition, which pavethe way for exploration of such favorable area of research.Arabic script, written from right to left, is a cursive script.Arabic script, written from right to left is a cursive script inboth printed and handwritten versions. Comparing to Latinscripts, it involves of strokes and dots above and underletters, along with, the connectivity of characters. There are28 letters and the shape of those letters change dependingon their position in the word: preceded and/or followedby other letters or isolated. Arabic letters can take fourdifferent shapes depending on their position in the word(first, middle, last and isolated) as illustrated in the Figure 1.Some letters can take different forms, for example the letterGhayen (�

« , �

ª� ,

¨ ,

©�). We find that some Arabic letters

sets share the same shapes and only a dot makes a differencebetween them such as or (

�� , �) or (h. , p , h). Arabic

Page 10: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 10

TABLE 4Comparison of recognition rate of System Perceptual Codes-LSTM with other systems.

System Database Methods RR Framewise RR Fuzzy Ground TruthTagougui2014 LMCA+ manually segmented characters form ADAB DNN 82.12 % -Tagougui2014 LMCA+ manually Segmented characters form ADAB MLP NN /HMM system 96.45% -OnHSR-LSTM LMCA+ manually segmented characters form ADAB BCP+LSTM 91.50% 97.50%OnHSR-LSTM MAYASTROUN BCP+LSTM 94.50 % 98.5 %OnHSR-LSTM ADAB BCP+LSTM 90.00 % 96.00 %

writing recognition, it is a hard task since numerous lettersposses different numbers of positions and dots. In mostcases, Arabic writing is devoid of vowels. The meaningof the word is frequently determined by the context ofthe sentence. Thus, all vowels are not considered in ourwork. However, Latin script, is considered as the easiestscript, it serves the largest family of a total of 59 languages[3]. It composes of 26 characters. Our work based on thetheory of perception i.e.: all we perceive is a form includinghandwriting scripts. Handwriting Recognition is feasible,even if their constitutions is missing based on the RBC ap-proach [25], [34]. Handwriting is a visual scene perceived inorder to decode the contained message as any visual scenepresenting illusions. The existence of perceptual illusionsin handwriting makes its recognition, identification andunderstanding difficult. The main assumption of the pro-posed theory is that handwriting involves a concatenation ofperceptual codes grouped together so as to obtain a shape, acharacter or a digit. Hence, the perceptual segmentation2makes recognition easier by perceiving a character as agroup of BPC. In order to compare our proposed works,

Fig. 12. Arabic Alphabets and their forms at different positions.

we need to use the same lexicon which is Arabic in thissection, extracted from MAYSTROUN [34] , LMCA andmanually segmented from ADAB Database. MAYSTROUN,is composed of 400 letters, 300 digits, 200 words and 200Arabic texts written by 15 writers [28]. Kherallah et al. [35]used Circular trajectory modeling and NN for Arabic digitsrecognition, wherein each digit is repeated 1000 times, andthey obtained 95% as recognition rate. Compared to theresults of [12], we obtained better recognition by achieving98.5% of Arabic digit recognition. In Table 2 we compareour results with the results achieved by Tagougui et al. [2],[1]. In fact, with reference to Table 2, LMCA and manually

segmented from ADAB Database display noticeable perfor-mances as compared to other already developed methods.Indeed, LMCA (Lettres Mots Chiffres Arabes) database [19],[20] contains 30.000 shapes for ten digits, 100.000 shapes for56 Arabic letters and 500 Arabic words. 55 respondents werehosted to participate in the development of the handwrittenLMCA. We used 54 shapes instead of 56 Shapes. Some lettersare manually segmented Letters and ligatures, extractedfrom the four sets (a,b,c and d) of ADAB database [36]to recognize isolated online Arabic handwritten characters.To test our system OnHSR-LSTM, we use 24861 samplesfor training and test, such as digits and Arabic letters(beginning, middle and end of the word) incorporatingvarious styles of handwriting. The over-fitting can be re-duced considerably by randomly omitting half of the hiddenunits on each training case by using Dropout. Randomdropout yields remarkable improvements to our proposed.We perform our system 1 with Fuzzy Ground Truth in therange 94%to 98.2% as recognition rate. We attempt to usedifferent combinations of one BPC or character in orderto prevent the distortion of the model during the trainingstep. Table 6 reveals that OnHSR-LSTM, describes bothstatic and dynamic aspects of handwriting and by meansof perceptual codes we codify the scripts into sequence ofcodes, in the form of an equation. Moreover, it demonstratesthat the method of training fuzzy Ground Truth improvedthe recognition rate RR, yielding competitive results com-pared to similar works. Then, those results indicate thefact that the results increase intensively by applying dataaugmentation and dropout mechanisms. As shown in Table7, our Perceptual Codes-LSTM system achieves the bestrecognition rate, especially when we use CTC and FuzzyGround Truth methods during training. Table 7 boosts thefact that Fuzzy LSTM is endowed with a more efficientperformance,compared to framewise proposed approachesin terms of recognition rate.

5.2 Latin DatabaseThe problem with comparing online handwritten characterrecognition systems lies to the few number of availabledatabases. Accordingly, we find an old database UNIPEN[37] which is used frequently, however,its evaluation isrestricted to single Latin character recognition. It is usedin [3]. Thus, we attempt to compare it with our recognizersystem 1. The results are summarized in Table 8. We observethat the recognition rate is very competitive, especially withset 1(c), due to CTC and Fuzzy Ground Truth as well as forthe prepossessing applying in online script.So far, based on the aforementioned two precedent tables,

using fuzzy training gives rise to an enhancement interms of classification accuracy. In order to further ensure

Page 11: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 11

TABLE 5Comparison of recognition rate with Google system with UNIPEN set 1(c).

System Methods RR with Framewise RR with Fuzzy Ground Truth[3] Language model 94.9 % -

OnHSR-LSTM BCP+LSTM 90.50 % 97.5%

the strength of the proposed approaches, additional testsare performed on another database IRONOFF [38]. Thetesting results of recognition of digits are illustrated inTable 8. In fact, this table demonstrates that our system 2,OnHR-convLSTM, achieves efficient results compared toour system 1, OnHSR-LSTM. To allow a fair comparison

TABLE 6Comparison of recognition rate with IRONOFF Database.

System Methods Recognition RateOnHSR-LSTM BCP+LSTM 97.5%

OnHR-convLSTM convLSTM 98.00%

between framewise and fuzzy training, the performances ofthe two proposed recognizers is shown in the next figurewhich describes the evolution of predicted and test fuzzyground truth,illustrating that the two curves are close toeach other.

Fig. 13. Classes Predicted and Correct ones.

5.3 Our Mobile DatabaseBased on mobile application of [7], we created a mobiledatabase for acquired data from different persons with dif-ferent styles of writing in order to examine the performanceof our system. The (x, y, z) coordinates are stored in text fileof online written script.The online script can be presented as a variable lengthsequence.

input = [(x1, y1, z1), (x2, y2, z3), ..., (xn, yn, zn)] (8)

where {xi} and {yi} are the xy-coordinates of the penmovements and {zi} is 0 when the pen is up and is 1 whenthe pen is down.

Moreover, experiments were conducted on this mobiledatabase. Correspondingly, we remark that the results arevery competitive with reference to previously publishedresults, which presented in Table 8 [23] [39].

TABLE 7Comparison of recognition rate with previous results system with

Mobile Database .

System Methods Recognition RateOnHR-convLSTM ConvLSTM 96.50%

OnHSR-LSTM BCP+LSTM 98.40%

6 WORD RECOGNITION EXPERIMENTS

6.1 Word Recognition Experiments on ADABADAB is a database of Arabic online handwritten words[40]. To evaluate our two systems we used it and goodresults are achieved as describes table 10 below.Table 10shows that our system 2, OnHR-convLSTM, achieves bestresults compared to our system 1,OnHSR-LSTM .

TABLE 8Comparison of recognition rate with ADAB Database.

System Methods Recognition RateOnHSR-LSTM BCP+LSTM 96.50%

OnHR-convLSTM ConvLSTM 97.0%

6.2 Word Recognition Experiments on IRONOFFThe IRONOFF database [38] contains 31346 isolated wordfrom a 196 word lexicon (French and English). Although thedatabase contains both online and offlifne information ofthe handwriting signals, only the online information is usedfor our experiments. Table 9 reports the performance of ourrecognition systems Again, we achieve a very competitiverate with respect to other published results we are aware of.

TABLE 9Comparison of recognition rate with IRONOFF Database.

System Methods Recognition RateOnHSR-LSTM BCP+LSTM 93 %

OnHR-convLSTM ConvLSTM 98%

7 NOISE

In order to study the robustness of the proposed recognizersystem, a randomly noise signal is applied in both trainingand testing databases. Noise in handwriting can be ascribedto the support of writing, the person who write, and Parkin-son person who produces trembled script. Figure 15 illus-trates how our Beta-elliptic segmentation method eliminatestrembling from script. Table 10 demonstrates that LSTM

Page 12: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 12

Fig. 14. Interfaces of Mobile Database.

TABLE 10Recognition Rate with LMCA and manually segmented from ADAB

Database with noise .

System Methods Recognition RateOnHSR-LSTM BCP+LSTM 96.00 %

OnHR-convLSTM (x,y,z)+ConvLSTM 50.45%OnHR-convLSTM (x,y,z)+ConvLSTM +teta+4 EPCs 74.41%

with Beta-elliptic features yield best accuracy compared to(x,y,z), only because the Beta-elliptic method can improvethe online script and eliminate, displaying its robustnesswith respect to noise. Therefore, our perceptual method,based on its preprocessing step can improve the quality ofonline script via removing noise and applying sampling,which is not available with OnHR-convLSTM system thatneglects the preprocessing and improving step of the signalof the input handwritten script. However, OnHR-convLSTMwith teta+4 EPCs provides result compared to only (x,y,z).Thus, the elliptic model is not only a segmentation modelbut is also with its methods of preprocessing, capable of im-proving the quality of acquired script, thus yielding compet-itive recognition rate. Beta-elliptic model can eliminate trem-bling from script. But also can detect with its parametersvelocity and produce a different shape then a normal oneSo, Beta-elliptic a good solution to detect some assumptionsof trembling in handwriting and can assists in the decisionprocess leading to the diagnosis of some diseases related totrembles of handwriting or movements. Figure 16 illustratethe difference beta-elliptic points between normal script andtrembled script.

Fig. 15. Sample of input noise & output

Fig. 16. Sample of input noise & output word dhahab I. ëX

8 PERCEPTUAL ERRORS

The existence of perceptual illusions in handwriting makesits recognition, identification and understanding difficult.Figure 16 presents some examples of handwriting per-ceptual illusions. Those errors can be performed by ourrecognition language system. In Figure 16.(a) what do yousee? : This handwriting shape can be identified as a letter”O” or a digit ”0”, according to the context it appearsin. The presented handwriting perceptual illusions proved

Page 13: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 13

Fig. 17. Handwriting perceptual illusions : (a): the word he or be, (b): theletter B or the number 13, (e): the two Arabic words aalimon: Õæ

Ê« or

alimon : ÕæË @.

that finding evidence, whether or not humans go throughan identification process as a part of perception, has beendifficult.

9 CONCLUSION

Two systems for online handwriting recognition are pre-sented in this paper. The first is based on the fact thathandwriting is composed of a group of Perceptual Codesand the second is generative system based on convolutionalLSTM. We validate our system with three databases andmobile database, containing digits, letters and words inArabic language. Good results are obtained almost iden-tical to those produced by the human perceptual system.Encouraged obtained recognition rate is equal to 98.5%. Weplan to build and test our proposed systems recognition ina large-scale database and other scripts.The perspective ofimprovement of our method is located in order to reachthe stage of a sentence. Thanks to beta-elliptic model forhandwriting strokes generation, we obtain supplementaryinformation about dynamic domain. Furthermore, throughthis segmentation model, we can improve the quality ofacquired trace.

ACKNOWLEDGMENTS

The research leading to these results received funding fromthe Ministry of Higher Education and Scientific Researchof Tunisia under the grant agreement number LR11ES48.Moreover, this project was carried out under the MOBIDOCscheme, funded by the EU through the EMORI program andmanaged by the ANPR.

REFERENCES

[1] N. Tagougui and M. Kherallah, “Recognizing online arabichandwritten characters using a deep architecture,” Proc.SPIE, vol.10341, pp. 1– 5, 2017. [Online]. Available: https://doi.org/10.1117/12.2268419

[2] N. Tagougui, H. Boubaker, M. Kherallah, and A. M. Alimi,“A hybrid NN/HMM modeling technique for online arabichandwriting recognition,” CoRR, vol. abs/1401.0486, 2014.[Online]. Available: http://arxiv.org/abs/1401.0486

[3] D. Keysers, T. Deselaers, H. A. Rowley, L. L. Wang, and V. Car-bune, “Multi-language online handwriting recognition,” IEEETransactions on Pattern Analysis and Machine Intelligence, vol. 39,no. 6, pp. 1180–1194, June 2017.

[4] V. Ghods and M. K. Sohrabi, “Online farsi handwritten characterrecognition using hidden markov model,” JCP, vol. 11, pp. 169–175, 2016.

[5] Y. Hamdi, A. Chaabouni, B. Houcine, and A. M. Alimi, “Off-lexicon online arabic handwriting recognition using neural net-work,” pp. 1–5, 2016.

[6] W. Yang, L. Jin, Z. Xie, and Z. Feng, “Improved deep convolutionalneural network for online handwritten chinese character recogni-tion using domain-specific knowledge,” pp. 551–555, Aug 2015.

[7] J. Zhang, J. Du, and L. Dai, “A gru-based encoder-decoder ap-proach with attention for online handwritten mathematical ex-pression recognition,” 12 2017.

[8] X. Xiao, Y. Yang, T. Ahmad, L. Jin, and T. Chang, “Design of a verycompact cnn classifier for online handwritten chinese characterrecognition using dropweight and global pooling,” 2017 14thIAPR International Conference on Document Analysis and Recognition(ICDAR), vol. 01, pp. 891–895, 2017.

[9] X. Zhang, F. Yin, Y. Zhang, C. Liu, and Y. Bengio, “Drawing andrecognizing chinese characters with recurrent neural network,”IEEE Transactions on Pattern Analysis and Machine Intelligence,vol. 40, no. 4, pp. 849–862, April 2018.

[10] Y. M.Pranoto, E. Setyati, E. Pramana, Y. Kristian, and R. Budiman,“Real time handwriting recognition for mathematic expressionsusing hidden markov model,” pp. 1–6, 07 2016.

[11] V. Carbune, P. Gonnet, T. Deselaers, H. A. Rowley, A. N.Daryin, M. Calvo, L. Wang, D. Keysers, S. Feuz, andP. Gervais, “Fast multi-language lstm-based online handwritingrecognition,” CoRR, vol. abs/1902.10525, 2019. [Online]. Available:http://arxiv.org/abs/1902.10525

[12] G. Ren and V. Ganapathy, “Recognition of online handwritingwith variability on smart devices,” pp. 7605–7609, May 2019.

[13] N. Tagougui, M. Kherallah, and A. M. Alimi, “Online arabichandwriting recognition: A survey,” Int. J. Doc. Anal. Recognit.,vol. 16, no. 3, pp. 209–226, Sep. 2013. [Online]. Available:http://dx.doi.org/10.1007/s10032-012-0186-8

[14] C. C. Tappert, C. Y. Suen, , and T. Wakahara, “The state of the art inonline handwriting recognition,” IEEE Trans. Pattern Anal. Mach.Intell., vol. 12, p. 787808, August 1990.

[15] S. Jaeger, S. Manke, J. Reichert, and A. Waibel, “Online hand-writing recognition: The npen++ recognizer,” Int. J. Doc. Anal.Recognit., vol. 3, p. 169180, 2001.

[16] J. Schenk and G. Rigoll, “Novel hybrid nn/hmm modelling tech-niques for on-line handwriting recognition,” in Proc. Int. WorkshopFrontiers Handwriting Recognit., p. 619623, 2006.

[17] A. Graves, M. Liwicki, S. Fernandez, R. Bertolami, and H. B. andJ.Schmidhuber, “A novel connectionist system for unconstrainedhandwriting recognition,” IEEE Trans. Pattern Anal. Mach. Intell.,vol. 31, p. 855868, May 2009.

[18] S. Njah, H. Bezine, and A. M. Alimi, “A new encoding system:Application to on-line arabic handwriting,” pp. 451–456, 2010.

[19] S. Njah, M. Ltaief, H. Bezine, and A. M. Alimi, “The pertohs theoryfor on-line handwriting segmentation,” IJCSI, vol. 9, pp. 142–151,2012.

[20] S. Njah, H. Bezine, and A. M. Alimi, “On-line arabic handwritingsegmentation via perceptual codes: application to mayastroundatabase,” pp. 1–5, March 2011.

[21] J.R.Pinales, “Reconnaissance hors-ligne de lcriture cursive parlutilisation de modles perceptifs et neuronaux,” PhD, Universityof Paris, 2002.

[22] A.Malaviya, L.Peters, and P.Camposano, “A fuzzy online hand-writing recognition system: Fohres,” Proceedings of the InternationalConference on fuzzy theory and technology, pp. 1–15, 1993.

[23] H. Akouaydi, S. Abdelhedi, S. Njah, M. Zaied, and A. M. Alimi,“Decision trees based on perceptual codes for on-line arabic char-acter recognition,” no. 1-5, pp. 153–157, April 2017.

[24] M. Ltaief, S. Njah, H. Bezine, and A. M. Alimi, “Genetic algorithmsfor perceptual codes extraction,” International Journal of IntelligentLearning Systems and Applications, JILSA, pp. 256–265, 2012.

[25] S. Njah, H. Bezine, and A. M. Alimi, “A new approach for theextraction of handwriting perceptual codes using fuzzy logic,” pp.302–307, 2008.

[26] H.Bezine, A.M.Alimi, and N.Derbel, “A developmental theory ofthe relation between geometry and kinematics in handwriting,”IEEE International Conference Systems, Man and Cybernetics, p. 5,2002.

[27] M.Ltaief, H.Bezine, and A.M.Alimi, “A spiking neural networkmodel for complex handwriting movements generation,” Interna-

Page 14: JOURNAL OF LA Neural Architecture based on Fuzzy

JOURNAL OF LATEX CLASS FILES, VOL. , NO. , MAY 2019 14

tional Journal of Computer Science and Information Security 14, p. 7,2016.

[28] S. Njah, H. Bezine, and A. M. Alimi, “A fuzzy genetic systemfor segmentation of on-line handwriting: Application to adabdatabase,” pp. 95–102, April 2011.

[29] T.Dhieb, W.Ouarda, H.Boubaker, M.B.Hlima, and A. M. Alimi,“Online arabic writer identification based on beta-elliptic model,”pp. 74–79, 2015.

[30] T.Dhieb, W.Ouarda, H.Boubaker, and A. M. Alimi, “Beta-ellipticmodel for writer identification from online arabic handwriting,”vol. 11, p. 6, 2016.

[31] T.Dhieb, S.Njah, W.Ouarda, H.Boubaker, M.B.Ayed, and A. M.Alimi, “An online writer identification system based on beta-elliptic model and fuzzy elementary perceptual codes,” p. 12, 2018.

[32] G. Alex, F. Santiago, G. Faustino, and S. Jurgen, “Connectionisttemporal classification: Labelling unsegmented sequence datawith recurrent neural networks,” New York, NY, USA, pp.369–376, 2006. [Online]. Available: http://doi.acm.org/10.1145/1143844.1143891

[33] T. Bluche, H. Ney, J. Louradour, and C. Kermorvant, “Framewiseand ctc training of neural networks for handwriting recognition,”pp. 81–85, Aug 2015.

[34] S. Njah, B. B. Nouma, H. Bezine, and A. M. Alimi, “Mayastroun:A multilanguage handwriting database,” pp. 308–312, Sept 2012.

[35] M. Kherallah, S. Njah, A. M. Alimi, and N. Derbel, “Recognition ofon-line handwritten digits by neural networks using circular andbeta approaches,” vol. 2, pp. 164–169, Oct 2002.

[36] M.Kherallah, A.Elbaati, H.ElAbed, and A.M.Alimi, “The on/off(lmca) dual arabic handwriting database,” 2008.

[37] I. Guyon, L. Schomaker, R. Plamondon, M. Liberman, and S. Janet,“Unipen project of on-line data exchange and recognizer bench-marks,” vol. 2, pp. 29–33 vol.2, Oct 1994.

[38] V. Gaudin, P. Lallican, S. Knerr, and P. Binter, “The ireste on/off(ironoff) dual handwriting database,” p. 455 458, 1999.

[39] H.Akouaydi, S.Njah, and A. M. Alimi, “Android applicationfor handwriting segmentation using pertohs theory,” Proc.SPIE,vol. 10341, pp. 10 341 – 10 341 – 5, 2016. [Online]. Available:https://doi.org/10.1117/12.2269059

[40] H. E. Abed, M. Kherallah, V. Mrgner, and A. M. Alimi, “On-linearabic handwriting recognition competition, adab database andparticipating systems,” International Journal on Document Analysisand Recognition, vol. 14, pp. 15–23, 2011.

Hanen Akouaydi currently a PhD student incomputer science at the National School of En-gineers of Sfax. She obtained her engineeringdegree in Computer Science from Higher Insti-tute of Multimedia Arts of Manouba in 2012.Her research interests include online handwrit-ing recognition and deep learning. She is amember of the IEEE Student and the IEEE Com-puter Society and affiliate to REGIM Laboratory.

Sourour Njah currently an Assistant Professorat the High Institute of Commerce of Sfax. Sheearned the M.S. degree in Computer Sciencefrom Faculty of Economics and Management ofSfax in 2003 and the Ph.D. degree in ComputerEngineering from the National School of Engi-neers of Sfax in 2013. Her research domainsare cursive handwriting analysis and fuzzy logic.She is a member of the IEEE and affiliate toREGIM Laboratory.

Wael Ouarda is currently a Postdoctoral Re-searcher at the National School of Engineersof Sfax, Tunisia. He received earned the hisM.S. degree in Computer Science from the INSALyon, France in 2010 and the Ph.D. degree inComputer Engineering from the National Schoolof Engineers of Sfax in 2016. His research fo-cused on Bio- metric systems, Information Fu-sion and Pattern Recognition. He is a member ofthe IEEE and he is affiliate to REGIM Laboratory.

Anis Samet the director of the site SIFASTin Tunisia since 2013. He received his M.S.in IT applied to management from the Facultyof Management of Sfax in 2004. His work fo-cused on Team management, E-learning, Qual-ity and process management,Business Intelli-gence,Piloting of teams,Business Unit, site, Pi-loting projects, etc.

Thameur Dhieb presently a PhD student ofcomputer science at the Higher Institute of Com-puter Sciences and Communication Techniquesof Hammam Sousse. He obtained his M.S. de-gree in Computer Science from the Faculty ofEconomics and Management of Sfax in 2012.His research interests include writer identifica-tion and signature verification. He is a memberof the IEEE Student and the IEEE ComputerSociety and affiliate to REGIM Laboratory.

Mourad Zaied actually a Professor at the Na-tional School of Engineers of Gabes. He earnedthe Ph.D. degree and then the HDR both in Elec-trical and Computer Engineering in 2013. He isIEEE Senior member since 2000. He receivedhis HDR, and his Ph.D degrees in ComputerEngineering and the Master of science from theNational Engineering School of Sfax respectivelyin 2013,2008 and in 2003. He obtained the de-gree of Computer Engineer from the NationalEngineering School of Monastir in 1995. Since

1997 he served in several institutes and faculties in the university ofGabes as teaching assistant. His research interests include ComputerVision and Image and video analysis. These research activities arecentered on Wavelets and Wavelet networks and their applications todata classification and approximation, pattern recognition and image,audio and video coding and indexing.

Adel M. Alimi currently a Professor at the Na-tional School of Engineers of Sfax. He obtainedhis Ph.D. degree and then his HDR both in Elec-trical and Computer Engineering in 1995 and2000 respectively. His research interest includesapplications of intelligent methods and visionsystems. He is a guest editor of several specialissues of international journals like Soft Comput-ing. He is IEEE Senior member since 2000 andhe is the director of REGIM Laboratory.