Engineering Applications of Artificial Intelligence 95 (2020) 103912 Contents lists available at ScienceDirect Engineering Applications of Artificial Intelligence journal homepage: www.elsevier.com/locate/engappai A deep learning framework for text-independent writer identification Malihe Javidi, Mahdi Jampour Quchan University of Technology, Quchan, Iran ARTICLE INFO Keywords: Enhanced ResNet Deep residual networks Text-independent Handwriting writer recognition ABSTRACT Handwriting Writer Identification (HWI) refers to the process of handwriting text image analysis to identify the authorship of the documents. It has yielded promising results in various applications, including digital forensics, criminal purposes, exploring the writer of historical documents, etc. The complexity of the text image, especially in images with various handwriting makes the writer identification difficult. In this work, we propose an end-to-end system that relies on a straightforward yet well-designed deep network and very efficient feature extraction, emphasizing feature engineering. Our system is an extended version of ResNet by conjugating deep residual networks and a new traditional yet high-quality handwriting descriptor towards handwriting analysis. Our descriptor analyzes the handwriting thickness as a preliminary and essential feature for human handwriting characteristics. Our approach can also provide text-independent writer identification that we do not need to have the same handwriting content for learning our model. The proposed approach is evaluated and achieved consistent results on four public and well-known datasets of IAM, Firemaker, CVL, and CERUG-EN. We empirically demonstrate that our conjugated network outperforms the original ResNet, and it can work well for real-world applications in which patches with few letters exist. 1. Introduction Handwriting is an individual human characteristic that represents the writer’s psychological state during the writing and can prove a person’s authenticity through its pattern analysis. It has many potential applications such as digital forensics, criminal purposes, exploring the writer of historical documents, etc. Handwriting writer identification refers to finding an author from his/her handwriting documents among numerous writers’ documents. This identifying is possible due to the handwriting style, which has multiple specific features that show the author’s personality. The features are including the shape of letters, spacing between letters, the slope of letters, cursive or separated writ- ing, rhythmic repetition of the elements, the pressure to the paper, the size of letters, the thickness of letters, etc. (Khan et al., 2019). Since a human can intelligently discover a bunch of above features, modern AI systems focus on proposing approaches that can imitatively identify human from handwriting images that is an offline handwriting writer identification system. In contrast, an online writer identification system analyzes a script during the writing. Therefore, it needs expensive peripheral devices like a digital pen to record vital information such as writing speed, chaotic behavior, etc. Of course, more information helps to achieve a better identification rate. Handwriting writer identification has numerous employment. While handwriting analysis is useful for guessing the author of the historic or non-historic documents based on the recognized scripts (De Stefano Corresponding author. E-mail address: jampour@qiet.ac.ir (M. Jampour). et al., 2018), it is also pretty beneficial for forensics purposes like writer identification or verification due to carry the personality of the authors. Handwriting writer identification can also be divided into two branches of text-depended and text-independent categories. Both approaches find the most similar handwriting (author label) from a pre-defined gallery compared to the input test handwriting sample. Nevertheless, the former refers to the methods that use the same text for discovering author properties, while the latter is not sensitive to the analyzed text content (Xiong et al., 2017). This means that the content of the test and train texts must not be needed to be the same for all authors. Therefore, only the second category can be used for applications like exploring writers of historical documents, although both are useful for forensic purposes. In contrast to the identification, writer verification refers to confirming the similarity of two scripts (items). In handwriting analysis, while we know who the author of a document is, we need to verify whether the claimed person is correct or not? Similar to other verification biometrics like signature verification as well as fingerprint verification. Of course, the identification is more complicated than verification because the authentication system needs to find the similarity of input sample from thousands of examples in the gallery, which means that the identification is a process of finding one from N while verification is confirming 1 by 1. Although, several approaches achieved excellent performance on different writer identification datasets; they are still far to address this problem in more challenging datasets or real-world applications. https://doi.org/10.1016/j.engappai.2020.103912 Received 27 February 2020; Received in revised form 26 July 2020; Accepted 23 August 2020 Available online 28 August 2020 0952-1976/© 2020 Elsevier Ltd. All rights reserved.