What is it about?

To collect the handwritten format of separate Kurdish characters, each character has been printed on a grid of 14 × 9 of A4 paper. Each paper is filled with only one printed character so that the volunteers know what character should be written in each paper. Then each paper has been scanned, spliced, and cropped with a macro in photoshop to make sure the same process is applied for all characters. The grids of the characters have been filled mainly by volunteers of students from multiple universities in Erbil.

Featured Image

Why is it important?

Value of the Data •The dataset is suitable for machine learning models for handwriting recognition. •Researchers who have an interest in researches of Kurdish/Persian/Arabic language in deep learning and machine learning. •This data can be a start for research of a more complex subject of joint characters and word recognition for this specific language. •As it is highly standardized (meaning very carefully sized and formatted) it can be used as a benchmark of quality and usability for future works.


This is a great collection of handwritten format of separate Kurdish characters.

Professor Tarik A. Rashid
University of Kurdistan Hewler

Read the Original

This page is a summary of: An extensive dataset of handwritten central Kurdish isolated characters, Data in Brief, December 2021, Elsevier, DOI: 10.1016/j.dib.2021.107479.
You can read the full text:



The following have contributed to this page