07. Anonymization of scanned documents and images

Automatically anonymize scanned documents (including handwritten ones), photos and images using OCR* (Optical Character Recognition).

Przeczytaj ten artykuł w języku polskim.


OCR (Optical Character Recognition) is a technology that converts text stored in image format (in document scans, photos) into editable and searchable data. With OCR, we can work with documents as if they were created in a text editor.


Automatic redaction of scans, photos, and images works even when the text in the document is arranged in different directions. OCR processes the entire file, ensuring full functionality without requiring any additional actions from you.

The optical character recognition is applied to the entire document automatically after it is uploaded and does not require any further actions from you in the system. The process begins automatically once the document is uploaded, and an "OCR in progress" label will appear in the system.


The "OCR'd" label means that the document is ready for anonymization using the automatic "Search and Redact" and redaction patterns features.

✅ Automatically anonymize scans, including those with low quality.

✅Redact document scans with rotated pages, regardless of the text layout.

✅ Redact files containing text in multiple languages.

✅ Automatically redact stamps in your documents.

✅ Anonymize handwritten text in 9 languages.

List of languages supported for handwritten text:

English Chinese Simplified French
German Italian Japanese
Korean Portugese Spanish

List of languages supported for printed text:

Afrikaans Hani Nepali
Albanian Haryanvi Niuean
Angika (Devanagiri) Hawaiian Nogay
Arabic Hindi Northern Sami (Latin)
Asturian Hmong Daw (Latin) Norwegian
Awadhi-Hindi (Devanagiri) Ho(Devanagiri) Occitan
Azerbaijani (Latin) Hungarian Ossetic
Bagheli Icelandic Pashto
Basque Inari Sami Persian
Belarusian (Cyrillic) Indonesian Polish
Belarusian (Latin) Interlingua Portuguese
Bhojpuri-Hindi (Devanagiri) Inuktitut (Latin) Punjabi (Arabic)
Bislama Irish Ripuarian
Bodo (Devanagiri) Italian Romanian
Bosnian (Latin) Japanese Romansh
Brajbha Jaunsari (Devanagiri) Russian
Breton Javanese Sadri (Devanagiri)
Bulgarian Kabuverdianu Samoan (Latin)
Bundeli Kachin (Latin) Sanskrit (Devanagari)
Buryat (Cyrillic) Kangri (Devanagiri) Santali(Devanagiri)
Catalan Karachay-Balkar Scots
Cebuano Kara-Kalpak (Cyrillic) Scottish Gaelic
Chamling Kara-Kalpak (Latin) Serbian (Latin)
Chamorro Kashubian Sherpa (Devanagiri)
Chhattisgarhi (Devanagiri) Kazakh (Cyrillic) Sirmauri (Devanagiri)
Chinese Simplified Kazakh (Latin) Skolt Sami
Chinese Traditional Khaling Slovak
Cornish Khasi Slovenian
Corsican K'iche' Somali (Arabic)
Crimean Tatar (Latin) Korean Southern Sami
Croatian Korku Spanish
Czech Koryak Swahili (Latin)
Danish Kosraean Swedish
Dari Kumyk (Cyrillic) Tajik (Cyrillic)
Dhimal (Devanagiri) Kurdish (Arabic) Tatar (Latin)
Dogri (Devanagiri) Kurdish (Latin) Tetum
Dutch Kurukh (Devanagiri) Thangmi
English Kyrgyz (Cyrillic) Tongan
Erzya (Cyrillic) Lakota Turkish
Estonian Latin Turkmen (Latin)
Faroese Lithuanian Tuvan
Fijian Lower Sorbian Upper Sorbian
Filipino Lule Sami Urdu
Finnish Luxembourgish Uyghur (Arabic)
French Mahasu Pahari (Devanagiri) Uzbek (Arabic)
Friulian Malay (Latin) Uzbek (Cyrillic)
Gagauz (Latin) Maltese Uzbek (Latin)
Galician Malto (Devanagiri) Volapük
German Manx Walser
Gilbertese Maori Welsh
Gondi (Devanagiri) Marathi Western Frisian
Greenlandic Mongolian (Cyrillic) Yucatec Maya
Gurung (Devanagiri) Montenegrin (Cyrillic) Zhuang
Haitian Creole Montenegrin (Latin) Zulu
Halbi (Devanagiri) Neapolitan

*Note: For contracts signed before 1.06.2025, the OCR function is disabled. To activate it, please contact our team.


Do you have additional questions?


#FORDATAteam is for you.

Contact us via email support@fordatagroup.com

or at phone number:

EMEA +44 204 584 3861

APAC +852 21 582 983

Americas +1 917 779 9339

How do you assess the usefulness of the information in this article? Thanks for the feedback There was a problem submitting your feedback. Please try again later.

Still need help? Contact Us Contact Us