07. Anonymization of scanned documents and images
Automatically anonymize scanned documents (including handwritten ones), photos and images using OCR* (Optical Character Recognition).
OCR (Optical Character Recognition) is a technology that converts text stored in image format (in document scans, photos) into editable and searchable data. With OCR, we can work with documents as if they were created in a text editor.
Automatic redaction of scans, photos, and images works even when the text in the document is arranged in different directions. OCR processes the entire file, ensuring full functionality without requiring any additional actions from you.
The optical character recognition is applied to the entire document automatically after it is uploaded and does not require any further actions from you in the system. The process begins automatically once the document is uploaded, and an "OCR in progress" label will appear in the system.
The "OCR'd" label means that the document is ready for anonymization using the automatic "Search and Redact" and redaction patterns features.
✅ Automatically anonymize scans, including those with low quality.
✅Redact document scans with rotated pages, regardless of the text layout.
✅ Redact files containing text in multiple languages.
✅ Automatically redact stamps in your documents.
✅ Anonymize handwritten text in 9 languages.
List of languages supported for handwritten text:
English | Chinese Simplified | French |
German | Italian | Japanese |
Korean | Portugese | Spanish |
List of languages supported for printed text:
Afrikaans | Hani | Nepali |
Albanian | Haryanvi | Niuean |
Angika (Devanagiri) | Hawaiian | Nogay |
Arabic | Hindi | Northern Sami (Latin) |
Asturian | Hmong Daw (Latin) | Norwegian |
Awadhi-Hindi (Devanagiri) | Ho(Devanagiri) | Occitan |
Azerbaijani (Latin) | Hungarian | Ossetic |
Bagheli | Icelandic | Pashto |
Basque | Inari Sami | Persian |
Belarusian (Cyrillic) | Indonesian | Polish |
Belarusian (Latin) | Interlingua | Portuguese |
Bhojpuri-Hindi (Devanagiri) | Inuktitut (Latin) | Punjabi (Arabic) |
Bislama | Irish | Ripuarian |
Bodo (Devanagiri) | Italian | Romanian |
Bosnian (Latin) | Japanese | Romansh |
Brajbha | Jaunsari (Devanagiri) | Russian |
Breton | Javanese | Sadri (Devanagiri) |
Bulgarian | Kabuverdianu | Samoan (Latin) |
Bundeli | Kachin (Latin) | Sanskrit (Devanagari) |
Buryat (Cyrillic) | Kangri (Devanagiri) | Santali(Devanagiri) |
Catalan | Karachay-Balkar | Scots |
Cebuano | Kara-Kalpak (Cyrillic) | Scottish Gaelic |
Chamling | Kara-Kalpak (Latin) | Serbian (Latin) |
Chamorro | Kashubian | Sherpa (Devanagiri) |
Chhattisgarhi (Devanagiri) | Kazakh (Cyrillic) | Sirmauri (Devanagiri) |
Chinese Simplified | Kazakh (Latin) | Skolt Sami |
Chinese Traditional | Khaling | Slovak |
Cornish | Khasi | Slovenian |
Corsican | K'iche' | Somali (Arabic) |
Crimean Tatar (Latin) | Korean | Southern Sami |
Croatian | Korku | Spanish |
Czech | Koryak | Swahili (Latin) |
Danish | Kosraean | Swedish |
Dari | Kumyk (Cyrillic) | Tajik (Cyrillic) |
Dhimal (Devanagiri) | Kurdish (Arabic) | Tatar (Latin) |
Dogri (Devanagiri) | Kurdish (Latin) | Tetum |
Dutch | Kurukh (Devanagiri) | Thangmi |
English | Kyrgyz (Cyrillic) | Tongan |
Erzya (Cyrillic) | Lakota | Turkish |
Estonian | Latin | Turkmen (Latin) |
Faroese | Lithuanian | Tuvan |
Fijian | Lower Sorbian | Upper Sorbian |
Filipino | Lule Sami | Urdu |
Finnish | Luxembourgish | Uyghur (Arabic) |
French | Mahasu Pahari (Devanagiri) | Uzbek (Arabic) |
Friulian | Malay (Latin) | Uzbek (Cyrillic) |
Gagauz (Latin) | Maltese | Uzbek (Latin) |
Galician | Malto (Devanagiri) | Volapük |
German | Manx | Walser |
Gilbertese | Maori | Welsh |
Gondi (Devanagiri) | Marathi | Western Frisian |
Greenlandic | Mongolian (Cyrillic) | Yucatec Maya |
Gurung (Devanagiri) | Montenegrin (Cyrillic) | Zhuang |
Haitian Creole | Montenegrin (Latin) | Zulu |
Halbi (Devanagiri) | Neapolitan |
*Note: For contracts signed before 1.06.2025, the OCR function is disabled. To activate it, please contact our team.
Do you have additional questions?
#FORDATAteam is for you.
Contact us via email support@fordatagroup.com
or at phone number:
EMEA +44 204 584 3861
APAC +852 21 582 983
Americas +1 917 779 9339