Skip to main content
Version: 23.5

OCR Language Support

The Optical Character Recognition (OCR) engine is a versatile tool within Eggplant Functional built for working with dynamic text. This page outlines all languages supported for use with OCR.

Using the Language Property

Language dictionaries can be specified either in Text Preferences or in line with your OCR search using the Language property. to any OCR search in Eggplant Functional scripts. The OCR Engine provides its own system dictionaries for the languages that have full built-in dictionary support. The property values listed in Supported OCR Language Properties are case sensitive.

The Language property can be used for both reading and searching for text using OCR. For more on the difference between reading and searching for text, see How to Use OCR.

Examples:

//Using the Language property to find text (searching)
Click (Text:"Aubergine", Language:"French")
//Using the Language property to restrict the readText() function
log ReadText(("TLImage","BRImage"), Language:"French") -- where "TLImage" and "BRImage" are captured images that define a search rectangle by indicating the top left and bottom right corners of that rectangle, respectively.

Supported OCR Language Properties

Eggplant Functional comes with numerous languages out of the box. Additional languages are also available for purchase.

Custom OCR Dictionaries

In addition to selecting specific languages, you can use SenseTalk properties to customize the OCR engine dictionary. You can add specific words that you want text searches to recognize, and you can list words that you want to prohibit the OCR engine from recognizing. For complete information about creating a custom dictionary, see Customize the OCR Engine Dictionary.

Languages Supported

The table below shows a comprehensive list of all languages supported for use with OCR.

* Denotes Full Dictionary Support

note

Note: These predefined language properties are case-sensitive.

AbkhazFaeroeseKurdishRundi
AdygheFijianLakRussian *
AfrikaansFinnish *LappishRussianOldSpelling *
AgulFrench *Latin *RussianWithAccent *
AlbanianFrisianLatvian *Samoan
AltaicFriulianLezginSelkup
ArabicGaelicScottishLithuanian *SerbianCyrillic
ArmenianEastern *GagauzLubaSerbianLatin
ArmenianGrabar *GalicianMacedonianShona
ArmenianWestern *GandaMalagasySioux (Dakota)
AwarGerman *MalaySlovak *
AymaraGermanNewSpelling *MalinkeSlovenian *
AzeriCyrillicGermanLuxembourgMalteseSomali
AzeriLatin *Greek *MansiSorbian
Bashkir *GuaraniMaoriSotho
BasqueHaniMariSpanish *
BelarusianHausaMayaSunda
BembaHawaiianMiaoSwahili
BlackfootHebrewMinankabawSwazi
BretonHungarian *Mixed (Russian and English) *Swedish *
BugotuIcelandicMohawkTabassaran
Bulgarian *IdoMoldavianTagalog
BuryatIndonesian *MongolTahitian
Catalan *IngushMordvinTajik
ChamorroInterlinguaNahuatlTatar *
ChechenIrishNenetsTinpo (Jingpo)
ChinesePRCItalian *NivkhTongan
ChineseTaiwanJapanese *NogayTswana
ChukchaJapanese+English *Norweigan (NorvegianNynorsk and NorvegianBokmal) *Tun
ChuvashJapaneseModernNorwegianBokmal *Turkish *
CorsicanKabardianNorwegianNynorsk *Turkmen
CrimeanTatarKalmykNyanjaTurkmenLatin
Croatian *KarachayBalkarOccidentalTuvin
CrowKarakalpakOjibwayUdmurt
Czech *KasubOsseticUighurCyrillic
Danish *KawaPapiamentoUighurLatin
DargwaKazakhPidginEnglish (Tok Pisin language)Ukrainian *
DunganKhakasPolish *UzbekCyrillic
Dutch *KhantyPortugueseBrazilian *UzbekLatin
DutchBelgianKikuyuPortugueseStandard *Visayan (Cebuano)
English *KirgizProvencalWelsh
EskimoCyrillicKongoQuechuaWolof
EskimoLatinKorean *RhaetoRomanicXhosa
EsperantoKoreanHangul *Romanian *Yakut
Estonian *KoryakRomanianMoldaviaZapotec
EvenKpelleRomanyZulu
EvenkiKumykRuanda

Eggplant Functional scripts recognize other keywords as pre-defined language properties as shown in Other Supported Keywords.

Other Supported Keywords

BasicCMC7E13BPascal
C++CobolFortranOCRA
ChemistryDigitsJavaOCRB