Industry leading accuracy and reliability are the driving forces behind Transym. However, since 2002 we’ve refined TOCR to the point where it now offers an impressive range of benefits for integrators
Extensive lexicon of 45 different languages
At Transym, we use a lexicon which includes words and phrases from many languages, living or dead, to provide a single source of reference offering outstanding word accuracy and reliability. Lex improves accuracy by using the context of the character and those around it.
TOCR offers up to 99% accuracy in English, French, Italian, German, Dutch, Swedish, Norwegian, Finnish, Danish, Spanish, Portuguese, Russian, Belarusian, Bulgarian, Bosnian, Catalan, Czech, Greek, Estonian, Basque, Croatian, Hungarian, Icelandic, Lithuanian, Latvian, Macedonian, Polish, Romanian, Serbo-Croatian, Slovakian, Slovenian, Albanian, Serbian, Turkish, Ukrainian, Luxembourgish, Galician, Neapolotan, Lombardian, Sicilian, Piedmontese, West Frisian, West Flemish, Limburgian and Sami. You can see a full character map here.
In addition, on very poor quality documents or where characters have been badly reproduced, TOCR will provide up to four suggested alternatives during word accuracy checking and carry on processing so that the document or batch can be completed and the checking process can be performed in the quickest time possible.
Auto-Lex allows TOCR to automatically decide whether to use the lexicon function for a given image.
Optimisation for poor backgrounds
The quality of the background (for example, photocopied, faxed or crumpled documents) can also have an impact on character recognition.
TOCR is tested and enhanced using extremes of light and dark backgrounds, deformation and speckle. It is hardened using a vast source of imperfect samples to train the software to identify text as opposed to background defects.
Colour Conversion
TOCR supports colour images by using colour conversion algorithms that convert colour images to monochrome.
You can choose between 9 different options to best suit the colour conversion of your documents. These options include:
- Desaturation
- Decomposition
- Luma BT 601
- Luma BT 709
- RGB
Automatic orientation detection
TOCR automatically detects which way up the image or page has been scanned and delivers the recognised text the right way up.
Deskewing
Scanned documents often become skewed (slanted) during the scanning process due to alignment or feeding issues. This is not an issue for TOCR. TOCR will deskew your documents before processing to ensure the most accurate result possible.
Scalable performance
Although TOCR only requires a single processor PC running Windows 7 or later, for large scale solutions it can be scaled to run on up to 255 Processors on a single machine. Ideal for integrators.
TWAIN interface
TOCR’s simple TWAIN interface allows users to send an image to OCR directly from their image scans, with no intermediate steps.
PDF Support
TOCR now supports PDFs, Our new API supports extracting pages from PDFs and producing a bitmap (DIB) to be processed by TOCR. The results can then be saved as an appendix which would allow text from within PDF images to be searchable.
Font Independent Recognition
Because recognised text is primarily used for searching or formatting we don’t focus on the font that has been used. Instead we have optimised TOCR to recognise the characters and allow you to set the font in which you wish to view or export the text.
For users with data in the OCRB font, there is an additional option to flag this font for more accurate processing, at the expense of accuracy with other fonts.
TOCR is now able to return font information; the nearest font face or family name from 50 installed fonts as well as if the text is italic or normal.
Exceptional support and assistance
Transym offer a personal level of support that few other companies can match. While we endeavour to make sure that our system is as easy to use and reliable as possible, our support team are on hand to answer any technical or account questions that you may have.