A high-performance Optical Character Recognition engine for Khmer script — trained on 3 million text lines, powered by deep learning, runs everywhere.
KhmerOCR is like Google Lens, but built specifically for Cambodia's language. Give it a photo of any Khmer document — it gives you back editable digital text.
ព្រះរាជាណាចក្រកម្ពុជា → "Kingdom of Cambodia"Khmer script is uniquely complex — stacked consonants, vowels above and below letters, 800+ font variations. Generic OCR tools like Tesseract fail badly on Khmer. This model was trained from scratch on 3 million real Khmer text lines to solve this properly.
From raw image to clean Khmer text — 10 steps, two AI models, one clean output.
Both models are in ONNX format — run anywhere, on any device, without framework dependencies.
Non-Maximum Suppression (NMS) — After detection, many overlapping boxes cover the same word. NMS keeps only the best one.
CTC Decoding — The recognition model outputs character probabilities at every time step. CTC turns this noisy sequence into clean text.
Line Sorting — After NMS, boxes must be grouped into reading lines and sorted in natural reading order.
Image Preprocessing — Raw images must be transformed before feeding into the neural network.
How data moves through the entire system from input to output.