dawg2wordlist (1) Linux Manual Page
dawg2wordlist – convert a Tesseract DAWG to a wordlist
Synopsis
dawg2wordlist UNICHARSET DAWG WORDLISTDescription
dawg2wordlist(1) converts a Tesseract Directed Acyclic Word Graph (DAWG) to a list of words using a unicharset as key.Options
UNICHARSET The unicharset of the language. This is the unicharset generated by mftraining(1).DAWG The input DAWG, created by wordlist2dawg(1)
WORDLIST Plain text (output) file in UTF-8, one word per line
See Also
tesseract(1), mftraining(1), wordlist2dawg(1), unicharset(5), combine_tessdata(1) m[blue]https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseractm[]
