How I can specify specific segmentation mode for tesseract that Layoutlm should use?

#19

by anemilentsau - opened Nov 4, 2023

Nov 4, 2023

Hi,
I am having an issue that default segmentation model for tesseract fails to extract properly the text from my document that leads to Layoutlm being incapable to provide correct answers. The proper segmentation mode shall be --psm 6. Is there a way to specify the tesseract segmentation mode that nlp pipeline for the question answering should use?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment