I have been working on extracting text from images, specifically focusing on seven-segment fonts, using .NET. Unfortunately, my attempts with popular libraries like Tesseract, IronOcr and many more have been unsuccessful, as they seem to excel with normal English fonts.
Here's a brief overview of my tries so far: 1. Tesseract (Limited to normal English fonts, unable to recognize seven-segment characters) 2. IronOcr (Similar limitations, not suitable for seven-segment fonts) 3. Leadtools 4. pretrained models 5. custom trained models 6. some matlab and python projects from internet 7. some free OCR Api providers Despite these efforts, I'm facing challenges in accurately extracting text from images with seven-segment fonts. Link to Image Dataset Folder
Additionally, I've experimented with image processing techniques, including: Cropping and zooming to the text region. Applying gray, black and white, and binarization filters.