Welcome, fellow learners! In this video, we'll explore how to combine two newly released open-source models to achieve better OCR results on low-quality scanned documents. The first model, AuraSR, is a GAN-based super-resolution model that enhances the quality of scanned document images. The second model is MiniCPM-V 2.6, a recently released multimodal LLM, which we'll use to extract text from the upscaled document images.
Notebook - colab.research...
MiniCPM-V 2.6 - huggingface.co...
AuraSR-v2 - huggingface.co...
#ocr #superresolution #aurasr #minicpm #documentscanning #machinelearning #deeplearning #opensource #imageprocessing #gan #llm #generativeadversarialnetworks #lowqualityimages #4xresolution
Негізгі бет Improving OCR on Low-Quality Documents with AuraSR-v2 and MiniCPM-V 2.6
Пікірлер: 14