Dyson Dunbar

Search

❯

❯

ColPaliGemma (2024-07-25)

ColPaliGemma (2024-07-25)

Aug 03, 20241 min read

Started work on a OCR model that uses surya, PaliGemma, and ColPali to extract the text from a PDF.

The general idea is as follows:

Use Surya to identify and isolate all of the text/ paragraphs individually
use PaliGemma to get the text from the image (TODO: compare quality against surya OCR)
Use ColPali to check to see how closely the transcribed text matches the output

Model Info

End

Here’s an image for your time

Graph View

Model Info
End

Backlinks

No backlinks found

Created with Quartz v4.2.4 © 2024

GitHub
Discord Community