๐Ÿ” AIN VLM - Vision Language Model OCR

Advanced OCR using Vision Language Model (VLM) for accurate text extraction

Powered by MBZUAI/AIN - Specialized for understanding and extracting text from images

โ„น๏ธ How it works: Upload an image containing text, click "Process Image", and get the extracted text. The VLM model intelligently understands context and can handle handwritten text better than traditional OCR models.
512 4096

๐Ÿ“ Image Resolution Settings

Controls visual token range (4-16384) - balance quality vs speed

โœจ Ready to process images

๐Ÿ“š Example Images

Click on any example below to load it