๐ AIN VLM - Vision Language Model OCR
Advanced OCR using Vision Language Model (VLM) for accurate text extraction
Powered by MBZUAI/AIN - Specialized for understanding and extracting text from images
โน๏ธ How it works: Upload an image containing text, click "Process Image", and get the extracted text.
The VLM model intelligently understands context and can handle handwritten text better than traditional OCR models.
512 4096
๐ Image Resolution Settings
Controls visual token range (4-16384) - balance quality vs speed
โจ Ready to process images
๐ Example Images
Click on any example below to load it