Skip to main content

Sora Watermark Remover - Allows you to remove the watermark from Sora videos.Try Now

CurateClick

Back to Blog

Tag: OCR

5 posts found

GLM-OCR is a 0.9B-parameter multimodal OCR model for complex document understanding. Learn how it achieves SOTA performance on OmniDocBench V1.5 (94.62) with structure-first outputs (Markdown, JSON, LaTeX), handling tables, formulas, handwriting across 100+ languages. Open Apache-2.0 weights, suitable for research, finance, legal, and developer workflows.
Complete guide to DeepSeek OCR 2: 3B parameter vision-language model with DeepEncoder V2 human-like reading order. Covers deployment via vLLM/Transformers/Unsloth, fine-tuning with 86% language understanding improvement, performance benchmarks, and SOTA document understanding.
CurateClick Team
January 28, 2026
Discover HunyuanOCR, the end-to-end OCR model from Tencent, which can handle detection, recognition, parsing, translation, and more in one unified pipeline.
CurateClick Team
November 25, 2025
Discover DeepSeek OCR's revolutionary vision-text compression technology that reduces LLM token consumption by 10-20x while maintaining high accuracy. Complete guide with benchmarks, deployment tips, and practical applications.
CurateClick Team
October 22, 2025
Discover PaddleOCR-VL-0.9B, the #1 ranked model on OmniBenchDoc V1.5 leaderboard. Ultra-lightweight 0.9B parameter model that outperforms GPT-4o and Gemini 2.5 Pro in document parsing. Supports 109 languages, recognizes tables, formulas, and handwritten text.
CurateClick Team
October 17, 2025