NVIDIA's Nemotron OCR v2: How Synthetic Data Built a Multilingual Vision PowerhouseNVIDIA just open-sourced a state-of-the-art OCR model trained almost entirely on synthetic data. Here's why that matters for the future of vision-language models.#synthetic-data#ocr#vision-language-models#multilingual#open-source