
Computer Vision - IBM Research
Jun 26, 2025 · Computer Vision Modern computer vision systems have superhuman accuracy when it comes to image recognition and analysis, but they don’t really understand what they see. At IBM …
A Model for Estimating the Economic Costs of Computer Vision …
Feb 20, 2024 · In this paper, we connect technical insights from deep learning scaling laws and transfer learning with the economics of IT to propose a framework for estimating the cost of deep learning …
Artificial Intelligence - IBM Research
Feb 9, 2026 · AI for Code AI for Supply Chain AI Testing Automated AI Causality Computer Vision Conversational AI Explainable AI Fairness, Accountability, Transparency Foundation Models …
A new way to generate synthetic data for pretraining computer vision ...
Jun 23, 2022 · A new way to generate synthetic data for pretraining computer vision models IBM's Task2Sim churns out synthetic images tailored for specific AI tasks to reduce the need for real data. …
Inspecto – Large Vision Model Inspection Service - IBM Research
General AI for computer vision is experiencing a surge of innovation fueled by the advent of vision transformers and Large Vision Models (LVMs). At IBM Research, we extended this technology to …
IBM Granite Vision tops the chart for small AI models
Jun 26, 2025 · The latest Granite vision model recently came in second on the OCRBench leaderboard, and is the best-performing small model on the chart.
Workshop on Memory and Vision for ICCV 2025 - IBM Research
Oct 19, 2025 · At the same time, advances in computer vision, especially through generative AI, have enabled models to synthesize realistic imagery and understand complex scenes with remarkable …
IBM Granite now has eyes - IBM Research
Feb 26, 2025 · IBM’s new vision-language model for enterprise AI can extract knowledge locked away in tables, charts, and other graphics, bringing enterprises closer to automating a range of document …
Multimodal AI for Healthcare and Life Sciences - IBM Research
Our research includes advanced computer vision techniques that enable the automatic extraction of diagnostically relevant features in multimodal healthcare images. We are developing machine …
Terramind: the new generative AI model for Earth observation
Apr 22, 2025 · It is the first “any-to-any” multi-modal generative AI model for Earth observation. This means it can self-generate additional training data from other modalities — a technique IBM …