Computer Vision | Videos

CV algorithm development by the masses for the masses

ByRicardo November 3, 2025November 3, 2025

CV algorithm development by the masses for the masses

Learn extra

CV algorithm development by the masses for the masses

Enjoyed this video? Why not take a look at some associated studying 👇

Artificial Intelligence Computer Vision

GLM-4.1V-Thinking: Advancing General-Purpose Multimodal Understanding and Reasoning
ByRicardo July 18, 2025

Vision-language models (VLMs) play a crucial role in today’s intelligent systems by enabling a detailed understanding of visual content. The complexity of multimodal intelligence tasks has grown, ranging from scientific problem-solving to the development of autonomous agents. Current demands on VLMs have far exceeded simple visual content perception, with increasing attention on advanced reasoning. While…

Read More GLM-4.1V-Thinking: Advancing General-Purpose Multimodal Understanding and Reasoning
Computer Vision Editors Pick

This AI Paper Introduces MMSearch-R1: A Reinforcement Learning Framework for Efficient On-Demand Multimodal Search in LMMs
ByRicardo July 14, 2025

Large multimodal models (LMMs) enable systems to interpret images, answer visual questions, and retrieve factual information by combining multiple modalities. Their development has significantly advanced the capabilities of virtual assistants and AI systems used in real-world settings. However, even with massive training data, LMMs often overlook dynamic or evolving information, especially facts that emerge post-training…

Read More This AI Paper Introduces MMSearch-R1: A Reinforcement Learning Framework for Efficient On-Demand Multimodal Search in LMMs
Computer Vision Multimodal AI

From cloud to factory – humanoid robots coming to workplaces
ByRicardo January 11, 2026

The partnership announced this week between Microsoft and Hexagon Robotics marks an inflection point in the commercialisation of humanoid, AI-powered robots for industrial environments. The two companies will combine Microsoft’s cloud and AI infrastructure with Hexagon’s expertise in robotics, sensors, and spatial intelligence to advance the deployment of physical AI systems in real-world settings. At…

Read More From cloud to factory – humanoid robots coming to workplaces
Artificial Intelligence Computer Vision

Zhipu AI Releases GLM-4.6V: A 128K Context Vision Language Model with Native Tool Calling
ByRicardo December 12, 2025

Zhipu AI has open sourced the GLM-4.6V sequence as a pair of imaginative and prescient language fashions that deal with photographs, video and instruments as first-class inputs for brokers, not as afterthoughts bolted on high of textual content. Model lineup and context size The sequence has 2 fashions. GLM-4.6V is a 106B parameter basis mannequin…

Read More Zhipu AI Releases GLM-4.6V: A 128K Context Vision Language Model with Native Tool Calling
Artificial Intelligence Computer Vision

Black Forest Labs Releases FLUX.2 [klein]: Compact Flow Models for Interactive Visual Intelligence
ByRicardo January 18, 2026

Black Forest Labs releases FLUX.2 [klein], a compact image model family that targets interactive visual intelligence on consumer hardware. FLUX.2 [klein] extends the FLUX.2 line with sub second generation and editing, a unified architecture for text to image and image to image, and deployment options that range from local GPUs to cloud APIs, while keeping…

Read More Black Forest Labs Releases FLUX.2 [klein]: Compact Flow Models for Interactive Visual Intelligence
Artificial Intelligence Computer Vision

A Hands-On Coding Tutorial on Qualcomm AI Hub Models for Classification, Object Detection, and Hardware-Aware Deployment
ByRicardo June 5, 2026

In this tutorial, we work by way of an end-to-end workflow for Qualcomm AI Hub Models. We begin by organising the required package deal, discovering the out there mannequin assortment, and loading MobileNet-V2 for native PyTorch inference. We additionally deal with an essential input-shape problem by changing NHWC picture tensors into the NCHW format anticipated…

Read More A Hands-On Coding Tutorial on Qualcomm AI Hub Models for Classification, Object Detection, and Hardware-Aware Deployment