Baidu Releases ERNIE-4.5-VL-28B-A3B-Thinking: An Open-Source and Compact Multimodal Reasoning Model Under the ERNIE-4.5 Family
How can we get giant mannequin stage multimodal reasoning for paperwork, charts and movies whereas working solely a 3B class mannequin in manufacturing? Baidu has added a brand new mannequin to the ERNIE-4.5 open supply household. ERNIE-4.5-VL-28B-A3B-Thinking is a imaginative and prescient language mannequin that focuses on doc, chart and video understanding with a small…
