Connect with us

AI

Baidu’s Groundbreaking ERNIE 5 Outperforms GPT-5 in Charts, Document Understanding, and Beyond

Published

on

Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more

Baidu Unveils ERNIE 5.0: The Next-Generation Foundation Model

In a move to solidify its position in the enterprise AI market, Baidu recently introduced ERNIE 5.0, its latest foundation model, at the Baidu World 2025 event. Unlike its predecessor, ERNIE 4.5-VL-28B-A3B-Thinking, ERNIE 5.0 is a proprietary model available exclusively through Baidu’s ERNIE Bot website and the Qianfan cloud platform API for enterprise clients. The new model is designed to process and generate content across various modalities, including text, images, audio, and video.

Alongside the launch of ERNIE 5.0, Baidu also unveiled updates to its digital human platform, no-code tools, and general-purpose AI agents, all aimed at expanding its footprint beyond China. The company introduced ERNIE 5.0 Preview 1022, a variant optimized for text-intensive tasks, as well as a general preview model that balances performance across different modalities.

Baidu claims that ERNIE 5.0 outshines competitors like GPT-5 and Gemini 2.5 Pro in tasks such as multimodal reasoning, document understanding, and image-based question answering. The model’s ability to handle joint inputs and outputs across modalities sets it apart from other foundation models in the market. Additionally, ERNIE 5.0 demonstrated strong performance in areas like visual tasks, audio and speech tasks, and language tasks, showcasing its broad capability footprint.

In terms of pricing, ERNIE 5.0 is positioned at the premium end of Baidu’s model pricing structure. Compared to other models in the market, ERNIE 5.0 offers competitive pricing for its capabilities, making it an attractive option for enterprise customers looking for high-performance AI solutions.

As part of its global expansion strategy, Baidu is introducing a range of products and platforms to international markets. GenFlow 3.0, Famou, MeDo, and Oreate are just a few of the offerings that are now available globally, catering to different use cases and user needs. Baidu’s digital human platform and autonomous ride-hailing service, Apollo Go, are also part of the company’s international push.

See also  Breaking the Data Bottleneck: How Google's Watch & Learn Framework Revolutionizes Training Computer-Use Agents

In addition to ERNIE 5.0, Baidu also released an open-source vision-language model, ERNIE-4.5-VL-28B-A3B-Thinking, under the Apache 2.0 license. This model offers a cost-effective and efficient solution for organizations looking to leverage multimodal AI capabilities without licensing restrictions.

Overall, Baidu’s ERNIE 5.0 represents a significant advancement in the global foundation model landscape. With its competitive performance, pricing strategy, and global expansion efforts, Baidu is positioning itself as a key player in the enterprise AI market. The company’s focus on developer communication and open-source offerings further enhances its appeal to a wide range of users. As the landscape of AI deployment continues to evolve, Baidu’s ERNIE models are poised to play a crucial role in shaping the future of artificial intelligence.

Trending