Early this morning, Google officially launched Gemini 2.5 Pro Experimental, the "strongest reasoning model", which performed quite well in a number of tests: it ranked No. 1 in the LMSYS Arena, with a score 40 points higher than the Grok-3 and GPT-4.5; it was ranked No. 1 in all the categories (general ability, coding, math, etc.); and it excelled in a number of benchmark tests, including Hard Prompts w/ Style Control and Multi-Turn. Gemini 2.5 Pro ranked #1 in all categories (General Skills, Coding, Math, etc.), especially in Hard Prompts w/ Style Control and Multi-Turn; Gemini 2.5 Pro took the top spot for overall performance in the individual benchmarks. Gemini 2.5 Pro is the best overall performer in all benchmarks, leading in Science, Code Generation, Visual Reasoning (MMMU), and Long Text Comprehension (MRCR); and in the so-called toughest test, "Human's Last Exam," Gemini 2.5 Pro outperforms a number of large models such as OpenAI o3-mini, GPT-4.5, DeepSeek-R1, and so on.
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed:
