New Blog by Lilian Weng, Former VP of Security at OpenAI: Why We Think

5month19Day

admin

Lilian Weng analyzes the importance of "thinking time" for large models, and argues that the performance of models on complex tasks can be significantly improved by increasing the computation (e.g., chaining of thoughts, pause marking, etc.) during testing; there are two main strategies for model "thinking": parallel sampling (generating multiple outputs at the same time) and sequence revision (iterative revision based on the previous round's outputs). Currently, there are two main strategies for model "thinking": parallel sampling (generating multiple outputs at the same time) and sequence revision (iterative revision based on the previous round of outputs), but in practice, we need to balance the thinking time and computational cost; it is found that optimizing the chain of thinking through reinforcement learning may lead to the problem of reward hacking, where the model hides its true intentions in the chain of thinking, which needs to be solved in the future research.

TOP1

MAKE A VIDEO ON A PET FROM AI, COPY IT DIRECTLY FROM A MILLION-FLOW-EXPLOSIVE VIDEO COURSE
8hours ago
TOP2

Mi suddenly released a new model: DeepSeek-V3.2
8hours ago
TOP3

Google, Shopivy, former executive, OpenAI
8hours ago
OpenAI Launching Newborn Map Model, PK Nano Banana
8hours ago
Bean bag: The first supply has been sold out, and the quantity of goods that were previously reported to be inaccurate in the industry
8hours ago
OpenAI announces that Apple Music is about to integrate with ChatGPT
8hours ago
Meta expands the rights of employees to use competing AI tools, including ChatGPT-5, Gemini 3 Pro, etc
8hours ago
AI Diagram Generator: Chart Generator, AI One Generates Flowcharts, Mindcharts, UmL Charts, etc
12hours ago

❯

Checking in, please wait

Click for today's check-in bonus!

You have earned {{mission.data.mission.credit}} points today!

Check-in

Leaderboard

{{item.credit}}

Lasted{{item.count}}days

My Coupons

_￥_Coupons

Limitation of useExpired and Unavailable

Limitation of use
before

Limitation of usePermanently valid

Coupon ID:
×

Available for the following products: Available for the following products categories: Unrestricted use:

[{{ct.name}}]

Available for all products and product types

No coupons available!

Cart

×

Delete

Shopping Cart is Empty!

Empty Cart Checkout

You have a new message

No new messages

Write a new message More

{{userData.name}}Verify

New Blog by Lilian Weng, Former VP of Security at OpenAI: Why We Think

MAKE A VIDEO ON A PET FROM AI, COPY IT DIRECTLY FROM A MILLION-FLOW-EXPLOSIVE VIDEO COURSE

Mi suddenly released a new model: DeepSeek-V3.2

Google, Shopivy, former executive, OpenAI

OpenAI Launching Newborn Map Model, PK Nano Banana

Bean bag: The first supply has been sold out, and the quantity of goods that were previously reported to be inaccurate in the industry

OpenAI announces that Apple Music is about to integrate with ChatGPT

Meta expands the rights of employees to use competing AI tools, including ChatGPT-5, Gemini 3 Pro, etc

AI Diagram Generator: Chart Generator, AI One Generates Flowcharts, Mindcharts, UmL Charts, etc

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow