OpenAI releases CoT monitoring to prevent malicious behavior of big models - LianPR

Scan with WeChat Online Consultation

Call for Consultation
13392791745

Email for Consultation
lemon@lianpr.com

Share to：

News center > 7*24H News > Featured > Context

OpenAI releases CoT monitoring to prevent malicious behavior of big models

Editor

2 hours ago 8,174

According to Golden Finance, OpenAI has released the latest research, using CoT (thinking chain) monitoring method, it can prevent malicious behaviors such as talking nonsense and hiding true intentions, and it is also one of the effective tools to supervise super models. OpenAI uses the newly released cutting-edge model o3-mini as the monitored object and uses the weaker GPT-4o model as the monitor. The test environment is a coding task that requires AI to implement functions in the code base to pass unit testing. The results show that the CoT monitor performed excellently in detecting systemic "reward hackers" behavior, with a recall rate of up to 95%, far exceeding 60% of the monitoring behavior only.

Keywords： Bitcoin

Share to：

Related News

Golden Morning Post | Tesla fell 15% BTC fell below $77,000

50 mins ago 6,374
332 BTCs transferred from unknown wallet to Mt.Gox address

50 mins ago 5,002
Bitwise: Why is the market responding incorrectly? What is the only important issue in Bitcoin

50 mins ago 7,855
In order to avoid liquidation, a giant whale sold 25,800 ETH, and lost $31.75 million.

50 mins ago 2,256
Citigroup: Downgrade U.S. stock market rating

50 mins ago 5,892
Solana Network's total transaction fees last week were 53,800 SOL, down 10% month-on-month

50 mins ago 686

Editorial Guidelines Terms & Conditions Site Map

Copyright © 2024 LianPR (Copyright Reserved. Unauthorized Copying Forbidden.) Contact US：lemon@lianpr.com