Register now for better personalized quote!

HOT NEWS

DeepSeek unveils new approach to improve AI reasoning

Apr, 08, 2025 Hi-network.com

Chinese AI firm DeepSeek has unveiled a new method to improve LLM reasoning skills, claiming it offers more accurate and faster responses than current technologies. The approach, developed with researchers from Tsinghua University, combines generative reward modeling (GRM) with a self-principled critique tuning technique.

The method aims to refine how AI LLMs respond to general queries by better aligning their outputs with human preferences. According to a paper published on the arXiv scientific repository, the resulting DeepSeek-GRM models showed stronger performance than existing methods and proved competitive against widely accepted public reward models.

DeepSeek has announced intentions to release these models as open source, though no release date has been set. The move follows increased global interest in the company, which had earlier gained attention for its V3 foundation model and R1 reasoning model.

tag-icon Hot Tags : Artificial Intelligence Digital access Development DeepSeek publish

Copyright © 2014-2024 Hi-Network.com | HAILIAN TECHNOLOGY CO., LIMITED | All Rights Reserved.
Our company's operations and information are independent of the manufacturers' positions, nor a part of any listed trademarks company.