250407 Lab Seminar

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

발표자 : 윤소영