Yonsei Univ. ICL

CoT

SEAL: Steerable Reasoning Calibration of Large Language Models for Free

26 March 2026

SEAL: Steerable Reasoning Calibration of Large Language Models for Free

COLM'25

💡너무 길고 복잡한 reasoning 경향을 완화하자!⇒ reasoning process를 세단계로 분류하고, 그 중에 어떤 걸 줄여야 할지 분석하자

CoT PROBING research

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

26 March 2026

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

COLM'25

💡Mathematical Reasoning Task 를 할 때, RL을 간접적으로 구현하여 간단하게 풀어보자.(= 강화학습 형태로 수학문제를 효과적으로 풀어보자 !)

CoT Mathematical Reasoning RL research