Blog

14 January 2026

S1: Simple Test-time Scaling

EMNLP'25

๐Ÿ’กtraining ๋‹จ๊ณ„์—์„œ ๋ง๊ณ , inference ๋‹จ๊ณ„์—์„œ ์„ฑ๋Šฅ์„ ๋†’ํžˆ๋ ค๋ฉด ์–ด๋–ป๊ฒŒ ํ•ด์•ผ ํ• ๊นŒ?โ‡’ ์ผ๋‹จ ์ˆ˜ํ•™/์ถ”๋ก  ๋ฌธ์ œ๋Š” token ๊ฐœ์ˆ˜ ์กฐ์ •ํ•ด

์—ผ๊ทœํ™˜
14 January 2026

Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models

NIPS'25

๐Ÿ’ก๋ชจ๋ธ์— ๋…ธ์ด์ฆˆ๋ฅผ ์ฃผ์ž…ํ–ˆ์„ ๋•Œ ์„ฑ๋Šฅ์ด ๋น„์ •์ƒ์ ์œผ๋กœ ํ–ฅ์ƒ๋˜๋ฉด, ์ด๋Š” ์ƒŒ๋“œ๋ฐฐ๊น… ํ˜„์ƒ์„ ์•”์‹œํ•œ๋‹ค!

14 January 2026

Let LRMs Break Free from Overthinking via Self-Braking Tuning

NIPS'25

๐Ÿ’ก๋ชจ๋ธ ๋‚ด์žฌ์ ์œผ๋กœ ๋ถˆํ•„์š”ํ•œ ์ถ”๋ก (์˜ค๋ฒ„ ๋ตํ‚น)์„ ๋ง‰์ž!