Blog

์ด์Šนํ™˜
07 January 2026

Scaling Laws for Precision

ICLR'25

๐Ÿ’ก์–ธ์–ด ๋ชจ๋ธ์˜ ํ•™์Šต ๋ฐ ์ถ”๋ก  ์‹œ ์ •๋ฐ€๋„(precision)๊ฐ€ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ๊ณผ ๋น„์šฉ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ฒด๊ณ„์ ์œผ๋กœ ๋ถ„์„ํ•˜๊ณ , ์ด๋ฅผ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ๋Š” precision-aware scaling laws๋ฅผ ์ œ์‹œ

์—ผ๊ทœํ™˜
07 January 2026

Layer by Layer: Uncovering Hidden Representations in Language Models

ICML'25

๐Ÿ’กAutoregressive ๋ฐฉ์‹์œผ๋กœ ํ•™์Šตํ•˜๋Š” ์–ธ์–ด๋ชจ๋ธ์€ ์ค‘๊ฐ„ layer ํ‘œํ˜„์ด ๊ฐ€์žฅ ํ’๋ถ€ํ•˜๋‹ค!

07 January 2026

How Do Large Language Monkeys Get Their Power (Laws)?

ICML'25

๐Ÿ’กLLM์˜ ๋ฐ˜๋ณต ์ƒ˜ํ”Œ๋ง ์„ฑ๋Šฅ์ด power law์ฒ˜๋Ÿผ ๋ณด์ด๋Š” ์ด์œ ๋Š” ๋ชจ๋ธ์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ ๋•Œ๋ฌธ์ด ์•„๋‹ˆ๋‹ค.๊ฐ ๋ฌธ์ œ๋Š” ์ด๋ฏธ ์ง€์ˆ˜์ ์œผ๋กœ(exponentially) ํ•ด๊ฒฐ๋˜๊ณ  ์žˆ์œผ๋ฉฐ, ์†Œ์ˆ˜์˜ ๊ทน๋„๋กœ ์–ด๋ ค์šด ๋ฌธ์ œ๋“ค์ด ๋๊นŒ์ง€ ๋‚จ์•„ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ์ „์ฒด ํ‰๊ท  ์„ฑ๋Šฅ์ด power law์ฒ˜๋Ÿผ ๋ณด์ผ ๋ฟ์ด๋‹ค.โ‡’ power law๋Š” ๋ชจ๋ธ์˜ ๋ฒ•์น™์ด ์•„๋‹ˆ๋ผ, ๋ฌธ์ œ ๋‚œ์ด๋„ ๋ถ„ํฌ์˜ ๊ฒฐ๊ณผ๋‹ค.