26 March 2026

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games

๐Ÿ’กํ˜„์žฌ์˜ ์ถ”๋ก  ์ตœ์ ํ™”๊ฐ€ ํ˜‘๋ ฅ์„ ๋ณ„๋„๋กœ ์ •๋ ฌ์‹œํ‚ค์ง€ ์•Š๋Š”๋‹ค๋ฉด, ํ˜‘๋ ฅ์ด ์•„๋‹Œ ํ•ฉ๋ฆฌ์  ์ด๊ธฐ์ฃผ์˜๋ฅผ ํ‘œ๋ฐฉํ•˜๋Š” ๊ฐœ์ธ์ฃผ์˜ ๋ชจ๋ธ์ด ํƒ„์ƒํ•  ์ˆ˜ ์žˆ๋‹ค!์ฆ‰, ์ถ”๋ก  ๋Šฅ๋ ฅ๊ณผ, ํ˜‘์—… ๋Šฅ๋ ฅ(๋น„์šฉ ๊ฐ์ˆ˜ ์ธก๋ฉด)์€ ๋ณ„๊ฐœ๋‹ค!

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games

Review

๋‹‰๋„ค์ž„ ํ•œ์ค„ํ‰๋ณ„์  (0/5)
๋Œ“์ธ ๋…ธ๋…ธ โ€ข ์žฅ์ : ํ˜‘๋ ฅ์— ๊ด€ํ•œ ๋ณด์žฅ,์ฒ˜๋ฒŒ,ํ–‰๋™ํŒจํ„ด์„ ๋ถ„์„ํ•จ / ๋ชจ๋ธ ๋ณ„ ํ˜‘๋ ฅ์ •๋„์— ๊ด€ํ•œ ๋ถ„์„ ์ œ๊ณต
โ€ข ๋‹จ์ /๋ณด์™„์ : so what?
3
์•„์ด๋ฆฌ์Šค์žฅ์ : ์ง„์งœ ๋‚˜๊ฐ™์€ ์‚ฌ๋žŒ์ด ์“ด ๋…ผ๋ฌธ์ธ ๊ฒƒ ๊ฐ™์Œ. ๊ฐœ์ธ์ ์œผ๋กœ๋Š” ๊ณ ๋ คํ•ด์•ผํ•  ๋ฌธ์ œ์ด๋ฉด์„œ, ๊ถ๊ธˆํ•œ ์ฃผ์ œ์ž„. ์‚ฌํšŒ์ ์œผ๋กœ ์•ˆ์ „ํ•œ ๋ชจ๋ธ์„ ์œ„ํ•ด์„œ ๊ณ ๋ คํ•ด์•ผํ•˜๋Š” ๊ด€์ ์„ ํ’“์–ด๋‚ด๋Š” ๋…ผ๋ฌธ์œผ๋กœ ์ข‹์€ ์ฐธ๊ณ ๊ฐ€ ๋ ๋“ฏ.
๋‹จ์ : ์‹คํ—˜ ๋ฐฉ์‹์ด ์ปดํ“จํ„ฐ๊ณผํ•™์ด ์•„๋‹Œ ๊ฒƒ ๊ฐ™์Œ.. ๊ทธ๋ƒฅ ์‚ฌํšŒ์‹ฌ๋ฆฌํ•™์„ ๊ฐ€์ ธ๋‹ค ๋ถ™์ธ ๋А๋‚Œ์ด๊ณ , ํ•ด์„๋„ ๋งŽ์ด ์•„์‰ฌ์›€.
๋ณด์™„์ : ํ•ด๊ฒฐํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•ด์•ผํ•œ๋‹ค๊ณ  ์ƒ๊ฐํ•จ. ๋‚ด๊ฐ€ ์ƒ๊ฐํ•ด๋ณผ ์˜์—ญ์ด๋ผ๊ณ ๋„ ์ƒ๊ฐํ•จ.
4.0
ํ•ธ๋“œํฌ๋ฆผโ€ข ์žฅ์ : MoE ํ™˜๊ฒฝ์—์„œ LRM ๊ฐ„ ํ˜‘๋ ฅ์ด๋ผ๋Š” ์ƒˆ๋กœ์šด ์ฃผ์ œ๋ฅผ ๋ถ„์„
โ€ข ๋‹จ์ : ์ƒˆ๋กœ์šด ๋ถ„์„ ๊ฒฐ๊ณผ๋Š” ํฅ๋ฏธ๋กญ์ง€๋งŒ, ์™œ ์ด๋ ‡๊ฒŒ ๋™์ž‘ํ•˜๋Š” ๊ฒƒ์ธ์ง€ ํ•ด์„์ด ๋ถ€์กฑํ•จ
โ€ข ๋ณด์™„์ : ํ•ด๊ฒฐ์ฑ…์ด๋‚˜ ์‹ฌํ™” ๋ถ„์„
3.3
3์›” โ€ข ์žฅ์ : ์‹คํ—˜ ์„ค๊ณ„๊ฐ€ ์ฐธ์‹ ํ•˜๊ณ  ์žฌ๋ฐŒ๋‹ค. ๊ธฐ๊ด€ ์„ ํƒ์„ ๊ณต๊ณต์žฌ ๊ฒŒ์ž„์œผ๋กœ ๊ฐ„์ฃผํ•˜์—ฌ ํ˜„์‹ค์ ์ธ ์‚ฌํšŒ์  ๋”œ๋ ˆ๋งˆ๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ๊ตฌํ˜„ํ•จ
โ€ข ๋‹จ์ : LLM์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ํ˜‘๋ ฅ์„ ์ €ํ•ดํ•˜๋Š” ์ฃผ์žฅ์€ ํŠน์ • prompt์ด ์ฃผ์–ด์งˆ ๋•Œ ๊ทธ๋ ‡๊ฒŒ ํ–‰๋™ํ–ˆ์„ ๋ฟ์ธ๊ฑฐ๊ฐ™์€๋ฐ... ์ด๊ฒŒ ๋ชจ๋ธ์˜ ๋ณธ์งˆ์  ํŠน์„ฑ์ด๋ผ๊ณ  ๋ณด๊ธฐ๋Š” ์–ด๋ ค์›Œ๋ณด์ž„
โ€ข ๋ณด์™„์ : Alignment prompt๋ฅผ ๋ช…์‹œ์ ์œผ๋กœ ์ฃผ์ž…ํ•ด์„œ ์—ฌ์ „ํžˆ ์ถ”๋ก  ๋ชจ๋ธ์ด ๋ฐฐ์‹ ํ•˜๋Š”์ง€ ํ™•์ธํ•ด๋ณด๊ธฐ
3.5
ํ™”์ดํŠธ๋…ธ์ด์ฆˆ โ€ข ์žฅ์ : ์—์ด์ „ํŠธ๊ฐ€ ๋Œ€์„ธ๋ผ ๊ทธ๋Ÿฐ์ง€ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ™˜๊ฒฝ์„ ๋‹ค๋ฃจ๋Š” ๋…ผ๋ฌธ์ด ๋งŽ์ด๋ณด์ด๋Š” ๊ฒƒ ๊ฐ™๋‹ค. ์—ญ์‹œ ๊ณตํ•™์  ์‚ฌ๊ณ  ๋ฟ ์•„๋‹ˆ๋ผ ์ฒ ํ•™์ ์ธ ์‚ฌ๊ณ ๋„ ์ค‘์š”ํ•  ๊ฒƒ ๊ฐ™์Œ!
โ€ข ๋‹จ์ : LLM ์ด ์™œ ๊ทธ๋ ‡๊ฒŒ ์ถ”๋ก ์„ ํ–ˆ๋Š”์ง€์— ๋Œ€ํ•œ why๊ฐ€ ๋ถ€์กฑํ•จ
โ€ข ๋ณด์™„์ : ์ฝ์œผ๋ฉด์„œ ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ™˜๊ฒฝ์—์„œ ๋™์กฐ ํ˜„์ƒ ๋ฌธ์ œ๋ฅผ ๋‹ค๋ฃจ๋Š” (์—ฌ๋Ÿฌ ์—์ด์ „ํŠธ๊ฐ€ ๋งž๋‹คํ•˜๋ฉด ์–ด์ฉ” ์ˆ˜ ์—†์ด ๋™์กฐํ•˜๊ฒŒ ๋˜๋Š” ํ˜„์ƒ) Do as We Do, Not as You Think: the Conformity of Large Language Models (ICLRโ€™25 Oral) ๋…ผ๋ฌธ์ด ์ƒ๊ฐ๋‚ฌ๋Š”๋ฐ ์ด ๋…ผ๋ฌธ๊ณผ ๋น„์Šทํ•œ ํ™˜๊ฒฝ์—์„œ ์‹คํ—˜์„ ํ•ด๋ณด๋ฉด ์žฌ๋ฐŒ์„ ๊ฒƒ ๊ฐ™์Œ!
3.5
์—๋„ˆ์ง€ โ€ข ์žฅ์  : Public good game์˜ ํ™˜๊ฒฝ์„ ์„ค์ •ํ•ด, ์‹ค์ œ LLM์˜ reasoning ๋Šฅ๋ ฅ๊ณผ ํ˜‘๋ ฅ ๋Šฅ๋ ฅ์˜ ๊ด€๊ณ„์„ฑ์„ ๋ณด์—ฌ์ฃผ๋Š” ์—ฐ๊ตฌ ๋…ผ๋ฌธ.
โ€ข ์•ฝ์  : ์—ฐ๊ตฌ ์ฃผ์ œ๋Š” ์ฐธ์‹ ํ•˜์ง€๋งŒ ๋‹จ์ˆœํžˆ ํ•ด์„(?)์— ๊ทธ์น˜๋Š” ๊ฒƒ ๊ฐ™์Œ.
โ€ข ๋ณด์™„์  : ์ดํ›„ ์›์ธ ํŒŒ์•…์ด๋‚˜ ์ถ”๊ฐ€ ๋ถ„์„, ํ•ด๊ฒฐ ๋ฐฉ์•ˆ ๊ฐ™์€๊ฒŒ ์žˆ์œผ๋ฉด ์ข‹์„ ๊ฒƒ ๊ฐ™์Œ. ๋˜ํ•œ ๊ผญ ํ˜‘์—…์ด ์ข‹์€ ๊ฒƒ์ธ๊ฐ€? ์ƒ๊ฐ์ด ๋“ฆ. (ํ˜‘์—…์ด ์ข‹๊ณ  ๋‚˜์˜๋‹ค๋ฅผ ํ™•์‹คํžˆ ์ •์˜ํ•œ ๊ฒƒ ๊ฐ™์ง„ ์•Š์ง€๋งŒ, ๋…ผ๋ฌธ์—์„œ๋Š” ํ˜‘์—…์„ ์ข‹๊ฒŒ ์ƒ๊ฐํ•˜๋Š” ๊ฒƒ ๊ฐ™์•„์„œ)
3.1
ํ”ผ์ฆˆ์น˜์ž โ€ข ์žฅ์ : multi-agent ํ™˜๊ฒฝ์—์„œ ํ˜‘๋ ฅ์„ ์ค‘์‹ฌ์œผ๋กœ ๋ถ„์„ํ•จ. ์—์ด์ „ํŠธ ์—ฐ๊ตฌ์— ์ฐธ๊ณ ํ•  ์ˆ˜๋Š” ์žˆ์„๊ฒƒ ๊ฐ™์Œ
โ€ข ๋‹จ์ : ๊ธฐ์กด์—๋„ 'LLM ์ง‘๋‹จ์—์„œ ์ƒํ˜ธ์ž‘์šฉ์ด ํ–‰๋™์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ'์„ ๋ณด๋Š” ์—ฐ๊ตฌ๋Š” ๋งŽ๋Š”๋ฐ ์ด๊ฒƒ๋„ ๋ง๋งŒ '๊ณต๊ณต์žฌ~' ๋А๋‚Œ์œผ๋กœ๋งŒ ์ข€ ๋ฐ”๊พผ๊ฑฐ๊ฐ™์Œ
โ€ข ์ œ์•ˆ: ์–ด๋–ป๊ฒŒ ์–ด๋–คํ™˜๊ฒฝ์—์„œ ํ˜‘๋ ฅ์ด๋‚˜ ๋น„ํ˜‘๋ ฅ์„ ์œ ๋„ํ•˜๋Š”์ง€ ๋” ์ •๊ตํ•˜๊ฒŒ ๋ถ„์„ํ•  ์ˆ˜ ์žˆ์„ ๊ฒƒ ๊ฐ™์Œ
3.5
์ฐฝ๋ฐฑ์นด์ธ„์žฅ์ : ๋‚ด ์„ธ์ƒ์ด ๋ฌด๋„ˆ์ง. ์ถฉ๊ฒฉ์ ์ธ(๋†€๋ผ์šด) ๊ฒฐ๊ณผ์ž„
๋‹จ์ : ์‹คํ—˜ํ•˜๊ณ  ์‹ค์ œํ•˜๊ณ  ์–ผ๋งˆ๋‚˜ align๋˜๋Š”์ง€ ๋ชจ๋ฅด๊ฒ ์–ด์„œ ์ด๊ฒŒ ์œ ํšจํ• ์ง€๋Š” ๋ฏธ์ง€์ˆ˜์ž„
์ œ์•ˆ์ : ์‹ค์ œ ๋ฉ€ํ‹ฐ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์€ context ์ฃผ๊ณ  ๋ฐ›๋Š”๋ฐ, ๊ทธ๋Ÿฐ ์„ค์ •์—์„œ๋„ ํ•ด๋ด์•ผ ํ•œ๋‹ค๊ณ  ์ƒ๊ฐํ•จ
3.8
์ œ๋กœ์ฝœ๋ผ โ€ข ์žฅ์ : ์ถ”๋ก  ๋Šฅ๋ ฅ์ด ๊ฐ•ํ•ด์งˆ์ˆ˜๋ก ์˜คํžˆ๋ ค ํ˜‘๋ ฅ์„ ์•ˆ ํ•œ๋‹ค๋Š” ๊ฒฐ๊ณผ๊ฐ€ ํฅ๋ฏธ๋กœ์›€.
โ€ข ๋‹จ์ : ์ถ”๋ก  ๋ชจ๋ธ์ด ํ˜‘๋ ฅ์„ ์•ˆ ํ•œ๋‹ค๋Š” ๊ฒฐ๊ณผ๋Š” ๋ณด์—ฌ์ฃผ๋Š”๋ฐ, ์™œ ๊ทธ๋Ÿฐ ์„ ํƒ์„ ํ•˜๋Š”์ง€์— ๋Œ€ํ•œ ๋ถ„์„์ด ๋ถ€์กฑํ•œ๊ฒƒ ๊ฐ™์Œ.
โ€ข ๋ณด์™„์ : ํ˜‘๋ ฅ์„ ๋ช…์‹œ์ ์œผ๋กœ ์œ ๋„ํ•˜๋Š” ์ง€์‹œ๋ฅผ ํ”„๋กฌํ”„ํŠธ์— ์ถ”๊ฐ€ํ–ˆ์„ ๋•Œ๋„ ์ถ”๋ก  ๋ชจ๋ธ์ด ์—ฌ์ „ํžˆ ๋ฌด์ž„์Šน์ฐจํ•˜๋Š”์ง€ ํ™•์ธํ•ด๋ณด๋ฉด ์ข‹์ง€ ์•Š์„๊นŒ
3.6

TL; DR

๐Ÿ’ก

ํ˜„์žฌ์˜ ์ถ”๋ก  ์ตœ์ ํ™”๊ฐ€ ํ˜‘๋ ฅ์„ ๋ณ„๋„๋กœ ์ •๋ ฌ์‹œํ‚ค์ง€ ์•Š๋Š”๋‹ค๋ฉด, ํ˜‘๋ ฅ์ด ์•„๋‹Œ ํ•ฉ๋ฆฌ์  ์ด๊ธฐ์ฃผ์˜๋ฅผ ํ‘œ๋ฐฉํ•˜๋Š” ๊ฐœ์ธ์ฃผ์˜ ๋ชจ๋ธ์ด ํƒ„์ƒํ•  ์ˆ˜ ์žˆ๋‹ค!
์ฆ‰, ์ถ”๋ก  ๋Šฅ๋ ฅ๊ณผ, ํ˜‘์—… ๋Šฅ๋ ฅ(๋น„์šฉ ๊ฐ์ˆ˜ ์ธก๋ฉด)์€ ๋ณ„๊ฐœ๋‹ค!

Summary

Background

  • ๋” ๋˜‘๋˜‘ํ•œ ๋ชจ๋ธ์„ ๋งŒ๋“ค๋ฉด(์ถ”๋ก  ๋“ฑ) ๋‹ค์ค‘ ์—์ด์ „ํŠธ ํ™˜๊ฒฝ์—์„œ๋„ ๋” ์ข‹์€ ์‚ฌํšŒ์  ํ–‰๋™์„ ํ• ๊นŒ?
    • ๋‚˜๋งŒ ํฌ๊ฒŒ ์ด๋“๋ณด๊ธฐ vs ๋‹ค๊ฐ™์ด ์กฐ๊ธˆ ์ด๋“๋ณด๊ธฐ
    • The Competitive Advantage of Sanctioning Institutions (Scienceโ€™2006)

Motivation

  • LLM์˜ ์ถ”๋ก ์„ ๊ฐ•ํ™”ํ•˜๋Š” ๊ฒƒ์ด ๋” ๋‚˜์€ ์˜์‚ฌ ๊ฒฐ์ •์œผ๋กœ ์ด์–ด์งˆ ์ˆ˜ ์žˆ์„๊นŒ?
    • ๊ฐœ์ธ ์ด์ต vs ์ง‘๋‹จ ์ด์ต์˜ ์ถฉ๋Œ ์ƒํ™ฉ (social dilema)
    • ๋‚ด๊ฐ€ ์กฐ๊ธˆ ์†ํ•ด๋ด๋„, ์ „์ฒด์ ์œผ๋กœ ์ด๋“์ด ๋˜๋Š” ์ƒํ™ฉ
  • ๋น„์šฉ์„ ๋“ค์—ฌ ๊ทœ๋ฒ”์„ ์ง‘ํ–‰ํ•˜๋Š” ํ˜‘๋ ฅ ์ƒํ™ฉ์„ ์‹คํ—˜ํ•ด๋ณด์ž!
  • ํ˜‘๋ ฅ์ด main? X
  • LLM์ด ํ˜‘๋ ฅ์„ ์œ ์ง€ํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ์ธ์˜ ์ž์›์„ ์‚ฌ์šฉํ•ด๊ฐ€๋ฉฐ sanction(๋ณด์ƒ/์ฒ˜๋ฒŒ)ํ•˜๋Š”๊ฐ€?
    • ๋ชฐ๋ผ์„œ ์ฐพ์•„๋ด„ sanction: ์ œ์žฌ/์ฒ˜๋ฒŒ/๋ณด์ƒ ๋ฌธ๋งฅ์— ๋”ฐ๋ผ ๋‹ค๋ฆ„
  • Public Good Game ํ™œ์šฉ

    ์—ฌ๊ธฐ์— ๋”ํ•ด์„œ, ๊ทœ์ • ์ดํ–‰ โ‡’ ๋ณด์ƒ / ๊ทœ์ • ๋ถˆ์ดํ–‰ โ‡’ ์ฒ˜๋ฒŒ ๋„์ž…

Idea

  • ์ˆœ์ฐจ์ ์œผ๋กœ ์„ ํƒํ•˜๊ฒŒ ํ•ด๋ณด์ž!
  1. ๊ทœ์ •์— ๋”ฐ๋ฅธ ์ฒ˜๋ฒŒ/๋ณด์ƒ์„ ํ• ๋ž˜ ๋ง๋ž˜?
  1. ์–ผ๋งˆ๋‚˜ ๊ธฐ์—ฌํ• ๋ž˜?
  1. ๋‹ค๋ฅธ ์—์ด์ „ํŠธ ์ฒ˜๋ฒŒ/๋ณด์ƒ ํ• ๋ž˜ ๋ง๋ž˜?
  • WHY?
    • ๊ทœ์ • ์ฒ˜๋ฒŒ/๋ณด์ƒ์€ ๋˜๋‹ค๋ฅธ ๋น„์šฉ์„ ์•ผ๊ธฐํ•จ
      • ์ฒ˜๋ฒŒ ์ˆ˜์ค€, ๋ณด์ƒ ์ˆ˜์ค€, ์‹ค์ œ ๋ณด์ƒ ์ฒ˜๋ฒŒ ์ง‘ํ–‰ ๋“ฑโ€ฆ
      • ๊ฐœ์ธ ์ž…์žฅ์—์„œ๋Š” ์ถ”๊ฐ€ ์ฒ˜๋ฆฌ๋ฅผ ํ•ด์•ผํ•˜๋Š” ์—…๋ฌด

โ‡’ Main Question: ๊ทธ๋Ÿผ์—๋„, ๊ทœ์ • ์ดํ–‰ ๋ฐ ๋ชจ๋‘์˜ ์ด์ต์„ ์œ„ํ•ด, ๋‚ด๊ฐ€ ํ•˜๊ฒ ๋‹ค ํ•˜๋Š” ์—์ด์ „ํŠธ๊ฐ€ ์žˆ์„๊นŒ? ์žˆ๋‹ค๋ฉด, ๋ˆ„๊ตฌ์ผ๊นŒ? ๊ทธ๊ฒŒ ์ถ”๋ก  ์„ฑ๋Šฅ๊ณผ ์–ด๋–ค ์—ฐ๊ด€์ด ์žˆ์„๊นŒ?

Method

  • ๋Œ€ํ™” ์—†์ด, ์ด์ „ ๋‹จ๊ณ„์˜ ๊ฒฐ์ •๋งŒ ๋ณด๊ณ  ๋‹ค์Œ ๋ผ์šด๋“œ ์ง„ํ–‰!

Experiment

  • ์ „ํ†ต์ (์ถ”๋ก  ์•ฝํ•จ) LLM์ด ๋” ํ˜‘๋ ฅ์ ์ด๋‹ค
    • LLAMA๋Š” ์ธ๊ฐ„๊ณผ ๋น„์Šทํ•œ ์ˆ˜์ค€
    • o1-mini๋Š” ๊ธฐ์—ฌ๋Ÿ‰์ด ๋‚ฎ๊ณ , ๋ฌด์ž„์Šน์ฐจํ•˜๋ ค๊ณ  ํ•จ
    • ์ถ”๋ก ์„ ํ•˜๊ธฐ ์‹œ์ž‘ํ•˜๋ฉด ๋‚˜๋น ์ง„๋‹ค!
  • ํ–‰๋™ ํŒจํ„ด์€ 4๊ฐ€์ง€ (์ง„ํ–‰ ๋‹จ๊ณ„์— ๋”ฐ๋ผ)
    • ์ ์  ํ˜‘๋ ฅ ์ˆ˜์ค€ ํ–ฅ์ƒ
    • ์ ์  ํ˜‘๋ ฅ ๋ฌด๋„ˆ์ง€๊ณ  ๋ชจ๋‘๊ฐ€ ๋ฌด์ž„์Šน์ฐจํ•˜๋ ค๊ณ  ํ•จ
    • ์ค‘๊ฐ„ ์ „๋žต ๋ฐ˜๋ณต
    • ํ˜‘๋ ฅํ–ˆ๋‹ค๊ฐ€, ๋ฐฐ์‹ ํ–ˆ๋‹ค๊ฐ€

    Traditional LLM โ‡’ ์ ์  ํ˜‘๋ ฅ

    Reasoning LLM โ‡’ 2,3,4

  • ์ธ๊ฐ„๊ณผ ๊ฒฐ๊ณผ๋Š” ๋น„์Šทํ•  ์ˆ˜ ์žˆ์–ด๋„, ๊ทธ ๊ทผ๊ฐ„์ด ๋‹ค๋ฅด๋‹ค! (์ธ๊ฐ„์€ ์ฒ˜๋ฒŒ ์„ ํ˜ธ, LLM์€ ๋ณด์ƒ ์„ ํ˜ธ)
  • ์ถ”๋ก ์ด ๊ฐ•ํ•œ ๋ชจ๋ธ์ด ํ˜‘๋ ฅ์— ๊ณ„์† ์‹คํŒจํ•˜๋ฉด, ๊ฒŒ์ž„ ์ด๋ก ์— ๊ฐ€๊นŒ์›Œ์ง

Insights

  • ํ˜‘์—…์ด ์ธ๊ฐ„์˜ ๋•๋ชฉ์ธ๊ฐ€? ๋ผ๋Š” ๊ฒƒ์€ ์ž˜ ๋ชจ๋ฅด๊ฒ ์Œ.
  • ์ธ๊ฐ„๋„ ์˜คํžˆ๋ ค ์ง€์‹œ๋ฅผ ๋ช…ํ™•ํžˆ ๋‚ด๋ ค์ฃผ๋Š” ๊ฒƒ์ด ๋” ์ž˜ํ•˜์ง€ ์•Š๋‚˜? ํ˜‘์—…๋„ ๊ทธ๋Ÿฐ ๋ฐฉ์‹์ด๋ผ๊ณ  ์ƒ๊ฐํ•จ.
  • ๊ฐœ์ธ์—ฐ๊ตฌ๋ฐฉํ–ฅ์— ์ถ”๊ฐ€ํ•˜๊ณ ์ž ํ•˜๋Š” ๊ฒƒโ‡’ MoE๋ฅผ ๊ทธ๋ƒฅ ํ†ต๊ณผ์‹œํ‚ค๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, Planner๊ฐ€ ํ†ต์ œํ•ด์„œ ํ†ต๊ณผ์‹œํ‚ค๋Š” ๊ฒƒ
  • ์ง€๊ธˆ์€ LLM ๊ฐ„ ํ˜‘์—…๋งŒ ๊ณ ๋ คํ•˜๋Š”๋ฐ, ๊ฒฐ๊ตญ ์‚ฌ๋žŒ์ด ๋ผ๋ฉด ํ˜‘์—… ๊ณผ์ •์—์„œ ์‚ฌ๋žŒ์˜ ๊ฐ์ •/์ด๋“์„ ์šฐ์„ ์‹œํ• ๊นŒ?
    • ์ด๊ฒƒ๋„ ๊ณ ๋ คํ•ด๋ณผ ํฌ์ธํŠธ ๊ฐ™์Œ
    • LLM์€ ์†Œ์‹œ์˜คํŒจ์Šค์ผ๊นŒ?

Categories

ALIGNMENT research