Discussion about this post

User's avatar
Daniel Popescu / ⧉ Pluralisk's avatar

Insightful. This piece adeptly builds on your prior analyses, underscoring how RLHF's reliance on human fine-tunning intrinsically perpetuates pre-existing biases in LLMs.

Expand full comment

No posts