BitGarden
Search
Search
Dark mode
Light mode
Explorer
Recent Notes
hackintosh-opencore
Apr 14, 2026
hackintosh
Apr 14, 2026
mysql-gap-lock
Apr 14, 2026
See 640 more →
Home
❯
01 Source
❯
2303
❯
RLHF
RLHF
share
Mar 31, 2023
1 min read
ai
资料
Illustrating Reinforcement Learning from Human Feedback (RLHF)
ChatGPT/InstructGPT详解 - 知乎
Graph View
Backlinks
2023-03-31