BitGarden
Search
Search
Dark mode
Light mode
Explorer
Recent Notes
工作流设计
Feb 27, 2026
260206-llm-token-compression-survey
Feb 06, 2026
250122-OpenCode
Feb 06, 2026
ai
See 583 more →
Home
❯
1 Inputs
❯
Article
❯
RLHF
RLHF
share
Mar 31, 2023
1 min read
ai
资料
Illustrating Reinforcement Learning from Human Feedback (RLHF)
ChatGPT/InstructGPT详解 - 知乎
Graph View
Backlinks
2023-03-31
ChatGPT 的训练原理科普