BitGarden

Recent Notes

  • Home

    Sep 01, 2025

    • 真需求

      Aug 28, 2025

      • note
    • 用户画像是什么

      Aug 28, 2025

      See 562 more →

      Home

      ❯

      1 Inputs

      ❯

      Article

      ❯

      RLHF

      RLHF

      shareMar 31, 20231 min read

      • ai

      资料

      • Illustrating Reinforcement Learning from Human Feedback (RLHF)
      • ChatGPT/InstructGPT详解 - 知乎

      Graph View

      Backlinks

      • 2023-03-31
      • ChatGPT 的训练原理科普

      Created with Quartz v4.5.0 © 2025

      • GitHub
      • Discord Community