BitGarden

Recent Notes

  • 251012-你好面试官

    Oct 12, 2025

    • 游戏化设计的八大驱动力

      Sep 23, 2025

      • 股市投资

        Sep 19, 2025

        See 564 more →

        Home

        ❯

        1 Inputs

        ❯

        Article

        ❯

        RLHF

        RLHF

        shareMar 31, 20231 min read

        • ai

        资料

        • Illustrating Reinforcement Learning from Human Feedback (RLHF)
        • ChatGPT/InstructGPT详解 - 知乎

        Graph View

        Backlinks

        • 2023-03-31
        • ChatGPT 的训练原理科普

        Created with Quartz v4.5.0 © 2025

        • GitHub
        • Discord Community