BitGarden

Recent Notes

  • hackintosh-opencore

    Apr 14, 2026

    • hackintosh

      Apr 14, 2026

      • mysql-gap-lock

        Apr 14, 2026

        See 640 more →

        Home

        ❯

        01 Source

        ❯

        2303

        ❯

        RLHF

        RLHF

        shareMar 31, 20231 min read

        • ai

        资料

        • Illustrating Reinforcement Learning from Human Feedback (RLHF)
        • ChatGPT/InstructGPT详解 - 知乎

        Graph View

        Backlinks

        • 2023-03-31

        Created with Quartz v4.5.0 © 2026

        • GitHub
        • Discord Community