README 模板

🎉Introduction🌟Methods Reproduced📝Reproduced Results
☄️How to Use👨‍🏫Acknowledgments🤗Contact


🎉 Introduction

🌟 Methods Reproduced

📝 Reproduced Results

☄️ How to Use

👨‍🏫 Acknowledgments

🤗 Contact

icon

🚀 🤗 👨‍🏫 🔎 🔑 🗂️ 🕹️ ☄️ 📝 🌟 🎉 📌 📁 ✅ ❌ 📚 📢 💡 🎈 💭 👀 🛠️ 💫 🔥 📣

:arrow_down: :white_check_mark:​

notice

[!NOTE]
For the “base” models, the template argument can be chosen from default, alpaca, vicuna etc. But make sure to use the corresponding template for the “instruct/chat” models.

[!TIP]
The implementation details of PPO can be found in this blog.

[!IMPORTANT]
Installation is mandatory.

shields.io

visitors

GitHub last commit

GitHub

GitHub release (latest by date including pre-releases)

GitHub issues

GitHub pull requests

文件目录

  • 📁 mlx :
  • 📁 docs :
  • 📁 examples :

图片