人工智能知识库

标签: 大模型训练

此标签下有3条笔记。

2026年4月18日
DPO深度指南
2026年4月18日
ORPO对齐
2026年4月18日
PPO训练详解

Created with Quartz v4.5.2 © 2026

GitHub