S
sijuntan
Article URL: DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL | Notion
Comments URL: DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL | Hacker News
Points: 212
# Comments: 87
Continue reading...
Comments URL: DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL | Hacker News
Points: 212
# Comments: 87
Continue reading...