lumpenspace
Subscribe
Sign in
Share this post
lumpenspace
WMTP 2: DeepSeek, RL:DTF, and the weirdest ever cake.
Copy link
Facebook
Email
Notes
More
WMTP 2: DeepSeek, RL:DTF, and the weirdest…
Lumpen Space Princeps
Feb 17
25
Share this post
lumpenspace
WMTP 2: DeepSeek, RL:DTF, and the weirdest ever cake.
Copy link
Facebook
Email
Notes
More
1
3
"Reinforcement Learning through Deterministic Truthful Feedback" of course.
Read →
1 Comment
Séb Krier
Feb 17
Liked by Lumpen Space Princeps
excellent as always
Expand full comment
Reply
Share
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
excellent as always