Reinforcement Learning to Train Large Language Models to Explain Human Decisions (arxiv.org)
13 points by PaulHoule 12 hours ago | 0 comments
21113 points by PaulHoule 12 hours ago | 0 comments
2116 points by gdubs 13 hours ago | 0 comments
21265 points by geox 13 hours ago | 46 comments
2133 points by jk_tech 13 hours ago | 0 comments
2143 points by Teever 13 hours ago | 1 comment
2154 points by teleforce 13 hours ago | 1 comment
2164 points by teleforce 13 hours ago | 0 comments
2179 points by k1m 13 hours ago | 0 comments
2183 points by jszymborski 13 hours ago | 0 comments
2193 points by graeme 13 hours ago | 1 comment
2203 points by ohjeez 13 hours ago | 1 comment
2215 points by walterbell 13 hours ago | 1 comment
2223 points by a_void_sky 13 hours ago | 3 comments
2233 points by landonxjames 13 hours ago | 0 comments
2245 points by thisisauserid 13 hours ago | 1 comment
2254 points by edwinjm 13 hours ago | 0 comments
2263 points by SbNn6uJO5wzPww 13 hours ago | 0 comments
2272 points by emrah 13 hours ago | 0 comments
2284 points by FeepingCreature 13 hours ago | 5 comments
2294 points by frays 13 hours ago | 1 comment
2304 points by bookofjoe 14 hours ago | 0 comments
2313 points by gulaydin 14 hours ago | 2 comments
2329 points by ColinWright 14 hours ago | 2 comments
2334 points by fi-le 14 hours ago | 0 comments
2346 points by kyleomalley 14 hours ago | 2 comments
2353 points by Vermin2000 14 hours ago | 0 comments
2363 points by thunderbong 14 hours ago | 0 comments
23720 points by jonbaer 14 hours ago | 1 comment
23898 points by jonbaer 14 hours ago | 11 comments
23996 points by Kaibeezy 14 hours ago | 135 comments
240