Tweeted by aaronjmar
Tweeted by aaronjmars@1368626890802659330
fascinating paper from Microsoft good validation of my gut feeling that skills should self-evolve in a competitive environnement, related to a specific KPI
'lifts GPT–5.5 by +23.5 points on average over no skill in direct chat and by +24.8/ + 19.1 points under Codex and Claude https://t.co/32vcW9Eq1Y https://t.co/fMaQ9KfE7j