David Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · 2 days agoOpenAI o3 beats FrontierMath — because OpenAI funded the test and had access to the questionspivot-to-ai.comexternal-linkmessage-square10fedilinkarrow-up165cross-posted to: fuck_ai@lemmy.world
arrow-up165external-linkOpenAI o3 beats FrontierMath — because OpenAI funded the test and had access to the questionspivot-to-ai.comDavid Gerard@awful.systemsM to TechTakes@awful.systemsEnglish · 2 days agomessage-square10fedilinkcross-posted to: fuck_ai@lemmy.world
minus-squareBigMuffin69@awful.systemscakelinkfedilinkEnglisharrow-up6·1 day ago has data access to much but not all of the dataset. Huh! I wonder what part of the dset had the 25% of questions they got right in it 🙃
Huh! I wonder what part of the dset had the 25% of questions they got right in it 🙃