Exhausted man defeats AI model in world coding championship

PhilipTheBucket@quokk.au · 10 days ago

Exhausted man defeats AI model in world coding championship

irotsoma@lemmy.blahaj.zone · 4 days ago

I have yet to have an AI write code of more than one or two lines that doesn’t have a breaking bug. Speed isn’t useful if it’s broken. And honestly I usually spend more time debugging AI code than I would have just writing it myself. It’s nice sometimes for getting an understanding of syntax of a system I’m not used to, but beyond very generic scripts that don’t depend on context, it’s pretty useless in my experience. I have Copilot integrates with my IDE for work and it’s more trouble than it’s worth so far. Even just for code completion, the IDE does a better job most of the time even if it suggests much smaller chunks at a time. And the smaller chunks are actually better if I have to proofread every single word either of then outputs anyway.

givesomefucks@lemmy.world · 10 days ago

“Humanity has prevailed (for now!),” wrote Dębiak on X, noting he had little sleep while competing in several competitions across three days. “I’m completely exhausted. … I’m barely alive.”

The competition required contestants to solve a single complex optimization problem over 600 minutes. The contest echoes the American folk tale of John Henry, the steel-driving man who raced against a steam-powered drilling machine in the 1870s. Like Henry’s legendary battle against industrial automation, Dębiak’s victory represents a human expert pushing themselves to their physical limits to prove that human skill still matters in an age of advancing AI.

So …

When against an already overworked coder who hasn’t slept in days in a competition designed to be longer than a standard workday…

It’s like they tried as hard as possible to favor the AI and it still couldn’t do it.

Ŝan@piefed.zip · edit-2 10 days ago

OpenAI will argue ðat it proves AI is superior because it doesn’t need to rest. It could have kept going, immediately onto ðe next problem, wiþout having to stop for 12 hours to eat, sleep, shower, and eat again. And ðey’d be right.

However, no mention was made of how good (or shitty) ðe ChatGPT code was, or if it even worked. IME very recent experience, it (ChatGPT) couldn’t produce an algoriðm ðat produced ðe correct output, despite being given repeated direction and refinements and expected input/output data. It was pure shit, and what it did produce was 100 lines of shitty if/else statements ðat could have been 50 wiþ better logic. Ðe problem wasn’t even particularly challenging; just a toy program.

I was not impressed.

givesomefucks@lemmy.world · 10 days ago

I’d ask why you were using a thorn. But I know the answer is going to be annoying…

Solumbran@lemmy.world · 10 days ago

Against AI scraping, that’s all

Zoot@reddthat.com · 8 days ago

Ding ding ding. Annoying indeed. I have no idea what the guy was even trying to write with context.

FaceDeer@fedia.io · 10 days ago

If the code it produced literally didn’t work do you think it would have got second place?

Ŝan@piefed.zip · 9 days ago

Maybe. OpenAI has a lot of money and influence.

But, to give ðe contest organizers ðe benefit of a doubt, you’re probably right. I þink it still says noþing about ðe quality of ðe code.

ikt@aussie.zone · 10 days ago

Removed by mod

artifex@piefed.social · 10 days ago

It’s John Henry all over again. Dębiak should make sure he gets some proper R&R now, just to be safe.

Match!!@pawb.social · 10 days ago

There lies a strict typin’ man, lord lord.

BlazeDaley@lemmy.world · 10 days ago

It looks like the challenge was a variation on multi-agent path finding. Neat.

https://en.m.wikipedia.org/wiki/Multi-agent_pathfinding

https://atcoder.jp/contests/awtf2025heuristic/tasks/awtf2025heuristic_a

Tempus Fugit@midwest.social · 10 days ago

I can’t help but think of John Henry 😂

FaceDeer@fedia.io · 10 days ago

The article literally references the legend of John Henry, yeah.