Communist

Communist@lemmy.frozeninferno.xyz · 8 hours

I am not, you inaccurately said that the math olympiad was not bested by llm’s because they had a tool that told them if they were close but incorrect and can just try an infinite number of times. This is incorrect, they had a number of tries with python. This just isn’t a true statement. I think them besting it with use of python is equally significant and still counts as them besting it, and saying they can’t do math work is absurd.

Communist@lemmy.frozeninferno.xyz · 9 hours

I don’t know what you mean, I wasn’t the one who claimed they couldn’t do something they clearly can.

Communist@lemmy.frozeninferno.xyz · 13 hours

You aren’t, and that’s exactly what I’m saying, it’s capable of doing these things with tools, therefore it’s capable of doing these things.

Communist@lemmy.frozeninferno.xyz · 15 hours

I’m academically interested, what I mean when I say I’m not interested is that I just don’t see the significance when we’re talking about if it’s capable of the task.

Communist@lemmy.frozeninferno.xyz · 19 hours

The calculator does not tell them if they’re getting closer? This isn’t how anything works. No I can’t say I’m very interested in whether or not the llm has access to python/a calculator as long as it completes the task, that doesn’t matter.

Communist@lemmy.frozeninferno.xyz · 19 hours

It does the math, it just uses a calculator.

Communist@lemmy.frozeninferno.xyz · 1 day

That doesn’t change the fact that llm’s are capable of acing math olympiads. So what if it uses tools? You probably would too. I doubt anybody there did it without a calculator.

https://www.nature.com/articles/d41586-025-02343-x

Communist@lemmy.frozeninferno.xyz · 1 day

It is totally irrelevant that the model calls tools to do the math. That is still a success.

Communist@lemmy.frozeninferno.xyz · 2 days

No.

https://www.nature.com/articles/d41586-025-02343-x

It’s lying

Communist@lemmy.frozeninferno.xyz · 2 days

Whether they use tools to do it or not is entirely unimportant, that’s just how they do it?

Communist@lemmy.frozeninferno.xyz · 2 days

They regularly win olympiad mathematics up from not standing a chance and just created a novel solution to the erdos conjecture, them counting the r’s in strawberry is inconsequential but also something they can do even if you just use the raw api or a local model.

Communist@lemmy.frozeninferno.xyz · 2 days

I get that you hate AI but there’s no reason to lie about its capabilities.