

I don’t know what you mean, I wasn’t the one who claimed they couldn’t do something they clearly can.
I’m an anarchocommunist, all states are evil.
Your local herpetology guy.
Feel free to AMA about picking a pet/reptiles in general, I have a lot of recommendations for that!


I don’t know what you mean, I wasn’t the one who claimed they couldn’t do something they clearly can.


You aren’t, and that’s exactly what I’m saying, it’s capable of doing these things with tools, therefore it’s capable of doing these things.


I’m academically interested, what I mean when I say I’m not interested is that I just don’t see the significance when we’re talking about if it’s capable of the task.


The calculator does not tell them if they’re getting closer? This isn’t how anything works. No I can’t say I’m very interested in whether or not the llm has access to python/a calculator as long as it completes the task, that doesn’t matter.


It does the math, it just uses a calculator.


That doesn’t change the fact that llm’s are capable of acing math olympiads. So what if it uses tools? You probably would too. I doubt anybody there did it without a calculator.


It is totally irrelevant that the model calls tools to do the math. That is still a success.


Whether they use tools to do it or not is entirely unimportant, that’s just how they do it?


They regularly win olympiad mathematics up from not standing a chance and just created a novel solution to the erdos conjecture, them counting the r’s in strawberry is inconsequential but also something they can do even if you just use the raw api or a local model.


I get that you hate AI but there’s no reason to lie about its capabilities.
I am not, you inaccurately said that the math olympiad was not bested by llm’s because they had a tool that told them if they were close but incorrect and can just try an infinite number of times. This is incorrect, they had a number of tries with python. This just isn’t a true statement. I think them besting it with use of python is equally significant and still counts as them besting it, and saying they can’t do math work is absurd.