Anthropic says Alibaba must be punished for largest Claude cloning attack

jaxxed@lemmy.world · 2 hours

New Qwen release incoming!

Q: if you steal a stolen thng, is it stealing?

flango@lemmy.eco.br · 53 minutes

What’s the science behind cloning?

duckCityComplex@lemmy.world · 54 minutes

The article is not clear on what a “distillation attack” is… what exactly is Alibaba supposed to be getting away with here? The article mentions using many different connections through obfuscation networks and proxies… so that would get them around rate limiting, and maybe enable them to submit many queries on free accounts… just spin up a new account whenever you hit the token limit of an unpaid account. So basically it’s a terms of service violation?

I don’t see why it’s necessarily a huge leg up for a competitor… they are just using the outputs of another model as training data. They still need to train their model, which is the expensive and energy intensive part.

It sounds to me like Anthropic just wants the US Government to help enforce its TOS internationally and force Alibaba to pay for those precious tokens? Because apart from that piece, the “attack” just seems like normal use of the service. If Anthropic’s service has an inherent vulnerability, that’s their problem.

Of course all the other comments about how they stole all their training data in the first place are spot on.

boonhet@sopuli.xyz · 4 hours

Claude’s still there, seems Alibaba’s attack wasn’t really all it’s cracked up to be.

Now the US gov’s attack seems to be working since Claude Fable 5 is still not there.

vrighter@discuss.tchncs.de · 9 hours

you can’t just call anything you don’t like “an attack”

weimaraner_of_doom@piefed.social · 10 minutes

How about “terrorism” or “national security threat”?

kleber_gueriero@lemmy.world · 33 minutes

Exactly!

Still, stop attacking me.

SkaveRat@discuss.tchncs.de · 4 hours

I declare an attack!

madcaesar@lemmy.world · 4 hours

You cannot just declare it. That doesn’t do anything…

ILikeTraaaains@lemmy.world · 8 hours

Stop attacking me!

Pika@sh.itjust.works · 14 hours

oh no, the data I stole is being stolen, whatever shall I do.

In other news, does anyone know a good source for crocodile tears? I ran out.

ChaosMonkey@lemmy.dbzer0.com · 2 hours

Sounds like Google complaining about scraping.

small violin

100_kg_90_de_belin@feddit.it · 8 hours

Enclosure or inclosure[a] is a term, used in English landownership, that refers to the appropriation of “waste”[b] or “common land”[c], enclosing it, and by doing so depriving commoners of their traditional rights of access and usage.

uuj8za@piefed.social · 19 hours

bigbangdangler@reddthat.com · 18 hours

Lol corporate thieves bitching about other corporate thieves is the funniest part of 2026

Zarxrax@lemmy.world · 20 hours

Nooooo, you can’t train on OUR data! That’s illegal!!!1

terabyterex@lemmy.world · 11 hours

i am not defending anyone here just a correction. if allibava just wanted data they would get it the same way anthropic did. Alibaba is distilling the model. they are cloning claudes capabilities. basically they are using claude to teach a model to behave the way claude does.

youmaynotknow@lemmy.zip · 4 minutes

Why the fuck would they do that if Anthropic is being kind enough to just give them that data (regardless of how it makes them be butthurt)?

FooBarrington@lemmy.world · 2 hours

If every AI company steals the public data separately, it means massively increased costs for everyone who is getting their data stolen. If the AI companies “steal” from each other it’s much better for everyone else.

TheBlackLounge@lemmy.zip · 5 hours

No difference. Distillation is a valid and useful way of generating data to improve or make new models. It’s still just example data to be trained on. Anthropic is doing the same with their own models, and inadvertently every other model through web scraping.

The legal difference is that this data is uncopyrightable. At most it’s a TOS breach, nothing major.

halcyoncmdr@piefed.social · 11 hours

Seems like it’s up to Anthropic to teach it’s AI model not to pimp itself out.

Holytimes@sh.itjust.works · 5 hours

Ai is already dog shit, there is a level of concern if we start getting ai incest and the absolute fucking retards shoving this shit everywhere goes from using unethical ai to unethical incest ai.

There’s no way this isn’t just going to make everything worse.

I have zero sympathy for anthropic but can we not make a shit situation worse and just be ok with that cause the first dude is Hitler and the second dude is mega Hitler.

Ok the flip side if this some how creates a more efficient and power conservative model that doesn’t fuck over consumers and the environment as hard.

PIRATE MORE ALIBABA YOU DA CHAMP

mememuseum@lemmy.world · 5 hours

I welcome it getting worse. The worse it gets the faster it will collapse.

[object Object]@lemmy.ca · 19 hours

Okay, so Anthropic distills MY copywriter data and it’s fine.

Alibaba distills Anthropic non-copywritable and that demands retaliation at the nation state level.

Fuck off. The rules are abundantly clear.

whaleross@lemmy.world · 19 hours

They stole to monetize without paying in money or attribution what we stole to monetize without paying money or attribution!

Hackworth@piefed.ca · 19 hours

Well, they did pay, just after the fact.

melroy@kbin.melroy.org · 17 hours

Well they didn’t pay me. But still used all my open source mit licensed code to train their model. And now I need to rent their compute back.

timochka@lemmy.zip · 10 hours

I mean sure, Anthropic are pricks, but “they did exactly what the license I put on my code said they could” is probably not the way to highlight that.

Womble@piefed.world · 16 hours

You chose to publish under that essentially says “do whatever, I dont care”. I can understand people who wrote GPL code being peeved, but writing stuff under MIT is pretty much designed to let companies take it and not give back.

brsrklf@jlai.lu · 20 hours

Is there a scenario in which they both lose? I’ll take that.

Franconian_Nomad@feddit.org · 19 hours

Alibabas Qwen were among the first open weights models that were actually useful and can be run on consumer hardware without too much difficulties.

If they continue with that, they will hurt the business model of the big AI companies significantly, accelerating the burst of the bubble.

Barbecue Cowboy@lemmy.dbzer0.com · 18 hours

I heard there was some new AI model that was so amazing for cyber security that they had to limit access to it. It’s just too bad Anthropic couldn’t use that.

toiletobserver@lemmy.world · 19 hours