@TwilightKiddy I can see how you can get there, but the MITM would need to know the hashing algo, you canāt really just un-hash something, at least not reliably
But your original statement was that the hashing was the privacy violation, and thatās the part I took issue with, hashing is a generally accepted security measure, it is not inherently a privacy violation
@zbyte64 where am I wrong? The process is effectively the same: you get a set of training data (a textbook) and a set of validation data (a test) and voila, Iām trained
To learn how to draw an image of a thing, you look at the thing a lot (training data) and try sketching it out (validation data) until itās right
How the data is acquired is irrelevant, I can pirate the textbook or trespass to find a particular flower, that doesnāt mean Iām learning differently than someone who paid for it
@zbyte64 data quality, again, was out of the scope of what I was talking about originally
Which, again, was that legal precedent would suggest that the how is largely irrelevant in copyright cases, theyāre mostly focused on why and the scale of the operation
Iām not getting sued for copyright infringement by the NYT because I used inspect element to delete content to read behind their paywall, OpenAI is
@zbyte64 1) In no way is quality a part of that equation and 2) In what other contexts is quality ever a part of the equation? I mean I can go look at some Monets and paint some shitty water lillies, is that somehow problematic?
@zbyte64 from what I understand, youāre referring to the process at scaleāthe amount of information the AI can take in is inhumanāwhich Iām not disagreeing with
None of which is relevant to my original point: the scale of their operations, which has already been used countless times in copyright law
The scale at which they operate and their intention to profit is the basis for their infringement, how theyāre doing it would be largely irrelevant in a copyright case, is my point
@zbyte64 with everything you see you are scraping data from your environment whether you want to or not
How does a child learn what pain is? How does a teenager learn what heartbreak is? Itās certainly not because they made the decision to find that out themselves
@Subverb that is, quite impressively, the opposite of what I said
Is a person infringing on copyright by producing content? No. Itās about intent and scale. Humans donāt just sit on this knowledge, they do something with it
There is nothing illegal about WHAT itās doing, there is everything illegal about HOW and WHY
I very clearly stated that OpenAIās intent and their scale at which they operate are blatant copyright infringement and that it has been backed up with decades of precedents
@Pika@flopleash973 This is largely my thoughts on the whole thing, the process of actually training the AI is no different from a human learning
The thing about that, is that there's likely enough precedent in copyright law to actually handle that, with most copyright law it's all about intent and scale and I think that's likely where this will all go
Here the intent is to replace and the scale is astronomical, whereas an individual's intent is to add and the scale is minimal
@technomad Iāve been keeping a lot of my friends aware of whatās going on in the fediverse (especially around Threads, which theyāre more aware of) through a discord channel I run on our server thatās dedicated to whatās going on in the world of tech
Conceptually I prefer using the email analogy for how it actually works since thatās pretty close
Gatekeeping the fediverse isnāt good for it, get people to join up, you have control over what you see
@TwilightKiddy I can see how you can get there, but the MITM would need to know the hashing algo, you canāt really just un-hash something, at least not reliably
But your original statement was that the hashing was the privacy violation, and thatās the part I took issue with, hashing is a generally accepted security measure, it is not inherently a privacy violation