Even_Adder

11mo ago

NVIDIA: Copyrighted Books Are Just Statistical Correlations to Our AI Models * TorrentFreak

I think it's really disingenuous to mention the DeviantArt/Midjourney/Runway AI/Stability AI lawsuit without talking about how most of the infringement claims were dismissed by the judge.

11mo ago

YouTube creator sues Nvidia and OpenAI for ‘unjust enrichment’ for using their videos for AI training

Jump

This isn't about research into AI, what some people want will impact all research, criticism, analysis, archiving. Please re-read the letter.

11mo ago

NVIDIA: Copyrighted Books Are Just Statistical Correlations to Our AI Models * TorrentFreak

Jump

Damn, this article is so biased.

11mo ago

YouTube creator sues Nvidia and OpenAI for ‘unjust enrichment’ for using their videos for AI training

Jump

You should read this letter by Katherine Klosek, the director of information policy and federal relations at the Association of Research Libraries.

Why are scholars and librarians so invested in protecting the precedent that training AI LLMs on copyright-protected works is a transformative fair use? Rachael G. Samberg, Timothy Vollmer, and Samantha Teremi (of UC Berkeley Library) recently wrote that maintaining the continued treatment of training AI models as fair use is “essential to protecting research,” including non-generative, nonprofit educational research methodologies like text and data mining (TDM). If fair use rights were overridden and licenses restricted researchers to training AI on public domain works, scholars would be limited in the scope of inquiries that can be made using AI tools. Works in the public domain are not representative of the full scope of culture, and training AI on public domain works would omit studies of contemporary history, culture, and society from the scholarly record, as Authors Alliance and LCA described in a recent petition to the US Copyright Office. Hampering researchers’ ability to interrogate modern in-copyright materials through a licensing regime would mean that research is less relevant and useful to the concerns of the day.

11mo ago

[Serious] What do you think Disney's Recess characters are up to now, as adults?

Jump

It was a different word when this show aired. https://youtu.be/rMoDslz0EtI

11mo ago

dysphoria rule

Jump

One can imagine Grape-kun died happy.

11mo ago

Pokémon Mystery Dungeon: Red Rescue Team Heads To Nintendo Switch Online This Week | Retro Dodo

Jump

Yeah, I really don't get why this is news I have to keep hearing about.

11mo ago

due soon

Jump

Oh my god...

11mo ago

Olympic anime

Jump

That title looks like a typical sports anime title.

Haikyu!!
Kuroko's Basketball
Blue Lock
Hajime no Ippo
Ace of Diamond
Slam Dunk
Major
The Prince of Tennis
Yowamushi Pedal
One Outs
Baby Steps

I could go on, but I think I've made my point.

12mo ago

Can AI even be open source? It's complicated

Jump

Have you read this article by Cory Doctorow yet?

12mo ago

[Karl Jobst] Monster Hunter Has A Cheating Problem

Jump

He blames Monster Hunter being available on PC for cheating, but Monster Hunter has always had tools and cheats. An absolute trash take.

12mo ago

Say it.

Jump

She can't be fully 2 without the paws and face though.

12mo ago

Say it.

Jump

But without the face she isn't quite there yet.

12mo ago

Say it.

Jump

I'd say you're mostly safe since we're not even reaching step two.

12mo ago

my one (piece) rule

Jump

You should read One Piece.

12mo ago

AI Music Generator Suno Admits It Was Trained on ‘Essentially All Music Files on the Internet’

Jump

It should be fully legal because it's still a person doing it. Like Cory Doctrow said in this article:

Break down the steps of training a model and it quickly becomes apparent why it's technically wrong to call this a copyright infringement. First, the act of making transient copies of works – even billions of works – is unequivocally fair use. Unless you think search engines and the Internet Archive shouldn't exist, then you should support scraping at scale: https://pluralistic.net/2023/09/17/how-to-think-about-scraping/

Making quantitative observations about works is a longstanding, respected and important tool for criticism, analysis, archiving and new acts of creation. Measuring the steady contraction of the vocabulary in successive Agatha Christie novels turns out to offer a fascinating window into her dementia: https://www.theguardian.com/books/2009/apr/03/agatha-christie-alzheimers-research

The final step in training a model is publishing the conclusions of the quantitative analysis of the temporarily copied documents as software code. Code itself is a form of expressive speech – and that expressivity is key to the fight for privacy, because the fact that code is speech limits how governments can censor software: https://www.eff.org/deeplinks/2015/04/remembering-case-established-code-speech/

That's all these models are, someone's analysis of the training data in relation to each other, not the data itself. I feel like this is where most people get tripped up. Understanding how these things work makes it all obvious.

12mo ago

MKBHD worst reviewed AI boogaloo rule

Jump

Watching that made me sick.

12mo ago

Rule

Jump

I didn't expect to see a Megaman Battle Network reference here.

12mo ago

Sam Altman urges formation of US-led AI freedom coalition • The Register

Jump

Yeah, I'm fine with a coalition being formed as long as there are no people like Sam Altman in it.

12mo ago

Reddit changes have blocked all search engines except Google amid AI 'misuse' [U]

Jump

They don't train on random social media posts. Everything is sorted and approved.