Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)PI
Posts
3
Comments
544
Joined
2 yr. ago

  • Oh, yeah innthat case I agree.

    Either way the mechanic of "you have to move the cursor to purposefully hover on the ad, abd then wait multiple seconds, and then not click anything" opt in behavior on the ads has lnt changed since day one.

    So seeing news articles pretending this is anything new at all really just goes to show how shit "journalism" has gotten over the years. Literally like, 5s of looking this up and you'll find out this isn't anything new.

    SMH

  • I bought CCwGTV when it first came out, I don't recall the adbar at the top ever not being present.

    It just was strictly ads for TV shows / movies for the first while and "external" ads for stuff like mcdonalds only showed up later.

    But I'm pretty sure I remember ads for Black Widow on it front and center, and that was only 5? 6? months after it came out. (The ads were showing up 1-2 months prior to theatrical release iirc?)

  • So you think it's okay to spread misinformation if it's about a giant corporation?

    Misinformation is never okay, as it muddies the waters and makes it hard to know what you can trust. If we idly stand by and let a lemmy instance degrade to the point where garbage posts like this are commonplace, it becomes difficult to sift apart the actual news and stuff that matters from the shit deluge of misinfo.

    Which means, yes, calling out misinformation / shit posts even if it's about a megacorp, because shit in the water is still shit in the water.

  • I bought the Chromecast with Google TV (older version, not the newer 4k one, which I also own now) when it first cane out.

    The home screen hasn't changed since day 1 when I plugged it in.

    It always had an ad at the top, and you always have had to purposefully move the cursor onto the ad itself and not touch anything for multiple seconds before it played.

    I owned the OG chromecast and chromecast 2 before that, and you are right, that one didn't have ads, but it also didn't have much of anything really.

  • It’s one thing to advertise a show or an app / service that is in the App Store but another to show actual ads.

    Most of the time it's this, rarely mcdonalds or Harvey's has an ad like this you need to hover over for several seconds to play, intentionally, and people turn it into rage bait garbage posts.

    90% of the time it's just an add for a TV show or movie, and you still have to purposefully hover over it unmoving to start it playing, it's pretty opt in.

  • I don't even notice it, the ad starts out small at the top and your cursor starts out on the Apps row, you have to very intentionally trigger the ad.

    90% of the time the ads are for movies or TV shows on the streaming services you have installed (and presumably an account for) anyways, so there's been non zero times where I did go abd hover the ad to watch it cuz I was like "oh hey I actually wanna watch that, is it coming out soon? No shit!"

    The other 10% if the time it's mcdonalds or Harvey's or whatever, I barely notice it as I spend pretty much all of my time with the Google tv "inside" an app.

    Very little time gets spent on the home screen, it's a glorified Start menu to pick an app and open it up, so I don't, to be blunt, give much of a shit that for half a second I can see a big Mac at the top of my TV screen before I click the 1 button to open Netflix.

    Also more often than not I use my phone app to push to the TV, so my process is:

    1. TV is turned off atm, I open on my phone (Netflix, Disney plus, crunchyroll, Amazon prime, YouTube, etc)
    2. I click the cast button on my phone
    3. TV auto detects activity, starts turning on, meanwhile my chromecast is already loading up the app and booting into it
    4. By the time my TV screen flips on, the app is opened as well and my content starts to play, so u never even saw the home screen in the first place

    End result: I rarely even see the app realistically anywho.

  • Nope.

    That's just how the ad at the top looks and always has, and yes, it plays if you hover over it, they always have, and yes, it expands out if you keep watching it and don't touch anything.

    If you use the UI normally the ad doesn't play, the person in that video explicitly played the ad.

    plays ad

    ad plays

    SurprisedPikachu.jpeg

    Get this clickbait shit outta here. It's literally an ai generated article that stole content off a reddit post as it's "source". Have some standards people.

  • NYT hasn't actually won that case yet, so it's pointless to bring up. OpenAI has publicly stated that NYT heavily has misrepresented their findings.

    OpenAI's value would plummet and crash if they gained a reputation for using illegal material to train their AI on, investors would drop them so fast.

    This is just a simple fact. LLM providers reputation is heavily staked on the legality of their data.

    So far the courts have ruled in these companies favor.

    But it's extremely likely illegaly scraped Dara from reddit would not pass the sniff test and debestate an offending companies reputation.

    If you don't understand why, you have to do some brushing up on why these LLM services are worth so much and who is using them and for what. Once you understand that, it becomes extremely apparent why legally owning the entire history of every reddit post ever would be extremely valuable, and why a 5bil price tag is actually not that crazy.

  • Scraped data isn't legal to resell, scraping isn't even legal in the first place.

    Just because you can scrape the data doesn't mean it's worth anything.

    Companies like MS, Google, OpenAI, FB they make money by selling the usage of their LLM services to other companies who then they use that service to make their own products.

    If it came to light that MS/Google/OAI/FB were using illegal training data for their LLMs, it would get all those other companies hit in the crossfire.

    So these companies have to do a shit tonne of diligence to assure their investors and clients that their LLMs are purely trained on legally obtained data and are safe to use.

    And you know what is a super easy way to assure them of that?

    If they literally own the original data themselves

  • Do you actually think this has any impact? That's silly.

    Reddit's servers have the original copy of every single post made, undoubtedly, and everytime you edit your post, they store that copy too.

    So not only has everyone "poisoned" their data ineffectively, they literally have created training data of "before" vs "after" poisoning to compare the two for training the LLM against poisoned data.

    Whoever buys the right to that is going to have a pretty huge goldmine, and perhaps they will rent it out, or perhaps they'll use it themselves.

  • Not legally / free.

    And yes, that very very much matters if you intend to actually sell the service to companies that they themselves dont want to get hit in the crossfire of potential lawsuits for building their products on top of stolen info.

    So if you can own the data itself (via buying reddit), you now have an ENORMOUS quantity of prime training data that you're investors and potential customers know is legally clean, because you literally own it.

  • Most LLMs have tonnes of NSFW data in their training.

    Typically, if this wants to be blocked, a secondary RAG or LORA is run overtop to act as a filtering mechanism to catch, block, and regenerate explicit responses.

    Furthermore, output allowed lexicon is a whole thing.

    Unfiltered LLMs without these layers added on are actually quite explicit and very much capable of generating extremely NSFW output by default.

  • I 100% can see it easily selling for that much.

    You want to know why it's worth that much?

    Petabytes of raw training Data for LLMs. Arguably atm reddit us one of the better gold mines of LLM training data on the internet, bazillion of posts already formatted as post-response chains, which is the exact type if format an LLM wants to train on.

    Can you imagine how valuable those servers loaded with posts are to a company like OpenAI, Google, or Microsoft?

    5 billion is quite reasonable to harvest every reddit post that has ever been made ever and cut it off from your competitors.

  • writes several paragraphs of white nationalist racism and bigotry totally unprompted across multiple tweets

    gets called out for it

    "It was taken out of context"

    I don't think this person knows what the phrase "out of context" means, they are just parroting it.

  • If you wanna end up with billions of dead baby girls, sure, cuz that's what happens.

    Instead, best way to curve overpopulation is just improve education.

    Also we still have tonnes and tonnes of room for growth if we just stopped being incredibly wasteful. We produce multiple times the food to feed everyone on earth already, but almost all of it ends up thrown out and wasted.

    We need a fundamental shift in priorities, and better push on legislation to get food into people's mouths.

  • Ferrets are not completely domesticated, and are indeed classified as exotic. It's extremely difficult to insure them (in many plaves impossible), it's primarily exotic specialized vets that can help with them, and though they are extremely friendly and fun, they are at best "half" domesticated. They are also banned in many countries and states.

    Source: I have four ferrets and love them, but I also know exactly how putting your hand near a young ferret plays out every single time, and it takes months of training to get them to stop wanting to draw blood <3

  • That's a very very long list...

    Debian + Cinnamon desktop which inck7des the countless tools that come with that stack.

    • Termux on my phone
    • Zsh as my debian shell
    • OpenSSH
    • OpenVpn
    • tmux + tmuxinator
    • neovim, and dozens of plugins/tools with that
    • dart
    • flutter
    • large chunks of Node.js and the npm ecosystem
    • dotnet framework and countless nuget packages
    • lazygit
    • stable diffusion
    • llama.cpp, and many tools built on top of that
    • k3OS running Rancher
    • my entire selfhosted stack on the above which includes but is not limited to:
      • Shinobi
      • Bitwarden
      • Gogs