Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)BJ
Posts
0
Comments
275
Joined
2 yr. ago

  • they did reproduce Tesla's claim in 2 of their 1/8 mile runs.

    Their claim was that it would win on the 1/4 mile

    Electric vehicles have no/single speed transmissions. Of course they can accelerate faster, because they don't have to have the driver operating the clutch and cycling through all the gears before they get up to pace.

    What they did is basically say Usain Bolt is the world's fastest marathon runner by filming his 100m race and "extrapolating"

  • Reddit has way more data than you would have been exposed to via the API though - they can look at things like user ARN (is it coming from a datacenter), whether they were using a VPN, they track things like scroll position, cursor movements, read time before posting a comment, how long it takes to type that comment, etc.

    no one at reddit is going to hunt these sophisticated bots because they inflate numbers

    You are conflating "don't care about bots" with "don't care about showing bot generated content to users". If the latter increases activity and engagement there is no reason to put a stop to it, however, when it comes to building predictive models, A/B testing, and other internal decisions they have a vested financial interest in making sure they are focusing on organic users - how humans interact with humans and/or bots is meaningful data, how bots interact with other bots is not

  • To compare every comment on reddit to every other comment in reddit's entire history would require an index

    You think in Reddit's 20 year history no one has thought of indexing comments for data science workloads? A cursory glance at their engineering blog indicates they perform much more computationally demanding tasks on comment data already for purposes of content filtering

    you need to duplicate all of that data in a separate database and keep it in sync with your main database without affecting performance too much

    Analytics workflows are never run on the production database, always on read replicas which are taken asynchronously and built from the transaction logs so as not to affect production database read/write performance

    Programmers just do what they're told. If the managers don't care about something, the programmers won't work on it.

    Reddit's entire monetization strategy is collecting user data and selling it to advertisers - It's incredibly naive to think that they don't have a vested interest in identifying organic engagement

  • Look at the picture above - this is trivially easy. We are talking about identifying repost bots, not seeing if users pass/fail the Turing test

    If 99% of a user's posts can be found elsewhere, word for word, with the same parent comment, you are looking at a repost bot

  • I know everyone here likes to circle jerk over "le Reddit so incompetent" but at the end of the day they are a (multi) billion dollar company and it's willfully ignorant to infer that there isn't a single engineer at the company who knows how to measure string similarity between two comment trees (hint: import difflib in python)

  • A couple of them fall into the "technically true, but misleading territory" - I'm sure the person handing this out couldn't identify which though - broken clock right twice a day and all

    "Can you reverse effects" - no you can't make your immune system forget how to work. Probably not what they are going for here though.

    "Risk of [...] or other side effects?" - yeah the vaccines generally give people a headache and short lived fever symptoms

    "Have there been deaths?" - The astrazeneca vaccine had like a 0.000001% mortality risk (more likely to die driving to the pharmacy), and was pulled in many countries because that was deemed too dangerous. Person handing out the flyer has likely been parroting "mRNA vaccines cause blood clots" nonsense for years while being completely unaware that AZ was a traditional viral vector vaccine

    "Are there doctors recommending NOT taking it" - yeah, there are many notable anti-vaccine doctors, what they typically have in common is they earned their doctorate in computer science, social studies, or some other field that gives them no qualifications to talk about immunology

  • They aren't talking about system administrators. They are talking about 3rd party software presenting a privilege escalation prompt (administrator access) and changing your default browser without you knowing about it

  • You could set it up in docker whilst still on windows, and then all you need to do is copy/paste your compose file onto your new Linux machine, that way you aren't struggling to learn two things at the same time (alleviates the "I don't know if the problem is with my docker config or my host OS")

  • Actually, I‘m just excluding companies like yours because they are making way too much revenue on the basis of FOSS without giving back

    You don't know anything about my company? You don't know what proportion of FOSS vs proprietary software we use, nor how much we give back lol.

    It would completely break the locked down proprietary software model and break walled gardens wide open.

    This is very pie in the sky. Your license idea only penalizes small to medium sized businesses. Alphabet's 1% would just go to Chromium/AOSP, and Meta's 1% would just go to React/Torch

  • You are probably better off setting up a non-profit and running traditional license fees through it into your payment union then. I can't emphasize how much of a non-starter 1% of revenues is for any business (it's my company's entire IT budget, including salary) - you are basically just saying "personal use only" with more words.