Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)BI
Posts
6
Comments
293
Joined
2 yr. ago

  • I like and use NixOS, but if this is the first time OP is using Linux, I'd recommend sticking with something like Pop. When something goes wrong, there's a pretty good chance that there will already be a SO post for Pop or Ubuntu on how to fix it.

  • I imagine they're referencing the season 6 finale of Rick & Morty, where one is dropped and if it reaches the Earth's core the planet will blow up for sciencey plot reasons.

  • Also, at an old workplace I put that in my username in our chat app and it completely screwed up the UI for people that were using terminal apps for chat as soon as my username appeared and I had to take it out.

    You can also do things like use confusable characters to point someone at the wrong domain like pаypal.com.

  • Did one of your annoying nerdy friends hide some weird unicode in a message they sent to you? Copy/paste into this site to see exactly what dickery they're up to:

    https://babelstone.co.uk/Unicode/whatisit.html

    I say this because I'm that annoying friend that likes to slip a unicode right-to-left character in just to see what happens.

  • The article doesn't give enough information to know, unsurprisingly given that it's a spy agency. It could be something mundane like outlier detection for finding unusual events of some sort. It might be something more exciting like using an LLM to conduct targeted psy-ops campaigns. We'll probably never know, and the article isn't really that interesting.

  • The scary temperatures you see in news headlines are basically unaffected by the fires. Wikipedia has a good overview:

    https://en.m.wikipedia.org/wiki/Global_surface_temperature

    The overall issue with global warming is not that one place gets super hot once and sets a record. Otherwise I could make news headlines by setting my house on fire and getting "hottest temperature ever! (at my house)". Those local hotspots of fire will affect the average global temp only a tiny bit, because the earth is a big place and there's lots of places not currently on fire. The thing to worry about is the reverse actually: because the earth is warming, fires are increasing everywhere, and then everybody will be next to a fire on that blessed record-setting day.

  • Here's a transformer (GPT-4) describing this advancement in transformer architecture:

    Sure, let's break this down like you're five years old!

    Imagine you're playing with a big set of Lego blocks (think of these as your data). You have a big task: you need to sort all of these blocks based on their colors, size, and shapes (this is similar to processing information in a machine learning model).

    At first, you try to do it all by yourself (this is like the original Transformer model). It takes a long time, and your hands get really tired (this means the model takes a lot of time and computer memory).

    Then, your older brother comes in with a new plan. He tells you, "Hey, why don't we divide the work?" He helps you sort the blocks by colors, and you sort them by size and shapes (this is like the 'work partitioning' in FlashAttention-2).

    Also, your older brother suggests, "Let's do it together instead of one after another". So, while he's sorting by color, you're sorting by size and shape at the same time (this is the 'parallelism' in FlashAttention-2).

    This makes the job get done faster and you both don't get as tired (meaning FlashAttention-2 is faster and uses less memory).

    The new way of sorting is also super smart because it knows how to best use your toy boxes to keep your sorted Lego blocks (the 'IO-aware' part, with the toy boxes being the different levels of memory in the computer).

    FlashAttention-2 is like your older brother's plan, but for computer models. It helps them process big amounts of data more quickly and efficiently. This is great because it allows us to create even bigger and more exciting models that can understand things like language, pictures, and even generate music and videos.

    And just like you needed to have enough space and the right kind of Lego blocks to play this sorting game, to use FlashAttention-2, you need some specific computer parts (like certain types of GPUs), and programming stuff (like CUDA and PyTorch).

    This is a big deal because it makes our computer models better and faster. They can learn more and do more cool things, just like how you and your brother can sort more Lego blocks and have more fun with your new sorting plan!

  • Can you comment or is it documented somewhere how "active" you and the other admins are planning on being RE mods? i.e. do you see community management as being more hands off, and if there's a bad mod then people should make a new community, or would you want to step in and try to fix things? Reddit mostly took the approach of being hands off, which had some nasty side effects and is why a lot of the comments in this thread are wary of powermods. IMO if you and the other admins were to be proactive in modding the mods, that would probably solve a lot of people's worries.