Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)LI
Posts
0
Comments
92
Joined
2 yr. ago

  • That uses a similar approach to the wake word technology, but slightly differently applied.

    I am not a computer or ML scientist but this is the gist of how it was explained to me:

    Your smartphone will have a low-powered chip connect to your microphone when it is not in use/phone is idle to run a local AI model (this is how it works offline) that asks one thing: is this music or is it not music. Anyway, after that model decides it's music, it wakes up the main CPU which looks up a snippet of that audio against a database of other audio snippets that correspond to popular/likely songs, and then it displays a song match.

    To answer your questions about how it's different:

    • the song id happens on a system level access, so it doesn't go through the normal audio permission system, and thus wouldn't trigger the microphone access notification.
    • because it is using a low-powered detection system rather than always having the microphone on, it can run with much less battery usage.
    • As I understand it, it's a lot easier to tell if audio seems like it's music than whether it's a specific intelligible word that you may or may not be looking for, which you then have to process into language that's linked to metadata, etc etc.
    • The initial size of the database is somewhat minor, as what is downloaded is a selection of audio patterns that the audio snippet is compared against. This database gets rotated over time, and the song id apps often also allow you to send your audio snippet to the online megadatabases (Apple's music library/Google's music library) for better protection, but overall the data transfer isn't very noticeable. Searching for arbitrary hot words cannot be nearly as optimized as assistant activations or music detection, especially if it's not built into the system.

    And that's about it....for now.

    All of this is built on current knowledge of researchers analysing data traffic, OS functions, ML audio detection, mobile computation capabilities, and traditional mobile assistants. It's possible that this may change radically in the near future, where arbitrary audio detection/collection somehow becomes much cheaper computationally, or generative AI makes it easy to extrapolate conversations from low quality audio snippets, or something else I don't know yet.

  • That you apparently have the privilege to not be affected by the consequences of a Republican government doesn't invalidate the choices of those who would be, and voted accordingly.

    We are all complicit in the same way we are all complicit for the war crimes committed by America in the Middle East: most of us did not have a choice in the matter whatsoever. All we can do is demand them to stop.

    I'm not going to judge someone whose choices are "genocide" and "genocide even more, and even more local genocide" and picks the former.

  • I would hazard a 1/3 to 1/2 of that last third genuinely doesn't have any resources to stay aware and engaged in the electoral process because they're already used up on surviving and on their individual scale it doesn't feel like either choice has a meaningful difference. The liberal establishment does itself no favors (and I say this a leftist who votes democrat)

  • They're targeting Gen Alpha though (and the one after that). Gen Z is just about out of K12, but their younger siblings have to deal with stuff like PragerU being part of the official curriculum. I hope the teachers can work around it, but it looks grim.

  • It's a bit tiring that every single infringement on people's rights to exist has to be combatted via 1A because the only thing that trumps dAsTaRdLy BeHaViOr In FrOnT oF cHiLdReN is free speech.

    Is that literally the only framework US law sees? Can't it be illegal for lawmakers to force their views on people because they're hateful bigots?