Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)TE
Posts
1
Comments
21
Joined
2 yr. ago

  • One thing I would do differently is setup LDAP and OIDC so you can use the same authentication credentials for different apps (at least the ones that support them). I use LLDAP and Authelia for this purpose.

  • I found a VRAM calculator for LLMs here: https://huggingface.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator

    Wow it seems like for 128K context size you do need a lot of VRAM (~55 GB). Qwen 72B will take up ~39 GB so you would either need 4x 24GB Nvidia cards or the Mac Pro 192 GB RAM. Probably the cheapest option would be to deploy GPU instances on a service like Runpod. I think you would have to do a lot of processing before you get to the breakeven point of your own machine.

  • The context cache doesn't take up too much memory compared to the model. The main benefit of having a lot of VRAM is that you can run larger models. I think you're better off buying a 24 GB Nvidia card from a cost and performance standpoint.

  • I wasn't sure if it was AI or not. According to the description on GitHub:

    Utilizes state-of-the-art algorithms to identify duplicates with precision based on hashing values and FAISS Vector Database using ResNet152.

    Isn't ResNet152 a neural network model? I was careful to say neural network instead of AI or machine learning.

  • I migrated using the CLI tool to upload before external libraries came out, and it worked well for me. I think I would have preferred to use external libraries for things like my wedding photos, so there aren't duplicates on my server, but it's not a big deal.

  • I always used fingertip with small hands and never thought it was weird. I just figured out maybe it's because I played piano growing up? It feels more natural to have a C shape with my hand and space between the mouse and my palm.

  • Yeah, the power prices in my city are really high (USA). They're even higher than Hawaii, from what I've heard. That's why I'm leaning towards the mini PCs and SBC options, even if used server/desktop parts have better performance for the price.

  • Is your NAS in an old tower PC?

    I think I had the misconception that USB was slower than SATA, but USB-C is actually just as fast. And anything USB 3.0+ should be faster than 1 gigabit ethernet I guess?

  • Selfhosted @lemmy.world

    Best current hardware solution for selfhosting?