Skip Navigation

OpticalMoose @ OpticalMoose @discuss.tchncs.de

Posts

12
Comments

131
Joined

2 yr. ago

1y ago

Self hosting an LLM for research

Probably better to ask on !localllama@sh.itjust.works. Ollama should be able to give you a decent LLM, and RAG (Retrieval Augmented Generation) will let it reference your dataset.

The only issue is that you asked for a smart model, which usually means a larger one, plus the RAG portion consumes even more memory, which may be more than a typical laptop can handle. Smaller models have a higher tendency to hallucinate - produce incorrect answers.

Short answer - yes, you can do it. It's just a matter of how much RAM you have available and how long you're willing to wait for an answer.

1y ago

Replace Torrenting with Usenet

One thing I love about usenet is that it's great if you're just looking for one episode, song, etc and don't want to download a whole collection.

1y ago

Replace Torrenting with Usenet

Why replace torrents? Why not use both? It's a bonus if your usenet provider includes a VPN.

1y ago

Best Upgrade Path for my Desktop

Yep, I had been hoping for the same thing.

Also, to @projectmoon@lemm.ee, you might want to wait and see what gets announced at Computex next month. Hopefully they announce some new stuff and the current gen prices drop.

1y ago

Best Upgrade Path for my Desktop

Ollama doesn't currently support mixing CUDA & ROCm. https://github.com/ollama/ollama/issues/3723#issuecomment-2071134571

One thing to keep in mind about adding RAM your speed could drop depending on how many slots you populate. For me, I have a 5700G and with 2x16Gb, it runs at 3200Mhz, but with 4x16Gb(same exact product), it only runs at 1800Mhz. In my case, RAM speed has a huge effect on tokens/sec, if I have a model that has to use some RAM.

You can check AMD's spec page for your processor, but they don't really document a lot of this stuff.

1y ago

Getting into Civ 6

I think 5 is what I'm looking for. Sometimes Civ (in general) gets out of hand with the micromanagement. I want something that's kind of casual.

1y ago

Getting into Civ 6

I've been looking to try Civ 5 or 6, but haven't decided which one yet.

1y ago

Llama 3 Establishes Meta as the Leader in “Open” AI

So it seems obvious that the US dept of Homeland Security would include a Meta rep on their AI Safety and Security Board. But they didn't.

Don't get me wrong, I hate Facebook/Instagram/Meta, but I appreciate their contribution to AI.

1y ago

Tesla slashes vehicle and self-driving-ish software prices as shares plummet

Even after the price cut, theirs is still 3x the price of Mercedes' system which works better. I have a feeling Tesla's earnings report won't go well this afternoon. https://finance.yahoo.com/news/tesla-earnings-q1-175358835.html

1y ago

Meta releases Llama 3, claims it's among the best open models available

It's out on Ollama already. I'm downloading it now. Can't wait for the uncensored versions.

1y ago

SD cards finally expected to hit 4TB in 2025

You probably don't want it
https://www.youtube.com/watch?v=O2jKKFUnycA
https://www.youtube.com/watch?v=3frnBoqqI_Q

1y ago

SD cards finally expected to hit 4TB in 2025

In addition, manufacturers will make a smaller and easier to lose format.

1y ago

Using your phone to pay is convenient, but it can also mean you spend more

I guess "It’s not for everyone" is the real takeaway here. I'm not a phone guy in general, but I've been using cards since BK was still selling 99¢ Whoppers. I'm guessing both of us are ready to pay before the cashier has our order rung up.

To each their own. (I'm finally admitting that I'm fighting a losing battle on writing checks though.)

1y ago

Beginner questions thread

Have you tried the guide on AMD's site? It looks like it's for Windows, and I don't know what you're running. Plus, I use Ollama, so I probably can't be of much help.

For programing, my favorite is Dolphin-Mixtral, but I've had good results with Dolphin-Mistral and Llama2.

1y ago

Using your phone to pay is convenient, but it can also mean you spend more

Same here. I guess I should have pointed out that I'm not really much of a phone guy to begin with. I don't install many apps, and I stay logged out of Google. To me, losing a phone really just means losing my pictures and videos. The most expensive phone I've ever had was $200.

1y ago

Using your phone to pay is convenient, but it can also mean you spend more

Using a phone sounds inconvenient to me. I usually just pull my card out of my wallet, wave it over the terminal until I hear a beep and that's it. Worst case scenario, I have to insert it into the chip reader or God-forbid swipe it through the slot like some kind of Neanderthal.

I'm kidding, but seriously, that's easier than screwing around with a phone, to me.

1y ago

Easter 2007: B.J. Novak Proves Cadbury Eggs Are Getting Smaller

I'm slightly pissed about the shrinkage, but really pissed that they don't come in packages anymore (at least not where I live). It's bad enough I have to scan my own groceries, but now I have to scan 8-12 individual eggs in a row. What's next?

1y ago

[Louis Rossmann] Is All Piracy Equal? Exploring Gray Areas: When Is It REALLY "Stealing" ??

I pirated a certain 'crash cars and shoot'em up' game because, even though I own it on Steam, the gameplay (especially the launcher) absolutely sucks.

No more automatically downloading online content when I don't even play online and no more updates breaking my mods. It's worked out so well that I'm looking at pirating other games I already own.

1y ago

Valve's custom kernel patches for SteamOS' kernel; how many are up-streamed into the mainline kernel already?

I really hope those patches make their way into the other distros. I've got a few Linux machines and the Steam Deck is the only one that wakes from sleep without locking up. It's also the only one that allocates VRAM for the iGPU automatically when a game needs more.

1y ago

NVIDIA Chat With RTX

Thanks for the info. I'm not on Windows, so I can't try it. Doesn't work offline? That's what I was afraid of.

It's a great proof of concept. Hopefully we'll see more text-gen-webui extensions soon.