Audalin

3mo ago

[April 2025] Which model are you using?

That's the ones, the 0414 release.

3mo ago

[April 2025] Which model are you using?

QWQ-32B for most questions, llama-3.1-8B for agents. I'm looking for new models to replace them though, especially the agent one.

Want to test the new GLM models, but I'd rather wait for llama.cpp to definitely fix the bugs with them first.

5mo ago

Anyone found "optimal" settings for llama.cpp partial offload?

Jump

What I've ultimately converged to without any rigorous testing is:

using Q6 if it fits in VRAM+RAM (anything higher is a waste of memory and compute for barely any gain), otherwise either some small quant (rarely) or ignoring the model altogether;
not really using IQ quants - afair they depend on a dataset and I don't want the model's behaviour to be affected by some additional dataset;
other than the Q6 thing, in any trade-offs between speed and quality I choose quality - my usage volumes are low and I'd better wait for a good result;
I load as much as I can into VRAM, leaving 1-3GB for the system and context.

11mo ago

is there a genre of written work specifically concerned with the conception and procedure of literary works, or is it all random interviews or annotated guides?

Jump

For Tolkien's work, there is the twelve volume "The Complete History of Middle Earth" which is about as inside baseball as you can get for Tolkien.

I'd replace HoME with Parma Eldalamberon, Vinyar Tengwar and other journals publishing his early materials here.

11mo ago

is there a genre of written work specifically concerned with the conception and procedure of literary works, or is it all random interviews or annotated guides?

Jump

Recommending Italo Calvino's Six Memos for the Next Millennium, the lectures he has been preparing shortly before his death.

Not an assembly guide for a work of literature, but it'll help your own process if it's already ongoing and you want to improve.

The lectures also have some comments on what Calvino himself was doing here and there and why.

11mo ago

Are there any good open source text-to-music models, preferably with lyrical abilities?

Jump

ChatMusician isn't exactly new and the underlying dataset isn't particularly diverse, but it's one of the few models made specifically for classical music.

Are there any others, by the way?

12mo ago

What is the secret to making LED light bulbs last as long as the package says?

Jump

The Phoebus cartel strikes again!

12mo ago

Why are weather apps so bad at telling you the current weather?

Jump

Because we have tons of ground-level sensors, but not a lot in the upper layers of the atmosphere, I think?

Why is this important? Weather processes are usually modelled as a set of differential equations, and you want to know the border conditions in order to solve them and obtain the state of the entire atmosphere. The atmosphere has two boundaries: the lower, which is the planet's surface, and the upper, which is where the atmosphere ends. And since we don't seem to have a lot of data from the upper layers, it reduces the quality of all predictions.

12mo ago

Youtube link doesn't suggest video title and the descriptor is in what looks like Russian. I don't live in Russia or use their version of YT. What's going on??

Jump

Given the fact that there was an unintentional DDOS when federated Lemmy instances were requesting the same preview around the same time, it must be one of LW's servers, not anything on your side.

The only sure way to get rid of this effect is to use an instance entirely hosted on servers in anglophone countries, I think.

12mo ago

Youtube link doesn't suggest video title and the descriptor is in what looks like Russian. I don't live in Russia or use their version of YT. What's going on??

Jump

I know Google likes to localise their websites based on IP addresses. Perhaps the preview was requested from a Russian IP? (not necessarily yours, could be a VPN if you use one or one of LW's servers)

1y ago

Introverts use more concrete language than extraverts | BPS

Jump

This is not to say that Jung wasn't a genius. Jung was THE BOMB DIGGIDITY (which, by the way, I wish was an official term in the Oxford dictionary).

If they love Jung so much (which I agree they should because Jung was amaaaaazing), why don't they honor him by using the spelling he actually used?

Love etymological articles with unreliable narrators.

1y ago

Cloudflare is bad. Youre right.

Jump

It would. But it's a good option when you have computationally heavy tasks and communication is relatively light.

1y ago

Cloudflare is bad. Youre right.

Jump

Once configured, Tor Hidden Services also just work (you may need to use some fresh bridges in certain countries if ISPs block Tor there though). You don't have to trust any specific third party in this case.

1y ago

Researchers claim GPT-4 passed the Turing test

Jump

If config prompt = system prompt, its hijacking works more often than not. The creators of a prompt injection game (https://tensortrust.ai/) have discovered that system/user roles don't matter too much in determining the final behaviour: see appendix H in https://arxiv.org/abs/2311.01011.

1y ago

best foss app for OCR?

Jump

Like Firefox ScreenshotGo? (I think it only supports English though)

1y ago

[Paper] Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in SOTA Large Language Models

Jump

Don't know much of the stochastic parrot debate. Is my position a common one?

In my understanding, current language models don't have any understanding or reflection, but the probabilistic distributions of the languages that they learn do - at least to some extent. In this sense, there's some intelligence inherently associated with language itself, and language models are just tools that help us see more aspects of nature than we could earlier, like X-rays or a sonar, except that this part of nature is a bit closer to the world of ideas.

1y ago

If human skin was sometimes completely patterned (eg. spots, stripes, etc.), what pattern (if any) would you want to have?

Jump

https://arxiv.org/abs/2305.17743

1y ago

Yesterday, the temperature was in the 80's.

Jump

The temperature here was very interesting for a second or two until I remembered some people use °F.

1y ago

Chrome: 72 hours to update or delete your browser.

Jump

xkcd.com is best viewed with Netscape Navigator 4.0 or below on a Pentium 3±1 emulated in Javascript on an Apple IIGS at a screen resolution of 1024x1. Please enable your ad blockers, disable high-heat drying, and remove your device from Airplane Mode and set it to Boat Mode. For security reasons, please leave caps lock on while browsing.

https://xkcd.com

1y ago

Chrome: 72 hours to update or delete your browser.

Jump

CVEs are constantly found in complex software, that's why security updates are important. If not these, it'd have been other ones a couple of weeks or months later. And government users can't exactly opt out of security updates, even if they come with feature regressions.

You also shouldn't keep using software with known vulnerabilities. You can find a maintained fork of Chromium with continued Manifest V2 support or choose another browser like Firefox.