Skip Navigation

sweng @ sweng @programming.dev

Posts

1
Comments

206
Joined

2 yr. ago

1y ago

Android app for GitHub

Do you hapen to know where? Searching seems to give no results.

1y ago

Open Source Initiative tries to define Open Source AI

In theory, if you have the inputs, you have reproducible outputs, modulo perhaps some small deviations due to non-deterministic parallelism. But if those effects are large enough to make your model perform differently you already have big issues, no different than if a piece of software performs differently each time it is compiled.

1y ago

Open Source Initiative tries to define Open Source AI

The analogy works perfectly well. It does not matter how common it is. Pstching binaries is very hard compared to e.g. LoRA. But it is still essentially the same thing, making a derivative work by modifying parts of the original.

1y ago

Open Source Initiative tries to define Open Source AI

I don't see your point? What is the "source" for Mona Lisa I would use? For LLMs I could reproduce them given the original inputs.

Creating those inputs may be an art, but so could any piece of code. No one claims that code being elegant disqualifies it from being open source.

1y ago

Open Source Initiative tries to define Open Source AI

How is that different then e.g. patching a closed-sourced binary? There are plenty of community patches to old games to e.g. make them work on newer hardware. Architectural independence seems irrelevant, it's no different than e.g Java bytecode.

1y ago

Open Source Initiative tries to define Open Source AI

It would depend on the format what is counted as source, and what isn't.

You can create a picture by hand, using no input data.

I challenge you to do the same for model weights. If you truly just sit down and type away numbers in a file, then yes, the model would have no further source. But that is not something that can be done in practice.

1y ago

Open source LaTeX book first release

"Open source" and "source available" are different things. See e.g. https://opensource.org/osd and https://opensource.com/article/18/2/coining-term-open-source-software

1y ago

Proton Mail Discloses User Data Leading to Arrest in Spain

Pgp does not encrypt the whole email, only part of it.

1y ago

Should we replace democracy with science?

Sounds like a wildly unscientific statement, considering e.g ~10% of the US population works in STEM.

1y ago

Should we replace democracy with science?

How about the current system where we vote and do science?

1y ago

Should we replace democracy with science?

You forget a piece: "Given these observations, these objectives, and this bit of sound reasoning, ..."

Without objectives, no amount of reasoning will tell you what to do. Who sets the objectives?

1y ago

Someone got Gab's AI chatbot to show its instructions

The second LLM could also look at the user input and see that it look like the user is asking for the output to be encoded in a weird way.

1y ago

Someone got Gab's AI chatbot to show its instructions

Can you explain how you would jailbfeak it, if it does not actually follow any instructions in the prompt at all? A model does not magically learn to follow instructuons if you don't train it to do so.

1y ago

Someone got Gab's AI chatbot to show its instructions

You are using the LLM to check it's own response here. The point is that the second LLM would have hard-coded "instructions", and not take instructions from the user provided input.

In fact, the second LLM does not need to be instruction fine-tuned at all. You can jzst fine-tune it specifically for the tssk of answering that specific question.

1y ago

Someone got Gab's AI chatbot to show its instructions

Wouldn't it be possible to just have a second LLM look at the output, and answer the question "Does the output reveal the instructions of the main LLM?"

1y ago

WhatsApp's interoperability agreement doesn't allow GNU (A|L)GPL licenses

They actually did not. They clearly state (at least in the text posted by the OP) that you are not allowed to license under a version or derivative of the GPL if it would end up copyleft. The main condition is that it is licensed under a version of the GPL.

(To be clear, I'm talking about the second quote, about combining)

1y ago

WhatsApp's interoperability agreement doesn't allow GNU (A|L)GPL licenses

Just dual-license your software under the TNGPL (Totally Not GPL) license that just so happens to afford the same protections.

1y ago

Linking parts of the codebase such that changing one forces reviewing the other ?

By input coverage I just mean that you test with different inputs. It doesn't matter if you have 100% code coverage, if you only tested with the number "1", and the code crashes if you give it a negative number.

If you can prove that your code can't crash (e.g. using types), it's a lot more valuable then spending time thinking about potentially problematic inputs and writing individual tests for them (there ate tools thst help with this, but they are not perfect).

1y ago

Linking parts of the codebase such that changing one forces reviewing the other ?

Test coverage alone is meaningless, you need to think about input-coversge as well, and that's where you can spend almost an infinite amount of time. At some point you also have to ship stuff.

1y ago

Linking parts of the codebase such that changing one forces reviewing the other ?

Wouldn't static type checking solve most of these issues?