Akisamb

11mo ago

Mozilla wants you to love Firefox again

Jump

They've got thunderbird which is as far as I know the only serious alternative to outlook.

1y ago

neither will moving to the cloud

Jump

Now instead of just querying the goddamn database, a one line fucking SQL statement, I have to deal with the user team

Exactly, you understand very well the purpose of microservices. You can submit a patch if you need that feature now.

Funnily enough I'm the technical lead of the team that handles the user service in an insurance company.

Due to direct access to our data without consulting us, we're getting legal issues as people were using addresses to guess where people lived instead of using our endpoints.

I guess some people really hate the validation that service layers have.

1y ago

Israel says its strike that killed aid workers was a mistake. Rights groups say it was no anomaly

Jump

Hamas claims 6000 of their militants were killed.

https://www.wionews.com/world/hamas-official-says-over-6000-fighters-killed-during-war-in-gaza-691701/amp

1y ago

Permanently Deleted

Jump

people who've never been laid

That was unnecessary. I know that people with poor social skills have more trouble with romance, but implying that all virgins are socially inept is a harmful stereotype, luck is a big factor in finding relationships.

1y ago

Elon Musk's X pushed a fake headline about Iran attacking Israel. X's AI chatbot Grok made it up.

Jump

It's absolutely amazing, but it is also literally and technologically impossible for that to spontaneously coelesce into reason/logic/sentience.

This is not true. If you train these models on game of Othello, they'll keep a state of the world internally and use that to predict the next move played (1). To execute addition and multiplication they are executing an algorithm on which they were not explicitly trained (although the gpt family is surprisingly bad at it, due to a badly designed tokenizer).

These models are still pretty bad at most reasoning tasks. But training on predicting the next word is a perfectly valid strategy, after all the best way to predict what comes after the "=" in 1432 + 212 = is to do the addition.

1y ago

Israel says its strike that killed aid workers was a mistake. Rights groups say it was no anomaly

Jump

More than 33,000 Palestinians have been killed in Israel’s offensive, around two-thirds of them women and children, according to Gaza’s Health Ministry. Its count doesn’t distinguish between civilians and combatants.

In the 33 000 figure Hamas combatants are included.

I'd say at least 20000 innocent civilians killed since the start of the conflict. Probably more as Israel seems to be quite trigger happy on civilians.

1y ago

Microsoft unbundles Office and Teams globally in new attempt to appease antitrust regulators

Jump

Now let's look at Office. Open an Excel spreadsheet with tables in any app other than excel. Tables are something that's just a given in excel, takes 10 seconds to setup, and you get automatic sorting and filtering, with near-zero effort. No, I'm not setting up a DB in an open-source competitor to Access. That's just too much effort for simple sorting and filtering tasks, and isn't realistically shareable with other people.

Am I missing something or isn't it exactly the same thing in libre office ?

1y ago

Microsoft unbundles Office and Teams globally in new attempt to appease antitrust regulators

Jump

I don't believe that there are solutions that are as complete as team, for video and voice calls it's among the best.

But it's so bad for text ! Why do I have to wait for a second when I change channels ? Why does it not support markdown (the partial implementation that it has is arguably worse than no implementation at all) ? Why is the search so bad ?

1y ago

Rule

Jump

And cow feed is also grown with tons of pesticides and you need much more of it for less tissue at the end.

I have hard time seeing clothing with a bigger environmental than leather.

1y ago

EU moving towards total monetary surveillance and banning all anonymous payments

Jump

This is not true in France. Politicians that have proven fraud are arrested and charged. In France we have Sarkozy, Cahuzac, Fillon that were all charged with crimes.

They were president, minister and presidential candidate respectively. I'd be surprised if it was different in the USA. I'm seeing that trump is also being charged, the system seems to be working.

1y ago

Using AI to spot edible mushrooms could kill you | AI tools are good for some things, but don’t trust your health to apps that make frequent mistakes

Jump

Convolutional neural networks and plant identifying apps came before chat gpt. Beyond both relying on neural networks they don't have much in common.

1y ago

In Cringe Video, OpenAI CTO Says She Doesn’t Know Where Sora’s Training Data Came From

Jump

Don't know why you are down voted it's a good question.

As a matter of fact it almost happened for search engines in France. Newspaper's argued that snippets were leading people to not go into their ad infested sites thus losing them revenue.

https://techcrunch.com/2020/04/09/frances-competition-watchdog-orders-google-to-pay-for-news-reuse/

1y ago

White House: Future Software Should Be Memory Safe

Jump

Why would java have an impact on battery performance ? Pretty much all credit cards run java for their encryption algorithms, and they need pretty much no power to run.

1y ago

Autonomous murderbot incinerated by SF Chinese New Year street partiers

Jump

I don't agree. Curvy roads are dangerous, but there are much more conflicts in cities. You're not going to have many pedestrians in curvy mountain roads.

That said, you are right that the ideal comparison would be int the same city. But I'm not sure that the data exists, I'll have to look this afternoon.

That said, even if my data is not perfect, it's much better than taking one accident and saying that self driving cars are dangerous. They are not going to be magically better than humans, after all driving is a difficult task, but we should at least crunch the numbers before dismissing them.

1y ago

Autonomous murderbot incinerated by SF Chinese New Year street partiers

Jump

You can't take one accident and use that to generalize.

You need to take into account all accidents and see how worse humans are.

https://arstechnica.com/cars/2023/12/human-drivers-crash-a-lot-more-than-waymos-software-data-shows/

Cars are naturally dangerous. A robot car is going to have deaths no matter what. That does not mean they are bad if they mean a reduction of cars and accidents. Taxis if done properly can help a public transport system.

1y ago

Can I just convert to Judaism tomorrow and get a free vacation to Israel?

Jump

They gave them a birth control shot without properly informing them of what it was. Still scandalous, but not what you are saying.

1y ago

GPT told me to break my system

Jump

These models do not see letters but tokens. For the model, violet is probably two symbols viol and et. Apart from learning by heart the number of letters in each token, it is impossible for the model to know the number of letters in a word.

This is also why gpt family sucks at addition their tokenizer has symbols for common numbers like 14. This meant that to do 14 + 1 it could not use the knowledge 4 + 1 was 5 as it could not see the link between the token 4 and the token 14. The Llama tokenizer fixes this, and is thus much better at basic algebra even with much smaller models.

2y ago

AI girlfriend bots are already flooding OpenAI’s GPT store

Jump

Yes to your question, but that's not what I was saying.

Here is one of the most popular training datasets : https://pile.eleuther.ai/

If you look at the pdf describing the dataset, you'll find the mean length of these documents to be somewhat short with mean length being less than 20kb (20000 characters) for most documents.

You are asking for a model to retain a memory for the whole duration of a discussion, which can be very long. If I chat for one hour I'll type approximately 8400 words, or around 42KB. Longer than most documents in the training set. If I chat for 20 hours, It'll be longer than almost all the documents in the training set. The model needs to learn how to extract information from a long context and it can't do that well if the documents on which it trained are short.

You are also right that during training the text is cut off. A value I often see is 2k to 8k tokens. This is arbitrary, some models are trained with a cut off of 200k tokens. You can use models on context lengths longer than that what they were trained on (with some caveats) but performance falls of badly.

2y ago

AI girlfriend bots are already flooding OpenAI’s GPT store

Jump

There are two issues with large prompts. One is linked to the current language technology, were the computation time and memory usage scale badly with prompt size. This is being solved by projects such as RWKV or mamba, but these remain unproven at large sizes (more than 100 billion parameters). Somebody will have to spend some millions to train one.

The other issue will probably be harder to solve. There is less high quality long context training data. Most datasets were created for small context models.

2y ago

Hydroxychloroquine could have caused 17,000 deaths during COVID, study finds

Jump

Didier Raoult for a large part. He was the one who published the paper that really started this whole mess. His shoddy research practices and non-respect for patients did plenty of harm.

Good thing that they've forced his retirement.