I couldn't figure out how I'm supposed to use my pen for scrolling (I decided to stop using mouse and shelved it) so I bought a macropad with 3 knobs and ascended to godhood.
My script had batch processing of the data with the batch sizes being set to 200. With that the result of communities for this month I was getting was 125. After setting the batch size to 1 making it process each community separately it changed to 313 so who knows if there are any other problems with the script. :p
My script may or may not have picked up the data correctly. I thought I had it working but during writing down the data I noticed that the new 196 blahaj community wasn't showing up in communities from this month and was categorised as 2023.06. After tinkering with the script a bit and rerunning the script it's now correctly categorised as from this month but who knows if there are any other errors. And due to writing this down manually I could have made some mistakes myself too. :p
New communities this month by instance: (potential typos) (I just pasted it into the lemmy ui only to realise it doesn't have a monofont ;-; (Edit: it seems to actually get rid of additional spaces anyway.))
The political stuff is leaking into any other community though. I've already blocked every politics specific and news specific community and a handful other communities and I still get a lot of that. I'm also blocking a few keywords as well, this stuff just gets through regardless of what you do and in no small amount either.
Lemmy is full of bad news, hate and toxicity while pixelfed is full of artists and people sharing cool pictures and their passion. Visiting my pixelfed account from 2023 a few days ago set such a contrast with what I see on lemmy that I started using it regularly. It really felt weird to visit such a wholesome place after using lemmy for a long time. Now I use pixelfed to balance out the bad stuff from lemmy for maintaining my mental health and to appreciate the art which is not common on lemmy.
This is a link detection and not an image one. Two wildly different things. It also wouldn't handle images with slight differences like an edit here and there because again, doesn't handle images. Same goes for varying levels of compression. In fact it wouldn't even detect the exact same images with different sources or when reuploaded by users. Even if there were people who source images from the same place it would still be irrelevant without an overwhelming share of the users doing that to make the feature actually relevant. And EVEN if there was this high coordination then any trackers, shorteners, arguments, etc. varying the link to the same source they would be treated as a different links without recognising them as a duplicate like with youtube for example. So users would need to be a literal mindhive to coordinate on this level and at this point the tools would be pointless because the knowledge would be shared between everyone anyway.
Having this feature would help immensely both as a poster and as a mod to handle the images with high probability of being a repost. But at the same time I know it isn't feasible due to image processing requiring quite a bit of computing power so it will continue to be a dream.
I use a bridge to matrix for private messages to the bot accounts, reports for posts for which there are multiple bot accounts on different instances because federation is broken for reports, and new posts to the communities (where the last one was merged just few hours ago). We are also contemplating getting ourselves the functionality to automatically message users when we take action on their post/comment.
It's crazy how far we have to go to make moderating stuff easier/more pleasant to do. I hope lemmy improves in that by a lot at some point.
My another gripe is no ability to detect image reposts because in image heavy communities they're very common and remembering what was posted and when is a massive pita. That would fall under a bot category and not integrated feature (but would be cool if it was deeply integrated into lemmy so situations where it would tell you if it's a repost BEFORE you even post it could be possible) but it's still something that makes it harder to moderate. Same goes for posting to other communities because you need to check if it was posted recently or not if you aren't chronically online to know that already.
I will just mention that including the direct link is friendly for mobile app users. Voyager for example keeps me on my account when clicking links that are in this format: https://lemmy.dbzer0.com/post/36114134
E: Lemmyverse.link takes me outside the app to the browser.
I have a vague feeling of seeing a movie like that years ago. What was the title?