this post was submitted on 05 Jul 2023
91 points (94.2% liked)

Fediverse

28216 readers
168 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to [email protected]!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy

founded 1 year ago
MODERATORS
 

I made this tool to help self-hosters, new admins, or smaller instances have more global and updated content on their instances.

This is the similar to Lemmy Community Seeder but is designed to be run periodically to capture new communities, and include EVERYTHING by default.

EDIT: As noted in the comments, this is an admin tool. Please do not run it as a user if you don't know what you are doing. If you want a better "All," ask your admin first! That said, lemmony in no way constitutes abuse! You can cause a DOS with curl, but that's not what curl was written for. This tool is to legitimately use an API to enhance our experience. Admins that desire to accommodate high volume on a public service will not know this tool is running against, or on their instances. If it causes performance issues, that is unfortunate. They are free to throttle, ban or block API access to their instance in a multitude of ways.

EDIT 2: Donate to your instance/admin if you like Lemmy!

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 11 points 1 year ago* (last edited 1 year ago) (2 children)
[–] [email protected] 6 points 1 year ago (1 children)

Quite a bit of space could be saved with database compression. The database side of things has lower hanging fruit right now though.

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago) (1 children)
[–] [email protected] 3 points 1 year ago (2 children)

Images are not federated, they only live on the hosting instance.

Thumbnails might copied though, I'm not sure.

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (1 children)
[–] [email protected] 1 points 1 year ago (1 children)

There is some discussion. https://github.com/LemmyNet/lemmy/issues/2947

I am still fairly confident that it shouldn't be storing images, but I'll admit my pict-rs directory is growing quite fast compared to the database. Have to keep a close eye on this.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago) (1 children)
[–] [email protected] 1 points 1 year ago

I'm not convinced either one of us knows what the software is SUPPOSED to do, and I am pretty sure nobody knows what it's actually doing. Here's another thread: https://github.com/LemmyNet/lemmy/issues/3163

[–] [email protected] 2 points 1 year ago (1 children)

Does the image get stores on the poster's instance or the instance hosting the community they're posting to?

[–] [email protected] 3 points 1 year ago
[–] [email protected] 2 points 1 year ago* (last edited 1 year ago) (2 children)

EVERYTHING by default. Also working on "discover only" for searching without the subscribe-to-everything. That said: It's far less than 3GB per day for EVERYTHING I can see, plus: you don't HAVE to keep it forever. Were you doing something that got other than text?

[–] [email protected] 3 points 1 year ago (1 children)

Do you have a link to a documentation concerning retention/cleanup for instances?

[–] [email protected] 1 points 1 year ago

I don't. I haven't looked yet either because I haven't crossed that bridge. I think there were some admins on matrix chatting about it though. It will become an issue for large instances like near term, so I suspect someone will tackle it very soon, if they haven't already.

[–] [email protected] 0 points 1 year ago* (last edited 1 year ago) (1 children)
[–] [email protected] 4 points 1 year ago (1 children)

They're not supposed to, and don't call me friend, buddy.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago)