starred/pictshare

Fork 0

mirror of https://github.com/HaschekSolutions/pictshare.git synced 2026-04-25 06:55:56 +03:00

[GH-ISSUE #80] Scaled instances and the deletion problem #60

New issue

Open

opened 2026-02-25 23:40:32 +03:00 by kerem · 7 comments

kerem commented

2026-02-25 23:40:32 +03:00

Owner

Originally created by @geek-at on GitHub (Dec 29, 2018).
Original GitHub issue: https://github.com/HaschekSolutions/pictshare/issues/80

Now the codebase has been rewritten we can start and think about the problem with scaling pictshare: deleting content.

Imagine two Pictshare servers connected through a shared folder (ALT_FOLDER)

An image is requested frequently so both servers have a local copy and there is a copy in the shared folder.

If the user wants to delete the image, it's deleted off the server the user sent the request to and from the shared folder.

the second server never got any info about any deleted hash so they kept theirs.

Possible solutions:

Keep a list of deleted hashes in all storage controllers
Use some kind of centralized database that manages all hashes and stati
Make all nodes somehow communicate with each other

Originally created by @geek-at on GitHub (Dec 29, 2018). Original GitHub issue: https://github.com/HaschekSolutions/pictshare/issues/80 Now the codebase has been rewritten we can start and think about the problem with scaling pictshare: deleting content. Imagine two Pictshare servers connected through a shared folder (ALT_FOLDER) An image is requested frequently so both servers have a local copy and there is a copy in the shared folder. If the user wants to delete the image, it's deleted off the server the user sent the request to and from the shared folder. the second server never got any info about any deleted hash so they kept theirs. Possible solutions: - Keep a list of deleted hashes in all storage controllers - Use some kind of centralized database that manages all hashes and stati - Make all nodes somehow communicate with each other

kerem added the

Feature request

help wanted

labels

2026-02-25 23:40:32 +03:00

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@thomasjsn commented on GitHub (Mar 8, 2019):

I'm really loving that this app doesn't require a database, a centralized database will introduce some complexity. A list of deleted hashes in all storage controllers and a cron job is quite simple and would do the job. I'm guessing instant deletion is not really required.

@thomasjsn commented on GitHub (Mar 8, 2019): I'm really loving that this app doesn't require a database, a centralized database will introduce some complexity. A list of deleted hashes in all storage controllers and a cron job is quite simple and would do the job. I'm guessing instant deletion is not really required.

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@cwilby commented on GitHub (Nov 19, 2019):

Just some thoughts:

Each server maintains a list of peers.

The first server is created (0), then the second server is created (1) and pointed to 0. 0 and 1 both update their lists to [0,1].

For each server added after 1, the server being added is pointed to any server (N). N iterates through every server in its list but itself (If N is 1, this subset is [0]) and sends an HTTP message telling it to add the new server N to their list. Making the list [0, 1, N] on each server.

With this in place, when a server receives a delete request, it performs the delete, then sends a delete signal via HTTP to each server on its list (which should be up to date given the above works).

TL;DR - I agree with making nodes communicate.

@cwilby commented on GitHub (Nov 19, 2019): Just some thoughts: Each server maintains a list of peers. The first server is created (0), then the second server is created (1) and pointed to 0. 0 and 1 both update their lists to `[0,1]`. For each server added after 1, the server being added is pointed to any server (N). N iterates through every server in its list but itself (If N is 1, this subset is `[0]`) and sends an HTTP message telling it to add the new server N to their list. Making the list `[0, 1, N]` on each server. With this in place, when a server receives a delete request, it performs the delete, then sends a delete signal via HTTP to each server on its list (which should be up to date given the above works). --- **TL;DR** - I agree with making nodes communicate.

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@geek-at commented on GitHub (Jan 5, 2020):

The problem with all nodes talking to each other is that it would complicate the whole project by a landslide.

I think the easiest way to implement it would be to have a list of deleted hashes that won't get re-used by chance and this list should be copied and checked by all storage providers

@geek-at commented on GitHub (Jan 5, 2020): The problem with all nodes talking to each other is that it would complicate the whole project by a landslide. I think the easiest way to implement it would be to have a list of deleted hashes that won't get re-used by chance and this list should be copied and checked by all storage providers

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@cwilby commented on GitHub (Jan 6, 2020):

Sounds good, where would the deleted hashes be stored? If each node has a copy it would be similarly complex

@cwilby commented on GitHub (Jan 6, 2020): Sounds good, where would the deleted hashes be stored? If each node has a copy it would be similarly complex

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@geek-at commented on GitHub (Jan 7, 2020):

The easiest implementation would be a simple file where delelted hashes are stored

This file should then be compared with the list on every storage controller and every pictshare instance should periodically check this file for hashes to delete. and check storage controllers for updated hashes to add to their local list.

It's just a simple blacklist system. I think that could work.

@geek-at commented on GitHub (Jan 7, 2020): The easiest implementation would be a simple file where delelted hashes are stored This file should then be compared with the list on every storage controller and every pictshare instance should periodically check this file for hashes to delete. and check storage controllers for updated hashes to add to their local list. It's just a simple blacklist system. I think that could work.

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@cwilby commented on GitHub (Jan 7, 2020):

Yep that sounds like it could work. Each node can be configured to communicate with a service to add/read deleted hashes. Would the service be the root pictshare instance or something else?

@cwilby commented on GitHub (Jan 7, 2020): Yep that sounds like it could work. Each node can be configured to communicate with a service to add/read deleted hashes. Would the service be the root pictshare instance or something else?

kerem commented

2026-02-25 23:40:33 +03:00

Author

Owner

@geek-at commented on GitHub (Jan 7, 2020):

I'm thinking cronjob so admins can set their own intervals for comparing the blacklist and deletions can take as much time as they need

@geek-at commented on GitHub (Jan 7, 2020): I'm thinking cronjob so admins can set their own intervals for comparing the blacklist and deletions can take as much time as they need

kerem referenced this issue

2026-02-25 23:40:55 +03:00

[PR #60] [CLOSED] Refactor the image filters #148

kerem referenced this issue

2026-02-25 23:40:55 +03:00

[PR #61] [CLOSED] Introduce Storage Providers #149