mirror of
https://github.com/s3fs-fuse/s3fs-fuse.git
synced 2026-04-25 13:26:00 +03:00
[GH-ISSUE #1391] After running for some time, s3fs starts failing with "failed: mkdir /var/lib/rexray/volumes/<bucket>: file exists" #743
Labels
No labels
bug
bug
dataloss
duplicate
enhancement
feature request
help wanted
invalid
need info
performance
pull-request
question
question
testing
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/s3fs-fuse#743
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Makeshift on GitHub (Sep 11, 2020).
Original GitHub issue: https://github.com/s3fs-fuse/s3fs-fuse/issues/1391
Additional Information
I'm attempting to use s3fs along with the rexray/s3fs plugin on ECS in AWS. This is the userdata I'm using:
Everything seems fine for a little while, with tasks spawning and correctly connecting to volumes. After a little while (I'm not sure of the exact trigger yet), containers will fail to launch with the error:
"Handler for POST /v1.25/containers/<hash>/start returned error: error while mounting volume '': VolumeDriver.Mount: docker-legacy: Mount: <bucket>: failed: mkdir /var/lib/rexray/volumes/<bucket>: file exists"That folder doesn't exist on the host - so I'm assuming it's within the volume plugin.
From what I can tell, this happens on all buckets from that have previously been mounted after a period of time. Note in this case I do have multiple containers utilizing the same bucket - but this hasn't been an issue previously.
I'm shortly going to try building master instead of using the EPEL release to see if it changes anything.
If there's any other troubleshooting steps I can take, please let me know.
Version of s3fs being used (s3fs --version)
Amazon Simple Storage Service File System V1.87 (commit:unknown) with OpenSSL (Current version in AML2 EPEL)This was wrong - I was using the one provided by the official rexray plugin. I've since rebuilt the plugin using the latest s3fs version and am now testing with it.
Version of fuse being used (pkg-config --modversion fuse, rpm -qi fuse, dpkg -s fuse)
Kernel information (uname -r)
4.14.193-149.317.amzn2.x86_64GNU/Linux Distribution, if applicable (cat /etc/os-release)
s3fs syslog messages (grep s3fs /var/log/syslog, journalctl | grep s3fs, or s3fs outputs)
@Makeshift commented on GitHub (Sep 11, 2020):
I'm having quite a bit of trouble understanding the interactions between docker plugins/s3fs/rexray/libstorage/etc so I may be being really stupid and not realizing that the plugin isn't actually using the on-host s3fs. If that's the case, please let me know that building master won't do anything for me :D
@Makeshift commented on GitHub (Sep 11, 2020):
Ah, looks like this may be a rexray issue: https://github.com/rexray/rexray/issues/1336
@Makeshift commented on GitHub (Sep 11, 2020):
I've rebuilt the rexray plugin with the current master of s3fs (available here). Unfortunately now I'm getting a more unhelpful error:
:(
@gaul commented on GitHub (Sep 12, 2020):
Can you try running s3fs with
-d -fto see if there are any debug logs?@Makeshift commented on GitHub (Sep 14, 2020):
This seems to be some specific bug with repeatedly mounting and unmounting the same bucket(s) using Rexray and some craziness with how docker reuses plugins. I'm going to say this probably isn't an s3fs issue and is much more likely to be rexray, and because of the sheer amount of wrapping rexray does with s3fs, it's pretty hard to debug and get logs out of it.
My workaround is to simply mount the bucket on the host and pass it through to the ECS containers. Not quite as elegant, but it seems to work brilliantly!
@Poweranimal commented on GitHub (Mar 7, 2021):
@Makeshift I'm having the exact same problem.
It's super annoying that rexray for s3fs performs that badly.
Also, as you said, debugging it yourself is super difficult because of the lack of helpful logs.
If it helps anyone, I could resolve the issue by restart docker.
This, of course is not the solution and even a good workaround for the problem, but maybe someone can afford doing this.
I'd propose reopening this issue, because it doesn't seem to be resolved.
@himalreddy commented on GitHub (Mar 8, 2022):
I am also facing same issue. any solution for this problem?
@gaul commented on GitHub (Jun 12, 2022):
We need more information to debug further. If rexray does not offer the option to dump debug logs please open an issue against that project.