mirror of
https://github.com/s3fs-fuse/s3fs-fuse.git
synced 2026-04-25 05:16:00 +03:00
[GH-ISSUE #2294] Copy from S3 bucket is slow #1143
Labels
No labels
bug
bug
dataloss
duplicate
enhancement
feature request
help wanted
invalid
need info
performance
pull-request
question
question
testing
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/s3fs-fuse#1143
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @vitalyk-multinarity on GitHub (Aug 27, 2023).
Original GitHub issue: https://github.com/s3fs-fuse/s3fs-fuse/issues/2294
Additional Information
Version of s3fs being used (
s3fs --version)V1.93 (commit:82107f4) (the same results with v1.90)
Version of fuse being used (
pkg-config --modversion fuse,rpm -qi fuseordpkg -s fuse)2.9.9-5ubuntu3
Kernel information (
uname -r)6.2.0-1009-aws
GNU/Linux Distribution, if applicable (
cat /etc/os-release)PRETTY_NAME="Ubuntu 22.04.2 LTS"
How to run s3fs, if applicable
Used command line
s3fs syslog messages (
grep s3fs /var/log/syslog,journalctl | grep s3fs, ors3fs outputs)Details about issue
Copy from S3 bucket is very slow: I have a test bucket with 3.7GB of data. "aws s3 cp --recursive s3://my-test/ /tmp" takes 1 minute, but "cp -r my-mount-point /tmp" takes >10 minutes.
Interesting that the first 1.3GB copied in 1 minute, but after that copying is much slower...
@gaul commented on GitHub (Aug 27, 2023):
If the bucket contains many files then AWS CLI will be faster since it copies in parallel while
cpcopies serially.@vitalyk-multinarity commented on GitHub (Aug 28, 2023):
@gaul - thank you!
Yes, I have a few big files (movies), plus thousands small images.
Somehow I was sure that s3fs if multithreaded.
@gaul commented on GitHub (Aug 28, 2023):
s3fs is multithreaded but if the application does not copy in parallel then s3fs cannot operate in parallel. So if you use something like https://superuser.com/a/536643 s3fs performance should improve.