mirror of
https://github.com/s3fs-fuse/s3fs-fuse.git
synced 2026-04-25 21:35:58 +03:00
[GH-ISSUE #1608] s3fs not parallel upload small file #845
Labels
No labels
bug
bug
dataloss
duplicate
enhancement
feature request
help wanted
invalid
need info
performance
pull-request
question
question
testing
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/s3fs-fuse#845
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @dayiguizhen on GitHub (Mar 22, 2021).
Original GitHub issue: https://github.com/s3fs-fuse/s3fs-fuse/issues/1608
I'm deploy minio(4 node, 2 compute with 4 disk) using docker swarm.
the docker-compose file is below.
Then, I use s3fs mount minio to local.
s3fs -o passwd_file=/etc/.passwd-s3fs -o url=http://10.42.0.115:9001 -o allow_other -o sigv2 -o nonempty -o no_check_certificate -o use_path_request_style -o umask=000 -o parallel_count=100 -o multipart_size=100 -o max_write=131072 -o big_writes -o use_cache=/dev/shm -o enable_noobj_cache marvel /minio/I use
dddo a benchmark test speed between minio/nfs and local disk.This is my python script:
But when I try to write small file like 512K, it was seem like serial not parallel.
So if I want trans serial to parallel, what should I config?
@gaul commented on GitHub (Mar 22, 2021):
You have set
-o multipart_size=100so s3fs will serially upload files smaller than 100 MB. You can set this value as small as 5 MB and the default is 10 MB. Changing this should improve performance for the 16 MB and larger columns but 512 KiB and 1 MB will remain the same. At these small sizes, creating the zero-byte object also becomes a bottleneck #1013.For multi-file parallelism I am not sure that s3fs limits curl in any way? It may be worth experimenting to find out.
Also which version of s3fs do you use? Later versions improve performance.
@gaul commented on GitHub (May 30, 2021):
Closing since there is nothing actionable here. Note that #1640 improves performance of small files by not creating the unneeded zero-byte object.