http://cwlota.com instead of the primary endpoint https://cwobject.com. That is the single most important change for performance from inside a cluster.
Once you are on the LOTA endpoint, follow these practices documented in Performance best practices:
- Hash your object key prefixes to avoid hot-spotting on sequential keys.
- Aim for read sizes of at least 15 MB. Performance degrades below 1 MB. Consolidate small files into TAR, WebDataset, or TFRecord archives.
- Maximize parallelism. Start at roughly 300 concurrent operations per Node and tune. Raise
max_pool_connectionsabove the Boto3 default of 10. - Use multipart uploads for large objects with a minimum part size of 50 MB so LOTA can distribute parts across Nodes.
Administrator