In CoreWeave AI Object Storage, data is stored as objects consisting of the data itself, its metadata, and a unique identifier. These objects are organized into containers called buckets, which provide a namespace for objects to ensure that object names are unique within a bucket. Additionally, buckets can have specific configurations and policies to manage the data lifecycle and security. Creating and deleting buckets are both straightforward operations that can be performed programmatically using the S3-compatible Object Storage API or various S3-compatible tools. This guide explains how to manage CoreWeave AI Object Storage buckets with Cloud Console and using three popular clients: S3cmd, AWS CLI, and Boto3.Documentation Index
Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt
Use this file to discover all available pages before exploring further.
Prerequisites
This guide assumes that you have followed the steps in Get started with AI Object Storage. Before you begin, ensure that you have completed the following tasks:- Set up Workload Identity Federation for your organization.
- Create Access Keys with either SAML/SSO assertions or Cloud Console tokens.
- Create a bucket to store data.
- Set organization and bucket access policies to control access, lifecycle, and auditing.
- Set up your endpoints to ensure you’re using the required virtual-hosted addressing style, and the correct endpoints for AI Object Storage.
About LOTA (Local Object Transport Accelerator)
To use LOTA, clients must point their requests to the LOTA endpoint instead of the primary endpoint. No other changes are required for S3-compatible clients.- Use the LOTA endpoint,
http://cwlota.com, when running inside a CoreWeave cluster. The LOTA endpoint routes to the LOTA path for best performance. - Use the primary endpoint,
https://cwobject.com, when running outside of a CoreWeave cluster.
Bucket considerations
Bucket creation
You cannot create a bucket without a valid API Access key. Anonymous requests are never allowed to create buckets. To obtain an API Access key, see How-To: Manage Access Keys. When you create a bucket, you automatically become the bucket owner. The bucket owner has full control over the bucket and its contents.Bucket naming
Bucket names must be globally unique, and adhere to the following rules:- Length: 3 to 63 characters.
- Characters: Only lowercase letters (
a-z), numbers (0-9), and hyphens (-). No dots, uppercase letters, underscores, spaces, or other special characters. - Start and end: Must begin and end with a letter or number. Cannot start or end with a hyphen (
-). - Prohibited patterns: Cannot start with
xn--.- Reserved: Must not begin withcw-,vip-, orlog-stitcher-ch-. Must not be the exact nameint. These are reserved for internal use.
Endpoint configuration
All requests to CoreWeave AI Object Storage endpoints must be in DNS (virtual-hosted) style, where the bucket name is part of the domain name in the URL.- Use
http://[BUCKET-NAME].cwlota.comif using the LOTA endpoint. - Use
https://[BUCKET-NAME].cwobject.comif using the primary endpoint. - Set your S3 configuration to specify a virtual addressing style.
Manage buckets with Cloud Console
The Buckets section allows you to manage your buckets. You can create and delete buckets, and view their total size and the number of objects in them. The Grafana link at the top of the page leads to a Grafana dashboard for all buckets in your organization.
Availability Zones
S3cmd, AWS CLI, and Boto3 use the term “region” to define the location of the object storage service. However, at CoreWeave, are larger constructs that are organized into . When configuring your client, use a CoreWeave Availability Zone in place of the traditional AWS “region”. When creating a new bucket, you must specify the Availability Zone in which the bucket will be created.- When creating a bucket with S3cmd, use
--bucket-locationwith the Availability Zone. - When creating a bucket with AWS CLI or Boto3, use
LocationConstraint.
Manage buckets programmatically
The following examples show how to manage buckets with the AWS CLI, s3cmd, and Boto3.- AWS CLI is a unified tool for managing AWS services from the command line. It provides a consistent interface for interacting with S3-compatible services, including CoreWeave AI Object Storage. Follow these steps to install and configure the AWS CLI, then use it to create and delete buckets in CoreWeave AI Object Storage.
- S3cmd is a command-line tool for managing objects in S3-compatible object storage services. It can be used to create, delete, and manage buckets, as well as upload, download, and manage objects within those buckets. S3cmd is available for Linux and macOS.
- Boto3 is the AWS SDK for Python. It allows you to interact with AWS services programmatically using Python. Follow these steps to install and configure Boto3, then use it to create and delete buckets in CoreWeave AI Object Storage.
- For high-performance and bulk operations, you can use the CoreWeave fork of s5cmd. See Migrate data to AI Object Storage for installation and usage.
Install tools
- AWS CLI
- s3cmd
- Boto3
AWS CLI supports macOS, Linux, and Windows. See the official installation procedures for your operating system in the AWS CLI User Guide.
Configure tools
- AWS CLI
- s3cmd
- Boto3
To set up your AWS configuration for CoreWeave AI Object Storage, create a new credentials file and profile in your CoreWeave configuration directory. Using a separate profile for CoreWeave AI Object Storage is recommended to avoid conflicts with your other AWS profiles and S3-compatible services.
-
Create a new profile called
cwand a credentials file located at~/.coreweave/cw.credentials:Configure a CoreWeave credentials file and profile -
When prompted for information, provide the following values:
Field Value AWS Access Key ID The CoreWeave AI Object Storage Access Key. AWS Secret Key The CoreWeave AI Object Storage Secret Key. Default region name Optional. To set a default region, refer to the CoreWeave Availability Zones. Default output format Use jsonfor JSON output. -
Set the endpoint URL to the appropriate endpoint for your use case:
Endpoint URL Description https://cwobject.comThe primary endpoint for CoreWeave AI Object Storage. Use this when running outside of a CoreWeave cluster. http://cwlota.comThe LOTA endpoint, which routes to the LOTA cache for best performance. Always use the LOTA endpoint when running inside a CoreWeave cluster. Set the LOTA endpoint URL in your CoreWeave configuration fileSet the primary endpoint URL in your CoreWeave configuration file -
Set the default addressing style to
virtualin your CoreWeave configuration file. This is required for CoreWeave AI Object Storage, as it uses virtual-hosted style URLs.Set virtual addressing style in your CoreWeave configuration file
Create a bucket
- AWS CLI
- s3cmd
- Boto3
Use the
aws s3api command to create a new bucket. For example:Replace [BUCKET-NAME] with your desired bucket name and [AVAILABILITY-ZONE] with a CoreWeave Availability Zone.Create a bucket with AWS CLI
Delete a bucket
Only empty buckets can be deleted. Remove all objects, object versions, and delete markers before deleting a bucket.- AWS CLI
- s3cmd
- Boto3
To delete a bucket, use the
delete-bucket sub-command as shown. Only empty buckets can be deleted. Remove all objects, object versions, and delete markers before deleting a bucket.Replace [BUCKET-NAME] with the name of the bucket to delete.Delete a bucket with AWS CLI
Report usage
- AWS CLI
- s3cmd
- Boto3
To report the usage of a bucket, use
aws s3 with the ls sub-command. Use the --recursive, --human-readable, and --summarize parameters to view information in an easier-to-understand format.Replace [BUCKET-NAME] with the name of your bucket.Report bucket usage with AWS CLI