Documentation Index
Fetch the complete documentation index at: https://docs.coreweave.com/llms.txt
Use this file to discover all available pages before exploring further.
Requirements
| Repository | Name | Version |
|---|---|---|
| file://../library | library | 0.1.0 |
| https://cybozu-go.github.io/moco/ | moco | 0.20.0 |
| oci://ghcr.io/coreweave/k8s-device-plugin/charts | nvidia-device-plugin | 0.17.0-5c8a50df |
Parameters
| Key & Description | Type | Default |
|---|---|---|
| imagePullSecrets Image pull secrets to configure if using custom private images. | list | |
| moco Options for the MOCO MySQL Operator. | object | See individual settings below. |
| moco.enabled Enable the MOCO MySQL Operator. | bool | |
| moco.image.repository The repository for the MOCO image. | string | |
| moco.image.tag The tag for the MOCO image. | string | |
| moco.imagePullSecrets Image pull secrets to configure if using custom private images. | list | |
| moco.monitoring.podMonitors.enabled Enable monitoring via the Prometheus operator PodMonitor CRD | bool | |
| moco.monitoring.vmPodScrapes.enabled Enable monitoring via the VictoriaMetrics operator VMPodScrape CRD | bool | |
| moco.priorityClassName The priority class name for the MOCO pod. | string | |
| moco.replicaCount The number of replicas of the MOCO instance to run. | int | |
| moco.resources Resources for the MOCO container. | object | |
| nvidia-device-plugin Options for the Coreweave fork of the Nvidia device plugin chart. This chart builds on the default configuration provided by Nvidia, and uses these default chart values. | object | See default chart values. |
| operator.affinity The affinity for the operator deployment. | object | |
| operator.config.operator.nodeSet.failedPodsBackoffGCInterval The time that has to pass before next iteration of backoff GC is run for checking failed pods. | string | |
| operator.config.operator.nodeSet.forceScalingDeleteKnownConditionTimeout The delay to wait before deleting a pod during nodeset scaling with a known Slurm state. “0” disables this feature. | string | |
| operator.config.operator.nodeSet.forceScalingDeleteUnknownConditionTimeout The delay to wait before deleting a pod during nodeset scaling with a unknown Slurm state. “0” disables this feature. | string | |
| operator.config.operator.nodeSet.maxBurstReplicas A rate limiter for booting pods when there are a lot of pods. A too high of a value can cause registry DoS issues. | int | |
| operator.config.operator.nodeSet.scaleDownPriorityOrdering Enable priority ordering for scale down. This ensures the following deletion order non-ready pods, drained pods without running workloads, idle pods, draining pods with running workloads, then running pods. | bool | |
| operator.config.operator.nodeSet.statusUpdateBackoffGCInterval The time that has to pass before next iteration of backoff GC is run for checking node status updates. | string | |
| operator.config.operator.nodeSlice.maxNodesPerNodeSlice The maximum number of nodes that can be in a single nodeSlice. | int | |
| operator.image The image to use for the operator. | object | |
| operator.leaderElection.enabled This forces the operator to use leader election even if the number of replicas is set to 1. Useful if planning to scale after deployment. | bool | |
| operator.leaderElection.leaderElectionID The string value to use as the leader election id. | string | |
| operator.logLevel The log level. Uses integers or zap log level strings:
| string | |
| operator.maxConcurrentReconciles | int | |
| operator.podMonitor.enabled Enable monitoring via the Prometheus operator PodMonitor CRD. | bool | |
| operator.priorityClassName The priority class name for the operator. | string | |
| operator.replicas The number of replicas of the operator pod to run. Leader election will be enabled if this is greater than 1 or leader election is explicitly enabled. | int | |
| operator.resources The resource to request for the operator. | object | |
| operator.tolerations The tolerations for the operator deployment. | list | |
| operator.vmPodScrape.enabled Enable monitoring via the VictoriaMetrics operator VMPodScrape CRD. Note: To enable this, podMonitor must be disabled in addition to setting vmPodScrape.enabled to true. | bool | |
| priorityClass.enabled Enable the priority class for the control plane components. | bool | |
| priorityClass.value The value of the priority class, generally should be high relative to other priority classes as these are critical components. | int | |
| scheduler.podMonitor.enabled Enable monitoring via the Prometheus operator PodMonitor CRD. | bool | |
| scheduler.vmPodScrape.enabled Enable monitoring via the VictoriaMetrics operator VMPodScrape CRD. Note: To enable this, podMonitor must be disabled in addition to setting vmPodScrape.enabled to true. | bool | |
| syncer.podMonitor.enabled Enable monitoring via the Prometheus operator PodMonitor CRD. | bool | |
| syncer.vmPodScrape.enabled Enable monitoring via the VictoriaMetrics operator VMPodScrape CRD. Note: To enable this, podMonitor must be disabled in addition to setting vmPodScrape.enabled to true. | bool |