Data Sources
Data Sources connect external storage to your Run:AI workloads. They appear as mounted directories inside your containers.
Pre-configured Data Sources
Your lab environment includes these data sources:
| Name | Type | Mount Path | Contents |
|---|---|---|---|
lab-datasets |
PVC | /mnt/datasets |
Pre-loaded training datasets |
shared-models |
PVC | /mnt/models |
Pre-trained model checkpoints |
Creating a Custom Data Source
- Go to Data Sources in the Run:AI UI
- Click New Data Source
- Choose your storage type:
- PVC - Kubernetes Persistent Volume Claim
- S3 - Amazon S3 bucket
- NFS - Network File System
- Git - Clone a Git repository
Attaching Data Sources to Workloads
When creating a workspace or training job:
- Scroll to the Data Sources section
- Click Add Data Source
- Select the data source and specify the mount path
Data sources are mounted read-only by default. To write results, use your project's home directory at
/home/user/.