Using DataChain Commands
DataChain is a command-line tool for wrangling unstructured AI data at scale. Use datachain -h to list all available commands.
Typical DataChain Workflow
-
Authentication with Studio
-
Use
datachain auth loginto authenticate with Studio -
Set your default team with
datachain auth team -
View your token with
datachain auth token -
Log out from Studio with
datachain auth logout
-
-
Job Management
-
Run jobs in Studio with
datachain job run -
Monitor job logs with
datachain job logs -
Cancel running jobs with
datachain job cancel -
Check for the clusters available for jobs
datachain job clusters
-