DeepSeek-R1 model deployment
Hardware Considerations
$ apolo run --preset <your-preset> ubuntu -- nvidia-smi
Using preset '<your-preset>'
Using image 'ubuntu:latest'
√ Job ID: job-7f36c1c3-f21e-4d14-9e59-8a69079bae22
- Status: pending Creating
- Status: pending Scheduling
- Status: pending ContainerCreating
√ Status: running Restarting
√ Http URL: https://job-7f36c1c3-f21e-4d14-9e59-8a69079bae22.jobs.cluster.org.apolo.us
√ The job will die in a day. See --life-span option documentation for details.
√ =========== Job is running in terminal mode ===========
√ (If you don't see a command prompt, try pressing enter)
√ (Use Ctrl-P Ctrl-Q key sequence to detach from the job)
Tue Feb 25 21:13:10 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.54.14 Driver Version: 550.54.14 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA A100-SXM4-80GB Off | 00000000:81:00.0 Off | 0 |
| N/A 25C P0 64W / 500W | 13MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+------------------------+----------------------+
| 1 NVIDIA A100-SXM4-80GB Off | 00000000:C1:00.0 Off | 0 |
| N/A 22C P0 57W / 500W | 13MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
√ Job job-7f36c1c3-f21e-4d14-9e59-8a69079bae22 finished successfullySoftware Consideration
Deploy with Apolo CLI
Query the model
Deploy with Apolo Flow
Query the model

PreviousDeepSeek-R1 distilled modelsNextTeaching Models To Reason - Training, Fine-Tuning, and Evaluating Models with LLaMA Factory on Apolo
Last updated
Was this helpful?