Skip to main content
Version: 0.1.0

Serve a Model from an NFS Volume

Connect a model that already exists on NAS or an NFS path as a NuFi Volume, register it in Model Artifacts, and deploy it as a Serving. Use this path when you want to reuse an existing storage location without downloading model files again.

Prerequisites

  • An administrator must register an NFS StorageClass. See StorageClass.
  • The NFS share path must be the parent directory of the model folder.

1. Create an NFS Volume

In the left sidebar, click Volumes and then Create. Select the NFS StorageClass registered by the administrator and connect the existing path.

FieldExample
Nametutorial-volume-nfs
Storage ClassNFS StorageClass
Mount PathNAS path containing the model folder

NAS Volume creation form

After creation, confirm that the status is Bound and that the model folder is visible in the Files tab.

NFS file check

2. Register the Model

In the left sidebar, click Model Artifacts and run Register Model.

FieldExample
Model Nameqwen-instruct-tutorial-nfs
Versionv1
Volumetutorial-volume-nfs
PathModel folder inside the NFS Volume
FormatSafeTensors

Add artifact - NFS

When validation succeeds, register the model version.

3. Choose a Serving Path

For GPU serving, run Quick Deploy from the model detail page.

For NPU serving, compile the source artifact first in Model Compilations. When compilation reaches Succeeded, run Quick Deploy with the generated NPU artifact.

4. Create the Serving

In the Quick Deploy dialog, confirm the model, version, and artifact, then enter a Serving name.

FieldExample
Service Namenfs-model-serving
Versionv1
ArtifactSource artifact for GPU serving, compiled artifact for NPU serving

The deployment is complete when the Serving status becomes Running.

Next Steps

To check the serving model's response, continue to Test Responses in Playground.

To check device and node metrics, continue to Check Metrics in Monitoring.