Skip to main content
Version: 0.1.0

Model Artifacts

Register models trained in MLflow into NuFi, manage per-version artifacts, and deploy them directly as inference services.

Models are managed as a 3-tier structure.

TierDescription
RegisteredModelTop-level group, named by model
ModelVersionAn individual version of the model. Multiple versions can be registered under one model.
ModelArtifactThe actual file path linked to a version (Volume + Path). Includes the rngd artifact added automatically after NPU compilation.

Prerequisites

  • A NuFi project must already be created.
  • The Volume (PVC) where the model files are stored must be mountable in the project.
  • To use MLflow integration, you need an accessible MLflow Tracking Server.

Model List

In the left sidebar, click Models.

Model list

NumberDescription
Go to the MLflow Webhook and Import History management page.
Go to the form for manually registering a new model version.
Open the dialog to instantly deploy the selected model. Disabled for models with no versions.

Columns

ColumnDescription
NameRegistered model name
VersionsNumber of registered versions
DescriptionModel description
CreatedTime of first registration

Model Detail — Version List

Click a model row in the model list to navigate to the version list page.

Model detail — version list

ColumnDescription
VersionVersion string (e.g., v1, 1.0.0)
StateVersion state (Active, etc.)
AuthorName of the user who registered the version
SourceSource type of the version (e.g., mlflow, manual)
ArtifactsNumber of artifacts linked to this version
DescriptionVersion description
CreatedVersion registration time

Click the + Add Version button at the top of the version list to add a new version to the current model.


Version Detail — Artifact List

Click a version row to navigate to the artifact list page.

Version detail — artifact list

No.Description
Quickly run an NPU compilation job for this artifact
Open the dialog to instantly deploy this artifact

Columns

ColumnDescription
NameArtifact name (e.g., original)
FormatModel format (e.g., safetensors, gguf, rngd, other)
PlatformTarget execution platform (rngd, gpu, etc.)
URIArtifact file location (pvc://{volume}/{path} format)
SizeArtifact size
CompilationWhether this is a compilation output / linked compilation info

Register a Model

A model is structured as model → version → artifact, and you register in that order.

1. Register the Model

Click the Register Model button in the upper right of the model list page.

Register a model

FieldDescription
Model NameModel group name (e.g., llama-3-ko-chat)
DescriptionDescription note for the model
AuthorName of the user registering the model

2. Register a Version

In the model list, click the row of the registered model to go to the model detail page, then click the + Add Version button.

Register a version

FieldDescription
VersionVersion string (e.g., v1, 1.0.0). Must be unique within the same model.
VolumeSelect the Volume (PVC) where the model files are stored.
PathPath to the model file or directory inside the Volume. Click the folder icon to the right of the input to show a list of selectable models.
FormatModel file format. Auto-detection is attempted when Path is filled in.
DescriptionDescription note for the version
AuthorName of the user registering the version

3. Register an Artifact

In the version list, click the row of the registered version to go to the version detail page, then click the + Add Artifact button.

Register an artifact

FieldDescription
Artifact NameArtifact name (e.g., furiosa-rngd)
FormatModel format (e.g., safetensors, gguf, rngd)
PlatformTarget execution platform (rngd, gpu, etc.)
VolumeSelect the Volume (PVC) where the artifact file is stored.
PathPath to the artifact file or directory inside the Volume. Click the folder icon to the right of the input to show a list of selectable models.

Quick Deploy

From any of the model list, version list, or artifact list, you can click Quick Deploy (rocket) to open the instant deployment dialog.

Per-Entry-Point Disable Conditions

Entry PointDisabled when
Model listThe model has no versions
Version listThe version has no artifacts

Quick Deploy Dialog

The Quick Deploy dialog lets you select model/version/artifact through a Cascading dropdown, and based on platform, automatically picks the accelerator, inference engine, and container image.

FieldDescription
ModelSelect the model to deploy
VersionSelect the version to deploy (enabled after selecting a model)
ArtifactSelect the artifact to deploy (enabled after selecting a version)
Service NameName of the Serving to be created
Inference EngineSelect the inference engine
AcceleratorSelect the accelerator
ImageContainer image

Advanced section (optional):

FieldDescription
CPUContainer CPU request
MemoryContainer memory request
Additional ArgumentsCustom arguments to pass to the inference engine (e.g., --max-model-len 4096)
Environment VariablesEnvironment variables to inject into the container

MLflow Integration

NuFi integrates with MLflow in two ways.

ModeDescription
Pull (manual Import)Pull artifacts directly from a specific Run on the MLflow Tracking Server.
Push (Webhook auto Import)When a model is registered in MLflow, the event is delivered automatically to NuFi via a Webhook. Register the Receive URL provided by NuFi as a Webhook in MLflow.
MLflow Integration detailed guide

For the Webhook form fields, how to register the issued Receive URL in MLflow (with a MlflowClient.create_webhook example), and how to check Import History — see the step-by-step procedure in Tutorial: Model Registration — Method C: MLflow Integration.

Click the Integration button at the top of the model list to navigate to the integration management page.

Integration Page

MLflow Integration

ButtonDescription
① Import from MLflowGo to the Pull Import form
② Add WebhookOpen the new Webhook registration dialog

In the Webhook list tab, you can check the URL and status (Active/Inactive) of registered Webhooks, the most recent received events, and delete them.

How to register a Webhook:

  1. Click the Add Webhook button. NuFi's Receive URL is displayed.
  2. Copy the shown Receive URL and register it in MLflow's Webhooks settings.
  3. From then on, whenever a model version is registered in MLflow, it is automatically delivered to NuFi.

In the Import History tab, you can query the status (Pending / Running / Succeeded / Failed) and logs of past Import jobs.

Handling abnormal states:

StatusWhat to check
Import FailedRe-verify that the MLflow Tracking URL and Run ID are correct.
Webhook InactiveCheck that the URL entered when registering the Webhook is reachable from MLflow.

Pull Import Form

MLflow Pull Import

FieldDescriptionRequired
MLflow Tracking URLMLflow Tracking Server address (e.g., http://mlflow.example.com)
Run IDUUID of the MLflow Run to import
Artifact Path (optional)Path to a specific artifact inside the Run. If empty, the entire Run is pulled.-
Target ModelThe model name to be registered in NuFi
Target VersionThe version string to be registered in NuFi
Storage PVCSelect the Volume (PVC) to store the artifacts

When you click the Import button, a background Import job starts. You can check progress under Integration > Import History.