Deploying Embedding Models on DKubeXΒΆ

Embedding models registered in the DKubeX LLM Catalog can be deployed on the DKubeX platform using local resources or cloud resources using SkyPilot. The steps to deploy them are given below.

  • To list all embeddding models registered in the DKubeX LLM Catalog, use the following command.

    d3x emb list
    

To deploy an embedding model using local resources, you can use the command given below. Replace the following fields in the command with the appropriate values:

  • <name of the deployment>: Unique name for the embedding model deployment

  • <emb Name>: Name of the embedding model provided in the list above

Note

Use the --publish flag if you want to make the deployment available for any user on the same setup to access and use.

d3x emb deploy --name <name of the deployment> --model <emb Name> --publish