Deploying Embedding Models on DKubeXΒΆ
Embedding models registered in the DKubeX LLM Catalog can be deployed on the DKubeX platform using local resources or cloud resources using SkyPilot. The steps to deploy them are given below.
To list all embeddding models registered in the DKubeX LLM Catalog, use the following command.
d3x emb list
To deploy an embedding model using local resources, you can use the command given below. Replace the following fields in the command with the appropriate values:
<name of the deployment>
: Unique name for the embedding model deployment<emb Name>
: Name of the embedding model provided in the list above
Note
Use the --publish
flag if you want to make the deployment available for any user on the same setup to access and use.
d3x emb deploy --name <name of the deployment> --model <emb Name> --publish
d3x emb deploy -n bge-large --model BAAI--bge-large-en-v1-5 --publish
To deploy an embedding model using cloud resources via SkyPilot, you can use the command given below. Replace the following fields in the command with the appropriate values:
<name of the deployment>
: Unique name for the embedding model deployment<emb Name>
: Name of the embedding model provided in the list above
Note
Use the --publish
flag if you want to make the deployment available for any user on the same setup to access and use.
d3x emb deploy --name <name of the deployment> --model <emb Name> -sky --publish
d3x emb deploy -n bge-large --model BAAI--bge-large-en-v1-5 -sky --publish