List of Embedding Models in DKubeX Embedding Model Catalog

Serving Models and Open-source LLMs, Deploying Embedding Models on DKubeX, Deploying Embedding Models with SkyPilot

The following list contains the embedding models which are currently registered with the DKubeX embedding model catalog on the current version of DKubeX (DKubeX v0.8.7.1).

Embedding Models registered on the DKubeX Embedding Model Catalog

Embedding Model Name

Accelerator Type

Deployment Config

Maximum total tokens

Number of GPUs needed

BAAI--bge-large-en-v1-5

A10

revision: refs/pr/5

512

0

BAAI--bge-m3

A10

revision: main

512

0

thenlper--gte-large

A10

512

0

These embedding models can be deployed on the DKubeX platform without using any config files provided the resource and permission requirements are met. You can deploy these embedding models on DKubeX using local resources or using cloud resources using SkyPilot. For more information on deploying embedding models, go to the appropriate page provided below.

Deploying Embedding Models using Local Resources
./serving/emb_deploy.html
Deploying Embedding Models using SkyPilot
./skypilot/emb-deployment-with-skypilot.html