Skip to content

Supported Models

HyperDex supports a variety of generative Transformer models in HuggingFace Transformers. The following is the list of model architectures that are currently supported by HyperDex. Alongside each architecture, we include some popular models that use it.

Architecture Models Example HuggingFace Models Orion Hybrid
CohereForCausalLM Cohere CohereForAI/c4ai-command-r-v01, etc.
ExaoneForCausalLM EXAONE LGAI-EXAONE/EXAONE-3.0-7.8B, etc.
FalconForCausalLM Falcon tiiuae/falcon-7b, tiiuae/falcon-40b, etc.
GemmaForCausalLM Gemma google/gemma-2b, google/gemma-7b, etc.
GPT2LMHeadModel GPT-2 gpt2, gpt2-xl, etc.
GPTBigCodeForCausalLM StarCoder, SantaCoder, WizardCoder bidcode/starcoder, etc.
GPTJForCausalLM GPT-J EleutherAI/gpt-j-6b, etc.
GPTNeoXForCausalLM GPT-NeoX,
Pythia,
StableLM
EleutherAI/gpt-neox-20b, EleutherAI/pythia-12b, etc.
InternLM2ForCausalLM InternLM2 internlm/internlm2-7b, etc.
LlamaForCausalLM LLaMA,
LLAMA-2,
LLAMA-3
meta-llama/Llama-2-7b-hf, 01-ai/Yi-6B, etc.
MistralForCausalLM Mistral
Mistral-Instruct
mistralai/Mistral-7B-v0.1, etc.
OPTForCausalLM OPT facebook/opt-1.3b, facebook/opt-66b, etc.
OlmoForCausalLM OLMo allenai/OLMo-7B, etc.
OrionForCausalLM Orion OrionStarAI/Orion-14B-Base, etc.
PhiForCausalLM Phi microsoft/phi-1_5, microsoft/phi-2, etc.
Phi3ForCausalLM Phi3 microsoft/phi-3, etc.
Qwen2ForCausalLM Qwen2 Qwen/Qwen2-beta-7B, Qwen/Qwen2-beta-7B-Chat, etc.
StableLmForCausalLM StableLM stabilityai/stablelm-3b-4e1t/, etc.
StarCoder2ForCausalLM StarCoder2 bigcode/starcoder2-15b, etc.

Note

Models marked under the “Hybrid” tab have been validated for use in the LPU-GPU Hybrid system. For GPU validation, the models have been tested with NVIDIA’s Ampere, Hopper, and Ada Lovelace product lines, ensuring compatibility and performance across these architectures.