Supported Models

HyperDex supports a variety of generative Transformer models in HuggingFace Transformers. The following is the list of model architectures that are currently supported by HyperDex. Alongside each architecture, we include some popular models that use it.

Architecture	Models	Example HuggingFace Models
`CohereForCausalLM`	Cohere	`CohereForAI/c4ai-command-r-v01`, etc.
`ExaoneForCausalLM`	EXAONE	`LGAI-EXAONE/EXAONE-3.0-7.8B`, etc.
`FalconForCausalLM`	Falcon	`tiiuae/falcon-7b`, `tiiuae/falcon-40b`, etc.
`GemmaForCausalLM`	Gemma	`google/gemma-2b`, `google/gemma-7b`, etc.
`GPT2LMHeadModel`	GPT-2	`gpt2`, `gpt2-xl`, etc.
`GPTBigCodeForCausalLM`	StarCoder, SantaCoder, WizardCoder	`bidcode/starcoder`, etc.
`GPTJForCausalLM`	GPT-J	`EleutherAI/gpt-j-6b`, etc.
`GPTNeoXForCausalLM`	GPT-NeoX, Pythia, StableLM	`EleutherAI/gpt-neox-20b`, `EleutherAI/pythia-12b`, etc.
`LlamaForCausalLM`	Llama, Lllama-2, Llama-3, Midm	`meta-llama/Llama-2-7b-hf`, `K-intelligence/Midm-2.0-Base-Instruct`, etc.
`MistralForCausalLM`	Mistral Mistral-Instruct	`mistralai/Mistral-7B-v0.1`, etc.
`Qwen2ForCausalLM`	Qwen2, Qwen2.5, A.X-4.0	`Qwen/Qwen2.5-7B-Instruct`,`skt/A.X-4.0-Light`, etc.
`OPTForCausalLM`	OPT	`facebook/opt-1.3b`, `facebook/opt-66b`, etc.
`OlmoForCausalLM`	OLMo	`allenai/OLMo-7B`, etc.
`PhiForCausalLM`	Phi	`microsoft/phi-1_5`, `microsoft/phi-2`, etc.
`Phi3ForCausalLM`	Phi3	`microsoft/phi-3`, etc.
`StableLmForCausalLM`	StableLM	`stabilityai/stablelm-3b-4e1t/`, etc.
`StarCoder2ForCausalLM`	StarCoder2	`bigcode/starcoder2-15b`, etc.

Note

Models marked under the “Hybrid” tab have been validated for use in the LPU-GPU Hybrid system. For GPU validation, the models have been tested with NVIDIA’s Ampere, Hopper, and Ada Lovelace product lines, ensuring compatibility and performance across these architectures.