Do you use Qwen 1.5 MoE?

What is Qwen 1.5 MoE?

Qwen1.5-MoE-A2.7B is a small mixture-of-expert (MoE) model with only 2.7 billion activated parameters yet matches the performance of state-of-the-art 7B models like Mistral 7B and Qwen1.5-7B.

Maker shoutouts

Arch — Build fast, hyper-personalized agents with intelligent infra

302 upvotes

•

2mo ago

"Highly performant base models that can be used for task-specific training. Such as the function calling experience built into Arch"

•

•View Launch

View all

Recent launches

Qwen 1.5 MoE

Qwen1.5-MoE-A2.7B is a small mixture-of-expert (MoE) model with only 2.7 billion activated parameters yet matches the performance of state-of-the-art 7B models like Mistral 7B and Qwen1.5-7B.

10mo ago