1

Qwen1.5-MoE-A2.7B: A Small MoE Model with only 2.7B Activated Parameters yet Matching the Performance of State-of-the-Art 7B models(www.marktechpost.com)

posted 6 months ago

by

Akisamb@programming.devM

in

machine_learning@programming.dev

0 commentshide report

Sort:

Hot Top Controversial New Old

No comments yet!

Machine Learning

!machine_learning@programming.dev

A community for posting things related to machine learning

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

Community stats

135
Monthly active users
56
Posts
32
Comments

Community moderators