Skip to content
View giangdip2410's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report giangdip2410

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. HyperRouter HyperRouter Public

    Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"

    Python 33 3

  2. CompeteSMoE CompeteSMoE Public

    Code for this paper "CompeteSMoE - Effective Sparse Mixture of Experts Training via Competition"

    Python 6 3

  3. DANet DANet Public

    DANets (a family of neural networks) for tabular data classification/ regression.

    Python 5 6

  4. SimSMoE SimSMoE Public

    Code for this paper "SimSMoE: Toward Efficient Training Mixture of Experts via Solving Representational Collapse".

    Python 5

  5. Brainformer-SMOE Brainformer-SMOE Public

    Brainformer SMOE

    Python 3

  6. VQMoE VQMoE Public

    Code for this paper "On the Role of Discrete Representation in Sparse Mixture of Experts".

    Python 3