Deep Mixture of Experts Reproduction
Open-sourced the first public implementation of “Deep Mixture of Experts via Shallow Embedding,” a computer vision paper that previously had no available code.
Built with: PyTorch, Computer Vision
GitHub: RyanKim17920/Deep-Mixture-of-Experts-via-Shallow-Embedding
- Reproduced the full model architecture directly from the original paper with no reference implementation
- First public code release for this paper, making the architecture accessible to the research community
- Computer vision model built and validated end-to-end in PyTorch
July – November 2024