Muon fsdp 2
☆55Aug 8, 2025Updated 7 months ago
Alternatives and similar repositories for muon_fsdp_2
Users that are interested in muon_fsdp_2 are comparing it to the libraries listed below
Sorting:
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- ☆13Dec 12, 2025Updated 2 months ago
- Stick-breaking attention☆62Jul 1, 2025Updated 8 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 8 months ago
- ☆50Aug 21, 2025Updated 6 months ago
- ☆20Oct 10, 2025Updated 4 months ago
- Research work aimed at addressing the problem of modeling infinite-length context