[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models
☆34Nov 4, 2025Updated 4 months ago
Alternatives and similar repositories for Outlier-Safe-Pre-Training
Users that are interested in Outlier-Safe-Pre-Training are comparing it to the libraries listed below
Sorting:
- Learning from Negative samples for Biomedical Generative Entity Linking☆17May 25, 2025Updated 9 months ago
- ☆27Mar 29, 2025Updated 11 months ago
- ☆18Oct 26, 2024Updated last year
- Pytorch implementation of our UniQ method, IEEE Access -- Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric …☆11Apr 7, 2021Updated 4 years ago
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆38Sep 20, 2024Updated last year
- ☆35Mar 12, 2025Updated 11 months ago
- This project compares the performance of Swin-Transformer v2 implemented in JAX and PyTorch.☆12Jun 8, 2022Updated 3 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- ☆12Jul 8, 2024Updated last year
- Research sources on graph-based anomaly detection☆13Nov 29, 2022Updated 3 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 6 months ago
- Traditional methods for volatility forecast of multiscale and high-dimensional data like foreign-exchange and stock market volatility ha…☆11Jun 1, 2017Updated 8 years ago
- Implementation of Diffusion Policy☆13Dec 13, 2024Updated last year
- Homework of CER☆11Apr 12, 2021Updated 4 years ago
- Deep Learning Framework with a specialisation aimed for Binarized Neural Networks.☆11Jan 9, 2022Updated 4 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Let GPT-4 run your Minecraft server!☆10Apr 15, 2023Updated 2 years ago
- 个人学习中总结的 Rust 思维导图☆10Feb 2, 2024Updated 2 years ago
- ☆13Feb 20, 2026Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆13Mar 30, 2024Updated last year
- ☆11May 19, 2021Updated 4 years ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- [ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design☆22Jul 4, 2025Updated 8 months ago
- A sample app to debug and validate cellular modems on balena devices☆13Jun 5, 2019Updated 6 years ago
- make your statistical research faster☆12Jul 7, 2023Updated 2 years ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- 팡요랩 자료☆11May 31, 2019Updated 6 years ago
- ☆16Jul 29, 2025Updated 7 months ago
- Limit Orderbook Replay/Analysis Library☆10Nov 19, 2018Updated 7 years ago
- ☆15Apr 26, 2025Updated 10 months ago
- VeighNa框架的LevelDB数据库接口☆13Apr 23, 2023Updated 2 years ago
- Generic build server☆64May 25, 2014Updated 11 years ago
- A miniture AI training framework for PyTorch☆43Feb 1, 2025Updated last year
- ☆51Jan 28, 2024Updated 2 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆47Oct 21, 2025Updated 4 months ago
- 🚀 Automated deployment stack for AMD MI300 GPUs with optimized ML/DL frameworks and HPC-ready configurations☆12Nov 30, 2024Updated last year
- A pure Julia wrapper for TD Ameritrade APIs☆11Apr 2, 2023Updated 2 years ago
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- [ICML 2025] Improving Planning of Agents for Long-Horizon Tasks☆24Oct 2, 2025Updated 5 months ago