Spico197 / random-luckLinks
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
☆45Updated 3 years ago
Alternatives and similar repositories for random-luck
Users that are interested in random-luck are comparing it to the libraries listed below
Sorting:
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆32Updated 3 years ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆43Updated 3 weeks ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 10 months ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- A paper list about diffusion models for natural language processing.☆182Updated last year
- The code and data for the paper JiuZhang3.0☆45Updated last year
- Tips for paper writing and researches 科技论文写作经验记录和总结☆135Updated 3 years ago
- Token level visualization tools for large language models☆81Updated 4 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆32Updated 5 months ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆50Updated 10 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated last year
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆85Updated 2 years ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 5 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- ☆49Updated 3 weeks ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆18Updated 7 months ago
- ☆54Updated 2 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆26Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated last year
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 6 months ago
- self-adaptive in-context learning☆45Updated 2 years ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆86Updated 6 months ago
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆38Updated 2 years ago
- Our code will be public soon .☆26Updated 2 years ago
- ☆36Updated last month
- A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning☆32Updated this week
- Code for paper "Patch-Level Training for Large Language Models"☆86Updated 6 months ago