Spico197 / random-luckLinks
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
☆45Updated 3 years ago
Alternatives and similar repositories for random-luck
Users that are interested in random-luck are comparing it to the libraries listed below
Sorting:
- ☆46Updated this week
- Our code will be public soon .☆26Updated 2 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆26Updated last year
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆32Updated 3 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆34Updated 5 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 8 months ago
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆89Updated 2 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆61Updated 6 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆29Updated last year
- The official repository for "Rongsheng Wang's Arxiv Template"☆33Updated last month
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆81Updated last year
- OpenReivew Submission Visualization (ICLR 2024/2025)☆151Updated 8 months ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆52Updated 10 months ago
- The code and data for the paper JiuZhang3.0☆47Updated last year
- ☆46Updated last month
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆86Updated 2 years ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆16Updated last year
- ☆29Updated 5 months ago
- ☆17Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆57Updated last year
- ☆20Updated last year
- ☆54Updated 3 months ago
- ☆22Updated 11 months ago