Spico197 / random-luck
Automatically select the best random seed based on ancient Chinese I Ching. Good luck and best wishes !
☆45Updated 3 years ago
Alternatives and similar repositories for random-luck:
Users that are interested in random-luck are comparing it to the libraries listed below
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated 2 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆84Updated 2 years ago
- 😎 A simple and easy-to-use toolkit for GPU scheduling.☆43Updated 3 years ago
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆121Updated last year
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆46Updated 8 months ago
- AI Alignment: A Comprehensive Survey☆133Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆36Updated last year
- Token level visualization tools for large language models☆79Updated 3 months ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆45Updated 5 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆37Updated 6 months ago
- The code and data for the paper JiuZhang3.0☆43Updated 11 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆64Updated last week
- ☆62Updated 2 months ago
- ☆26Updated 3 months ago
- ☆74Updated this week
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Updated 2 months ago
- ☆39Updated last year
- ICLR2024 statistics☆47Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 3 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆48Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 10 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 9 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- ☆29Updated 6 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆61Updated 6 months ago
- Lion and Adam optimization comparison☆61Updated 2 years ago
- ☆52Updated this week