kevinwu23 / StanfordFineTuneBenchLinks
☆31Updated 10 months ago
Alternatives and similar repositories for StanfordFineTuneBench
Users that are interested in StanfordFineTuneBench are comparing it to the libraries listed below
Sorting:
- ☆49Updated 7 months ago
- Simple GRPO scripts and configurations.☆59Updated 7 months ago
- An introduction to LLM Sampling☆79Updated 9 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆63Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆69Updated 2 months ago
- ☆135Updated 3 weeks ago
- ☆80Updated last week
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆49Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- ☆155Updated 9 months ago
- ☆54Updated 10 months ago
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆163Updated 4 months ago
- Datamodels for hugging face tokenizers☆47Updated this week
- Storing long contexts in tiny caches with self-study☆179Updated last week
- Python library to use Pleias-RAG models☆61Updated 4 months ago
- ☆58Updated 4 months ago
- ☆23Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆271Updated last year
- NLP with Rust for Python 🦀🐍☆64Updated 4 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year
- The first dense retrieval model that can be prompted like an LM☆86Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 4 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆69Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 6 months ago