google-deepmind / onetwoLinks
☆259Updated last month
Alternatives and similar repositories for onetwo
Users that are interested in onetwo are comparing it to the libraries listed below
Sorting:
- ☆144Updated 3 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆269Updated 7 months ago
- Let's build better datasets, together!☆265Updated 11 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated 2 weeks ago
- ☆213Updated this week
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆292Updated 9 months ago
- Inference-time scaling for LLMs-as-a-judge.☆316Updated last month
- Draw more samples☆196Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 7 months ago
- ☆160Updated last year
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆503Updated 10 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆143Updated 8 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆299Updated last month
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- Automating enterprise workflows with multimodal agents☆113Updated last year
- The history files when recording human interaction while solving ARC tasks☆118Updated last week
- Modular, scalable library to train ML models☆178Updated last week
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- Multi-backend recommender systems with Keras 3☆149Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆280Updated last month
- ☆232Updated 2 weeks ago
- ☆31Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 2 months ago
- ☆148Updated last year
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆157Updated last week
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆277Updated last year
- Extract full next-token probabilities via language model APIs☆248Updated last year
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago