google-deepmind / onetwoLinks
☆228Updated 4 months ago
Alternatives and similar repositories for onetwo
Users that are interested in onetwo are comparing it to the libraries listed below
Sorting:
- Source code for the collaborative reasoner research project at Meta FAIR.☆95Updated 3 months ago
- ☆134Updated 3 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 2 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆163Updated this week
- ☆186Updated this week
- ☆154Updated 7 months ago
- Multi-backend recommender systems with Keras 3☆131Updated 3 weeks ago
- ☆40Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆282Updated 4 months ago
- ☆145Updated last year
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆454Updated 5 months ago
- ☆56Updated last week
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆123Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated this week
- Let's build better datasets, together!☆260Updated 6 months ago
- Code for ExploreTom☆84Updated 3 weeks ago
- Simple UI for debugging correlations of text embeddings☆287Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆71Updated 7 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 4 months ago
- Discovering Data-driven Hypotheses in the Wild☆99Updated last month
- ☆124Updated 8 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆265Updated last year
- ☆210Updated 3 weeks ago
- Inference-time scaling for LLMs-as-a-judge.☆251Updated this week
- Mixing Language Models with Self-Verification and Meta-Verification☆106Updated 7 months ago
- Automating enterprise workflows with multimodal agents☆108Updated 9 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated last month
- ☆188Updated 3 weeks ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆101Updated last year