Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generation.
☆270Mar 20, 2026Updated this week
Alternatives and similar repositories for matrix
Users that are interested in matrix are comparing it to the libraries listed below
Sorting:
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Apr 17, 2025Updated 11 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 5 months ago
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated 10 months ago
- ☆37Nov 14, 2025Updated 4 months ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆55Updated this week
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆15Mar 1, 2026Updated 3 weeks ago
- ☆15Feb 23, 2026Updated 3 weeks ago
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- ☆19May 3, 2025Updated 10 months ago
- Data recipes and robust infrastructure for training AI agents☆111Updated this week
- Example of applying CUDA graphs to LLaMA-v2☆12Aug 25, 2023Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 8 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 5 months ago
- Official PyTorch implementation of CD-MOE☆12Updated this week
- ☆47May 20, 2025Updated 10 months ago
- Agentic Research and Evaluation Suite☆83Updated this week
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 5 months ago
- ☆21Jul 21, 2025Updated 8 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- High performance GPT-OSS MLX implementation☆37Aug 6, 2025Updated 7 months ago
- Implementation of Logic-RAG☆16May 16, 2025Updated 10 months ago
- On-the-fly Definition Augmentation of LLMs for Biomedical NER☆14Apr 14, 2025Updated 11 months ago
- ☆45Nov 1, 2025Updated 4 months ago
- Scaling Agentic Environments Automatically.☆54Jan 22, 2026Updated 2 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 5 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Oct 14, 2024Updated last year
- ☆20Sep 6, 2025Updated 6 months ago
- Unofficial Implementation of Selective Attention Transformer☆21Oct 31, 2024Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Mar 26, 2024Updated last year
- ☆35Jan 25, 2026Updated last month
- The code implementation of Symbolic-MoE☆46Sep 2, 2025Updated 6 months ago
- ☆21Sep 7, 2025Updated 6 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 9 months ago