allenai / SERALinks
Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.
☆117Updated last week
Alternatives and similar repositories for SERA
Users that are interested in SERA are comparing it to the libraries listed below
Sorting:
- A tool to use the Ai2 Open Coding Agents Soft-Verified Efficient Repository Agents (SERA) model with Claude Code☆220Updated this week
- Training Proactive and Personalized LLM Agents☆100Updated 3 weeks ago
- ☆160Updated last month
- Data recipes and robust infrastructure for training AI agents☆94Updated this week
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆68Updated 3 weeks ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆225Updated 5 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 4 months ago
- Pivotal Token Search☆145Updated last month
- ☆37Updated 6 months ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 7 months ago
- ☆39Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆127Updated 3 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated 2 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ☆188Updated 6 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 6 months ago
- Codebase for FinePDFs☆176Updated last month
- ☆97Updated 2 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- ☆43Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- ☆19Updated 11 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆137Updated 4 months ago
- CLI for Recursive Language Models☆42Updated 2 weeks ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago