Just-Curieous / CurieLinks
❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents
☆231Updated last week
Alternatives and similar repositories for Curie
Users that are interested in Curie are comparing it to the libraries listed below
Sorting:
- s3 - ⚡ Efficient Yet Effective Search Agent Training via RL for RAG☆508Updated last week
- From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery☆214Updated last week
- (ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators☆191Updated this week
- Multi-Faceted AI Agent and Workflow Autotuning. Automatically optimizes LangChain, LangGraph, DSPy programs for better quality, lower exe…☆249Updated 2 months ago
- MAKGED is the first multi-agent framework for collaborative error detection in knowledge graphs.☆29Updated 3 weeks ago
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆479Updated this week
- ☆668Updated this week
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆56Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆117Updated 3 weeks ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆53Updated 3 weeks ago
- Recipes to train the self-rewarding reasoning LLMs.☆224Updated 5 months ago
- DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approac…☆308Updated 3 months ago
- The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM…☆206Updated 2 weeks ago
- (NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning☆219Updated 2 months ago
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆704Updated this week
- Scaling Data for SWE-agents☆342Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆325Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆176Updated 5 months ago
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆122Updated 5 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆288Updated last month
- A lightweight framework for building research agents designed for developers☆102Updated this week
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆185Updated 2 weeks ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆168Updated this week
- ☆55Updated 2 months ago
- accompanying material for sleep-time compute paper☆102Updated 3 months ago
- A curated list of awesome leaderboard-oriented resources for foundation models☆278Updated last month
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆210Updated 2 weeks ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆72Updated 4 months ago
- A server-client framework to train any AI agent with rollouts and feedbacks☆207Updated this week
- A benchmark for LLMs on complicated tasks in the terminal☆358Updated this week