zechenzhangAGI / AI-research-SKILLsLinks
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.
☆89Updated this week
Alternatives and similar repositories for AI-research-SKILLs
Users that are interested in AI-research-SKILLs are comparing it to the libraries listed below
Sorting:
- "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"☆472Updated 3 weeks ago
- ☆356Updated 4 months ago
- Marco Search Agent for Realistic and Challenging Agentic Search☆235Updated 3 weeks ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆425Updated last month
- ☆288Updated 4 months ago
- ☆218Updated 5 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆236Updated 2 months ago
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆82Updated 2 weeks ago
- An AI-powered multi-agent platform for automated investment research — combining LLM reasoning, RAG retrieval, and real-time market data …☆142Updated 2 weeks ago
- ☆197Updated last month
- ☆166Updated this week
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆98Updated 8 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆60Updated 8 months ago
- Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic alignment bottlenec…☆111Updated 2 months ago
- Selective Prompt Anchoring☆93Updated last week
- [TMLR'25] The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆51Updated 7 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆234Updated 3 months ago
- Flexible RAG tools, Features semantic search, document indexing, and intelligent reranking with minimal intrusion design.☆89Updated 2 months ago
- [BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.☆450Updated this week
- (EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam☆92Updated 3 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆195Updated last year
- Create your self-hosted, open-source Operator model.☆11Updated 7 months ago
- ☆174Updated 2 months ago
- Valuation of tokens corresponding to influential individuals on social platforms through AI algorithms☆228Updated 2 months ago
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆57Updated last month
- 🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support☆482Updated 5 months ago
- ☆186Updated 2 weeks ago
- This project is designed to evaluate the effectiveness of DeepClaude and other combination models.☆41Updated 8 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆122Updated last month
- 🔐 企业级 AI API 安全代理 - 安全访问 DeepSeek API,无需在前端暴露密钥;🔐 Enterprise-grade AI API security proxy - Securely access DeepSeek API without exposin…☆57Updated 3 months ago