QuantaAlpha / GitTaskBenchLinks
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
☆219Updated this week
Alternatives and similar repositories for GitTaskBench
Users that are interested in GitTaskBench are comparing it to the libraries listed below
Sorting:
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆309Updated 8 months ago
- ☆286Updated 3 months ago
- Tokenize The Virtual Agents Onchain☆243Updated 3 months ago
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆184Updated this week
- ☆137Updated 3 months ago
- The 1st dynamic phishing kit dataset☆201Updated 7 months ago
- a multiscale multimodal large language models for radiology report generation (RRG) tasks☆262Updated last month
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://arxiv.org/abs/2504.09946☆164Updated this week
- React Secure State☆171Updated 2 months ago
- A project aims to improve LLMs' pixel reasoning ability.☆81Updated 3 weeks ago
- ☆85Updated 7 months ago
- F²-Gen - A open source Financial Fraud Detection Data Generator Web Application☆363Updated last month
- ☆160Updated last month
- An open platform for building, extending, and experimenting with scientific agents.☆347Updated last month
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated 11 months ago
- Valuation of tokens corresponding to influential individuals on social platforms through AI algorithms☆155Updated last week
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆140Updated this week
- A database operations and data analysis AI agent☆317Updated 3 weeks ago
- ☆100Updated 8 months ago
- 小而美的Vue3异步处理解决方案,让复杂的异步逻辑变得简单优雅,让重复的样板代码成为历史☆222Updated 2 weeks ago
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆175Updated 2 months ago
- An MCP service that automates data analysis through IPython sessions.☆160Updated last month
- ☆143Updated 4 months ago
- (LLM) A Sparse Activation Architecture for Green Artificial Intelligence: The Energy Efficiency Optimization Language Model AliceSkyGarde…☆165Updated 2 months ago
- Workflow runner engine for argo framework☆99Updated 7 months ago
- ☆130Updated 3 months ago
- ☆203Updated last year
- Enhanced Benchmark Creation Tool: Automates dataset profiling, model benchmarking, and performance visualization for streamlined evaluati…☆110Updated 4 months ago
- ☆162Updated 5 months ago
- Intelligent job recommendation platform using Java + MySQL + Redis. Supports location-based search, AI keyword extraction, and personaliz…☆207Updated 3 weeks ago