sade-adrien / SteloCoderLinks
☆16Updated 2 years ago
Alternatives and similar repositories for SteloCoder
Users that are interested in SteloCoder are comparing it to the libraries listed below
Sorting:
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆71Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Updated 2 years ago
- codebase release for EMNLP2023 paper publication☆19Updated 4 months ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆29Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Gentopia Agent Zoo and Agent Benchmark☆31Updated 2 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- ☆56Updated last year
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆174Updated this week
- ☆139Updated 2 years ago
- ☆17Updated 10 months ago
- Universal text classifier for generative models☆24Updated last year
- ☆21Updated 2 years ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- entropix style sampling + GUI☆27Updated last year
- ☆32Updated 2 years ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆21Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Updated 6 months ago
- ☆105Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆128Updated last year
- ☆35Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆21Updated 2 years ago
- Harness used to benchmark aider against SWE Bench benchmarks☆79Updated last year