amazon-science / CodeSage
CodeSage: Code Representation Learning At Scale (ICLR 2024)
☆82Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for CodeSage
- Codebase accompanying the Summary of a Haystack paper.☆71Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Just a bunch of benchmark logs for different LLMs☆113Updated 3 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆91Updated 4 months ago
- ☆111Updated last month
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆91Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆97Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆61Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆128Updated this week
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆73Updated last month
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆143Updated 8 months ago
- ☆127Updated 2 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆99Updated last week
- Evaluating LLMs with fewer examples☆133Updated 6 months ago
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆97Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆119Updated 2 weeks ago
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆77Updated 4 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆62Updated last week
- ☆54Updated 5 months ago
- ☆65Updated last month
- Code accompanying "How I learned to start worrying about prompt formatting".☆92Updated last month
- Enhancing AI Software Engineering with Repository-level Code Graph☆90Updated 2 months ago
- ☆149Updated 10 months ago
- evol augment any dataset online☆55Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆105Updated last week
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆71Updated 10 months ago
- Evaluating tool-augmented LLMs in conversation settings☆72Updated 5 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆57Updated 5 months ago