Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆15Sep 4, 2024Updated last year
Alternatives and similar repositories for experiments
Users that are interested in experiments are comparing it to the libraries listed below
Sorting:
- The application of multimodal RAG for Sustainable finance☆24Jul 22, 2024Updated last year
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Mar 12, 2026Updated last week
- ☆12Jun 12, 2024Updated last year
- [EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction☆17Nov 9, 2024Updated last year
- Airtable Eazydocs Block by Superblocks.at☆14Feb 13, 2022Updated 4 years ago
- The source code of paper "Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph" in KDD2022.☆15Jan 9, 2023Updated 3 years ago
- ☆11Oct 16, 2023Updated 2 years ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- Phoenix LiveView + HeadlessUI React web components☆13Nov 6, 2024Updated last year
- Create interactive tables from JSON on the command-line☆26Dec 13, 2018Updated 7 years ago
- Ecto extensions to support auditing data changes in your Schema.☆10Dec 4, 2017Updated 8 years ago
- 克劳德share,一个让你呼吸顺畅的claude4.5 ,支持artifacts☆19Mar 8, 2026Updated last week
- "The purest form of giving is from anonymous to anonymous" - Jay Z☆10Jan 6, 2021Updated 5 years ago
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆27Sep 18, 2025Updated 6 months ago
- The tutorial for how to render a map in python and graph data based off coordinates☆17Dec 8, 2022Updated 3 years ago
- PyTorch implementation for our proposed CFIE in EMNLP 2021 paper "Uncovering Main Causalities for Long-tailed Information Extraction".☆26Jan 5, 2022Updated 4 years ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆15Mar 9, 2026Updated last week
- Framework Tutorials Repo☆28Jan 29, 2025Updated last year
- ☆41Dec 7, 2025Updated 3 months ago
- GitLab CI/CD templates to automatically connect Gradle/Maven builds to Develocity☆13Mar 11, 2026Updated last week
- ☆27Mar 13, 2024Updated 2 years ago
- Play with neural network calculator!☆14Aug 1, 2025Updated 7 months ago
- Chrome extension that adds a repl.it link to npm packages' pages.☆11Apr 8, 2019Updated 6 years ago
- GPI-Space: Memory Driven Computing and Big Data☆10Jan 2, 2025Updated last year
- MCP server for ROS to control robots via topics, services, and actions.☆30Aug 19, 2025Updated 7 months ago
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆26Jul 22, 2025Updated 7 months ago
- A new repo to demonstrate tutorials for using HuggingFace on Graphcore IPUs.☆12May 3, 2023Updated 2 years ago
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Jun 30, 2025Updated 8 months ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- Mahabharata text compiled from multiple sources, split into chunks, parsed into CSV files with metadata. Named entities recognised and in…☆37Apr 27, 2024Updated last year
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated 2 months ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 4 months ago
- A Node task which reformats and adds metadata to raw data☆12Mar 12, 2026Updated last week
- Interactive map of Hack Club’s global club network.☆11Jul 27, 2022Updated 3 years ago
- We are exploring the potential impact of Generative AI on Nesta's Missions and work to uncover opportunities and risks that can inform Ne…☆28Jun 24, 2024Updated last year
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago