PinchBench is a benchmarking system for evaluating LLM models as OpenClaw coding agents. Made with π¦ by the humans at https://kilo.ai
β1,172May 14, 2026Updated last week
Alternatives and similar repositories for skill
Users that are interested in skill are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.β593May 17, 2026Updated last week
- CCS 2023 | Explainable malware and vulnerability detection with XAI in paper "FINER: Enhancing State-of-the-art Classifiers with Feature β¦β12Aug 20, 2024Updated last year
- Python/JupyterHub implementation of this UofT classicβ10Apr 18, 2024Updated 2 years ago
- survery of small language modelsβ18Jul 23, 2024Updated last year
- β56Mar 13, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchmβ¦β43Apr 12, 2026Updated last month
- Harness the scientific methods of Sydney Brenner using AI Agentsβ81May 15, 2026Updated last week
- A collection of Summoner clients and agents featuring example implementations and reusable templatesβ24Feb 19, 2026Updated 3 months ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"β45Apr 30, 2023Updated 3 years ago
- Official repository Flash Local Linear Attentionβ23Apr 23, 2026Updated last month
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grapβ¦β13Oct 30, 2024Updated last year
- A pre-trained model with multi-exit transformer architecture.β56Dec 10, 2022Updated 3 years ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tuβ¦β93Mar 6, 2026Updated 2 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficientlyβ39Feb 4, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β11Oct 29, 2024Updated last year
- Explanation Optimizationβ13Oct 16, 2020Updated 5 years ago
- The 4th rank system of the SemEval 2021 Task4.β10May 7, 2022Updated 4 years ago
- code for unsupervised entity resolutionβ10Apr 26, 2019Updated 7 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokensβ19Feb 29, 2024Updated 2 years ago
- β11Mar 1, 2024Updated 2 years ago
- Guidelines for our secondary layer of annotation adding multi-sentence AMR linksβ12Sep 6, 2017Updated 8 years ago
- Demo code for Gemini Live Integrationβ13Jul 29, 2025Updated 9 months ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} Γ {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}β14Jun 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SDLC enforcement for Claude Code β hooks, skills, and wizard setup in one command. TDD, planning, self-review, CI shepherd.β24Updated this week
- β10Aug 19, 2023Updated 2 years ago
- A simple ReAct agent that has access to LlamaIndex docs and to the internet to provide you with insights on LlamaIndex itself.β11Feb 23, 2025Updated last year
- β116Sep 13, 2025Updated 8 months ago
- Rex Ying's Ph.D. Thesis, Stanford Universityβ42Jun 16, 2022Updated 3 years ago
- β27Apr 14, 2025Updated last year
- Evals for agentsβ15Dec 4, 2024Updated last year
- Code for paper: "PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification", IEEE S&P 2024.β35Aug 10, 2024Updated last year
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learningβ14Oct 27, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository for the AAAI-21 paper 'Explainable Models with Consistent Interpretations'β18Apr 5, 2022Updated 4 years ago
- The theory of mind module for the SWE agentβ104May 12, 2026Updated last week
- β14Aug 10, 2023Updated 2 years ago
- Code for Horizontal Federated Learning blog around Credit Scoringβ10Sep 14, 2020Updated 5 years ago
- β12Apr 27, 2020Updated 6 years ago
- Hugo SEO Moduleβ10Jan 7, 2026Updated 4 months ago
- β19Aug 23, 2025Updated 9 months ago