AbanteAI / LoCoDiff-benchLinks
☆25Updated 2 months ago
Alternatives and similar repositories for LoCoDiff-bench
Users that are interested in LoCoDiff-bench are comparing it to the libraries listed below
Sorting:
- ☆151Updated 3 weeks ago
- ☆19Updated 10 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 8 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆133Updated this week
- Resources regarding evML (edge verified machine learning)☆20Updated last year
- Because it's there.☆16Updated last year
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- Lego for GRPO☆30Updated 7 months ago
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆312Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆22Updated 2 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆240Updated this week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…☆20Updated 6 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Updated 9 months ago
- Codebase from our first release.☆32Updated this week
- ☆37Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Editor with LLM generation tree exploration☆81Updated 11 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 4 months ago
- ☆40Updated last year
- Simple orchestration for EC2 spot containers☆19Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- ScalarLM - a unified training and inference stack☆94Updated last month
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 6 months ago
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Updated 6 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- Pivotal Token Search☆142Updated 3 weeks ago
- alternative way to calculating self attention☆18Updated last year