osoleve / glitchlingsLinks
Enemies for your LLM
☆33Updated last week
Alternatives and similar repositories for glitchlings
Users that are interested in glitchlings are comparing it to the libraries listed below
Sorting:
- ☆107Updated 3 months ago
- ☆68Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆81Updated 4 months ago
- look how they massacred my boy☆63Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- Marketplace ML experiment - training without backprop☆27Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- ☆36Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- ☆19Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- ☆50Updated 5 months ago
- ☆37Updated 5 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Updated 3 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- ☆94Updated last year
- Agentic Research and Evaluation Suite☆50Updated this week
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆44Updated last year
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- Verbosity control for AI agents☆66Updated last year
- iMessage RAG MCP Server from Anthropic MCP Hackathon (NYC)☆14Updated 10 months ago
- ☆15Updated last month
- ☆114Updated 6 months ago
- Lego for GRPO☆30Updated 8 months ago
- Embed anything.☆27Updated last year
- Metadspy: The framework for specifying—not programming—language models☆88Updated 7 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆42Updated 6 months ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆194Updated last month