ArturTanona / grpo_unsloth_docker
☆57Updated 2 months ago
Alternatives and similar repositories for grpo_unsloth_docker:
Users that are interested in grpo_unsloth_docker are comparing it to the libraries listed below
- ☆131Updated 2 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆76Updated last month
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆60Updated 8 months ago
- LLM reads a paper and produce a working prototype☆52Updated 3 weeks ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆50Updated 2 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 7 months ago
- ☆101Updated 8 months ago
- Very minimal (and stateless) agent framework☆43Updated 3 months ago
- Code for ScribeAgent paper☆57Updated 2 months ago
- ☆36Updated 3 months ago
- ☆24Updated 3 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆67Updated 6 months ago
- ☆112Updated 4 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆93Updated 4 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- ☆51Updated 9 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆28Updated 5 months ago
- ☆30Updated 10 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 9 months ago
- ☆56Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated 3 weeks ago
- Jina DeepSearch UI☆101Updated this week
- LLM-as-SERP☆64Updated 2 months ago
- ☆85Updated 7 months ago
- CursorCore: Assist Programming through Aligning Anything☆121Updated 2 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆36Updated 3 weeks ago
- Evaluation of bm42 sparse indexing algorithm☆65Updated 9 months ago