HarleyCoops / OneShotAquaRATLinks
One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.
☆23Updated 2 months ago
Alternatives and similar repositories for OneShotAquaRAT
Users that are interested in OneShotAquaRAT are comparing it to the libraries listed below
Sorting:
- ☆68Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Updated 9 months ago
- Train your own SOTA deductive reasoning model☆107Updated 10 months ago
- ☆80Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 11 months ago
- Simple GRPO scripts and configurations.☆59Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 4 months ago
- Simple examples using Argilla tools to build AI☆57Updated last year
- ☆124Updated 3 months ago
- ☆104Updated 9 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆126Updated 11 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆123Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆86Updated last year
- Function Calling Benchmark & Testing☆92Updated last year
- A user interface for DSPy☆208Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆81Updated last year
- Automating enterprise workflows with multimodal agents☆114Updated last year
- ☆80Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆256Updated last week
- ☆107Updated 2 months ago