AI-Maker-Space / RLXF-Community-Sessions
☆15Updated last year
Alternatives and similar repositories for RLXF-Community-Sessions:
Users that are interested in RLXF-Community-Sessions are comparing it to the libraries listed below
- ☆47Updated last year
- Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..☆64Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- ☆24Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆57Updated last year
- Fine-Tuning LLM and embedding models☆27Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated 3 weeks ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆23Updated 3 months ago
- ☆15Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Building GPT ...☆17Updated 4 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 2 weeks ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- ☆26Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 4 months ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 6 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆44Updated 7 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- Sample notebooks and prompts for LLM evaluation☆124Updated 4 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆108Updated 2 months ago
- ☆22Updated last year
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆13Updated this week
- Playground for Transformers☆48Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆100Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 6 months ago
- PyTorch implementation for MRL☆18Updated last year
- ☆87Updated last year