astorfi / LLM-Alignment-ProjectLinks
A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learning, and more. Build your own customizable LLM alignment solution with ease.
☆36Updated last year
Alternatives and similar repositories for LLM-Alignment-Project
Users that are interested in LLM-Alignment-Project are comparing it to the libraries listed below
Sorting:
- ☆39Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆58Updated 10 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆60Updated 11 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- Tools for formatting large language model prompts.☆13Updated 2 years ago
- Training Proactive and Personalized LLM Agents☆98Updated 2 weeks ago
- A benchmark for conversational bargaining by language models. In each 20‑round match one LLM plays buyer, one plays seller, and both hold…☆33Updated 5 months ago
- Pivotal Token Search☆144Updated last month
- Very minimal (and stateless) agent framework☆44Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Updated 11 months ago
- ☆24Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated 2 years ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Fast-track AI apps to production with LLaMA 3, Mistral, and other top LLMs!☆21Updated last year
- A framework for hosting and scaling AI agents.☆39Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- A Python micro framework for creating LLM-driven agents☆23Updated 8 months ago
- Streamlit app for recommending eval functions using prompt diffs☆30Updated 2 years ago
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated last year
- ☆57Updated 2 weeks ago
- Open source static analysis toolkit for LLM agent plans☆13Updated 5 months ago