GPT-Alternatives / gpt_alternatives
☆75Updated last year
Alternatives and similar repositories for gpt_alternatives:
Users that are interested in gpt_alternatives are comparing it to the libraries listed below
- ☆139Updated 7 months ago
- AI Alignment: A Comprehensive Survey☆133Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- ☆88Updated last month
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 10 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 7 months ago
- This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"☆205Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆49Updated 11 months ago
- ☆98Updated 2 months ago
- ☆81Updated 10 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆245Updated 5 months ago
- Light local website for displaying performances from different chat models.☆85Updated last year
- The Roadmap for LLMs☆85Updated last year
- ☆96Updated 10 months ago
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆21Updated 11 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆163Updated last year
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆49Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆148Updated this week
- ☆36Updated 5 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆75Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 4 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆153Updated 8 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 8 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆217Updated this week
- ☆89Updated 2 months ago
- ☆111Updated 7 months ago
- Reformatted Alignment☆114Updated 4 months ago
- ☆48Updated 11 months ago