aiwaves-cn / Dive-into-LLMs
The official github repo for the open online courses: "Dive into LLMs".
☆10Updated last year
Alternatives and similar repositories for Dive-into-LLMs:
Users that are interested in Dive-into-LLMs are comparing it to the libraries listed below
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 5 months ago
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆39Updated 4 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆57Updated last month
- A light-weight tool for evaluating LLMs in rule-based ways.☆46Updated 2 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆101Updated last month
- The rule-based evaluation subset and code implementation of Omni-MATH☆19Updated 4 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆26Updated 4 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆88Updated 2 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated 7 months ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆94Updated last week
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Updated last year
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- ☆25Updated 2 years ago
- A framework for evolving and testing question-answering datasets with various models.☆14Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 10 months ago
- ☆22Updated 4 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 10 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆24Updated 6 months ago
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆17Updated 2 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆54Updated 6 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆20Updated 2 months ago
- ☆35Updated last year
- a-m-team's exploration in large language modeling☆49Updated 3 weeks ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆32Updated 4 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆16Updated 5 months ago