☆52Jun 14, 2024Updated last year
Alternatives and similar repositories for Infinity-Instruct
Users that are interested in Infinity-Instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated 2 years ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆14Aug 8, 2025Updated 10 months ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆26Feb 1, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆27Jul 20, 2024Updated last year
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated 2 years ago
- An AI-powered coding assistant plugin for the Eclipse IDE.☆14Oct 28, 2025Updated 7 months ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆192Feb 17, 2025Updated last year
- ☆11Oct 25, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆35Nov 1, 2025Updated 7 months ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago
- ☆15Jun 25, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆46May 19, 2026Updated 3 weeks ago
- Implementation of Gradient Information Optimization (GIO) for effective and scalable training data selection☆14Jun 22, 2023Updated 2 years ago
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- a library of works related to Large Language Models (LLMs) based Agent Hallucination☆58Oct 30, 2025Updated 7 months ago
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆16Sep 20, 2025Updated 8 months ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆12Apr 27, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Procedural data generators suite for synthetic pretraining and formal reasoning☆41Jun 2, 2026Updated last week
- ☆18Feb 20, 2024Updated 2 years ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated 2 years ago
- ☆153Apr 16, 2024Updated 2 years ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆226Nov 27, 2025Updated 6 months ago
- Simple Telegram bot Framework☆10Apr 8, 2017Updated 9 years ago
- ☆26Jun 10, 2025Updated 11 months ago
- A vLLM plugin built on the FlagOS unified multi-chip backend.☆54Updated this week
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆23May 21, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- FlagScale is a large model toolkit based on open-sourced projects.☆517Jun 1, 2026Updated last week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆334Oct 19, 2024Updated last year
- The official repo of INF-34B models trained by INF Technology.☆34Jul 25, 2024Updated last year
- ☆129May 27, 2023Updated 3 years ago
- Collections of IR Research☆37May 18, 2025Updated last year
- ☆47Apr 9, 2025Updated last year