CONE-MT / CONE
☆13Updated last year
Alternatives and similar repositories for CONE:
Users that are interested in CONE are comparing it to the libraries listed below
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated 10 months ago
- ☆27Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆49Updated 11 months ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- ☆45Updated 8 months ago
- ☆36Updated 5 months ago
- OpenLLMDE: An open source data engineering framework for LLMs☆17Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- [EMNLP 2023 Demo] CLEVA: Chinese Language Models EVAluation Platform☆61Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 10 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆83Updated last year
- Reasoning by Communicating with Agents☆24Updated 4 months ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆48Updated last year
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆55Updated this week
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆34Updated last month
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated 11 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- ☆37Updated 2 years ago
- Light local website for displaying performances from different chat models.☆85Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆53Updated last week
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆49Updated 4 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- ☆48Updated 11 months ago
- Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.☆135Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 10 months ago
- ☆53Updated 3 months ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated last year
- [NeurIPS2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆92Updated 2 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆49Updated 6 months ago