Implementation of MixCE method described in ACL 2023 paper by Zhang et al.
☆20May 29, 2023Updated 2 years ago
Alternatives and similar repositories for MixCE-acl2023
Users that are interested in MixCE-acl2023 are comparing it to the libraries listed below
Sorting:
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"☆12May 15, 2023Updated 2 years ago
- Replication package for evaluation of code generation metrics☆16Nov 24, 2025Updated 3 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Dec 4, 2023Updated 2 years ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- ☆34Aug 23, 2023Updated 2 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- ☆36Jul 7, 2025Updated 7 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- A Large-Scale Dataset for Empathetic Response Generation☆44Apr 22, 2024Updated last year
- code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"☆39Apr 22, 2020Updated 5 years ago
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆44Dec 8, 2024Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis☆13Dec 26, 2024Updated last year
- OpenVLA for AIRBOT☆15Aug 15, 2024Updated last year
- 赵纯想个人网站☆11Nov 3, 2024Updated last year
- Optical flow library, based on NumPy arrays☆11Nov 30, 2021Updated 4 years ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆44May 10, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Mar 13, 2023Updated 2 years ago
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆12Sep 5, 2023Updated 2 years ago
- ☆11Jul 28, 2021Updated 4 years ago
- ☆22Feb 3, 2026Updated last month
- Layers, datasets and utilities for PyTorch☆10Nov 22, 2023Updated 2 years ago
- Generic build server☆64May 25, 2014Updated 11 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆27Jul 26, 2025Updated 7 months ago
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- This Node.js script automates the process of downloading and extracting source maps from websites. It uses Puppeteer to navigate web page…☆18Dec 17, 2025Updated 2 months ago
- Official GraphQLBlog repository. Add your blog posts as pull request!☆13Jan 11, 2023Updated 3 years ago
- 3D Scene Annotation and Dataset Toolkit☆10Jun 11, 2023Updated 2 years ago
- Code for RA-L paper "One-shot Learning for Task-oriented Grasping"☆12May 9, 2024Updated last year
- ☆11Oct 20, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- ☆12Apr 30, 2019Updated 6 years ago