Shawn-Guo-CN / Lossless_Text_Compression_with_TransformerLinks

This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.

☆14

Alternatives and similar repositories for Lossless_Text_Compression_with_Transformer

Users that are interested in Lossless_Text_Compression_with_Transformer are comparing it to the libraries listed below

Sorting:

swj0419 / in-context-pretraining
☆54Updated last year
KwanWaiChung / M4LE
Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
☆23Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
FranxYao / Retrieval-Head-with-Flash-Attention
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Updated last year
RZFan525 / Awesome-ScalingLaws
A curated list of awesome resources dedicated to Scaling Laws for LLMs
☆79Updated 2 years ago
hanxuhu / SeqIns
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆29Updated 11 months ago
Re-Align / AlignTDS
Analyzing LLM Alignment via Token distribution shift
☆17Updated last year
ChengpengLi1003 / DotaMath
☆30Updated 10 months ago
LLaMafia / SFT_function_learning
Explore what LLMs are really leanring over SFT
☆29Updated last year
sail-sg / symbolic-instruction-tuning
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆66Updated 2 years ago
RZFan525 / NLP-PhD-Application-In-The-World
The information of NLP PhD application in the world.
☆37Updated last year
swtheing / PF-PPO-RLHF
☆34Updated last year
shuyhere / about-super-alignment
Feeling confused about super alignment? Here is a reading list
☆43Updated last year
October2001 / ProLong
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆57Updated last year
siyuyuan / coscript
Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning
☆36Updated 2 years ago
chujiezheng / LLM-Extrapolation
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75Updated 5 months ago
Yifan-Song793 / GoodBadGreedy
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆30Updated last year
FranxYao / FlanT5-CoT-Specialization
Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.
☆132Updated 2 years ago
Zanette-Labs / SpeculativeRejection
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆52Updated last year
F2-Song / ICDPO
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…
☆16Updated last year
OpenLMLab / ParallelTokenizer
Use the tokenizer in parallel to achieve superior acceleration
☆20Updated last year
yegcjs / mixinglaws
☆106Updated 3 months ago
HKUNLP / STRING
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆78Updated 11 months ago
Shark-NLP / self-adaptive-ICL
self-adaptive in-context learning
☆45Updated 2 years ago
Hunter-DDM / stablemoe
Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"
☆50Updated 3 years ago
RUCAIBox / CARP
☆17Updated 2 years ago
hemingkx / SpecDec
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
☆44Updated last year
PlusLabNLP / Active-IT
Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"
☆25Updated last year
gmftbyGMFTBY / Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆37Updated 2 years ago
princeton-nlp / CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆165Updated last year