☆47May 25, 2025Updated 9 months ago
Alternatives and similar repositories for Critical-Tokens-Matter
Users that are interested in Critical-Tokens-Matter are comparing it to the libraries listed below
Sorting:
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- [EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…☆12Dec 15, 2024Updated last year
- ☆17Apr 7, 2025Updated 11 months ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated 10 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 3 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Oct 1, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Dec 8, 2024Updated last year
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆131Dec 19, 2025Updated 2 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Nov 24, 2024Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 9 months ago
- ☆10Feb 2, 2021Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆11Jul 7, 2020Updated 5 years ago
- Rad-cGAN v1.0: Radar-based precipitation nowcasting model with conditional Generative Adversarial Networks for multiple dam domains☆11Jul 22, 2022Updated 3 years ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 3 years ago
- ☆10Oct 9, 2025Updated 4 months ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆26Feb 12, 2026Updated 3 weeks ago
- (In Progress) JavaScript implementation of RNCryptor☆14Feb 4, 2017Updated 9 years ago
- Just another yet FlappyBird-style game.☆25Jan 15, 2025Updated last year
- Depression detection from social media☆10Dec 19, 2020Updated 5 years ago
- Python implementation of a simple neural network, including AND, OR, and XOR demos.☆11Jun 13, 2019Updated 6 years ago
- Task scheduler with high availability.☆12Jul 26, 2021Updated 4 years ago
- A collection of OCR'd and machine-corrected Greek texts. This base repository contains Git submodules for the different works and an inve…☆11Nov 18, 2014Updated 11 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆13Dec 16, 2024Updated last year
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- 爬虫:用于爬取百度百科中英语料、东方财富网财报、医 学NER中英语料,可实现Deepl多语自动翻译☆12Jul 7, 2021Updated 4 years ago
- Groovy Neural Network library☆10Apr 22, 2017Updated 8 years ago
- Code of the paper "Synthesizing Aspect-Driven Recommendation Explanations from Reviews", IJCAI'20☆10Apr 5, 2024Updated last year
- Python and Scala APIs for enhanced Spark analytics☆12Mar 15, 2017Updated 8 years ago
- A very small implementation of process.nextTick for the browser using requestAnimationFrame☆12Apr 23, 2015Updated 10 years ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- Mirror for Java and PHP libraries and text resources to facilitate the use of Inuktitut in its written form on computers and the web