yuanzhoulvpi2017 / mamba4transformersView external linksLinks
☆13Feb 18, 2024Updated last year
Alternatives and similar repositories for mamba4transformers
Users that are interested in mamba4transformers are comparing it to the libraries listed below
Sorting:
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated 11 months ago
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆33Nov 15, 2025Updated 3 months ago
- 这是一个从零学习CUDA课程☆13Nov 3, 2024Updated last year
- EXL2 quantization generalized to other models.☆10Mar 17, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- trending repositories and news related to AI☆10Mar 22, 2019Updated 6 years ago
- Automated bottleneck detection and solution orchestration☆19Updated this week
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆11May 28, 2024Updated last year
- GuessWhat?! is a challenging task-oriented visual dialogue problem.<br>Tensorflow code for the papers, <Visual Dialogue State Tracking f…☆11May 16, 2024Updated last year
- implementation of XPBD: Position-Based Simulation of Compliant Constrained Dynamics☆11Mar 18, 2019Updated 6 years ago
- ☆12Apr 12, 2024Updated last year
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated last year
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- ☆18Mar 11, 2025Updated 11 months ago
- ☆28Feb 4, 2026Updated last week
- A Pytorch tutorial of Conditional Flow Matching[Lipman22] using MNIST dataset.☆27Aug 26, 2025Updated 5 months ago
- A large dataset (500+ images) of past wildfire from Copernicus EMS using Sentinel-2 images in the period 2017- 2023☆15Oct 20, 2023Updated 2 years ago
- ☆22Nov 29, 2024Updated last year
- ☆14Feb 12, 2024Updated 2 years ago
- Increasing the scale and diversity of chart de-rendering data.☆12Mar 13, 2024Updated last year
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆16Feb 24, 2025Updated 11 months ago
- ☆13Apr 15, 2024Updated last year
- ゼロから作るDeep Learning ❸ をC++で実装する。自習用リポジトリ。☆16Aug 12, 2020Updated 5 years ago
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated last year
- Online Preference Alignment for Language Models via Count-based Exploration☆17Jan 14, 2025Updated last year
- ☆16Mar 13, 2025Updated 11 months ago
- ☆25Nov 13, 2025Updated 3 months ago
- Implementation of Recursive Language Model paper from scratch☆37Feb 10, 2026Updated last week
- An Interactive Causal Analysis Tool☆19Jun 16, 2023Updated 2 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 4 months ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated last year
- The codes for training sparsity predictor on LLaMA.☆18May 12, 2024Updated last year
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆22Apr 8, 2024Updated last year
- Repository of the paper ''CritiQ: Mining Data Quality Criteria from Human Preferences". Code for CritiQ Flow & Training CritiQ Scorer.☆23Dec 11, 2025Updated 2 months ago
- 北京大学博士后研究工作报告 LaTeX 模板☆22Mar 13, 2023Updated 2 years ago
- Emotional First Aid Raw Dataset, 心理咨询问答原始语料库☆21Jan 13, 2024Updated 2 years ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year