☆13Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for mamba4transformers
Users that are interested in mamba4transformers are comparing it to the libraries listed below
Sorting:
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- 这是一个从零学习CUDA课程☆13Nov 3, 2024Updated last year
- A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor☆30Jan 13, 2026Updated last month
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆17Feb 25, 2025Updated last year
- trending repositories and news related to AI☆10Mar 22, 2019Updated 6 years ago
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆13Mar 6, 2025Updated last year
- EXL2 quantization generalized to other models.☆10Mar 17, 2024Updated last year
- Automated bottleneck detection and solution orchestration☆19Feb 24, 2026Updated last week
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆34Nov 15, 2025Updated 3 months ago
- ☆11May 28, 2024Updated last year
- ☆13May 12, 2025Updated 9 months ago
- GuessWhat?! is a challenging task-oriented visual dialogue problem.<br>Tensorflow code for the papers, <Visual Dialogue State Tracking f…☆11May 16, 2024Updated last year
- ☆12Apr 12, 2024Updated last year
- ☆13Oct 29, 2021Updated 4 years ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- ☆18Mar 11, 2025Updated 11 months ago
- This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)☆11Mar 24, 2018Updated 7 years ago
- Increasing the scale and diversity of chart de-rendering data.☆12Mar 13, 2024Updated last year
- A large dataset (500+ images) of past wildfire from Copernicus EMS using Sentinel-2 images in the period 2017- 2023☆16Oct 20, 2023Updated 2 years ago
- ☆22Nov 29, 2024Updated last year
- ☆14Feb 12, 2024Updated 2 years ago
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆16Feb 24, 2025Updated last year
- A Pytorch tutorial of Conditional Flow Matching[Lipman22] using MNIST dataset.☆27Aug 26, 2025Updated 6 months ago
- Online Preference Alignment for Language Models via Count-based Exploration☆17Jan 14, 2025Updated last year
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated 2 years ago
- ゼロから作るDeep Learning ❸ をC++で実装する。自習用リポジトリ。☆16Aug 12, 2020Updated 5 years ago
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year
- 北京大学博士后研究工作报告 LaTeX 模板☆22Mar 13, 2023Updated 2 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 5 months ago
- The codes for training sparsity predictor on LLaMA.☆18May 12, 2024Updated last year
- ☆26Nov 13, 2025Updated 3 months ago
- Repository of the paper ''CritiQ: Mining Data Quality Criteria from Human Preferences". Code for CritiQ Flow & Training CritiQ Scorer.☆23Dec 11, 2025Updated 2 months ago
- Implementation of Recursive Language Model paper from scratch☆38Feb 10, 2026Updated last month
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆23Apr 8, 2024Updated last year
- ☆49Sep 26, 2025Updated 5 months ago
- Emotional First Aid Raw Dataset, 心理咨询问答原始语料库☆21Jan 13, 2024Updated 2 years ago