p1nksnow/MoICE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/p1nksnow/MoICE)

p1nksnow / MoICE

Official implementation for "Mixture of In-Context Experts Enhance LLMs’ Awareness of Long Contexts" (Accepted by Neurips2024)

☆14

Alternatives and similar repositories for MoICE

Users that are interested in MoICE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
viettmab / SA-DPM
View on GitHub
☆16Jan 28, 2024Updated 2 years ago
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
Trustworthy-ML-Lab / ThinkEdit
View on GitHub
[EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…
☆19Dec 17, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
srivatsan88 / sector
View on GitHub
☆18Dec 6, 2024Updated last year
RUCBM / DelTA
View on GitHub
Code for Paper 'DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards'
☆17May 21, 2026Updated 2 months ago
LCBHSStudent / fvck-this-term-collection-BUPT
View on GitHub
Collection of course design during the 2nd term of GRADE 2 in CS BUPT
☆13Sep 11, 2020Updated 5 years ago
safety-research / inverse-scaling-ttc
View on GitHub
Inverse Scaling in Test-Time Compute
☆26Dec 3, 2025Updated 7 months ago
trestad / Factual-Recall-Mechanism
View on GitHub
The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.
☆13Apr 10, 2024Updated 2 years ago
khainb / CSW
View on GitHub
A novel variant of sliced Wasserstein based on a new slicing technique that utilizes the convolution operator.
☆12Jan 14, 2023Updated 3 years ago
HuyNguyen-hust / flash-attn-101
View on GitHub
☆22Sep 3, 2024Updated last year
TARGET-SIDE-DATA-AUG / TSDASG
View on GitHub
Source Code for <Target-Side Data Augmentation for Sequence Generation>
☆12Oct 6, 2021Updated 4 years ago
shrimai / Focused-Attention-Improves-Document-Grounded-Generation
View on GitHub
☆21Sep 10, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xchhuang / pytorch_sliced_wasserstein_loss
View on GitHub
An unofficial PyTorch implementation of "A Sliced Wasserstein Loss for Neural Texture Synthesis" paper [CVPR 2021].
☆14Nov 10, 2021Updated 4 years ago
brson / being-rust
View on GitHub
Intro to Rust talk
☆15Dec 7, 2022Updated 3 years ago
zdou0830 / crosslingual_summarization_semantic
View on GitHub
☆10Jun 13, 2020Updated 6 years ago
mala-lab / OpenCIL
View on GitHub
Official code for paper "OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning"
☆13Jun 19, 2024Updated 2 years ago
tingyu215 / TS-LLaVA
View on GitHub
TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models
☆17Jan 2, 2025Updated last year
Jometeorie / MultiHopShortcuts
View on GitHub
Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"
☆14Jun 1, 2024Updated 2 years ago
alexcrawford0927 / cyclonetracking
View on GitHub
Lagrangian cyclone tracking algorithm originally developed while at the National Snow and Ice Data Center; modified while at the College …
☆21Jul 2, 2026Updated 3 weeks ago
webis-de / set-encoder
View on GitHub
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
☆19May 23, 2025Updated last year
Cardinalere / Batch-ICL
View on GitHub
Code for paper 'Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning'
☆18Apr 19, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ChangyuChen347 / MaskedThought
View on GitHub
[ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
☆27Jul 9, 2024Updated 2 years ago
2282588541a / HiRAG
View on GitHub
code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
☆14Aug 13, 2024Updated last year
JorgeGtz / TextureNets_implementation
View on GitHub
☆12Jun 3, 2020Updated 6 years ago
VinAIResearch / JointIDSF
View on GitHub
BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)
☆88Jul 22, 2024Updated 2 years ago
GaoxiangLuo / LLM-BioMed-NER-RE
View on GitHub
[npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction
☆13May 1, 2024Updated 2 years ago
TOM-tym / APG
View on GitHub
Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”
☆16Jan 8, 2024Updated 2 years ago
VinAIResearch / RecGPT
View on GitHub
RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)
☆42Sep 22, 2024Updated last year
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
cdhx / QDTQA
View on GitHub
Code for AAAI 2023 research track paper "Question Decomposition Tree for Answering Complex Questions over Knowledge Bases"
☆17Jan 3, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
VITA-Group / WeLore
View on GitHub
[ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications
☆52Oct 30, 2025Updated 8 months ago
lwaekfjlk / Personalized-Text-Generation-Papers
View on GitHub
Collect papers related to personalized text generation
☆18Sep 6, 2021Updated 4 years ago
CityU-AIM-Group / PRR-Imbalance
View on GitHub
[TMI'22] Personalized Retrogress-Resilient Federated Learning Towards Imbalanced Medical Data
☆15Jul 20, 2022Updated 4 years ago
UNITES-Lab / MoE-RBench
View on GitHub
[ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"
☆11Jul 1, 2024Updated 2 years ago
suzy0223 / STSM
View on GitHub
Official code for the paper 'Spatial-temporal Forecasting for Regions without Observations'
☆16Nov 9, 2025Updated 8 months ago
ulab-uiuc / MemReward
View on GitHub
Graph-based experience memory for LLM reward prediction with limited labels. 20% labels → 97.3% Oracle.
☆19Mar 24, 2026Updated 4 months ago
k-gyuhak / CLOM
View on GitHub
☆17Nov 3, 2022Updated 3 years ago