[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
☆77Jun 23, 2025Updated 11 months ago
Alternatives and similar repositories for Monet
Users that are interested in Monet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 광운대학교 컴퓨터 비전 AI 경진대회 1등 솔루션입니다.☆15Oct 5, 2022Updated 3 years ago
- 🥈12th place solution on G2Net Detecting Continuous Gravitational Waves🥈☆14Jan 4, 2023Updated 3 years ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆19Dec 21, 2024Updated last year
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 3 years ago
- 커버리스트 - 북 커버 생성 AI 서비스☆13Sep 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- KWU Real-time Notice Notification App for Android☆18May 10, 2024Updated 2 years ago
- [NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage☆16Sep 2, 2025Updated 9 months ago
- Serving large language model with transformers☆13Oct 18, 2022Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- 🧪categorical tabnet research part🧪☆13Apr 12, 2024Updated 2 years ago
- ☆13May 15, 2026Updated 3 weeks ago
- Dataset and Evaluation Code for the K-QA Benchmark.☆18May 26, 2024Updated 2 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- 🥇KNOW기반 직업 추천 알고리즘 경진대회 1등 솔루션입니다🥇☆44Feb 15, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains☆17Mar 4, 2025Updated last year
- A tiny easily hackable implementation of a feature dashboard.☆16Oct 21, 2025Updated 7 months ago
- 🏆데이콘 AI해커톤 대회 우수상 솔루션🏆☆22Mar 13, 2024Updated 2 years ago
- 🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.☆13Jun 6, 2023Updated 3 years ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆30Feb 6, 2026Updated 4 months ago
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆38Sep 20, 2024Updated last year
- Repository with sample code using Apollo's suggested engineering practices☆15Dec 16, 2024Updated last year
- 🥇Samsung AI Challenge 2021 1등 솔루션입니다🥇☆53Nov 12, 2021Updated 4 years ago
- 🥉 Codalab-Microsoft-COCO-Image-Captioning-Challenge 3rd place solution(06.30.21)☆23Apr 6, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Optimize RandAugment with differentiable operations☆25Jan 25, 2021Updated 5 years ago
- Modified to support crosscoder training.☆27Feb 4, 2026Updated 4 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Trains small LMs. Designed for training on SimpleStories☆14Sep 15, 2025Updated 8 months ago
- 🥉171st place in Google brain solution🥉☆10Jul 25, 2022Updated 3 years ago
- Tools for optimizing steering vectors in LLMs.☆22Apr 10, 2025Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆24Jul 4, 2025Updated 11 months ago
- 🏅토스 NEXT ML CHALLENGE : 광고 클릭 예측(CTR) 대회 5등 모델 제출용 레포지토리🏅☆25Feb 2, 2026Updated 4 months ago
- Kotlin Multiplatform App for generating README with AI☆50Oct 25, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- ☆60Nov 19, 2024Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆260Updated this week
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- A library for training crosscoders☆17May 28, 2025Updated last year
- ☆29May 24, 2024Updated 2 years ago
- Learning from Negative samples for Biomedical Generative Entity Linking☆18May 25, 2025Updated last year