[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
โ79Jun 23, 2025Updated last year
Alternatives and similar repositories for Monet
Users that are interested in Monet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ๐๏ธ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition๐๏ธโ16Sep 19, 2023Updated 2 years ago
- a Jax/Flax inference code of StarCoderโ12Jun 12, 2023Updated 3 years ago
- ๊ด์ด๋ํ๊ต ์ปดํจํฐ ๋น์ AI ๊ฒฝ์ง๋ํ 1๋ฑ ์๋ฃจ์ ์ ๋๋ค.โ15Oct 5, 2022Updated 3 years ago
- ๐ฅ12th place solution on G2Net Detecting Continuous Gravitational Waves๐ฅโ14Jan 4, 2023Updated 3 years ago
- TPU์์ ํ๊ตญ์ด์ฉ LLM ์ถ๋ก ์ ์ํ Jax/Flax ๊ตฌํ์ฒด์ ๋๋ค.โ12Jun 12, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ์ปค๋ฒ๋ฆฌ์คํธ - ๋ถ ์ปค๋ฒ ์์ฑ AI ์๋น์คโ13Sep 11, 2022Updated 3 years ago
- [NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverageโ16Sep 2, 2025Updated 9 months ago
- Serving large language model with transformersโ13Oct 18, 2022Updated 3 years ago
- Deploy KoGPT with Triton Inference Serverโ14Nov 18, 2022Updated 3 years ago
- Generate README.md with GPT-3 few-shot learningโ26Oct 19, 2022Updated 3 years ago
- KW ์๋ฆฌ๋ฏธ - ๊ด์ด๋ํ๊ต ๊ณต์ง์ฌํญ ์๋ฆผโ16Jul 23, 2022Updated 3 years ago
- ๐งชcategorical tabnet research part๐งชโ13Apr 12, 2024Updated 2 years ago
- โ13May 15, 2026Updated last month
- Inverse DALL-E for Optical Character Recognitionโ38Oct 14, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Dataset and Evaluation Code for the K-QA Benchmark.โ18May 26, 2024Updated 2 years ago
- LLM์ ํ์ฉํ ๋ํํ ์ ์ฌ ํ๋ก ๊ฒ์ ์์คํ ์ ๋๋ค.โ27Jul 3, 2023Updated 2 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementingโฆโ10Oct 7, 2024Updated last year
- ๐ฅKNOW๊ธฐ๋ฐ ์ง์ ์ถ์ฒ ์๊ณ ๋ฆฌ์ฆ ๊ฒฝ์ง๋ํ 1๋ฑ ์๋ฃจ์ ์ ๋๋ค๐ฅโ44Feb 15, 2022Updated 4 years ago
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domainsโ17Mar 4, 2025Updated last year
- A tiny easily hackable implementation of a feature dashboard.โ16Oct 21, 2025Updated 8 months ago
- ๐๋ฐ์ด์ฝ AIํด์ปคํค ๋ํ ์ฐ์์ ์๋ฃจ์ ๐โ22Mar 13, 2024Updated 2 years ago
- ๐ฅ LG-AI-Challenge 2022 1์ ์๋ฃจ์ ์ ๋๋ค.โ13Jun 6, 2023Updated 3 years ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)โ30Feb 6, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answeringโ37Sep 20, 2024Updated last year
- ๐ฅSamsung AI Challenge 2021 1๋ฑ ์๋ฃจ์ ์ ๋๋ค๐ฅโ54Nov 12, 2021Updated 4 years ago
- Optimize RandAugment with differentiable operationsโ25Jan 25, 2021Updated 5 years ago
- Modified to support crosscoder training.โ27Feb 4, 2026Updated 4 months ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)โ12Oct 31, 2024Updated last year
- Trains small LMs. Designed for training on SimpleStoriesโ14Sep 15, 2025Updated 9 months ago
- EMNLP 2022: Biomedical NER for the Enterprise with Distillated BERN2 and the Kazu Frameworkโ11Aug 29, 2024Updated last year
- ๐ฅ171st place in Google brain solution๐ฅโ10Jul 25, 2022Updated 3 years ago
- ๐์ ์ฉ์นด๋ ์ฌ์ฉ์ ์ฐ์ฒด ์์ธก AI ๊ฒฝ์ง๋ํ 2๋ฑ ์๋ฃจ์ ๐โ12Dec 5, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Tools for optimizing steering vectors in LLMs.โ22Apr 10, 2025Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problemsโ24Jul 4, 2025Updated 11 months ago
- ๐ ํ ์ค NEXT ML CHALLENGE : ๊ด๊ณ ํด๋ฆญ ์์ธก(CTR) ๋ํ 5๋ฑ ๋ชจ๋ธ ์ ์ถ์ฉ ๋ ํฌ์งํ ๋ฆฌ๐โ25Feb 2, 2026Updated 5 months ago
- Approximating the joint distribution of language models via MCTSโ22Nov 3, 2024Updated last year
- Kotlin Multiplatform App for generating README with AIโ51Oct 25, 2022Updated 3 years ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Featuresโ28Nov 20, 2024Updated last year
- โ59Nov 19, 2024Updated last year