Official repo of dataset-decomposition paper [NeurIPS 2024]
☆21Jan 8, 2025Updated last year
Alternatives and similar repositories for ml-dataset-decomposition
Users that are interested in ml-dataset-decomposition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 13, 2024Updated last year
- ☆20Nov 28, 2024Updated last year
- Produces a serialized hardware report of the physical infrastructure for automation☆26Jan 27, 2026Updated last month
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- CodeRepoQA dataset☆15Feb 19, 2025Updated last year
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated 2 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated last month
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆57Oct 14, 2025Updated 5 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Stick-breaking attention☆63Jul 1, 2025Updated 8 months ago
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- ☆27Jul 9, 2024Updated last year
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Nov 22, 2022Updated 3 years ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 10 months ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year
- ☆124Feb 21, 2025Updated last year
- A LiDAR visualization tool for HeLiMOS dataset☆26Sep 4, 2024Updated last year
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- Progetto per la prova finale di Ingegneria del Software 2023-2024 al Politecnico di Milano☆10Oct 19, 2024Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆224Jul 25, 2025Updated 7 months ago
- 6th Position Solution Code for Kaggle - LLM Science Exam Competition☆24Jul 8, 2024Updated last year
- HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models☆13Mar 6, 2025Updated last year
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆24Feb 3, 2026Updated last month
- LVCS@Tesla.com☆12Jan 16, 2026Updated 2 months ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- ☆34Sep 10, 2024Updated last year
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- NestJS project template, configured with prisma and ejs☆12Dec 1, 2024Updated last year
- When you want to be a brilliant man, you should write down something interesting thing for recall.☆12Dec 18, 2022Updated 3 years ago
- Demonstration of Jackknife Variational Inference for Variational Autoencoders, related to ICLR 2018 paper.☆22Feb 21, 2018Updated 8 years ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training☆262Aug 9, 2025Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- ☆15Sep 10, 2024Updated last year
- ☆12Jun 15, 2023Updated 2 years ago