☆20Feb 2, 2026Updated 5 months ago
Alternatives and similar repositories for COSMOS
Users that are interested in COSMOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆26Feb 18, 2025Updated last year
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆21Jul 18, 2025Updated 11 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un …☆19Dec 17, 2025Updated 6 months ago
- ☆15Apr 26, 2025Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆27Nov 11, 2024Updated last year
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆22Oct 15, 2024Updated last year
- Norm-Based Curriculum Learning for Neural Machine Translation (ACL 2020)☆18Aug 1, 2020Updated 5 years ago
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆32Oct 9, 2025Updated 8 months ago
- gradient-free, highly parallelizable trajectory optimization☆17Nov 24, 2025Updated 7 months ago
- Coq formalization of algorithms due to Tarjan and Kosaraju for finding strongly connected graph components using Mathematical Components …☆18Mar 3, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- [NeurIPS 2024 D&B Track] DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation☆13Mar 5, 2025Updated last year
- ☆11Apr 16, 2026Updated 2 months ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 9 months ago
- An `AbstractTestSet` implementation and a helper macro for test execution with auto discovery and a neater test summary.☆11Dec 6, 2024Updated last year
- CaMML:Context-Aware MultiModal Learner for Large Models (ACL 2024 SAC Award)☆15May 21, 2025Updated last year
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆14Jun 20, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Amazing package to compute decompositions into irreducibles of explicit group representations and the Wedderburn decomposition for endomo…☆12Updated this week
- Code and data from the paper 'Human Feedback is not Gold Standard'☆21May 5, 2026Updated last month
- ☆15Nov 23, 2023Updated 2 years ago
- Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations☆18Dec 15, 2012Updated 13 years ago
- ☆14Apr 22, 2024Updated 2 years ago
- Lift-style CSS selector transforms based on Scalate's Scuery☆10Aug 23, 2012Updated 13 years ago
- Julia package for generating random quantum states and processes according to a number of natural distributions.☆13Dec 16, 2022Updated 3 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated 2 years ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A decoding algorithm for quantum error correcting codes.☆19May 18, 2026Updated last month
- Code for "Unlearning Traces the Influential Training Data of Language Models"☆13Jun 13, 2024Updated 2 years ago
- Generic lab tools in Julia☆15Apr 20, 2026Updated 2 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Some examples on how egui can be used with Bevy engine☆13Jul 16, 2024Updated last year
- This is the Python-based repository for Variational LOCC-assisted quantum circuits for long-range entangled states.☆11Nov 6, 2025Updated 7 months ago
- SpExtor: Sparse Entity Extractor☆11Feb 10, 2020Updated 6 years ago