attentionmech / dexLinks
Pokedex for LLMs
☆13Updated 3 months ago
Alternatives and similar repositories for dex
Users that are interested in dex are comparing it to the libraries listed below
Sorting:
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆18Updated last month
- ☆33Updated 7 months ago
- Minimum Description Length probing for neural network representations☆18Updated 6 months ago
- Fork of Flame repo for training of some new stuff in development☆14Updated 3 weeks ago
- ☆18Updated this week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 9 months ago
- Lottery Ticket Adaptation☆39Updated 8 months ago
- ☆20Updated 5 months ago
- Simple repository for training small reasoning models☆32Updated 6 months ago
- ☆38Updated last year
- Collection of autoregressive model implementation☆86Updated 3 months ago
- A sample pattern for running CI tests on Modal☆18Updated 3 months ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆37Updated 5 months ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 2 months ago
- aesthetic tensor visualiser☆24Updated 3 months ago
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- ☆23Updated 8 months ago
- open source alpha evolve☆66Updated 2 months ago
- alternative way to calculating self attention☆18Updated last year
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Updated last week
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆20Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 11 months ago
- ☆23Updated 3 months ago
- Training hybrid models for dummies.☆25Updated 6 months ago
- Latent Diffusion Language Models☆69Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Updated 4 years ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year