TAU-VAILab / ProtoSnap
☆35Updated 3 months ago
Alternatives and similar repositories for ProtoSnap
Users that are interested in ProtoSnap are comparing it to the libraries listed below
Sorting:
- Training hybrid models for dummies.☆21Updated 4 months ago
- Tools for evaluating OCR performance relative to ground truth.☆10Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 7 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 11 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- An open source implementation of CLIP☆21Updated 6 months ago
- An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp ser…☆28Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 10 months ago
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated last year
- Flow Chart Image-to-Code Generation☆32Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆50Updated 3 months ago
- Datasets for training and evaluating Ancient Greek sentence embedding models☆11Updated 10 months ago
- ☆14Updated 5 months ago
- ☆21Updated 2 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆24Updated 3 months ago
- Effort to open-source 10.5 trillion parameter Gemini model.☆17Updated last year
- ☆15Updated last month
- ☆74Updated 7 months ago
- LLM trunk in 2d☆10Updated last month
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆35Updated 3 weeks ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆26Updated this week
- ☆13Updated 2 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆40Updated last month
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Latent Large Language Models☆18Updated 8 months ago
- Repository for the paper "Will GPT-4 Run DOOM?"☆22Updated 5 months ago
- ☆9Updated last year
- Visual search interface☆11Updated 3 years ago
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆33Updated 6 months ago