Latent Large Language Models
☆19Aug 24, 2024Updated last year
Alternatives and similar repositories for lllm
Users that are interested in lllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆33Jan 13, 2025Updated last year
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- A customizable GPT in a single page, using OpenAI models text-embedding-ada-002, tts-1, whisper-1, dall-e-3, and gpt-4-vision-preview☆14Jul 9, 2024Updated last year
- ☆56Nov 6, 2024Updated last year
- Lightweight and minimal dom template and ajax helpers☆19Dec 15, 2023Updated 2 years ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- An implementation of the hammer2 filesystem for Plan 9☆19Nov 25, 2018Updated 7 years ago
- Repository for the paper Stream of Search: Learning to Search in Language☆154Feb 3, 2025Updated last year
- Results and analysis scripts for FRB121102 burst analysis.☆12Aug 16, 2021Updated 4 years ago
- 🕵 Given a user query this python module will returns a list of related searches you see on Google search results pages.☆11Sep 28, 2018Updated 7 years ago
- Simple Application Sandboxing☆23Aug 9, 2024Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- End to End Machine Learning Pipeline with scikit learn☆12Mar 10, 2021Updated 5 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 2 years ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- [ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen☆17Sep 7, 2024Updated last year
- A standard (and generalised) rotation measures fitting package written in Python.☆12Sep 6, 2023Updated 2 years ago
- ☆26Feb 8, 2026Updated last month
- Python based Radio Frequency Interference Mitigation Routines☆13Jul 9, 2025Updated 8 months ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- ☆11Dec 23, 2023Updated 2 years ago
- ☆13Nov 24, 2019Updated 6 years ago
- ☆15Apr 19, 2021Updated 4 years ago
- ☆13Apr 25, 2024Updated last year
- Compression for unit-norm embedding vectors using spherical coordinates☆78Jan 23, 2026Updated 2 months ago
- ☆13Nov 30, 2020Updated 5 years ago
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 6 months ago
- 🔄 GitHub Action to update a best-of list.☆15Apr 7, 2022Updated 3 years ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆286Nov 3, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ☆14Mar 8, 2025Updated last year
- An open source code of the GitHub Copilot Workspace☆13Jun 8, 2024Updated last year
- Compression for Foundation Models☆35Jul 21, 2025Updated 8 months ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆24Jun 26, 2024Updated last year