KV Cache & LoRA for minGPT
☆61Mar 4, 2026Updated 3 months ago
Alternatives and similar repositories for llm_efficiency
Users that are interested in llm_efficiency are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 7 months ago
- ☆11Feb 22, 2025Updated last year
- CLIP is an open source, multimodal computer vision model and it's awesome!☆17Dec 16, 2024Updated last year
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆55Mar 7, 2026Updated 3 months ago
- This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) m…☆32May 3, 2025Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆27Nov 14, 2024Updated last year
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- ☆15Nov 22, 2023Updated 2 years ago
- Code for paper Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach by Zhe Lin, Xiaojun Wan. This…☆14Aug 10, 2021Updated 4 years ago
- NeurIPS 2022: Tree Mover’s Distance: Bridging Graph Metrics and Stability of Graph Neural Networks☆37Aug 4, 2023Updated 2 years ago
- Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer (ACL 2022)☆10Sep 22, 2022Updated 3 years ago
- Download Web-10K data by querying Bing Image Search☆10Feb 1, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Oct 17, 2024Updated last year
- Interact with GPT-3 through speech☆12Dec 12, 2022Updated 3 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 3 months ago
- Code for "A Bilingual Generative Transformer for Semantic Sentence Embedding" published at EMNLP 2020.☆10Nov 20, 2020Updated 5 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 3 years ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆72Sep 25, 2025Updated 8 months ago
- https://liuzeming01.github.io/XDailyDialog/☆16Jun 25, 2023Updated 2 years ago
- ☆80Feb 18, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Mar 31, 2022Updated 4 years ago
- Started as a Team Project for CS690D at UMass Amherst, now turning into pytorch implementation of hyperbolic neural networks using Poinca…☆12Dec 8, 2022Updated 3 years ago
- Implementations of the renormalization group-based diffusion model (RGDM).☆15Mar 10, 2025Updated last year
- ☆11Nov 16, 2022Updated 3 years ago
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 4 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 5 years ago
- Monophonic additive synth written in C++. VSTi version + stand-alone player included.☆19Apr 7, 2010Updated 16 years ago
- or the book is also hosted on GitHub at /Hands-On-GPU-Programming-with-Python 3-and-CUDA 10.2☆14Mar 16, 2020Updated 6 years ago
- Discord Bot in python with rasa nlu, tensorflow, discord api☆10Oct 15, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a distributed key-value store written in python☆14Oct 12, 2020Updated 5 years ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- A comprehensive evaluation framework for the SEA region☆29Apr 20, 2026Updated last month
- Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters☆10Mar 5, 2024Updated 2 years ago
- Inpainting protein sequence and structure☆12Nov 10, 2023Updated 2 years ago
- Code and data for the CIKM2021 paper "Learning Ideological Embeddings From Information Cascades"☆10Sep 8, 2021Updated 4 years ago
- Pytorch Lightning seed project with hydra☆18Oct 8, 2020Updated 5 years ago