☆32Nov 4, 2024Updated last year
Alternatives and similar repositories for transformer_ngrams
Users that are interested in transformer_ngrams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of approximate free-energy minimization in PyTorch☆21Oct 16, 2021Updated 4 years ago
- ☆18Mar 11, 2026Updated last week
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- This course handbook has been prepared for the subject 188.399 Introduction to Semantic Systems (VU 2.0) 2024W at TU Wien. Most of the fu…☆17Jan 15, 2026Updated 2 months ago
- ☆13May 9, 2025Updated 10 months ago
- Repository for the "Chain-of-Thought Reasoning In The Wild Is Not Always Faithful" paper☆31Nov 28, 2025Updated 3 months ago
- LLM extensions for Sphinx Documentation☆15Feb 26, 2026Updated 3 weeks ago
- ☆13Jun 29, 2024Updated last year
- Summer Scheming!!!!!!☆11Aug 20, 2020Updated 5 years ago
- Icechunk Pilot for NASA IMPACT☆13May 31, 2025Updated 9 months ago
- Code related to the Low Level C# course.☆12Nov 16, 2022Updated 3 years ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- Reformat datasets into zarr☆14Jun 10, 2025Updated 9 months ago
- A minimal implementation of Drifting Models for 2D toy data. Unlike diffusion/flow models that iterate at inference, drifting models evo…☆70Feb 13, 2026Updated last month
- ☆10Nov 15, 2023Updated 2 years ago
- ☆24Jun 24, 2025Updated 8 months ago
- Preprint | Previously at GenBio ICML 2025☆19Aug 20, 2025Updated 7 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 8 months ago
- Exploring Automatic Differentiation with Racket☆12Jan 9, 2022Updated 4 years ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Jun 15, 2024Updated last year
- A unified library for interacting with various AI APIs through a standardized interface.☆35Mar 13, 2025Updated last year
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆86Nov 2, 2025Updated 4 months ago
- The original Shared Recurrent Memory Transformer implementation☆34Jul 11, 2025Updated 8 months ago
- Code for☆28Dec 16, 2024Updated last year
- ☆17May 8, 2024Updated last year
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- An example of how to create a .NET GraphQL server on Azure Functions that talks to CosmosDB☆10Dec 10, 2020Updated 5 years ago
- Code for reviewers☆12Oct 8, 2024Updated last year
- This package enables inference of header hierarchy in the docling PDF parsing pipeline.☆62Feb 19, 2026Updated last month
- Source Cooperative Web Interface & API☆22Mar 13, 2026Updated last week
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- A showcase of companies and platforms leveraging CrewAI to power their AI solutions and workflows.☆23Jan 13, 2025Updated last year
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- CosmosDB 1 day hackathon☆14May 31, 2023Updated 2 years ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated last year
- Code for the Secure Triplet Loss approach for biometric template security.☆10Apr 22, 2021Updated 4 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆17Sep 8, 2022Updated 3 years ago
- Regular Chat Application with multiple chat sessions and multiple users☆16Apr 25, 2023Updated 2 years ago