☆32Nov 4, 2024Updated last year
Alternatives and similar repositories for transformer_ngrams
Users that are interested in transformer_ngrams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆27Oct 12, 2024Updated last year
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆45Oct 29, 2025Updated 8 months ago
- ☆54Jul 18, 2024Updated last year
- ☆15Feb 12, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Session demos for Build AI Apps at Fabric Conference 2024☆10Jul 3, 2024Updated 2 years ago
- Icechunk Pilot for NASA IMPACT☆14May 31, 2025Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 3 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆37Apr 7, 2025Updated last year
- Reformat datasets into zarr☆14Jun 10, 2025Updated last year
- An experimental programming language for ergonomic software verification☆16Jun 27, 2026Updated last week
- Fast and scalable indexing of grids☆12Updated this week
- The Zebrafish Activity Prediction Benchmark measures progress on the problem of predicting cellular-resolution neural activity throughout…☆75Jun 14, 2026Updated 3 weeks ago
- Localize the car in a static map with a particle filter.☆12Mar 30, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [TMLR 2026 J2C Certification] Previously at GenBio ICML 2025☆22Apr 28, 2026Updated 2 months ago
- Exploring Automatic Differentiation with Racket☆11Jan 9, 2022Updated 4 years ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Jun 15, 2024Updated 2 years ago
- Physics-inspired transformer modules based on mean-field dynamics of vector-spin models in JAX☆49Dec 10, 2023Updated 2 years ago
- LLM extensions for Sphinx Documentation☆31Apr 2, 2026Updated 3 months ago
- FlexiTokens☆23Dec 27, 2025Updated 6 months ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated 2 years ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆87Nov 2, 2025Updated 8 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Benchmark Suite for Real-Time Robotics☆14May 3, 2023Updated 3 years ago
- ☆17May 8, 2024Updated 2 years ago
- ☆11Apr 21, 2023Updated 3 years ago
- AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…☆10Nov 3, 2023Updated 2 years ago
- ☆15May 26, 2026Updated last month
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- A basic pure pytorch implementation of flash attention☆17Oct 28, 2024Updated last year
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated 2 years ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Apr 25, 2024Updated 2 years ago
- ☆13Jan 29, 2022Updated 4 years ago
- Code for the Secure Triplet Loss approach for biometric template security.☆10Apr 22, 2021Updated 5 years ago
- ☆13Feb 20, 2024Updated 2 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- ☆35Apr 28, 2025Updated last year
- Collection of National Science Foundation (NSF) proposal templates: pitches, general applications, etc.☆16Nov 17, 2023Updated 2 years ago