☆52Feb 5, 2025Updated last year
Alternatives and similar repositories for transformers_zamba2
Users that are interested in transformers_zamba2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of models from the Zamba2 series.☆193Jan 23, 2025Updated last year
- MedConceptsQA: Open source medical concepts QA benchmark☆18Dec 30, 2024Updated last year
- Experimental paper writing linter.☆35Sep 2, 2024Updated last year
- The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…☆20Sep 17, 2025Updated 8 months ago
- ☆16Dec 16, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Jun 1, 2026Updated last week
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Nov 30, 2023Updated 2 years ago
- Stream live plots to a matplotlib figure☆81Apr 18, 2025Updated last year
- Inhibits idle on Wayland when a video device is open☆11Jun 26, 2023Updated 2 years ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆40Oct 9, 2025Updated 8 months ago
- ☆13Dec 12, 2023Updated 2 years ago
- Learning records for building a large language model from scratch☆58Jan 1, 2025Updated last year
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- Convert LaBSE model from TF Hub to PyTorch.☆15Jan 15, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Jul 8, 2024Updated last year
- Recreate a Webpack project just by providing an URL.☆11Jan 4, 2023Updated 3 years ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- ☆185Oct 13, 2023Updated 2 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- Keymap for my SofleKeyboard☆10Apr 8, 2021Updated 5 years ago
- ☆24Jun 30, 2025Updated 11 months ago
- ☆68Feb 1, 2025Updated last year
- Large Language Model in Action☆344Jan 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploring how ChatGPT can be used to accelerate research in cosmology.☆13Dec 12, 2022Updated 3 years ago
- Obsidian plugin that allows to display contents of Arc sidebar right besides your notes☆14Jan 26, 2024Updated 2 years ago
- A Holistic Embodied Cognition Benchmark☆19Apr 3, 2025Updated last year
- ☆106May 21, 2024Updated 2 years ago
- all the knowledge of the astrophysics data system and the speed of the command line☆14Oct 21, 2025Updated 7 months ago
- ☆22Nov 9, 2024Updated last year
- a flying dog eating bones☆20Jun 22, 2024Updated last year
- ☆21Oct 22, 2021Updated 4 years ago
- Personal Cloud Storage and File Management Solution with Privacy and Security☆18Nov 12, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Apr 9, 2025Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- ForOpenAI - A Fortran library for OpenAI API.☆20Jan 10, 2024Updated 2 years ago
- Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind☆70Updated this week
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Mar 30, 2023Updated 3 years ago