GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!
☆20Oct 29, 2022Updated 3 years ago
Alternatives and similar repositories for Little-GPT
Users that are interested in Little-GPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Me☆30Feb 11, 2023Updated 3 years ago
- ☆10May 28, 2024Updated last year
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- A Project that uses Zillow research data on Quandl, Prophet for time series forecasting, Altair for vega-lite charts and Folium for an cr…☆12Dec 8, 2022Updated 3 years ago
- ☆21May 16, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Sep 22, 2023Updated 2 years ago
- kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting☆16Sep 12, 2017Updated 8 years ago
- Generic build server☆65May 25, 2014Updated 12 years ago
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- A sample app to debug and validate cellular modems on balena devices☆13Jun 5, 2019Updated 6 years ago
- ☆16Jul 29, 2025Updated 9 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Caching for Graphql Resolvers☆19Nov 21, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Jun 27, 2023Updated 2 years ago
- PySOM - The Simple Object Machine Smalltalk implemented in Python☆18Aug 19, 2025Updated 9 months ago
- Non-blocking concurrent hashmap for Haskell☆18Sep 29, 2017Updated 8 years ago
- this is a repository for blooket hacks on my youtube channel, Dog-tp6be SUBSCIRBE!☆71May 18, 2024Updated 2 years ago
- ☆11Feb 3, 2025Updated last year
- Encode and decode pairs of surrogate characters in Python 3☆10Mar 9, 2022Updated 4 years ago
- Free alternative of MegaHack v6-PRO☆11Jan 9, 2022Updated 4 years ago
- A simple one file python script that executes AI processes defined in YML.☆14Mar 26, 2023Updated 3 years ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆185Nov 6, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Port of Facebook's LLaMA model in C/C++☆16Jul 3, 2023Updated 2 years ago
- ☆10Feb 11, 2025Updated last year
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- MIC-CIS entry in PharmaCoNER, Bacteria Biotope (BB 2029) & SeeDev 2019 Shared Tasks in EMNLP '19☆11Feb 22, 2020Updated 6 years ago
- ☆17Apr 7, 2025Updated last year
- A curated list of awesome Deep Learning tutorials, projects and communities.☆14Oct 24, 2016Updated 9 years ago
- Question / answer AI bot for SuperTokens☆13Apr 23, 2023Updated 3 years ago
- ☆13Dec 12, 2025Updated 5 months ago
- Timber + Logger Integration. Make Logcat Prettier, show thread information and more.☆30Jun 18, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆16Apr 15, 2026Updated last month
- Best Blooket Hacks☆27Feb 10, 2023Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Sep 4, 2025Updated 8 months ago
- ☆38Jun 15, 2025Updated 11 months ago
- "Hacks" for online blooket games. Does not require outside information (ie. url, gamepin, sign in information) blooket hacks☆13Jan 25, 2024Updated 2 years ago
- # All paths in this configuration file are relative to Dynmap's data-folder: minecraft_server/dynmap/ # All map templates are defined in…☆14Aug 1, 2021Updated 4 years ago
- 命名实体识别☆12Dec 21, 2020Updated 5 years ago