joaopauloschuler/less-parameters-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/joaopauloschuler/less-parameters-llm)

joaopauloschuler / less-parameters-llm

This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report

☆58

Alternatives and similar repositories for less-parameters-llm

Users that are interested in less-parameters-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NiuTrans / ForgettingCurve
View on GitHub
A benchmark for testing memorization abilities of LMs
☆24Oct 15, 2024Updated last year
halixness / understanding-CLIP
View on GitHub
Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…
☆17Apr 13, 2023Updated 3 years ago
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
RobMulla / kaggle-ieee-fraud-detection
View on GitHub
IEEE-CIS Fraud Detection Kaggle Competition Code
☆10Oct 3, 2019Updated 6 years ago
Kazuhito00 / RO-GAN-using-Lightweight-GAN
View on GitHub
Lightweight GANを用いてラグナロクオンラインのキャラクター画像を生成するGAN
☆12May 13, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sztistvan / anomaly_detection
View on GitHub
Medium post example file
☆14Mar 18, 2023Updated 3 years ago
malaysia-ai / dataset
View on GitHub
Recipes to prepare datasets!
☆15Jun 28, 2026Updated 3 weeks ago
huseinzol05 / dynamic-batch-RAG-pipeline
View on GitHub
Dynamic batching for Document Layout and OCR, suitable for RAG, with extra tools.
☆14Nov 25, 2024Updated last year
STIInnsbruck / lkgt
View on GitHub
ISWC 2020 Tutorial: How to build large knowledge graphs efficiently (LKGT)
☆15Nov 2, 2020Updated 5 years ago
aidatatools / LLM_Sentinel
View on GitHub
A project (LLM Sentinel) that showcases NVIDIA's NeMo-Guardrails and LangChain for improving LLM safety
☆13Jan 22, 2025Updated last year
weizhuang1113 / Marketing_Campaign_Response_Prediction
View on GitHub
☆12Aug 17, 2018Updated 7 years ago
gustavovargas / stocktradingrl
View on GitHub
Stock Trading Model using Q Learning
☆10Dec 16, 2020Updated 5 years ago
oleg-panichev / WiDS-Datathon-2020-Second-place-solution
View on GitHub
WiDS Datathon 2020 Second place solution
☆10Jul 6, 2023Updated 3 years ago
vmanita / Customer-purchase-prediction
View on GitHub
Classification machine learning models to predict the probability of a client accepting a future marketing campaign/product release.
☆17Jul 27, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gbrlfaria / chaseconv
View on GitHub
A Fast and Simple Grand Chase 3D Asset Converter (P3M/FRM Importer and Exporter)
☆11Mar 14, 2026Updated 4 months ago
clockworkpi / LauncherGoDev
View on GitHub
☆38Dec 6, 2025Updated 7 months ago
NoLifeDev / nx-rs
View on GitHub
An NX library written in Rust
☆14Mar 16, 2018Updated 8 years ago
tsunghan-wu / reverse_vlm
View on GitHub
🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…
☆58Jan 22, 2026Updated 5 months ago
ZhangChengX / T5-Fine-Tuning-for-Question-Generation
View on GitHub
T5 Fine-tuning on SQuAD Dataset for Question Generation
☆13Feb 16, 2023Updated 3 years ago
oneal2000 / PRAG
View on GitHub
Code for Parametric RAG, SIGIR 2025 Full Paper
☆233May 1, 2025Updated last year
BernardOng / kaggle-consumer-credit-default
View on GitHub
Kaggle Machine Learning Challenge to Predict Consumer Credit Default
☆14Sep 24, 2016Updated 9 years ago
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
Preemo-Inc / text-generation-inference
View on GitHub
☆198Feb 9, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MenaWANG / mlarena
View on GitHub
Algorithm-agnostic machine learning toolkit for strealined model training, diagnostics, and optimization.
☆38Jun 30, 2026Updated 3 weeks ago
YerbaPage / MGDebugger
View on GitHub
Multi-Granularity LLM Debugger [ICSE2026]
☆100Jul 6, 2025Updated last year
Geaming2002 / Ruler
View on GitHub
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆41Sep 30, 2024Updated last year
michael-pacheco / cv-arrow-detection
View on GitHub
A simple attempt to use OpenCv's HoughLines to detect arrows.
☆15Jun 16, 2017Updated 9 years ago
go-skynet / go-ggml-transformers.cpp
View on GitHub
Binding to transformers in ggml
☆65Jun 24, 2026Updated 3 weeks ago
kirudang / Automated_Text_Extraction
View on GitHub
☆11Oct 9, 2023Updated 2 years ago
StealthyPanda / quantumcomputingsim
View on GitHub
A library to simulate quantum computations
☆12Dec 30, 2023Updated 2 years ago
world-modelz / dreamax
View on GitHub
A scalable Dreamer implementation in JAX
☆10May 22, 2022Updated 4 years ago
MidiyaZhu / MePO
View on GitHub
Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization
☆13Jan 12, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
leonjovanovic / keywords-extraction
View on GitHub
Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.
☆12May 22, 2023Updated 3 years ago
nahidalam / maya
View on GitHub
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆129Aug 7, 2025Updated 11 months ago
QuixiAI / agenticworker
View on GitHub
☆24Nov 19, 2024Updated last year
Sphere-AI-Lab / fda
View on GitHub
Implementation of <Model Merging with Functional Dual Anchors>
☆46Nov 23, 2025Updated 7 months ago
ybabakhin / kaggle-skeleton
View on GitHub
Initial skeleton for ML competitions
☆13Jul 19, 2018Updated 8 years ago
DozerDB / genai-stack
View on GitHub
Langchain + Docker + Neo4j
☆10Oct 29, 2024Updated last year
samborkent / uuid
View on GitHub
Implementations of UUID v4 and v7 as defined in the lastest RFC4122 draft. Including a highly-performant custom UUID v8 implementation.
☆12May 27, 2025Updated last year