geronimi73/accelerate_tricks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/geronimi73/accelerate_tricks)

geronimi73 / accelerate_tricks

☆15

Alternatives and similar repositories for accelerate_tricks

Users that are interested in accelerate_tricks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
chenllliang / MMEvalPro
View on GitHub
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆25Sep 26, 2024Updated last year
white127 / SQUAD-2.0-bidaf
View on GitHub
☆11Aug 8, 2018Updated 7 years ago
Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆31Jul 17, 2024Updated 2 years ago
vliu15 / qanet
View on GitHub
Tensorflow QANet with ELMo
☆15Mar 13, 2019Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mstrise / dep2label-bert
View on GitHub
Dependency Parsing as Sequence Labeling with BERT
☆13Nov 1, 2020Updated 5 years ago
Silin159 / PersonaChat-BART-PeaCoK
View on GitHub
☆12Nov 10, 2023Updated 2 years ago
Ahren09 / SciEvo
View on GitHub
A longitudinal dataset for academic literature, including papers, metadata, and citation graphs, Also available on 🤗 HuggingFace and Kag…
☆18Sep 6, 2025Updated 10 months ago
microsoft / chemistry-qa
View on GitHub
☆15Nov 6, 2020Updated 5 years ago
PKU-TANGENT / ConFiguRe
View on GitHub
Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"
☆12Jul 27, 2023Updated 2 years ago
samchengcs / IKEA-Dataset
View on GitHub
A dataset for multimodal machine translation
☆13Dec 6, 2021Updated 4 years ago
kirianguiller / BertForDeprel
View on GitHub
Framework for training dependency parsing models.
☆12Jun 12, 2024Updated 2 years ago
CVxTz / distill-llm
View on GitHub
☆21Apr 6, 2024Updated 2 years ago
rish-16 / dalle2-pytorch
View on GitHub
Unofficial PyTorch implementation of DALL-E 2 by OpenAI
☆10Apr 6, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
WeiminXiong / RationaleCL
View on GitHub
Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)
☆12Oct 11, 2023Updated 2 years ago
akashrajkn / dependency-parser
View on GitHub
Neural graph-based dependency parser
☆13Dec 20, 2017Updated 8 years ago
sdeva14 / sustai21-counter-neural-essay-length
View on GitHub
☆10Dec 15, 2021Updated 4 years ago
Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year
crux82 / squad-it
View on GitHub
A large scale dataset for Question Answering in Italian
☆28Nov 18, 2018Updated 7 years ago
lancopku / DCKD
View on GitHub
Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)
☆16Sep 6, 2022Updated 3 years ago
gersongerardcruz / extractive_and_abstractive_text_summarization
View on GitHub
A combination of extractive and abstractive text summarization for summarizing long scientific texts
☆16Feb 7, 2023Updated 3 years ago
Arvid-pku / Overleaf-Bib-Helper
View on GitHub
Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …
☆46Apr 14, 2025Updated last year
gauthierdmn / question_answering
View on GitHub
Question Answering task using Deep Learning on SQuAD dataset
☆22Dec 8, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
jina-ai / textbook
View on GitHub
distill chatGPT coding ability into small model (1b)
☆31Sep 7, 2023Updated 2 years ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
F2-Song / ICDPO
View on GitHub
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…
☆16Feb 15, 2024Updated 2 years ago
anikethjr / NER_Telugu
View on GitHub
An LSTM-CRF classifier for NER in Telugu, an Indian language.
☆15Sep 4, 2022Updated 3 years ago
coolbay / Re2TAL
View on GitHub
Repository for the CVPR23 paper Re^2TAL
☆13Nov 21, 2025Updated 8 months ago
Yifan-Song793 / InfoCL
View on GitHub
Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…
☆14Aug 13, 2024Updated last year
weleen / awesome-agent
View on GitHub
Repository about single/multi-agent, robotics, llm/vlm/vla, scientific discovery, etc.
☆20Jul 10, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
RenShuhuai-Andy / my-tools
View on GitHub
my commonly-used tools
☆64Jan 7, 2025Updated last year
LLM-Systems-Research / orca
View on GitHub
Our Clone of Orca used for experimentation
☆19Oct 15, 2024Updated last year
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
xdd666t / flutter_ffi
View on GitHub
flutter ffi usage
☆12Dec 12, 2022Updated 3 years ago
edorado93 / HMM-Part-of-Speech-Tagger
View on GitHub
An HMM based Part of Speech Tagger
☆10May 30, 2018Updated 8 years ago
TobiasLee / VEC
View on GitHub
Visual and Embodied Concepts evaluation benchmark
☆21Oct 10, 2023Updated 2 years ago
tristan-mcinnis / claude-code-agentic-semantic-memory-system-mcp
View on GitHub
This guide provides complete instructions for implementing an **Agentic Semantic Memory System** that enables Claude agents to:
☆15Aug 12, 2025Updated 11 months ago