A diff tool for language models
☆44Dec 28, 2023Updated 2 years ago
Alternatives and similar repositories for LMdiff
Users that are interested in LMdiff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- Visual search interface☆11Nov 30, 2021Updated 4 years ago
- ☆19Apr 5, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- ☆16Sep 10, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Personal Infrastructure for Deep Learning based on Pytorch and Tensorflow☆10Jan 10, 2019Updated 7 years ago
- ☆17Dec 11, 2023Updated 2 years ago
- Public helpers for huggingface.co. Now lives in https://github.com/huggingface/huggingface_hub☆13Jul 10, 2022Updated 3 years ago
- Making interactive visualizations fit into a programming workflow.☆16Jul 7, 2020Updated 5 years ago
- ☆22Jul 28, 2020Updated 5 years ago
- Natural language understanding benchmarks for Norwegian☆14Aug 29, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Sep 20, 2021Updated 4 years ago
- Rough codebase for exploring initialization strategies for new word embeddings in pretrained LMs☆19Dec 10, 2021Updated 4 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Project code for "Direct Fitting of Gaussian Mixture Models"☆18Aug 4, 2022Updated 3 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- ☆14Sep 17, 2020Updated 5 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- ☆14Mar 10, 2020Updated 6 years ago
- SPEAR: Programmatically label and build training data quickly.☆110Jun 27, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- A Python package that demontrates arbitrary code execution during the install process of a Python package.☆11Sep 28, 2014Updated 11 years ago
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Sep 15, 2020Updated 5 years ago
- ☆11May 14, 2024Updated last year
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- automatic visual data explorer for datasette☆14Apr 20, 2023Updated 2 years ago
- ☆25Jun 1, 2016Updated 9 years ago
- Advanced Semantics for Commonsense Knowledge Extraction (WWW 2021)☆25Jan 3, 2023Updated 3 years ago
- ☆47Apr 12, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An Airflow plugin, providing an admin UI to conveniently start backfills. Usable with Airflow 1, 2 and Cloud Composer☆14Aug 16, 2022Updated 3 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Sep 4, 2024Updated last year
- a spreadsheet becomes a streamlit app. trying to figure out when it makes sense to buy a house vs. rent in an expensive city.☆14Sep 28, 2025Updated 6 months ago
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆17Jul 25, 2024Updated last year
- ☆15Sep 13, 2023Updated 2 years ago
- reviving eyebrowse☆14Oct 6, 2018Updated 7 years ago
- Code for generating Quasimodo, a commonsense knowledge base.☆20Sep 14, 2021Updated 4 years ago