[𝗜𝗖𝗠𝗟 𝟮𝟬𝟮𝟲] Dispersion loss counteracts embedding condensation and improves generalization in small language models
☆104Jun 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for LM-Dispersion
Users that are interested in LM-Dispersion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Parameterized multi-agent orchestration framework for Claude Code and Gemini☆97Updated this week
- Ever wondered how popular your GitHub repo is compared to others?☆19Feb 14, 2026Updated 4 months ago
- [𝗜𝗖𝗔𝗦𝗦𝗣 𝟮𝟬𝟮𝟱 𝗢𝗿𝗮𝗹] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sa…☆15May 2, 2026Updated last month
- Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors e…☆80Jun 7, 2026Updated last week
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Get progress information for an ffmpeg process.☆17Apr 26, 2026Updated last month
- The FreeCAD Robust MCP server and MCP Bridge Workbench/Addon☆119May 11, 2026Updated last month
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated last year
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆17Nov 15, 2024Updated last year
- Local AI photo scoring, culling, and gallery — score, organise, and explore your library with face recognition and semantic search. No cl…☆112Updated this week
- A local macOS HTTP API gateway that exposes Apple apps (Reminders, Messages, etc.) to AI agents☆88Apr 21, 2026Updated last month
- HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion☆12Jul 6, 2024Updated last year
- This repository contains code and data for the paper "TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured T…☆29Jun 12, 2025Updated last year
- Iterator, Result and Option written in Rust, for Python☆59Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆25Dec 13, 2024Updated last year
- 基于预训练BERT和GAT的剧本角色情绪识别研究☆13Dec 15, 2023Updated 2 years ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆31Jun 3, 2025Updated last year
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Apr 15, 2025Updated last year
- A curated list of awesome claude marketplaces and plugins☆105Updated this week
- [ECCV'24 Oral] Momentum Auxiliary Network for Supervised Local Learning☆14Aug 15, 2024Updated last year
- Voice Assistant using Google's Gemini API Key☆22Jul 25, 2024Updated last year
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- Home Assistant integration for Ajax Security Systems☆93Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆19Dec 20, 2024Updated last year
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆15Jun 23, 2024Updated last year
- ☆135Updated this week
- [MedIA'23] "YoloCurvSeg: You only label one noisy skeleton for vessel-style curvilinear structure segmentation".☆23Sep 17, 2023Updated 2 years ago
- A light-weight, modular, and high-performance framework for building llm agentic systems.☆99May 27, 2026Updated 2 weeks ago
- ☆15Feb 8, 2025Updated last year
- ☆27May 26, 2025Updated last year
- ☆13Oct 20, 2022Updated 3 years ago
- 国科大信息检索大作业项目-新闻及评论搜索:定向采集不少于4个中文社会新闻网站或频道,实现这些网站新闻信息及评论信息的自动爬取、抽取、索引和检索。☆19May 11, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Covid-19 spread simulator with human mobility and intervention modeling.☆19May 28, 2022Updated 4 years ago
- The project is an official implementation of our paper " RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation…☆10Dec 9, 2020Updated 5 years ago
- ☆26Dec 29, 2023Updated 2 years ago
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆43Jun 24, 2025Updated 11 months ago
- ☆123Jun 7, 2026Updated last week
- [AAAI 2026 Oral] SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene Completion☆34Jan 12, 2026Updated 5 months ago
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆29Feb 17, 2025Updated last year