☆27Jan 14, 2025Updated last year
Alternatives and similar repositories for latent-gemma
Users that are interested in latent-gemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- Implementation for POET and POET-X for LLM pretraining☆27Mar 12, 2026Updated last month
- ☆207Apr 19, 2025Updated last year
- ☆22Aug 21, 2025Updated 8 months ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆18Nov 21, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆105Mar 6, 2026Updated 2 months ago
- Channels between coroutines in Python☆15Jan 4, 2021Updated 5 years ago
- ☆124Feb 21, 2025Updated last year
- Official implementation of T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition☆20Oct 23, 2024Updated last year
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Code for paper Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach by Zhe Lin, Xiaojun Wan. This…☆14Aug 10, 2021Updated 4 years ago
- A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.☆21Nov 26, 2025Updated 5 months ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 9 months ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- ☆14Nov 19, 2024Updated last year
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆35Apr 13, 2026Updated 3 weeks ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆39Feb 9, 2026Updated 2 months ago
- [EMNLP-2025] R1-Zero on ANY TASK☆30Nov 9, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- Factored Cognition Primer: How to write compositional language model programs☆52Feb 22, 2023Updated 3 years ago
- LibreTranslate C++ bindings☆18Aug 27, 2021Updated 4 years ago
- ☆42Oct 23, 2025Updated 6 months ago
- ☆10May 17, 2024Updated last year
- ☆11Dec 15, 2025Updated 4 months ago
- 收集用于跨境电商的ChatGPT Prompt☆13Oct 14, 2025Updated 6 months ago
- Learning as you go☆14Oct 25, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 9 months ago
- python banking project with tkinter and mysql database connection☆16Jun 4, 2021Updated 4 years ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆22Nov 17, 2025Updated 5 months ago
- ☆12Jun 15, 2023Updated 2 years ago
- Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model☆276May 27, 2025Updated 11 months ago
- 🤖 A multilingual translation tool that automatically converts Hugging Face's daily AI research papers into 🇯🇵 Japanese, 🇰🇷 Korean, �…☆18Apr 29, 2026Updated last week
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year