Official implementation of "BERTs are Generative In-Context Learners"
☆32Mar 14, 2025Updated last year
Alternatives and similar repositories for bert-in-context
Users that are interested in bert-in-context are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆89Feb 10, 2026Updated 4 months ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆24Jul 4, 2025Updated 11 months ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 9 months ago
- Code for the paper "Function-Space Learning Rates"☆24Jun 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 9 months ago
- Deep learning models to predict enhancers in different Drosophila embryo tissues☆20Dec 10, 2023Updated 2 years ago
- ADAG: Transluce's MLP neuron-level circuit tracing library☆28Apr 10, 2026Updated 2 months ago
- ☆16May 14, 2024Updated 2 years ago
- ☆15Jun 19, 2025Updated 11 months ago
- ☆11Feb 9, 2024Updated 2 years ago
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆25Jun 26, 2024Updated last year
- Evolution-inspired data augmentations for PyTorch-based models for regulatory genomics☆25Jun 3, 2025Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22Oct 14, 2024Updated last year
- ☆15Dec 15, 2025Updated 5 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆141Dec 28, 2024Updated last year
- Protein representation and design under a single training scheme☆24May 17, 2026Updated 3 weeks ago
- ☆21Mar 27, 2026Updated 2 months ago
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆17Mar 26, 2025Updated last year
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 11 months ago
- Simple Scalable Discrete Diffusion for text in PyTorch☆37Sep 27, 2024Updated last year
- AI powered Virtual Desktop☆16Jun 7, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 5 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆23Dec 14, 2025Updated 6 months ago
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 4 months ago
- ☆14Dec 12, 2024Updated last year
- Benchmarking scripts for Gaia☆16Apr 10, 2025Updated last year
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Equivariant layers for RC-complement symmetry in DNA sequence data☆13Feb 24, 2022Updated 4 years ago
- ☆12Nov 16, 2023Updated 2 years ago
- ☆17Nov 11, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆48May 16, 2026Updated 3 weeks ago
- DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation☆222Feb 18, 2026Updated 3 months ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- Realistic examples of building evals and optimizing agents with Harbor☆102Apr 23, 2026Updated last month
- ☆33Oct 22, 2024Updated last year
- ☆19Updated this week