mjbommar / gpt4-passes-the-barLinks

GPT-4 Passes the Bar

☆26

Alternatives and similar repositories for gpt4-passes-the-bar

Users that are interested in gpt4-passes-the-bar are comparing it to the libraries listed below

Sorting:

Breakend / PileOfLaw
A dataset for pretraining language models targeted for legal tasks.
☆133Updated 2 years ago
jsavelka / statutory_interpretation
☆27Updated 3 years ago
HeadspaceMeditation / transformer-embeddings
Production-grade embedding generation, for any length of text, for transformer models.
☆23Updated 2 weeks ago
JoelNiklaus / LegalDatasets
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆17Updated 2 years ago
trusthlt / mining-legal-arguments
Mining Legal Arguments in Court Decisions - Data and software
☆68Updated 2 years ago
MeLeLBGU / SaGe
Code for SaGe subword tokenizer (EACL 2023)
☆25Updated 6 months ago
privateai / deid-examples
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
☆82Updated last month
openlegaldata / awesome-legal-data
Collection of Datasets for Legal Text Processing
☆106Updated 2 years ago
ICLRandD / Case2Vec
A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…
☆26Updated 2 years ago
SgfdDttt / sara
StAtutory Reasoning Assessment
☆13Updated 2 years ago
273v / kelvin-public-examples
Kelvin Legal Data OS - Public Examples
☆19Updated last year
oughtinc / primer
Factored Cognition Primer: How to write compositional language model programs
☆49Updated 2 years ago
bamman-group / gpt4-books
Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"
☆69Updated 2 years ago
alea-institute / nupunkt
Next-generation Punkt sentence boundary detection with zero dependencies
☆17Updated 2 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆64Updated last year
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆30Updated 9 months ago
Liquid-Legal-Institute / Legal-LLMs-GPTs
Large Language Models (LLMs) and Generative Pre-trained Transformers (GPTs) for Legal
☆91Updated 2 years ago
minalee-research / coauthor-interface
☆95Updated last year
jsavelka / sbd_adjudicatory_dec
☆18Updated 4 years ago
reasoning-machines / prompt-lib
A set of utilities for running few-shot prompting experiments on large-language models
☆121Updated last year
neelguha / legal-segmenter
A simple library for segmenting legal texts
☆17Updated 2 years ago
weaviate / biggraph-wikidata-search-with-weaviate
Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine
☆31Updated 3 years ago
MurtyShikhar / LanguagePatching
Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"
☆19Updated 2 years ago
darrow-labs / LegalLens
☆8Updated 11 months ago
Pleias / OCRoscope
Small python package to measure OCR quality and other related metrics.
☆23Updated last year
Muhtasham / summarization-eval
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆100Updated last year
dioptra-ai / anchor-gpt
☆13Updated 2 years ago
coastalcph / lexlms
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
☆20Updated last year
CG80499 / trlx-with-T5
[Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆47Updated 2 years ago
google-research-datasets / c4repset
C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs
☆10Updated 2 years ago