SILO LANGUAGE MODELS: ISOLATING LEGAL RISK IN A NONPARAMETRIC DATASTORE

SILO IS BUILT BY (1) TRAINING A PARAMETRIC LM ON OPEN LICENSE CORPUS (OLC), A NEW CORPUS WE CURATE WITH 228B TOKENS OF PUBLIC DOMAIN AND PERMISSIVELY LICENSED TEXT AND (2) AUGMENTING IT WITH A MORE GENERAL AND EASILY MODIFIABLE NONPARAMETRIC DATASTORE (E. G., CONTAINING COPYRIGHTED BOOKS OR NEWS) THAT IS ONLY QUERIED DURING INFERENCE.

SYNJAX: STRUCTURED PROBABILITY DISTRIBUTIONS FOR JAX

THE MODELS THAT EXPLICITLY ACCOUNT FOR STRUCTURED OBJECTS, SUCH AS TREES AND SEGMENTATIONS, DID NOT BENEFIT EQUALLY BECAUSE THEY REQUIRE CUSTOM ALGORITHMS THAT ARE DIFFICULT TO IMPLEMENT IN A VECTORIZED FORM.

MEMORY TRANSFORMER

ADDING TRAINABLE MEMORY TO SELECTIVELY STORE LOCAL AS WELL AS GLOBAL REPRESENTATIONS OF A SEQUENCE IS A PROMISING DIRECTION TO IMPROVE THE TRANSFORMER MODEL.