Kind-of. You could theoretically use LoRA for this, in fact, but it probably wou...

		andy12_ 56 days ago \| parent \| context \| favorite \| on: Google Titans architecture, helping AI have long-t... Kind-of. You could theoretically use LoRA for this, in fact, but it probably wouldn't have enough capacity to make it a proper substitute of the attention mechanism. Instead a full MLP is trained as input chunks get processed.