The MAMBA Model transformer using a language modeling head on leading (linear layer with weights tied to the enter
an unlimited body of exploration has appeared on additional effective variants of focus to beat these https://k2spiceshop.com/product/liquid-k2-on-paper-online/