Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Deep SSMs, including the entire S4 to Mamba saga, are a very interesting alternative to transformers. In some of my genomics use cases, Mamba has been easier to train and scale over large context windows, compared to transformers.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: