Paper-Weekly09-Resurrecting Recurrent Neural Networks for Long Sequences

最近RWKV特别火,他号称能在线性时间内建模各种序列问题,参数量少泛化能力强,是transformer的有力竞争者。

陈沁宇
陈沁宇
Master Student@PKU

My research interests include natural language processing, machine learning and recommender systems.