Zhu, R. (ECE) – From Neuromorphic Principles to Efficient Neural Language Architectures
This dissertation investigates how neuromorphic and brain-inspired principles can guide the design of efficient neural language architectures. It addresses two central limitations of modern Transformer-based language models: memory growth with context length and high computational cost from dense matrix multiplication. Through studies of spiking neural networks, linear-recurrent language models, hybrid attention architectures, MatMul-free models, and […]