Skip to content

HassanJbara/lin-attn-bench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Linear Attention Benchmarks

Look at all of these results from different papers, supposedly of the same models on the same benchmark:

alt alt alt alt

What's up with that? This repository contains independent and reproducible benchmarking results for various linear attention mechanisms. This should hopefully help to clarify the discrepancies seen in the literature.

Currently, the repository includes (each on own branch):

  • Mechanistic Architecture Design (MAD)
  • Pre-training loss comparison

About

Independent and reproducable benchmarking of linear attention models

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published