diff options
author | Gustaf Rydholm <gustaf.rydholm@gmail.com> | 2023-01-29 21:16:49 +0100 |
---|---|---|
committer | Gustaf Rydholm <gustaf.rydholm@gmail.com> | 2023-01-29 21:16:49 +0100 |
commit | 24f9604116ac15e200567b77f1471122886783f1 (patch) | |
tree | ebe5b2ce4432027340c5a4d6ca6f76d85cf12f2b | |
parent | cf558e7146eabdf1e2c3435af31f4e87f4eb18bd (diff) |
Update readme
-rw-r--r-- | README.md | 1 |
1 files changed, 1 insertions, 0 deletions
@@ -75,6 +75,7 @@ Ideas of mine that did not work unfortunately: - [ ] fix linting - [x] Modularize the decoder - [ ] Add kv cache +- [ ] Train with Laprop - [x] Fix stems - [x] residual attn - [x] single kv head |