-
Notifications
You must be signed in to change notification settings - Fork 11
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.
whyNLP/LCKV
ErrorLooks like something went wrong!
About
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published