Hackworth
Hackworth@lemmy.world
Joined
8 posts • 211 comments
Calling what attention transformers do memorization is wildly inaccurate.
*Unless we’re talking about semantic memory.
*affected
small win
Calling what attention transformers do memorization is wildly inaccurate.
*Unless we’re talking about semantic memory.
*affected
small win