Not All Language Model Features Are Linear(huggingface.co) |
Not All Language Model Features Are Linear(huggingface.co) |
Are we only measuring the tip of the iceberg, and have coalesced towards getting better at iceberg tip measuring?
https://www.lesswrong.com/posts/BduCMgmjJnCtc7jKc/research-r...
Kind of like replacing a portion of unoptimized compiler code with hand written assembly?