A report published by Deepmind that attempted to interpret a large 70-billion parameter chinchilla model. Results were mixed with many tasks changing performance of identified nodes. Interpreting a large model is hard.
A report published by Deepmind that attempted to interpret a large 70-billion parameter chinchilla model. Results were mixed with many tasks changing performance of identified nodes. Interpreting a large model is hard.