Software Reverse Engineering and Mechanistic Interpretability
I had an interesting talk with Neel Nanda, a well known mechanistic interpretability researcher working at Anthropic, independently and soon for DeepMind. On our chat he interviewed me on paralles between software reverse engineering and neural-networks reverse engineering. You may read it here.