Questions

When NN adapts by transfer learning, what jnterpretablr neurons are removed and what aren't?

Identify component label by analogous relations

GPT-2-Small Circuits:

Superposition

Hypotheses


[ Evidence level: ? how to quanify this in comparable ways, if possible ? ]