2. Finding Important Heads for Most Recent Name Movers

most_recent_S_attn_pat.ipynb

Based on IOI findings, we expect to find:

Can be done w/ just GPT-2-small

<<<<<<

Direct Logit Attribution [IOI paper]

Was logit diff commented on? If so, how?
- Only states average logit difference X over Y examples. The rest of its info was only used in activation patching comparisons (no figures)
How was activation head patching described?
- Include the heatmap Figure

Direct Logit Attribution [IOI paper]

<<<