dotprod_size_tokens.ipynb

https://colab.research.google.com/drive/1rch6VaG9O1YFJT1wPjjbXyDgXizGT7WV

dotprod_size_tokens_GPTsmall.ipynb

https://colab.research.google.com/drive/18JcQcn7TKhN-1ULNjqQqvst9yJ6ZDhAA

Working on

Get avg of large dot prods, compare to avg of random dotprods

Future Work


Run Congruence tests

It’s still uncertain what embeddings at each intermediate output represent. Thus, we will experiment with performing vector similarity comparisons (between inputs and neuron groups) at various output areas to see if they make semantic sense. We start by comparing initial embeddings with various neurons to see if there are any significant patterns, and try to explain if these patterns are caused by things indicating how the model is representing semantic features. Some approaches might not make sense at first, but if they find patterns, they may be onto something that warrants further investigation.

Dot product tokens and feature neurons:

This is how $W_E(X)$ works: ‣