Visualising LLM Execution on a GPU?
Given that these computations are happening on a GPU, can the algorithm be visualized in any meaningful way? I'm not expecting to see a teapot rendered if the word "teapot" is found, but something more abstract - maybe even just white noise.
you can look at activations