No papers that are actually concrete. Most of it is just speculation.
I'm not a scientist, and for me personally it's enough to make me spend a bit longer thinking before immediately dismissing all insects as mindless automatons. Most probably are simple biological machines. Jumping spiders are however massive outliers in terms of insect intelligence, and a cursory Google search will provide a wealth of evidence for it.
I personally would also go as far as believing that they dream. I just don't believe there's a reasonable explanation for the REM like state other than some form of dreaming, even if rudimentary.
I'm not going to state that jumping spiders are fully conscious as 100% fact, there's not enough proof for that. But they do have a proven ability to learn, and an ability to make somewhat complex plans. And all I'm trying to say is that we need more research before dismissing them so certainly.
Not taking sides, but his argument hinged on the stable diffusion model not having CSAM in it, and using non-CSAM images in order to generate CSAM.
So he's already answered the question of what the models are trained on.
Whether those models actually are clean/safe is a different question.