Discussion about this post

User's avatar
Ben Recht's avatar

I don’t think this detracts from any of your excellent points here, but three things worth pointing out about Hall’s “paper” are that it wouldn’t actually get published anywhere, it used existing data in known repositories, and the instructions to Claude Code were longer than the paper itself.

That last part is critical. Because Hall could write a screenplay for a replication study, the study was instantly automatable. That this is mechanically true now is both remarkable and mundane.

Alex Tolley's avatar

Science proceeds in steps with new experimental discoveries that can be replicated. However, while the new discovery can be published, the replication, or more importantly, teh failure to replicate, is mostly not. A good "recent" example was the "Arsenic bacteria" paper published in Science in 2015. After 10 years of controversy, it was retracted this year (2025). Most of that failure to replicate was not published. This is a problem.

Cognitive Scientist Melanie Mitchell's presentation at NeurIPS 2025 decried that while AI papers were being easily accepted and published, replication studies that did not confirm the results were very hard to publish. Yet replication is important because there are biases that seep in, and this distorts the research and its interpretation.

Her summary of the talk. https://aiguide.substack.com/p/on-evaluating-cognitive-capabilities

We used to worry about junk research being published. Then, about the many non-peer-reviewed journals polluting science with so-so or even poor papers. Now we have GenAI increasing the noise of too many "genre" papers, saying very little that is new, first by making papers easier to write, now by generating "new" papers en masse. ArXiv doesn't want review papers, and it already has a lot of junk papers, as do similar platforms. The incentives for academic research publications are perverse, resulting in lots of junk and increasing fraud rates. Isn't this rather like the Russian propaganda model of creating lots of false information to bury the truth amongst the lies, and turning off people trying to find the truth?

27 more comments...

No posts

Ready for more?