Finally, there are, although rare, also instances of faking results. Diederik Stapel, for example, was a dutch psychologist who was suspended from Tilburg University for faking results. Psychology is not the only field. Hendrik Schön was a German physicist who was supposedly pioneering the field of semiconductors using organic materials, before being found out to be fraudulent and even having his PhD revoked.
In addition to the factors make reliable research hard, replication itself isn't easy for several reasons.
Exact replication might not be possible.
Sometimes, key elements of the method are not clear or are not included. In these cases, exact replication is not possible. When exact replication is not possible, the inability to reproduce results is often blamed on the fact that the method is not right, rather than the conclusion being wrong.
Another reason is that data about the participants is often not complete, leaving out relevant details such as age, sex, chronic illnesses, and medical conditions. There may be confounding factors that prevent others from getting the same result as the original study.
Finally, sometimes the same experiment cannot be performed due to changing circumstances. If I measured the personality of people in the 50s, there is no way to replicate the experiment in the modern age (unless I make a time machine). Another example is the famous Stanford prison experiment, in which people were placed in roles of either “guard” or “prisoner" to see how a position of authority affects human behavior. This resulted in increasingly brutal abuse of the prisoners by the guards, resulted in what became an extremely unethical experiment. With more ethical requirements in place, these kind of experiments can (luckily) no longer be repeated.
But we might not have to be so worried about the replication crisis. Bold research is always going to be risky.
A 2020 paper published in Nature used simulations and came to the conclusion that the method of publishing more risky papers and replicating it later was more efficient when compared to only publishing after replication. Perhaps the pressure to publish new theories is what has led to the modern era and the rapid amount of new discoveries.
We still, of course, want to have the best of both worlds, increasing replicability and maintaining the culture of proposing bold new theories.
There are several solutions to this issue.
Preregistration, the process of defining research questions and analysis beforehand, is starting to become more popular. This will help with p-hacking and prevent researchers from making post hoc hypotheses (after doing the experiment) and presenting them as a priori (made with no knowledge of the results).
The Replication Index is a new tool created by Ulrich Schimmack to understand replicability of studies, or even researchers themselves. It gives a numerical value to understand the credibility of research, giving an easy way to know which studies are more reliable.
There also seems to be a move towards making the data necessary to replicate studies more available, with journals such as PLOS ONE implementing this as a policy in 2014.
The most important thing we have to change, however, is our view of science, particularly of the social sciences. Although we view science as an accumulation of knowledge and facts, psychology and other less developed sciences are
different. Psychology goes through fads, a cycle where there is a lot of enthusiasm about a new theory, people try to apply it in several domains, new negative data comes out, there is confusion over inconsistent and contradictory results, people make ad hoc excuses, and finally people lose interest and move on to new theories.
Psychology isn't a clean road to knowledge. Our brain is one of the most complex things in the universe. There were, and will continue to be a lot of U-turns and dead ends in trying to understand it.
So next time you see a headline talking about the moon and brushing teeth, take it with a grain of salt, knowing that science is always evolving.