Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prediction results are not consistent #906

Open
lijing28101 opened this issue Mar 7, 2024 · 2 comments
Open

prediction results are not consistent #906

lijing28101 opened this issue Mar 7, 2024 · 2 comments

Comments

@lijing28101
Copy link

Hi, I'm using alphafold2 to predict my protein structure. I found the result models always different no matter for x y z coordinates or plddt even I used the exactly same command line and didn't change any parameter. Sometimes I can even see the 3d structure difference in the visualization.
Could you explain the reason of the variance?
I want to use alphafold to predict the structure for different mutations, so the variance is important to me.

@tcoates5
Copy link

tcoates5 commented Mar 7, 2024

alphafold2 relies on some fundamentally stochastic mechanisms. Most of the time, it finds almost exactly the same top answer, but sometimes there is greater variation, particularly when you have a lower plddt score. There are a number of options you can use to reduce the variance: increasing the recycle parameter, increasing the number of models, increasing the number of structures from each model, setting the seed ahead of time, and so on. Setting the seed in advance is the one option that shouldn't increase your run time. Additionally, anything that appears unstructured (usually long loops) should be expected to vary significantly from run to run and model to model.

@lildeadprince
Copy link

I agree with the explanation above.

But if you still wish to have a stable prediction, then please refer to --random_seed parameter (input value type is a number).
It is used in pseudo-random number generation to be able to have stable sequences of random number. Seed is a starting point for the random generator queue.

Thus if you provide same seed for the same prediction, it will generate the same exact output.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants