Re: [AMBER-Developers] Relaxed, converged and equilibrated

From: Adrian Roitberg <roitberg.ufl.edu>
Date: Fri, 5 Mar 2021 16:37:16 -0500

Ah, interesting.


The problem, which plagues every sampling method, including Markov
States, etc, is that you do not know what you don't know, and you do not
even know if there are other things you should know...

For example, if you run very good MD for a long times, many replicates,
and you never visited a conformation 'next door' but with a high
barrier, you cannot do any estimate of error or probability, unless you
already know that other conformation existed.

Dan Zuckerman has written a bit on these things.

One thing you 'could' do in 'theory', but actually doing it is hard is
to put limits to possible things you do not know about.

For example, if you ran for a certain time and you did not find a very
different conformation, you could estimate the probability of the
existence of a barrier lower than X kcal/mol.

In the limit, you could say: I ran for 1 microsecond, and nothing very
different happened. I estimate that IF there is a barrier to the
unknown, if must be larger than Y, otherwise I would have probably seen
it already.


Adrian


On 3/5/21 3:56 PM, B. Lachele Foley wrote:
> [External Email]
>
> I have thought about this, too. I wonder if we could use something equivalent to a "confidence interval" in statistics. I think any such statement should include time as a qualifier. I don't really like the following, but it's a start - "the simulation is expected to have sampled X% of ensemble-appropriate phase space in the simulated time of Y seconds at 95% confidence". Has someone already done the math on this? I expect someone has.
>
> This could help with the diamond-graphite issue. That is, one could say something like: the system is in equilibrium [at some stated conditions] at 99.999% confidence over an interval of Y million years with complete sampling occurring, on average, every Z milliseconds.
>
> This sort of statistic would be very useful when people run multiple simulations for statistical or other purposes. Running multiple simulations improves sampling. But the sampling applies to a time length defined by the individual simulations (smallest? average? median?).
>
> I do expect that someone has already done work on this. Do you know of it?
>
> :-) Lachele
>
> Dr. B. Lachele Foley (she/her/hers)
> Associate Research Scientist
> Complex Carbohydrate Research Center
> The University of Georgia
> Athens, GA USA
> lfoley.uga.edu
> https://urldefense.proofpoint.com/v2/url?u=http-3A__glycam.org&d=DwIF-g&c=sJ6xIWYx-zLMB3EPkvcnVg&r=dl7Zd5Rzbdvo14I2ndQf4w&m=ewGru8T6Xnkeb3BC2QbSdotH-IkhqW4e-KjS_hgeY2M&s=SvPhYtzBhpD_YC18O0g6AiC42KmwA15qWEiVWMINn8w&e=
> ________________________________
> From: Adrian Roitberg <roitberg.ufl.edu>
> Sent: Friday, March 5, 2021 10:37 AM
> To: amber-developers.ambermd.org <amber-developers.ambermd.org>
> Subject: Re: [AMBER-Developers] Relaxed, converged and equilibrated
>
> [EXTERNAL SENDER - PROCEED CAUTIOUSLY]
>
>
> Not quite...
>
> Feynman' example was for graphite vs diamond.
>
> You can look at diamond essentially forever, with its own phonons
> 'equilibrated', but at longer time scales, probably > 10^15 years, it
> MUST turn into graphite.
>
> So, his fast and slow were for what we would call non-ergodic systems.
>
> In our day to day simulations, if you have a protein that can
> fold/unfold in milliseconds, the time scale separation gets trickier.
>
>
> Adrian
>
>
> On 3/5/21 5:41 AM, Romain M. Wolf wrote:
>> [External Email]
>>
>> I have no strong opinion on this either, but fully agree with Lachele's classification in a strict sense.
>> Just thinking about the (funny?) definition of equilibrium from Richard Feynman's Statistical Mechanics course:
>>
>> "A system is in equilibrium when “all the fast things have happened but the slow things have not.”
>>
>> Applying this to an MD "equilibration" phase, we might argue that the system is "more or less in equilibrium" when all *very* fast things have happened, and all *slower* things are far from having happened.
>>
>> ...regards...romain
>>
>>> On 3 Mar 2021, at 08:31, B. Lachele Foley <lfoley.ccrc.uga.edu> wrote:
>>>
>>> I have no strong opinions on this, but others, notably Adrian, do. I want to make sure I have the same understanding of these words as others.
>>>
>>> The following definitions are what I think others mean and are intended as a starting point for better definitions. Do you agree with them? If not, what should they be? Or what words would you give for these definitions? I tried to avoid deep stat-mech/thermo terminology.
>>>
>>> Relaxed (commonly called 'equilibrated'):
>>> In practical terms, the system has entered a stationary phase with respect to bulk properties that are expected to be in stationary phase per the simulation setup but that are not being held constant. These will typically include one or more of total energy, pressure, volume, density, and temperature. With respect to physics, this means that the simulated system has probably begun to sample configurations of mass and energy (momentum) in proportions consistent with the simulated ensemble (not necessarily the reality being modeled).
>>>
>>> Converged:
>>> For some property, not necessarily a bulk property, a stationary phase, which might unimodal or multimodal, has been sampled enough that meaningful statistical descriptions might be made about it. The system has sampled very well, in ensemble-appropriate proportions, a persistent or metastable subset of the phase space.
>>>
>>> Equilibrated:
>>> The entire system has sampled all available configurations sufficiently that meaningful statistics can be made about any system property. That is, the system has thoroughly sampled an ensemble-appropriate portion of the phase space, in ensemble-appropriate proportions, multiple times.
>>>
>>> Most simulations of any significant complexity can only hope to attain 'converged' with respect to whatever properties are being tracked. It is very difficult to know for certain that other behaviors would not be observed were the simulation to be run longer. I think if there were an easy way to tell, then we would not very often need to do simulations. This problem also impacts our ability to know if 'relaxed' is truly 'relaxed', etc.
>>>
>>> :-) Lachele
>>>
>>> Dr. B. Lachele Foley (she/her/hers)
>>> Associate Research Scientist
>>> Complex Carbohydrate Research Center
>>> The University of Georgia
>>> Athens, GA USA
>>> lfoley.uga.edu
>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__glycam.org&d=DwIGaQ&c=sJ6xIWYx-zLMB3EPkvcnVg&r=dl7Zd5Rzbdvo14I2ndQf4w&m=blvyHE3Kysn_8gKQFX5r42fwC1_ZX38vrSktxkFMmDU&s=JCSpBAobP4SY4xAwB2wzD2u-LrX2ozIoZ1eF-OuBV14&e=
>>> _______________________________________________
>>> AMBER-Developers mailing list
>>> AMBER-Developers.ambermd.org
>>> https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ambermd.org_mailman_listinfo_amber-2Ddevelopers&d=DwIGaQ&c=sJ6xIWYx-zLMB3EPkvcnVg&r=dl7Zd5Rzbdvo14I2ndQf4w&m=blvyHE3Kysn_8gKQFX5r42fwC1_ZX38vrSktxkFMmDU&s=IgSsfHDoq2c5g63lIivEgQkswz7d97UIy8Mb4j2NRQI&e=
>> _______________________________________________
>> AMBER-Developers mailing list
>> AMBER-Developers.ambermd.org
>> https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ambermd.org_mailman_listinfo_amber-2Ddevelopers&d=DwIGaQ&c=sJ6xIWYx-zLMB3EPkvcnVg&r=dl7Zd5Rzbdvo14I2ndQf4w&m=blvyHE3Kysn_8gKQFX5r42fwC1_ZX38vrSktxkFMmDU&s=IgSsfHDoq2c5g63lIivEgQkswz7d97UIy8Mb4j2NRQI&e=
> --
> Dr. Adrian E. Roitberg
> V.T. and Louise Jackson Professor in Chemistry
> Department of Chemistry
> University of Florida
> roitberg.ufl.edu
> 352-392-6972
>
>
> _______________________________________________
> AMBER-Developers mailing list
> AMBER-Developers.ambermd.org
> https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ambermd.org_mailman_listinfo_amber-2Ddevelopers&d=DwIF-g&c=sJ6xIWYx-zLMB3EPkvcnVg&r=dl7Zd5Rzbdvo14I2ndQf4w&m=ewGru8T6Xnkeb3BC2QbSdotH-IkhqW4e-KjS_hgeY2M&s=Ubm3qJU0w8Yf4mwGucJwFznE5291vBxHMdgo7tNF_aw&e=
> _______________________________________________
> AMBER-Developers mailing list
> AMBER-Developers.ambermd.org
> https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ambermd.org_mailman_listinfo_amber-2Ddevelopers&d=DwIF-g&c=sJ6xIWYx-zLMB3EPkvcnVg&r=dl7Zd5Rzbdvo14I2ndQf4w&m=ewGru8T6Xnkeb3BC2QbSdotH-IkhqW4e-KjS_hgeY2M&s=Ubm3qJU0w8Yf4mwGucJwFznE5291vBxHMdgo7tNF_aw&e=

-- 
Dr. Adrian E. Roitberg
V.T. and Louise Jackson Professor in Chemistry
Department of Chemistry
University of Florida
roitberg.ufl.edu
352-392-6972
_______________________________________________
AMBER-Developers mailing list
AMBER-Developers.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber-developers
Received on Fri Mar 05 2021 - 14:00:02 PST
Custom Search