Content deleted Content added
created page with some initial information |
Koca Çingene (talk | contribs) m Small typo Tags: Visual edit Mobile edit Mobile web edit |
||
(24 intermediate revisions by 20 users not shown) | |||
Line 1:
'''Adaptive sampling''' is an approach to [[Sampling (statistics)|sampling]] that uses heuristics to provide [[Efficiency (statistics)|efficiency]]. The term ''adaptive sampling'' represents a general approach to the problem of sampling, rather than being a special method itself, meaning it can be combined with suitable other approaches/methods.
In some real world problems, sampling is implicitly/explicitly needed and used to obtain practical solutions. The sampling process will need resources and efficient usage of these resources is usually crucial. This is why there are multiple sampling methods instead of the brute-force approach.
==Background==▼
Proteins spend a large portion;– nearly 96% in some cases<ref name="10.1016/j.sbi.2011.12.001"/> – of their [[protein folding|folding]] time "waiting" in various [[thermodynamic free energy]] minimas. Consequently, a straightforward simulation of this process would spend a great deal of computation to this state, with the transitions between the states – the aspects of protein folding of greater scientific interest – taking place only rarely.<ref name="Simulation FAQ"/> Adaptive sampling exploits this property to simulate the protein's [[phase space]] in between these states. Using adaptive sampling, molecular simulations that previously would have taken decades can be performed in a matter of weeks.<ref name="10.1016/j.sbi.2010.10.006"/>▼
Let f(x) be a function that is to be sampled. For simplicity, let C(x,'''s''') be the cost for sample x given the previous set of samples '''s''' (For simplicity, we can assume that C(x,'''s''') is constant since sampling cost usually does not depend on the previous samples and the sampling input x to the function. In time-critical systems, where the cost for each sample is strongly related to computation time; usually there are other parameters to the function C like the current time...); and G(x, '''s''') be the gain (anti-cost) from sampling the function at x, given the set of previous samples '''s'''. For example, it can be assumed that G(x, '''s''')=0 if x has already been sampled. The sampling problem is then maximizing our cumulative gain minus cumulative cost. Which usually comes down to sampling the function n times until the next sample's estimated/deterministic cost C(x,s) is larger than the gain G(x,s) of that sample.
==Theory==▼
Adaptive sampling then assumes that given necessary knowledge about the problem, there is a theoretically optimal sequence '''s''' of samples that will maximize the information (gain) induced by that sample; and it is possible to estimate '''s''' using [[Heuristic|heuristics]]. Adaptive sampling usually focuses on estimating the next optimal sample input x, given the previous set of samples. Thus, being adaptive to the current knowledge about the function.
== Computational Molecular Biology ==
In computational [[molecular biology]], adaptive sampling is used to efficiently simulate [[protein folding]] when coupled with molecular dynamics simulations.
▲=== Background ===
▲Proteins spend a large portion
▲=== Theory ===
If a protein folds through the [[metastable state]]s A -> B -> C, researchers can calculate the length of the transition time between A and C by simulating the A -> B transition and the B -> C transition. The protein may fold through alternative routes which may overlap in part with the A -> B -> C pathway. Decomposing the problem in this manner is efficient because each step can be simulated in parallel.<ref name="10.1016/j.sbi.2010.10.006"/>
=== Applications ===
Adaptive sampling is used by the [[Folding@home]] distributed computing project in combination with [[Hidden
=== Disadvantages ===
While adaptive sampling is useful for short simulations, longer trajectories may be more helpful for certain types of biochemical problems.<ref name="10.1145/1364782.1364802"/><ref name="10.1146/annurev-biophys-042910-155245"/>
=== See also ===
* [[Folding@home]]
* [[Hidden
* [[Computational biology]]
* [[Molecular biology]]
Line 24 ⟶ 33:
| refs =
<ref name="10.1016/j.sbi.2011.12.001">{{cite journal | author = Robert B Best | title = Atomistic molecular simulations of protein folding | journal = Current Opinion in Structural Biology | year = 2012 |
<ref name="Simulation FAQ">{{cite web |
<ref name="10.1016/j.sbi.2010.10.006">{{cite journal |
<ref name="10.1145/1364782.1364802">{{cite journal | author = David E. Shaw
<ref name="10.1146/annurev-biophys-042910-155245">{{cite journal | title = Biomolecular Simulation: A Computational Microscope for Molecular Biology |
}}
[[Category:Molecular modelling]]
Line 46 ⟶ 52:
[[Category:Computational chemistry]]
[[Category:Hidden Markov models]]
|