diff options
author | Vratko Polak <vrpolak@cisco.com> | 2019-02-14 13:09:17 +0100 |
---|---|---|
committer | Vratko Polak <vrpolak@cisco.com> | 2019-02-14 13:09:17 +0100 |
commit | 8940a28e292d715c92f46d7a509d9e4ac0b18f2a (patch) | |
tree | 980ebb563ed34dcee825bdc29652738c02a8b203 /docs/report/introduction/methodology_plrsearch.rst | |
parent | 295553e33d059532cd7c91409e9c38687c003af2 (diff) |
Update PLRsearch methodology.
+ Add a graphical example.
+ Change title underlying characters.
+ Fix incorrect formulations in Caveats.
Change-Id: I3409f539973601433e6b8630f7ed10a4dd9d6154
Signed-off-by: Vratko Polak <vrpolak@cisco.com>
Diffstat (limited to 'docs/report/introduction/methodology_plrsearch.rst')
-rw-r--r-- | docs/report/introduction/methodology_plrsearch.rst | 47 |
1 files changed, 27 insertions, 20 deletions
diff --git a/docs/report/introduction/methodology_plrsearch.rst b/docs/report/introduction/methodology_plrsearch.rst index b21fad3677..cc81088e62 100644 --- a/docs/report/introduction/methodology_plrsearch.rst +++ b/docs/report/introduction/methodology_plrsearch.rst @@ -13,7 +13,7 @@ Eventually, a better description of the abstract search algorithm will appear at this IETF standard: `plrsearch draft`_. Motivation ----------- +`````````` Network providers are interested in throughput a device can sustain. @@ -27,7 +27,7 @@ actually is. We need another algorithm, which takes this indeterminism into account. Model ------ +````` Each algorithm searches for an answer to a precisely formulated question. When the question involves indeterministic systems, it has to @@ -55,7 +55,7 @@ generator detects duplicate packets on receive detection, and reports this as an error. Poisson distribution --------------------- +```````````````````` For given offered load, number of packets lost during trial measurement is assumed to come from `Poisson distribution`_, @@ -112,7 +112,7 @@ in an attempt to speed up convergence, but such methods tend to be scpecific for a particular system under tests. Fitting function coefficients distribution ------------------------------------------- +`````````````````````````````````````````` To accomodate systems with different behaviours, the fitting function is expected to have few numeric parameters affecting its shape (mainly @@ -136,7 +136,7 @@ stdev should refer to the equilibrium estimates based on the trend, not to immediate posterior values. Integration ------------ +``````````` The posterior distributions for fitting function parameters will not be integrable in general. @@ -146,7 +146,7 @@ time, so this time can be used for numeric integration (using suitable method, such as Monte Carlo) to achieve sufficient precision. Optimizations -------------- +````````````` After enough trials, the posterior distribution will be concentrated in a narrow area of parameter space. The integration method should take @@ -169,7 +169,7 @@ it only cares about overall counts of packets sent and packets lost), but debug output from traffic generator lists unidirectional values. Measurement delay ------------------ +````````````````` In a sample implemenation in FD.io CSIT project, there is roughly 0.5 second delay between trials due to restrictons imposed by packet traffic @@ -191,7 +191,7 @@ trial durations with configurable coefficients each subsequent trial being 0.2 second longer). Rounding errors and underflows ------------------------------- +`````````````````````````````` In order to avoid them, the current implementation tracks natural logarithm (instead of the original quantity) for any quantity which is never negative. @@ -202,7 +202,7 @@ Specific functions for frequent operations are defined to handle None correctly. Fitting functions ------------------ +````````````````` Current implementation uses two fitting functions. In general, their estimates for critical rate differ, @@ -257,7 +257,7 @@ During each measurement, two estimates are computed, even though only one (in alternating order) is used for next offered load. Stretch function -________________ +---------------- The original function (before applying logarithm) is primitive function to `logistic function`_. @@ -265,13 +265,13 @@ The name "stretch" is used for related function in context of neural networks with sigmoid activation function. Erf function -____________ +------------ The original function is double primitive function to `Gaussian function`_. The name "erf" comes from error function, the first primitive to Gaussian. Prior distributions -------------------- +``````````````````` The numeric integrator expects all the parameters to be distributed (independently and) uniformly on an interval (-1, 1). @@ -288,7 +288,7 @@ raised to a random power between zero and one; thus it follows a `reciprocal distribution`_. Integrator ----------- +`````````` After few measurements, the posterior distribution of fitting function arguments gets quite concentrated into a small area. @@ -314,17 +314,17 @@ is concentrated at. The second phase (dominated by whole sample population) is actually relevant for the critical rate estimation. Caveats -------- +``````` Current implementation does not constrict the critical rate (as computed for every sample) to the min_rate, max_rate interval. Earlier implementations were targeting loss rate (as opposed to loss ratio). -The chosen fitting functions do not even allow arbitrarily low loss ratios, -especially if the "spread" value is high enough (relative to "mrr" value). +The chosen fitting functions do allow arbitrarily low loss ratios, +but may suffer from rounding errors in corresponding parameter regions. Internal loss rate target is computed from given loss ratio -using the current trial offered load, which increases search instability -if measurements with surprisingly high loss count appear. +using the current trial offered load, which increases search instability, +especially if measurements with surprisingly high loss count appear. As high loss count measurements add many bits of information, they need a large amount of small loss count measurements to balance them, @@ -338,9 +338,16 @@ as the observed trends have varied characteristics. Probably, using a more realistic fitting functions will give better estimates than trend analysis. -.. TODO: Add a 1901 result section when results are available. +Graphical example +````````````````` + +The following picture shows the estimated average of the critical rate +computed by the two fitting functions after each trial measurement +within the 30 minute duration of a PLRsearch test run. -.. TODO: Add a graph of time evolution when 1901 run is available. +.. image:: PLRsearch.svg + +.. TODO: Add a 1901 result section when results are available. .. _plrsearch draft: https://tools.ietf.org/html/draft-vpolak-bmwg-plrsearch-00 .. _RFC 2544: https://tools.ietf.org/html/rfc2544 |